PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeEcoli.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CMJKDNLE_1 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1CMJKDNLE_00002CMJKDNLE_00017Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00002231-10.693509sodium:H+ antiporter NhaA
CMJKDNLE_00003231-10.786836NhaR transcriptional activator
CMJKDNLE_00004-118-6.063915hypothetical protein
CMJKDNLE_00005-215-4.395918putative outer membrane usher protein
CMJKDNLE_00006-217-2.491177hypothetical protein
CMJKDNLE_000070232.74235530S ribosomal subunit protein S20
CMJKDNLE_00008-1203.262891bifunctional riboflavin kinase / FMN
CMJKDNLE_00009-1213.390179isoleucyl-tRNA synthetase
CMJKDNLE_00010-2132.437498prolipoprotein signal peptidase II
CMJKDNLE_000110223.203760peptidyl-prolyl cis-trans isomerase
CMJKDNLE_000120212.6170631-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate
CMJKDNLE_00013-1192.046186ribonucleoside hydrolase 3
CMJKDNLE_000140181.918372dihydrodipicolinate reductase
CMJKDNLE_000150191.971501carbamoyl phosphate synthetase
CMJKDNLE_000160171.519226carbamoyl phosphate synthetase
CMJKDNLE_00017214-0.358564hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00005PF005772543e-79 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 254 bits (650), Expect = 3e-79
Identities = 87/387 (22%), Positives = 162/387 (41%), Gaps = 20/387 (5%)

Query: 1 MAAWRYASQDYRTFSDHLYENDKINHQSDYD----------DFYNIG--RKNSLSANIMQ 48
+ +RY++ Y F+D Y + D D+YN+ ++ L + Q
Sbjct: 476 LVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQ 535

Query: 49 PLSNNLGNVSLSALWRNYWGRSGNAKDYQFSYSNSWQRISYTFSASQSYDENDKEEER-F 107
L + LS + YWG S + +Q + +++ I++T S S + + K ++
Sbjct: 536 QL-GRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQML 594

Query: 108 NLFISIPF--YWGDDIAKTRHQINLSNSTSFSKDGYSSNNTGITGIAGEHDQLNYGI--- 162
L ++IPF + D + S S S +G +N G+ G E + L+Y +
Sbjct: 595 ALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTG 654

Query: 163 YVNQQQQNNDTSLGTNLSWRTPIAIIDGSYSHSKNAWQSGGSISSGLVVWSGGINITNQL 222
Y N+ ++ L++R + YSHS + Q +S G++ + G+ + L
Sbjct: 655 YAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPL 714

Query: 223 SDTFAILDEPGLEGAHINGQKYNRTNSKGQVVYDPIIPHRENHLVLDIANSESETELQGN 282
+DT ++ PG + A + Q RT+ +G V +REN + LD +L
Sbjct: 715 NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774

Query: 283 RQIIAPYRGAVSYVQFTTDQRKPWYIQALRPDGSPLTFGYDVLDLQENNIGVVGQGSRLF 342
+ P RGA+ +F + L + PL FG V + G+V +++
Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVY 833

Query: 343 IRVDEIPTGIKVALNDEQNLFCTITFQ 369
+ + ++V +E+N C +Q
Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQ 860


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00011INFPOTNTIATR310.002 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 30.7 bits (69), Expect = 0.002
Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 8 NSAVLVHFTLKLDDGTTAESTRNNGKPALFRL 39
+ V V +T L DGT +ST GKPA F++
Sbjct: 144 SDTVTVEYTGTLIDGTVFDSTEKAGKPATFQV 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00016HTHFIS340.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.004
Identities = 25/143 (17%), Positives = 49/143 (34%), Gaps = 20/143 (13%)

Query: 34 CKALREEGYRVILVNS-----------NPATIMTDPEMADATYIEPIHWEVVRKIIEKER 82
+AL GY V + ++ + ++TD M D + + I+K R
Sbjct: 20 NQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD------LLPRIKKAR 73

Query: 83 PDAVLPTMGGQTALNCALELERQGVLEEFGVTM-IGATADAIDKAEDRRRFDVAMKKIGL 141
PD + M Q A++ +G + + I +A + + +
Sbjct: 74 PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDS 133

Query: 142 ETARSGIAHS--MEEALAVAAEV 162
+ + S M+E V A +
Sbjct: 134 QDGMPLVGRSAAMQEIYRVLARL 156


2CMJKDNLE_00041CMJKDNLE_00050Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00041-2183.68414623S rRNA and tRNA pseudouridine synthase
CMJKDNLE_00042-2173.527184RNA Polymerase (RNAP)-binding ATPase and RNAP
CMJKDNLE_00043-2153.571642DNA polymerase II
CMJKDNLE_00044-1153.814267L-ribulose 5-phosphate 4-epimerase monomer
CMJKDNLE_000470174.458348L-arabinose isomerase monomer
CMJKDNLE_000481164.195701L-ribulokinase monomer
CMJKDNLE_000490163.395709AraC-Arabinose DNA-binding transcriptional
CMJKDNLE_000501163.620605hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00048TCRTETOQM320.006 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 32.1 bits (73), Expect = 0.006
Identities = 20/103 (19%), Positives = 40/103 (38%), Gaps = 18/103 (17%)

Query: 300 ILIADKQSVGERAVKGICGQVDGSVV------PGFIGLEAGQS-AFGDIYAWFGRVLGWP 352
+ I++K+ + + + ++G + G I + + + G P
Sbjct: 281 VRISEKEKIK---ITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV---LGDTKLLP 334

Query: 353 L-EQLAAQHPELKTQINASQKQ----LLPALTEAWAKNPSLDH 390
E++ P L+T + S+ Q LL AL E +P L +
Sbjct: 335 QRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00049PF05616290.022 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.9 bits (64), Expect = 0.022
Identities = 26/118 (22%), Positives = 47/118 (39%), Gaps = 21/118 (17%)

Query: 82 YGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQ-IINA 140
Y R PE +E + R YW + N P ++ +F+ + +F G ++
Sbjct: 158 YSRFPEVKELMESQMERLARPYWEKLRNRPDMY----YFKNYNFKRCYFGLNGGDCLVAK 213

Query: 141 G-----------QGEGRYSELLAINLLEQLLLRRMEA-----INESLHPPMDNRVREA 182
G QG +Y E + LE++L +++A I + +P +V A
Sbjct: 214 GDDGRTFISFSLQGNSKYKEEMDAKKLEEILSLKVDANPDKYIKATGYPGYSEKVEVA 271


3CMJKDNLE_00098CMJKDNLE_00104Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_000983271.668100N-acetyl-anhydromuramyl-L-alanine-amidase
CMJKDNLE_000994321.570523putative inner membrane protein
CMJKDNLE_001003291.748102aromatic amino acid:H+ symporter AroP
CMJKDNLE_001014322.369485PdhR DNA-binding transcriptional dual regulator
CMJKDNLE_001034332.186214subunit of E1p component of pyruvate
CMJKDNLE_001042261.806491Dihydrolipoyllysine-residue acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00104RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.007
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 119 EVTEILVKVGDKV-EAEQSLITVEGDKASMEVPAPFAGTVKEIKVN-VGDKVSTGSLIMV 176
E+ + L + D + L E + + + AP + V+++KV+ G V+T +MV
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 31.7 bits (72), Expect = 0.008
Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGAADAAP 85
+ V +T G S E+ + IVKEI V G+ + G +++ + AD
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 86 AQA 88
Q+
Sbjct: 139 TQS 141



Score = 30.6 bits (69), Expect = 0.019
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 2/60 (3%)

Query: 220 EVTEVMVKVGDKVAA-EQSLITVEGDKASMEVPAPFAGVVKELKVN-VGDKVKTGSLIMI 277
E+ + + + D + L E + + + AP + V++LKV+ G V T +M+
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 29.8 bits (67), Expect = 0.035
Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 3/95 (3%)

Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAP 289
+ VA +T G S E+ +VKE+ V G+ V+ G +++ GA A
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE-ADTL 137

Query: 290 AKQEAAAPAPAAKAEAPAAAPAAKAEGKSEFAEND 324
Q + A + + + + E D
Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172


4CMJKDNLE_00119CMJKDNLE_00142Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00119-117-3.208443putative polysaccharide deacetylase lipoprotein
CMJKDNLE_00120021-4.335054Aspartate 1-decarboxylase
CMJKDNLE_00121224-5.317596putative transposase
CMJKDNLE_00122428-6.215646pantothenate synthetase monomer
CMJKDNLE_00123433-8.1182363-methyl-2-oxobutanoate hydroxymethyltransferase
CMJKDNLE_00124538-9.399157putative fimbrial-like adhesin protein
CMJKDNLE_00125334-8.661313putative fimbrial-like adhesin protein
CMJKDNLE_00126331-7.879889hypothetical protein
CMJKDNLE_00127021-5.122433putative fimbrial-like adhesin protein
CMJKDNLE_00128-118-3.838670putative outer membrane usher protein
CMJKDNLE_00129-115-0.638541putative pilin chaperone similar to PapD
CMJKDNLE_00130-1150.190553putative fimbrial-like adhesin protein
CMJKDNLE_00131-1141.6048786-hydroxymethyl-7,8-dihydropterin
CMJKDNLE_001320143.045758poly(A) polymerase I
CMJKDNLE_00133-1142.887431glutamyl-Q tRNAAsp synthetase
CMJKDNLE_001340102.104256RNA polymerase-binding transcription factor
CMJKDNLE_001350112.420862putative DNA-binding transcriptional regulator
CMJKDNLE_00136-1122.8232442'-5' RNA ligase
CMJKDNLE_00137-1143.670980putative ATP-dependent helicase
CMJKDNLE_00138-2153.410500penicillin-binding protein 1B
CMJKDNLE_00139-1133.210257ferrichrome / phage / antibiotic outer membrane
CMJKDNLE_001401164.293210iron (III) hydroxamate ABC transporter - ATP
CMJKDNLE_001411153.969307iron (III) hydroxamate ABC transporter -
CMJKDNLE_001420143.772487iron (III) hydroxamate ABC transporter -
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00123FLGMRINGFLIF290.022 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.022
Identities = 26/99 (26%), Positives = 39/99 (39%), Gaps = 20/99 (20%)

Query: 110 MVKIEGGEWL----VETVQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDQL-L 164
V +E G L + V L AV GL P +V + D++G L
Sbjct: 176 TVTLEPGRALDEGQISAVVHLVSSAVA-----GLPPGNVTLV---------DQSGHLLTQ 221

Query: 165 SDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIG 203
S+ + AQL V + +RI L+ P++G G
Sbjct: 222 SNTSGRDLNDAQLKFANDVESRIQRRIEAILS-PIVGNG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00128PF005777920.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 792 bits (2048), Expect = 0.0
Identities = 271/870 (31%), Positives = 432/870 (49%), Gaps = 40/870 (4%)

Query: 14 RIATFCALLYCNTAFSAELVEYDHTFLMGQNASNIDLSRYSEGNPAIPGVYDVSVYVNDQ 73
R+ CA SAE + ++ FL + DLSR+ G PG Y V +Y+N+
Sbjct: 29 RLFVACAFAAQAPLSSAE-LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNG 87

Query: 74 PIINQSITFVAIEGKKNAQACITLKNLLQFHINSPDINNEKAVLLARDETLGNCLNLTEI 133
+ + +TF + ++ C+T L +N+ + LLA D C+ LT +
Sbjct: 88 YMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASV--SGMNLLADDA----CVPLTSM 141

Query: 134 IPQASVRYDVNDQRLDIDVPQAWVMKNYQNYVDPSLWENGINAAMLSYNLNGYHSETP-G 192
I A+ + DV QRL++ +PQA++ + Y+ P LW+ GINA +L+YN +G + G
Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201

Query: 193 RKNESIYAAFNGGMNLGAWRLRASGNYNWMTDSGS-----NYDFKNRYVQRDIASLRSQL 247
+ Y G+N+GAWRLR + +++ + S + N +++RDI LRS+L
Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261

Query: 248 ILGESYTTGETFDSVSIRGIRLYSDSRMLPPTLASFAPIIHGVANTNAKVTITQGGYKIY 307
LG+ YT G+ FD ++ RG +L SD MLP + FAP+IHG+A A+VTI Q GY IY
Sbjct: 262 TLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIY 321

Query: 308 ETTVPPGAFVIDDLSPSGYGSDLIVTIEESDGSKRTFSQPFSSVVQMLRPGVGRWDISGG 367
+TVPPG F I+D+ +G DL VTI+E+DGS + F+ P+SSV + R G R+ I+ G
Sbjct: 322 NSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAG 381

Query: 368 QVLKDD-IQDEPNLFQASYYYGLNNYLTGYTGIQITDNNYTAGLLGLGLNT-SVGAFSFD 425
+ + Q++P FQ++ +GL T Y G Q+ D Y A G+G N ++GA S D
Sbjct: 382 EYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSVD 440

Query: 426 VTHSNVRIPDDKTYQGQSYRVSWNKLFEETSTSLNIAAYRYSTQNYLGLNDALTLIDEVK 485
+T +N +PDD + GQS R +NK E+ T++ + YRYST Y D
Sbjct: 441 MTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY 500

Query: 486 HPE-----QDLEPKFMRNYSRM---KNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASG 537
+ E ++PKF Y+ + ++ +++ Q L + YLSGS YW +
Sbjct: 501 NIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTSTLYLSGSHQTYWGTS 556

Query: 538 QNRSNYSIGYSNSTSWGSYSVSAQRSWNE-DGDTDDSVYLSFTIPIEKLLGTEQRTS-GF 595
+ G + + ++++S + N D + L+ IP L ++ ++
Sbjct: 557 NVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRH 616

Query: 596 QSIDTQISSDFKGNNQLNVSSSGYS-DNARVSYSVNTGYTMNKASKDLSYVGGYASYESP 654
S +S D G G ++ +SYSV TGY S +Y
Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676

Query: 655 WGTLAGSISANSDNSRQVSLSTDGGFVLHSGGLTFSNDSFSDSDTLAVVQAPGAQGARIN 714
+G S + D +Q+ GG + H+ G+T +DT+ +V+APGA+ A++
Sbjct: 677 YGNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQPL---NDTVVLVKAPGAKDAKVE 732

Query: 715 YGNST-IDRWGYGVTSALSPYHENRIALDINDLENDVELKSTSAVAVPRQGSVVFADFET 773
D GY V + Y ENR+ALD N L ++V+L + A VP +G++V A+F+
Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792

Query: 774 VQGQSAIMNITRSDGKNIPFAADIYDEQGNVIGNVGQGGQAFVRGIEQQGNISIKWLEQS 833
G +M +T + K +PF A + E G V GQ ++ G+ G + +KW E+
Sbjct: 793 RVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEE 851

Query: 834 KPVSCLAHYQQSPEAEKIAQSIILNGIRCQ 863
C+A+YQ P + L+ C+
Sbjct: 852 NA-HCVANYQL-PPESQQQLLTQLSA-ECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00141FERRIBNDNGPP5110.0 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 511 bits (1318), Expect = 0.0
Identities = 296/296 (100%), Positives = 296/296 (100%)

Query: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60
MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA
Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60

Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120
DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR
Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120

Query: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180
GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT
Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180

Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240
TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH
Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240

Query: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA
Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296


5CMJKDNLE_00193CMJKDNLE_00243Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00193-122-3.284696*4-nitrobenzaldehyde reductase [multifunctional]
CMJKDNLE_00194-225-2.798778putative DNA-binding transcriptional regulator
CMJKDNLE_00195-124-2.765165hypothetical protein
CMJKDNLE_00196-223-3.224744putative S-adenosylmethionine-dependent
CMJKDNLE_00197-223-4.265715membrane-bound lytic murein transglycosylase D
CMJKDNLE_00198-131-7.577296glyoxalase II
CMJKDNLE_00199032-8.026905putative S-adenosyl-L-methionine-dependent
CMJKDNLE_00200234-9.722630RNase HI, degrades RNA of DNA-RNA hybrids,
CMJKDNLE_00201229-6.620007DNA polymerase III, epsilon subunit
CMJKDNLE_00203219-1.819333*putative aminopeptidase
CMJKDNLE_002042180.236219hypothetical protein
CMJKDNLE_002052202.433875putative inner membrane protein
CMJKDNLE_002061215.157087hypothetical protein
CMJKDNLE_002072225.722995hypothetical protein
CMJKDNLE_002081235.610145hypothetical protein
CMJKDNLE_002091245.632271hypothetical protein
CMJKDNLE_002101235.483265hypothetical protein
CMJKDNLE_002112214.520941ClpB chaperone
CMJKDNLE_002122192.448403hypothetical protein
CMJKDNLE_002133192.481933hypothetical protein
CMJKDNLE_002142191.677597hypothetical protein
CMJKDNLE_002153201.689749hypothetical protein
CMJKDNLE_002163210.646412hypothetical protein
CMJKDNLE_002172200.354199hypothetical protein
CMJKDNLE_002184201.299905hypothetical protein
CMJKDNLE_002194294.838073hypothetical protein
CMJKDNLE_002204324.645946hypothetical protein
CMJKDNLE_002213292.922587hypothetical protein
CMJKDNLE_002223283.128636hypothetical protein
CMJKDNLE_002233282.918454Actin cross-linking toxin VgrG1
CMJKDNLE_002242233.329575RhsD protein in rhs element
CMJKDNLE_00225-112-1.127529hypothetical protein
CMJKDNLE_00227-1120.387821putative transposase
CMJKDNLE_00228-2161.999241putative C-N hydrolase family amidase,
CMJKDNLE_00229-1151.114716inhibitor of vertebrate C-type lysozyme
CMJKDNLE_002300151.093964acyl-CoA dehydrogenase
CMJKDNLE_00231215-1.852940D-sedoheptulose 7-phosphate isomerase
CMJKDNLE_00233320-2.806900putative amidotransferase
CMJKDNLE_00234121-0.049884hypothetical protein
CMJKDNLE_002352220.657498toxin of the YafQ-DinJ toxin-antitoxin system
CMJKDNLE_002362211.740960DinJ antitoxin of YafQ-DinJ toxin-antitoxin
CMJKDNLE_002372191.425207putative lipoprotein and C40 family peptidase
CMJKDNLE_002392180.737648REP-associated tyrosine transposase
CMJKDNLE_002412180.932297flagellar biosynthesis protein FlhA
CMJKDNLE_00242320-2.027773MotB protein, enables flagellar motor rotation,
CMJKDNLE_00243220-1.842991DNA polymerase IV (Y-family DNA polymerase;
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00198BINARYTOXINB344e-04 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 34.3 bits (78), Expect = 4e-04
Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236
+ ++ EL A N T+ +K ++N+ +R D + I V +E+++++
Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00205PF01206300.001 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 30.5 bits (69), Expect = 0.001
Identities = 15/58 (25%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 58 PLTLDVAKKILSPITSFDYIHFITTHPSGIKDTLAWLVNAG-KLMTEFDDNGKIIFNL 114
PL + AKK L+ + + + ++ + T P +KD ++ G +L+ + +++G F L
Sbjct: 16 PLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDGTYHFRL 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00208PF06580320.016 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.016
Identities = 15/95 (15%), Positives = 29/95 (30%), Gaps = 9/95 (9%)

Query: 18 RPAMPRFKVSAFWLLILAWIFL-LVWIWWKGPTWTLYEEQWLKPLANRWLATAAWG---- 72
+ + ++ A + + +VW W L KP+A +
Sbjct: 66 QGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVV 125

Query: 73 IIALMW----LTVRVMKRLQQLEKMQKQQREEAVD 103
++ MW K +Q E Q + A +
Sbjct: 126 VVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQE 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00224OUTRSURFACE374e-04 Outer surface protein signature.
		>OUTRSURFACE#Outer surface protein signature.

Length = 273

Score = 37.2 bits (86), Expect = 4e-04
Identities = 34/147 (23%), Positives = 60/147 (40%), Gaps = 21/147 (14%)

Query: 536 DGTGVRRRITRNRYGQLLAFTDCSGYTTRYEYDQYGQQIA---VHREEGISTYSSYNPRG 592
+G+GV ++ L D TT + + G+ + V ++ ST +N +G
Sbjct: 71 NGSGVLEGTKDDKSKAKLTIADDLSKTTFELFKEDGKTLVSRKVSSKDKTSTDEMFNEKG 130

Query: 593 QLISRKDAQGRETRYEYSAAGDLTATISPDGKRSATEYDKR----------GRPVSVTEG 642
+L ++ + T+ EY+ + DG A E K + V EG
Sbjct: 131 ELSAKTMTRENGTKLEYT-------EMKSDGTGKAKEVLKNFTLEGKVANDKVTLEVKEG 183

Query: 643 GLTRSMGYDAAGRITV-LTNENGSQST 668
+T S +G +TV L + N +Q+T
Sbjct: 184 TVTLSKEIAKSGEVTVALNDTNTTQAT 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00235ENTSNTHTASED270.011 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 26.9 bits (59), Expect = 0.011
Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 45 AVYKDHPLQGSWKGYRDAHVEPD 67
+VYK + + G+ A V
Sbjct: 153 SVYKAFSDRVTLPGFNSAKVTSL 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00242OMPADOMAIN381e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 38.0 bits (88), Expect = 1e-05
Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 22/118 (18%)

Query: 71 FERGSAKIMPFFKTLLVELAPVFDSL---DNKIIITGHTDAM---AYKNNIYNNWNLSGD 124
F A + P + L +L +L D +++ G+TD + AY N LS
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAY------NQGLSER 276

Query: 125 RALSARRVLEEAGMPEDKVMQVS-----AMADQMLLDSKNPQS-----AGNRRIEIMV 172
RA S L G+P DK+ + + K + A +RR+EI V
Sbjct: 277 RAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


6CMJKDNLE_00252CMJKDNLE_00286Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00252-217-4.302266Crl transcriptional regulator
CMJKDNLE_00253026-7.503074outer membrane porin E
CMJKDNLE_00254133-8.875021gamma-glutamyl kinase-GP-reductase multienzyme
CMJKDNLE_00255338-10.395091gamma-glutamyl kinase-GP-reductase multienzyme
CMJKDNLE_00257747-12.757865*CPS-53 (KpLE1) prophage; prophage CPS-53
CMJKDNLE_00258744-12.832368putative type I restriction enzymeP M protein
CMJKDNLE_00259636-10.161305hypothetical protein
CMJKDNLE_00260426-5.032714hypothetical protein
CMJKDNLE_002615241.660867CP4-44 prophage; predicted DNA repair protein
CMJKDNLE_002622203.446854CP4-44 prophage; predicted protein
CMJKDNLE_002631215.371804hypothetical protein
CMJKDNLE_002641225.978538CP4-57 prophage; predicted antitoxin of the
CMJKDNLE_002651215.888319CP4-57 prophage; toxin of the YpjF-YfjZ
CMJKDNLE_002661195.301783putative transcriptional regulator LYSR-type
CMJKDNLE_002670195.111837hypothetical protein
CMJKDNLE_002680183.384215aldehyde dehydrogenase: molybdenum
CMJKDNLE_002691202.098773aldehyde dehydrogenase, FAD-binding subunit
CMJKDNLE_002701211.246132aldehyde dehydrogenase, Fe-S subunit
CMJKDNLE_002712211.056904inner membrane protein that contributes to acid
CMJKDNLE_002733220.639866hypothetical protein
CMJKDNLE_002743220.337894hypothetical protein
CMJKDNLE_002753220.200232hypothetical protein
CMJKDNLE_00276523-4.293099hypothetical protein
CMJKDNLE_00277320-1.810418E. coli common pilus - major subunit; cryptic
CMJKDNLE_00278323-2.579545MatA DNA-binding transcriptional dual regulator
CMJKDNLE_00279224-2.552507putative ribosomal protein
CMJKDNLE_00280223-2.802855putative ribosomal protein
CMJKDNLE_00281023-3.325041small membrane protein
CMJKDNLE_00282023-2.801119adhesin
CMJKDNLE_00283126-5.485161hypothetical protein
CMJKDNLE_00284124-4.206519putative DNA-binding transcriptional regulator
CMJKDNLE_00285023-2.935461hypothetical protein
CMJKDNLE_00286123-3.931817putative oxidoreductase with FAD/NAD(P)-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00253ECOLIPORIN5490.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 549 bits (1415), Expect = 0.0
Identities = 232/384 (60%), Positives = 269/384 (70%), Gaps = 34/384 (8%)

Query: 1 MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNASKDGDQSYIRFG 60
MK+ LALV+ ++A+ + AAEIYNKDGNKLD+YGKV +HY SD++SKDGDQ+Y+R G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALY 120
FKGETQINDQLTGYG+WE N E + A TRLAFAGLK+ D GSFDYGRN G LY
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNEN-- 178
DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFG++DGLN LQYQGKNE+
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 179 --------------RDVKKQNGDGFGTSLTYDFGGSDFSISGAYTNSDRTNEQNLQSR-- 222
D++ NGDGFG S TYD G FS AYT SDRTNEQ
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239

Query: 223 GTGKRAEAWATGLKYDANNIYLATFYSETRKMTP-------ITGGFANKTQNFEAVAQYQ 275
G +A+AW GLKYDANNIYLAT YSETR MTP GG ANKTQNFE AQYQ
Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299

Query: 276 FDFGLRPSLGYVLSKGKDIE----GIGDEDLINYIDVGATYYFNKNMSAFVDYKINQLDS 331
FDFGLRP++ +++SKGKD+ D+DL+ Y DVGATYYFNKN S +VDYKIN LD
Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359

Query: 332 DNKL----NINNDDIVAVGMTYQF 351
D+ I+ DDIVA+GM YQF
Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00254CARBMTKINASE376e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.5 bits (87), Expect = 6e-05
Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 17/127 (13%)

Query: 119 DTLRALLDNNI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169
+T++ L++ + VPVI E+ + E V D D A AD ++LT
Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235

Query: 170 DQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAA-DVACRAG 228
D G + + +++V +++ + G M K+ AA G
Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289

Query: 229 IDTIIAA 235
IIA
Sbjct: 290 ERAIIAH 296



Score = 30.2 bits (68), Expect = 0.013
Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%)

Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAAGHRIVIVTSG-------- 51
+ +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 52 -AIAAGREHLGYPELP 66
+ AG+ G P P
Sbjct: 62 LHMDAGQATYGIPAQP 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00275PF00577633e-12 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 63.3 bits (154), Expect = 3e-12
Identities = 38/316 (12%), Positives = 93/316 (29%), Gaps = 33/316 (10%)

Query: 487 TLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQNVYSGTFGSLGLRAGIQRYNNGDSN 546
L + + T +S + Y + +Q + F + N
Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588

Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTV 594
+ +AL++++P +W + Q + A+ S + +
Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647

Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYVNTNLTANGSVGWQGK 654
++ +G + +G A + Y + + S +D +G V
Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706

Query: 655 NIAASGRTDGNAGVIFNTGLED---DGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQ 711
+ + ++ G +D + Q + + R G + Y V L
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761

Query: 712 NSKNSLDSYDIVSGRKSRLTLYPGNVAVIEPEVKQMVTVSGRIRAEDGTLLANARINNHI 771
+ + + D+ + + G + E + + + + + + L A +
Sbjct: 762 TNTLADN-VDLDNAVA-NVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMV---- 815

Query: 772 GRTRTDENGEFVMDVD 787
T E+ + V
Sbjct: 816 ----TSESSQSSGIVA 827


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00282INTIMIN5600.0 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 560 bits (1443), Expect = 0.0
Identities = 233/818 (28%), Positives = 362/818 (44%), Gaps = 49/818 (5%)

Query: 41 PVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDS-----DATRNF 95
P++AA +L+ + VT N + ++AA L SQ S D ++
Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190

Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155
G+A +A+ ++Q WL YGTA V L +F SSL+ L P YD+ + F Q
Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248

Query: 156 HRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215
D R +N+G G R F M G N FID D S +TR+G+G EYWRDY K S NGY
Sbjct: 249 RYIDSRFTANLGAGQRFFLPE-NMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307

Query: 216 IRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275
R SGW +S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q
Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367

Query: 276 KDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLAKQLDTDSIRER 335
+P A + V YTP+PL+T+ ++ G END + ++ Y+ +P ++Q++ + E
Sbjct: 368 SNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNEL 427

Query: 336 RVLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQ 395
R L+GSRYDLV+RNNNI+LEY+K +++ + +P I G T + L+V K+ +GL +
Sbjct: 428 RTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIV-KSKYGLDRIV 486

Query: 396 WEAPSLLAEGGKITGQGSQ----WQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEV 451
W+ +L ++GG+I GSQ +Q LPAY G N Y ++A AYD GN+S V +
Sbjct: 487 WDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI 546

Query: 452 VITGAGMSADRTALTLDGQSRIQMLANGNEQRPLVLSLRDAEGQPVTGTKDQIKTELAFK 511
+ G D+ +T + A+G E +T T K +A
Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTE--------------AITYTATVKKNGVA-- 590

Query: 512 PAGNIVTRSLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTA 571
A V S A + +G + G+ ++ M+ + A
Sbjct: 591 QANVPV--SFNIVSGTAVLSANSANTNGSGKATVTLKSDK-PGQVVVSAKTAEMTSALNA 647

Query: 572 ELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRFVPQDTN 631
+ S VA+GQ A T T+ V PV+ + T
Sbjct: 648 NAVIFVDQTKASITEIKADKTTAVANGQDAITYTVK-VMKGDKPVSNQEVTF-----TTT 701

Query: 632 GVTVGAIS--EIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGP-LDAAHS 688
+ + G T++ST G +V A + ++F +D +
Sbjct: 702 LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761

Query: 689 SITLNPDKPVVGGTVTAIWTAKDAYDNPVTSLTPE---APSLAGAAAVGSTASGWTNNGD 745
I V G + +W + + + + A+V +++ T
Sbjct: 762 EIVGTG----VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEK 817

Query: 746 GTWTAQITLGSTAGELEVMPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVKAGES 805
GT T + + N N +K DA+++ ++ E+
Sbjct: 818 GTTTISVISSDNQTATYTIATPNSL-IVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELEN 876

Query: 806 TTVTLIAKDAHGNTISGLSLSASLTGTASEGATVSSWT 843
A + + S ++ + + TA + + + T
Sbjct: 877 VFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVAST 914



Score = 76.6 bits (188), Expect = 4e-16
Identities = 75/372 (20%), Positives = 124/372 (33%), Gaps = 51/372 (13%)

Query: 882 TVIAGEMSSANSTLVADNKAPTVKMTTELTFTVKDAYGNPVTGLKPDAPVFSGAASTGSE 941
V + ++ ++ AD + T + PV+ + SG A
Sbjct: 557 QVGVTDFTADKTSAKADGTEA-ITYTATVKKNGVAQANVPVSFN-----IVSGTAV---- 606

Query: 942 RPSAGNWTEKGNGVYVATLTLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDM 1001
SA + G+G TL + +A+ V+ V D +KA I ++
Sbjct: 607 -LSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFV--DQTKASITEI 663

Query: 1002 TVKVNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVD 1060
+ANGQ A IT TV V P+ QEVT T G S + T T+ G
Sbjct: 664 KADKTTAVANGQDA--ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNS--TEKTDTNGYAK 719

Query: 1061 IELMSTVAGEHSITASVNNAQ---KTVTVKFKADFS--TGQATLE---VDGSTPKVANDN 1112
+ L ST G+ ++A V++ K V+F + G + V G P V
Sbjct: 720 VTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQY 779

Query: 1113 DAFTLTATVKDQYGNLLPGAVVVFNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTY 1172
L A+ G + A+ I G+ LK GT
Sbjct: 780 GQVNLKAS----------GGNGKYTW-----RSANPAIASVDASSGQVTLK----EKGTT 820

Query: 1173 EITASAGNDQPSNAQSVTFVADKTTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLL 1232
I+ + ++Q T+ + I + D ++ N L
Sbjct: 821 TISVISSDNQT-----ATYTIATPNSLI-VPNMSKRVTYNDAVNTCKNFGGKLPSSQNEL 874

Query: 1233 KDSDVTLTASSE 1244
++ A+++
Sbjct: 875 ENVFKAWGAANK 886



Score = 54.7 bits (131), Expect = 3e-09
Identities = 57/366 (15%), Positives = 104/366 (28%), Gaps = 52/366 (14%)

Query: 779 VTVVADALSSNQSKV---SVAEDHVKAGESTTVTLIA------KDAHGNTISGLSLSASL 829
+TV+++ +Q V + + KA + +T A +S +S
Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVS--- 602

Query: 830 TGTASEGATVSSWTEKGDCSYVATLTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGEMS 889
GTA A +S G TL + + A A + +
Sbjct: 603 -GTAVLSA--NSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN--ANAVIFVDQTK 657

Query: 890 SANSTLVADNKAPTVKMTTELTFTVKDAY-GNPVTGLKPDAPVFSGAASTGSERPSAGNW 948
++ + + AD +T+TVK PV+ + +T + S
Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV-------TFTTTLGKLSNSTE 710

Query: 949 TEKGNGVYVATLTLGSAAGQLSVMPRVNGQN-AVAQPLVLNVAG---DASKAEIRDMTVK 1004
NG TLT + G+ V RV+ V P V D EI
Sbjct: 711 KTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEI------ 763

Query: 1005 VNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIEL 1063
+ G T+ + G T ++ ++G+V ++
Sbjct: 764 ----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTW---RSANPAIASVDASSGQVTLK- 815

Query: 1064 MSTVAGEHSITASVNNAQKTVTVKFKADFSTGQATLEVDGSTPKVANDNDAFTLTATVKD 1123
G +I+ ++ Q T + + + +
Sbjct: 816 ---EKGTTTISVISSDNQ---TATYTIATPNSLIVPNMSKRV-TYNDAVNTCKNFGGKLP 868

Query: 1124 QYGNLL 1129
N L
Sbjct: 869 SSQNEL 874



Score = 51.2 bits (122), Expect = 3e-08
Identities = 38/178 (21%), Positives = 64/178 (35%), Gaps = 12/178 (6%)

Query: 1168 TAGTYEITASA----GNDQPSNAQSVTFVADKTTAT---ISSIEVIGNRAVADGKTKQTY 1220
+ Y++TA A GN + ++T +++ ++ A ADG TY
Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580

Query: 1221 KVTVTDANNNLLKDSDVTLTASSENLVLDPKGTAKTNEQGQAVFTGSTTIAATYTLTAKV 1280
TV S + +A TN G+A T + ++AK
Sbjct: 581 TATVKKNGVAQANVPVSFNIVS--GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK- 637

Query: 1281 EQANGQVSTKTAESKFVADDKNAVLAASPERVDSLVADGKTTATMTVTLMAGVNPVGG 1338
A + FV K ++ ++ + VA+G+ T TV +M G PV
Sbjct: 638 -TAEMTSALNANAVIFVDQTKASITEIKADK-TTAVANGQDAITYTVKVMKGDKPVSN 693


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00284HTHTETR280.023 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.023
Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 3 RQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRN 44
RQ IL L S+ +IA+ +G +R I F++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


7CMJKDNLE_00297CMJKDNLE_00336Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_002972170.780423putative DNA-binding transcriptional regulator
CMJKDNLE_002982181.683743putative inner membrane protein
CMJKDNLE_002992182.177886putative transcriptional regulator with ankyrin
CMJKDNLE_003001213.468673hypothetical protein
CMJKDNLE_003012224.232937putative acyl-CoA synthetase with NAD(P)-binding
CMJKDNLE_003020162.027247hypothetical protein
CMJKDNLE_00303-1140.396739hypothetical protein
CMJKDNLE_00307-112-0.669242putative carbamate kinase-like protein
CMJKDNLE_00308-212-0.406490putative deaminase with metallo-dependent
CMJKDNLE_00309-115-0.893681putative oxidoreductase, Zn-dependent and
CMJKDNLE_00310-117-0.901394hypothetical protein
CMJKDNLE_003120152.144608*putative neutral amino acid efflux system
CMJKDNLE_003130203.231250hypothetical protein
CMJKDNLE_003140224.652287PrpR DNA-binding transcriptional dual regulator
CMJKDNLE_003151224.4970272-methylisocitrate lyase
CMJKDNLE_003160204.021186hypothetical protein
CMJKDNLE_003190203.8906952-methylcitrate synthase
CMJKDNLE_003200194.0464722-methylcitrate dehydratase
CMJKDNLE_00321-1183.442232propionyl-CoA synthetase
CMJKDNLE_003230143.099144cytosine transporter
CMJKDNLE_00324-1161.938529cytosine deaminase
CMJKDNLE_003250150.187202CynR DNA-binding transcriptional repressor
CMJKDNLE_00326-2111.592268carbonic anhydrase monomer
CMJKDNLE_00327-2101.655561cyanase monomer
CMJKDNLE_00328-2101.845362cyanate transporter
CMJKDNLE_00329-2111.826279galactoside O-acetyltransferase monomer
CMJKDNLE_00330-2122.982607lactose / melibiose:H+ symporter LacY
CMJKDNLE_00331-2144.209827beta-galactosidase monomer
CMJKDNLE_00332-1174.022735LacI DNA-binding transcriptional repressor
CMJKDNLE_00333-1143.729625MhpR transcriptional activator
CMJKDNLE_003340154.1659083-(3-hydroxyphenyl)propionate 2-hydroxylase
CMJKDNLE_003350134.1837323-(2,3-dihydroxyphenyl)propionate dioxygenase
CMJKDNLE_003361123.0934152-hydroxy-6-ketonona-2,4-dienedioate hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00297HTHFIS310.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.009
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 6 TEENLLAFTTAARFGSFSKAAEELGLTTSAISYTIKRMETGLDVVLFTRS 55
E L+ A G+ KAA+ LGL + + I+ + G+ V +RS
Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL--GVSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00307CARBMTKINASE435e-156 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 435 bits (1119), Expect = e-156
Identities = 139/315 (44%), Positives = 201/315 (63%), Gaps = 3/315 (0%)

Query: 1 MKELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLD 60
M + VV+A+GGN++ + S E + V+ A + E++A Y++V+THGNGPQVG
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 61 LRRAEIAHKREGLPLTPLANCVADTQGGIGYLIQQALNNRLARHG-EKKAVTVVTQVEVD 119
L + G+P P+ A +QG IGY+IQQAL N L + G EKK VT++TQ VD
Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120

Query: 120 KNDPGFAHPTKPIGAFFSDSQRDELQKANPDWCFVEDAGRGYRRVVASPEPKRIVEAPAI 179
KNDP F +PTKP+G F+ + L + W ED+GRG+RRVV SP+PK VEA I
Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLAREK-GWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179

Query: 180 KALIQQGFVVIGAGGGGIPVVRTDAGDYQSVDAVIDKDLSTALLAREIHADILVITTGVE 239
K L+++G +VI +GGGG+PV+ D G+ + V+AVIDKDL+ LA E++ADI +I T V
Sbjct: 180 KKLVERGVIVIASGGGGVPVILED-GEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238

Query: 240 KVCIHFGKPQQQALDRVDIATMTRYMQEGHFPPGSMLPKIIASLTFLEQGGKEVIITTPE 299
+++G ++Q L V + + +Y +EGHF GSM PK++A++ F+E GG+ II E
Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLE 298

Query: 300 CLPAALRGETGTHII 314
AL G+TGT ++
Sbjct: 299 KAVEALEGKTGTQVL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00314HTHFIS342e-114 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 342 bits (878), Expect = e-114
Identities = 121/401 (30%), Positives = 200/401 (49%), Gaps = 54/401 (13%)

Query: 164 DLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHNTHDATRNALRTRYVLGDMLGQSP 223
A +A G + Y ++ + + +L ++ ++G+S
Sbjct: 88 MTAIKASEKGAYDYLPKPFDL--TELIGIIGRALAEPKRRPSK-LEDDSQDGMPLVGRSA 144

Query: 224 QMEQVRQTILLYARSSAAVLIEGETGTGKELAAQAIHREYFARHDARQGKKSHPFVAVNC 283
M+++ + + ++ ++I GE+GTGKEL A+A+H + R+ PFVA+N
Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-----YGKRRNG---PFVAINM 196

Query: 284 GAIAESLLEAELFGYEEGAFTGSRRGGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVL 343
AI L+E+ELFG+E+GAFTG++ G FE A GGTLFLDEIG+MP+ QTRLLRVL
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTR-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255

Query: 344 EEKEVTRVGGHQPVPVDVRVISATHCNLEEDMQQGRFRRDLFYRLSILRLQLPPLRERVA 403
++ E T VGG P+ DVR+++AT+ +L++ + QG FR DL+YRL+++ L+LPPLR+R
Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315

Query: 404 DILPLAESFLKVSLAALSAPFSAALRQGLQASETVLLHYDWPGNIRELRNMMERLALFLS 463
DI L F++ ++ L+ + + WPGN+REL N++ RL
Sbjct: 316 DIPDLVRHFVQ-QAEKEGLDVKRFDQEALEL----MKAHPWPGNVRELENLVRRLTALYP 370

Query: 464 VEP-TPDLTPQFMQLLLPELARESAKTPAPRLLTP------------------------- 497
+ T ++ ++ +P+ E A + L
Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430

Query: 498 -----------QQALEKFNGDKTAAANYLGISRTTFWRRLK 527
AL G++ AA+ LG++R T ++++
Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIR 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00319PHPHTRNFRASE300.022 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.8 bits (67), Expect = 0.022
Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 65 LIHGKLPTRDE-LAAYKTKLKALRGLPANVRTV 96
+ +LPT +E AYK ++ + G P +RT+
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00329BCTERIALGSPD300.006 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 30.3 bits (68), Expect = 0.006
Identities = 24/129 (18%), Positives = 53/129 (41%), Gaps = 22/129 (17%)

Query: 80 FYANFN----LTIVDDYTVTIGDNVLIAPNVTLSVTGHPVHHELRKNGEMYSFPITIGNN 135
F A+F ++ + + V+I P+V ++T +++ + Y F +++ +
Sbjct: 30 FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTIT--VRSYDMLNEEQYYQFFLSV-LD 86

Query: 136 VWIGSHVVINPGVTI---------------GDNSVIGAGSIVTKDIPPNVVAAGVPCRVI 180
V+ + + +N GV D + +VT+ +P VAA ++
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLL 146

Query: 181 REINDRDKH 189
R++ND
Sbjct: 147 RQLNDNAGV 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00330TCRTETA363e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.6 bits (82), Expect = 3e-04
Identities = 44/192 (22%), Positives = 72/192 (37%), Gaps = 22/192 (11%)

Query: 4 LKNTNFWMFGLFFFFYFFI-MGAYFPFFPIWLHDINHISK--SDTGIIFAAISLFSLLFQ 60
+K + L + +G P P L D+ H + + GI+ A +L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 61 PLFGLLSDKLGLRKYLLWIITGMLVMFAPFFIFIFGPLLQYNILVGSIVGGIYLGFCFNA 120
P+ G LSD+ G R LL + + I P L + + +G IV GI A
Sbjct: 61 PVLGALSDRFGRRPVLL---VSLAGAAVDYAIMATAPFL-WVLYIGRIVAGIT-----GA 111

Query: 121 GAPAVEAFIEKVSRRSNFEFGRARMFG----CVGWALCAS--IVGIMFTINNQFVFWLGS 174
A+I ++ RAR FG C G+ + A + G+M + F+ +
Sbjct: 112 TGAVAGAYIADITDGDE----RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAA 167

Query: 175 GCALILAVLLFF 186
+ + F
Sbjct: 168 ALNGLNFLTGCF 179


8CMJKDNLE_00438CMJKDNLE_00461Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00438-1133.113076putative multidrug transporter subunit of ABC
CMJKDNLE_00439-2121.925749Nitrogen regulatory protein P-II 2
CMJKDNLE_00440-2120.315476ammonia / ammonium transporter
CMJKDNLE_00441-114-1.499726thioesterase II
CMJKDNLE_00442017-2.239973putative outer membrane lipoprotein
CMJKDNLE_00443219-3.545567DNA base-flipping protein
CMJKDNLE_00445220-5.120551hypothetical protein
CMJKDNLE_00446013-1.674730hypothetical protein
CMJKDNLE_00447216-0.377170putative inner membrane protein
CMJKDNLE_00448216-0.810754maltose acetyltransferase
CMJKDNLE_00449114-0.337300hemolysin expression modulating protein
CMJKDNLE_00450214-0.131233protein that modulates Hha toxicity
CMJKDNLE_004512150.747252AcrB RND-type permease
CMJKDNLE_004522110.061224AcrA membrane fusion protein
CMJKDNLE_00453213-0.103430AcrR DNA-binding transcriptional repressor
CMJKDNLE_004543142.245481potassium dependent mechanosensitive channel
CMJKDNLE_004554154.064154small protein involved in the cell envelope
CMJKDNLE_004563164.606577primosomal replication protein N''
CMJKDNLE_004573223.155718hypothetical protein
CMJKDNLE_004584272.986843adenine phosphoribosyltransferase
CMJKDNLE_004592212.882374DNA polymerase III, gamma subunit
CMJKDNLE_004612211.369132hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00438ACRIFLAVINRP330.003 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 33.3 bits (76), Expect = 0.003
Identities = 21/96 (21%), Positives = 42/96 (43%), Gaps = 4/96 (4%)

Query: 92 AAVGVVQQLRTDVMDAA--LRQPLSEFDTQ-PVGQVISRVTNDTEVIRDLYVTVVATVLR 148
A +G+ + +D A ++ L+E P G + + T ++ VV T+
Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFE 346

Query: 149 SAALVGAMLVAMFSLDWRMALVAIMIFPVVLVVMVI 184
+ LV +++ +F + R L+ + PVVL+
Sbjct: 347 AIMLV-FLVMYLFLQNMRATLIPTIAVPVVLLGTFA 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00451ACRIFLAVINRP13690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1369 bits (3546), Expect = 0.0
Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00452RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 6e-07
Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%)

Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268
+ D + KV I D I+ + G + ++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429

Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
+ S + I L GM V A ++ G
Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.4 bits (79), Expect = 8e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00453HTHTETR2012e-67 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 201 bits (511), Expect = 2e-67
Identities = 189/192 (98%), Positives = 189/192 (98%)

Query: 36 FLTAGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 95
F GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG
Sbjct: 24 FSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 83

Query: 96 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ 155
DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ
Sbjct: 84 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ 143

Query: 156 TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 215
TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL
Sbjct: 144 TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203

Query: 216 LCPTLRNPATNE 227
LCPTLRNPATNE
Sbjct: 204 LCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00454RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRIKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00459IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 4e-05
Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


9CMJKDNLE_00474CMJKDNLE_00492Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_004741123.174741hypothetical protein
CMJKDNLE_004750133.476575putative DNA-binding transcriptional regulator
CMJKDNLE_004780143.827571Cu+ efflux ATPase
CMJKDNLE_00479-1140.977595glutaminase
CMJKDNLE_00480019-0.236662YbaT APC transporter
CMJKDNLE_004811190.137633CueR transcriptional dual regulator
CMJKDNLE_00482-1170.071713hypothetical protein
CMJKDNLE_00483-117-0.258535putative protease, membrane anchored
CMJKDNLE_00484-1180.275711putative transporter subunit: ATP-binding
CMJKDNLE_004850183.151766putative metal resistance protein
CMJKDNLE_004860152.735289chaperone and weak protein oxidoreductase
CMJKDNLE_004870161.007561putative oxidoreductase with NAD(P)-binding
CMJKDNLE_00488219-0.349098multifunctional acyl-CoA thioesterase I and
CMJKDNLE_00489323-1.716288YbbA/YbbP ABC transporter
CMJKDNLE_00490322-2.585948YbbA/YbbP ABC transporter
CMJKDNLE_00491429-6.072104RhsD protein in rhs element
CMJKDNLE_00492122-4.147756hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00479BLACTAMASEA280.047 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 27.8 bits (62), Expect = 0.047
Identities = 11/43 (25%), Positives = 18/43 (41%)

Query: 38 GQLAAVAIVTCDGNVYSAGDSDYRFALESISKVCTLALALEDV 80
G++ + + G +A +D RF + S KV L V
Sbjct: 38 GRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00487DHBDHDRGNASE784e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.8 bits (191), Expect = 4e-19
Identities = 49/212 (23%), Positives = 81/212 (38%), Gaps = 7/212 (3%)

Query: 3 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNS----MGFT--GVLIDLDS 56
K ITG + GIG A L QG H+ A P+ +E++ S D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 116
++D + + + N AG G + ++S + E FS N G + +
Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 117 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 176
M+ G IV S + AYA+SK A ++ L +EL I+ +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 208
G T ++ ++ G F G
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00489PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 41 LVGESGSGKSTLLAILAGLD 60
L G G GKSTL+ L GLD
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


10CMJKDNLE_00502CMJKDNLE_00526Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00502215-1.391314hydroxypyruvate isomerase
CMJKDNLE_00503215-1.407096tartronate semialdehyde reductase 2
CMJKDNLE_00504313-1.287747YbbW NCS1 Transporter
CMJKDNLE_00505313-0.255652allantoinase monomer
CMJKDNLE_005064170.647675putative uracil/xanthine transporter
CMJKDNLE_005074151.698264glycerate kinase II
CMJKDNLE_005093141.755693S-ureidoglycine aminohydrolase
CMJKDNLE_005102163.041637allantoate amidohydrolase monomer
CMJKDNLE_005111153.919706ureidoglycolate dehydrogenase
CMJKDNLE_005122154.764172putative acyl-CoA synthetase with NAD(P)-binding
CMJKDNLE_005132154.832304hypothetical protein
CMJKDNLE_005141143.746974hypothetical protein
CMJKDNLE_005151183.229697putative carbamate kinase
CMJKDNLE_005163192.615914N5-carboxyaminoimidazole ribonucleotide
CMJKDNLE_005173192.127863N5-carboxyaminoimidazole ribonucleotide mutase
CMJKDNLE_005183181.582860UDP-2,3-diacylglucosamine hydrolase
CMJKDNLE_005192170.199130peptidyl-prolyl cis-trans isomerase B (rotamase
CMJKDNLE_00520015-0.870480cysteinyl-tRNA synthetase
CMJKDNLE_00521121-3.290317hypothetical protein
CMJKDNLE_00522123-4.201058putative RNA-binding protein
CMJKDNLE_00523225-4.258543bifunctional 5,10-methylene-tetrahydrofolate
CMJKDNLE_00524329-6.395137putative fimbrial-like adhesin protein
CMJKDNLE_00525327-6.212179putative pilin chaperone, periplasmic
CMJKDNLE_00526224-4.950994putative outer membrane export usher protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00505UREASE553e-10 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 55.1 bits (133), Expect = 3e-10
Identities = 39/163 (23%), Positives = 59/163 (36%), Gaps = 32/163 (19%)

Query: 4 DLIIKNGTVILENEARVVDIAVKGGKIAAIG-------QD-----LGDAKEVMDASGLVV 51
D +I N ++ DI +K G+IAAIG Q +G EV+ G +V
Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128

Query: 52 SPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRAS------- 104
+ G +D+H H P + A G+T M+ PA A+
Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177

Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFK 146
I +AA ++ A G + L E+ G K
Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00515CARBMTKINASE386e-138 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 386 bits (992), Expect = e-138
Identities = 126/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%)

Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60
K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 61 QNLAWKE---VEPYPLDVLVAESQGMIGYMLAQSLSAQPQM----PPVTTVLTRIEVSPD 113
A + + P+DV A SQG IGYM+ Q+L + + V T++T+ V +
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122

Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172
DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L
Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182

Query: 173 LKEGHVVICSGGGGVPVTDDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229
++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD +
Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242

Query: 230 NWGTPQQRAIRHATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIEE 285
+GT +++ +R +EL + + GSMGPKV A ++ G+ A I L + E
Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302

Query: 286 TLAGEAGTCI 295
L G+ GT +
Sbjct: 303 ALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00520RTXTOXIND290.029 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.029
Identities = 16/150 (10%), Positives = 44/150 (29%), Gaps = 8/150 (5%)

Query: 299 RSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTP----- 353
+ ++ +L QAR R R + P E F +++
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 354 EAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEV 413
E +S + + + A + + + + + + + + F +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249

Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRLN 443
A+ L Q+ + + +++
Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00526PF005778180.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 818 bits (2115), Expect = 0.0
Identities = 402/855 (47%), Positives = 570/855 (66%), Gaps = 20/855 (2%)

Query: 20 ICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFI 79
+ A + LS AE YFNP FL ++ +VADLSRFE G P G YRVD++ N+ ++
Sbjct: 31 FVACAFA-AQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89

Query: 80 GSQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPD 139
++D+ F NTGD G++PC + L +GLN+++ + ++ C+ L + D
Sbjct: 90 ATRDVTF-----NTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144

Query: 140 ATINFDFAAMRLNITIPQIALLSSAHGYIPPEEWDEGIPALLLNYNFTGN----RGNGND 195
AT D RLN+TIPQ + + A GYIPPE WD GI A LLNYNF+GN R GN
Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204

Query: 196 SYFFSEL-SGINIGPWRLRNNGSWNYFRGNG--YHSEQWNNIGTWVQRAIIPLKSELVMG 252
Y + L SG+NIG WRLR+N +W+Y + +W +I TW++R IIPL+S L +G
Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264

Query: 253 DGNTGSDIFDGVGFRGVRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSY 312
DG T DIFDG+ FRG +L S DNM PDSQ+GFAP + GIAR AQ+TI+QNG+ IY S
Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324

Query: 313 VSPGAFEITDLHPTSSNGDLDVTIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFR 372
V PG F I D++ ++GDL VTI E DG+ Q +T+PYS+VP+LQREG ++ +TAG++R
Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384

Query: 373 SGNSQQSSPFFFQGTALGGLPQEFTAYGGTQLSANYTAFLLGLGRNLGNWGAVSLDVTHA 432
SGN+QQ P FFQ T L GLP +T YGGTQL+ Y AF G+G+N+G GA+S+D+T A
Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444

Query: 433 RSQLADASRHEGDSIRFLYAKSMNTFGTNFQLMGYRYSTQGFYTLDDVAYRRMEGY-EYD 491
S L D S+H+G S+RFLY KS+N GTN QL+GYRYST G++ D Y RM GY
Sbjct: 445 NSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIET 504

Query: 492 YDGEHRDEPIIVNYHNLRFSRKDRLQLNVSQSLNDFGSLYISGTHQKYWNTSDSDTWYQV 551
DG + +P +Y+NL ++++ +LQL V+Q L +LY+SG+HQ YW TS+ D +Q
Sbjct: 505 QDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQA 564

Query: 552 GYTSSWVGISYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFN 611
G +++ I+++LS+S ++ ++++ LNV++PF+ R ++ A AS++
Sbjct: 565 GLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL--RSDSKSQWRHASASYS 622

Query: 612 ANRNSNGQNSWLAGVGGTLLEGHNLSYHVSQG----DTSNNGYTGSATANWQAAYGTLGG 667
+ + NG+ + LAGV GTLLE +NLSY V G N+G TG AT N++ YG
Sbjct: 623 MSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANI 682

Query: 668 GYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIENQTGILTDWR 727
GY++ D + + +SGGV+ H NG+TL QPL DT VL+KAPGA ++ENQTG+ TDWR
Sbjct: 683 GYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR 742

Query: 728 GYAVMLYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALITV 787
GYAV+ YAT YR NR+ALDTNT+ +++D++ +++VVPT+GA+VRA F R+G++ L+T+
Sbjct: 743 GYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL 802

Query: 788 TQGGKPVPFGSLVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLP 847
T KP+PFG++V S+ + +V D+GQVYLSG PL+G++ V+WG+ N+ C+A+Y LP
Sbjct: 803 THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLP 862

Query: 848 KQSLQQAVTVISAVC 862
+S QQ +T +SA C
Sbjct: 863 PESQQQLLTQLSAEC 877


11CMJKDNLE_00549CMJKDNLE_00568Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_005490133.461191MbtH-like protein that enhances the catalytic
CMJKDNLE_005501133.819601fatty acyl-CoA synthetase
CMJKDNLE_005511143.204812ferric enterobactin (enterochelin) transport
CMJKDNLE_005521145.332125ferric enterobactin ABC transporter - ATP
CMJKDNLE_005530155.445210ferric enterobactin ABC transporter - membrane
CMJKDNLE_00554-1165.057815ferric enterobactin ABC transporter - membrane
CMJKDNLE_00555-1164.557550enterobactin efflux transporter EntS
CMJKDNLE_00556-2154.239163ferric enterobactin ABC transporter -
CMJKDNLE_00557-1194.699543isochorismate synthase 1
CMJKDNLE_00558-1204.575888enterobactin synthase multienzyme complex
CMJKDNLE_005590194.425621EntB monomer
CMJKDNLE_005600174.0808542,3-dihydro-2,3-dihydroxybenzoate dehydrogenase
CMJKDNLE_005610172.785959proofreading thioesterase in enterobactin
CMJKDNLE_005620141.296284peptide transporter induced by carbon
CMJKDNLE_00563-118-2.854092hypothetical protein
CMJKDNLE_00564-118-3.242350putative oxidoreductase
CMJKDNLE_00565-117-4.437979methionine-oxo-acid transaminase, PLP-dependent
CMJKDNLE_00566-117-4.235058hypothetical protein
CMJKDNLE_00567-118-3.963429hypothetical protein
CMJKDNLE_00568-115-3.390789putative DNA-binding transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00555TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 81/393 (20%), Positives = 144/393 (36%), Gaps = 38/393 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELR 402
+ A + + +G+ + L LL L LR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00556FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00559ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1134), Expect = e-159
Identities = 145/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 LSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
S ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00560DHBDHDRGNASE364e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 364 bits (935), Expect = e-131
Identities = 110/258 (42%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDALVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


12CMJKDNLE_00653CMJKDNLE_00713Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00653119-3.220687putative pectinase
CMJKDNLE_00654118-3.138173hypothetical protein
CMJKDNLE_00655014-1.164438putrescine:H+ symporter / putrescine:ornithine
CMJKDNLE_00656-2132.197815ornithine decarboxylase, degradative
CMJKDNLE_006570152.470879ornithine decarboxylase, degradative
CMJKDNLE_006580153.529879ornithine decarboxylase, degradative
CMJKDNLE_00659-2154.068654hypothetical protein
CMJKDNLE_00660-1154.317194putative DNA-binding response regulator in
CMJKDNLE_00661-1133.458634putative sensory kinase in two-component
CMJKDNLE_006620101.056603K+ transporting ATPase - KdpC subunit
CMJKDNLE_006631120.660121K+ transporting ATPase - ATP binding subunit
CMJKDNLE_00664321-2.902672K+ transporting ATPase - K+ channel forming
CMJKDNLE_00665634-7.467703hypothetical protein
CMJKDNLE_00666635-6.817888RhsC protein in rhs element
CMJKDNLE_00667538-8.601837putative inner membrane protein
CMJKDNLE_00668437-8.542126putative Rhs-family protein
CMJKDNLE_00669440-11.575781hypothetical protein
CMJKDNLE_00670637-11.142312putative transposase; receptor protein
CMJKDNLE_00671334-10.024780RhsD protein in rhs element
CMJKDNLE_00672-127-7.415660hypothetical protein
CMJKDNLE_00673-117-4.512069hypothetical protein
CMJKDNLE_00674-116-2.883192hypothetical protein
CMJKDNLE_00675-213-0.351329putative DNA ligase
CMJKDNLE_00676-1131.856347hypothetical protein
CMJKDNLE_00677-1132.627216deoxyribodipyrimidine photolyase
CMJKDNLE_00678-1142.589070dipeptide:H+ symporter DtpD
CMJKDNLE_006790173.568701hypothetical protein
CMJKDNLE_00680-1141.704386putative carboxylase
CMJKDNLE_006810150.390609putative carboxylase
CMJKDNLE_00682-215-0.892038putative lactam utilization protein
CMJKDNLE_00683-116-1.994916endonuclease VIII
CMJKDNLE_00684-114-2.129325putative regulator
CMJKDNLE_00685015-3.477168putative fimbrial-like adhesin protein
CMJKDNLE_00686017-2.751346putative fimbrial chaperone
CMJKDNLE_00687117-2.435403putative outer membrane usher protein
CMJKDNLE_006881230.019088putative fimbrial-like adhesin protein
CMJKDNLE_006891230.604595citrate synthase monomer
CMJKDNLE_006902242.423301hypothetical protein
CMJKDNLE_006913262.986571succinate dehydrogenase membrane protein
CMJKDNLE_006922273.194475succinate dehydrogenase membrane protein
CMJKDNLE_006932293.210204succinate dehydrogenase flavoprotein
CMJKDNLE_006942262.242848succinate dehydrogenase iron-sulfur protein
CMJKDNLE_006961212.612402subunit of E1(0) component of 2-oxoglutarate
CMJKDNLE_00697-1110.637976Dihydrolipoyllysine-residue succinyltransferase
CMJKDNLE_00700-2100.238754succinyl-CoA synthetase, beta subunit
CMJKDNLE_00701-2100.171892succinyl-CoA synthetase, alpha subunit
CMJKDNLE_00702-28-0.004033MngR DNA-binding transcriptional repressor
CMJKDNLE_00703-2100.2657352-O-alpha-mannosyl-D-glycerate PTS permease
CMJKDNLE_00704114-0.742556alpha-mannosidase
CMJKDNLE_007052220.282396cytochrome bd-I terminal oxidase subunit I
CMJKDNLE_00706118-0.139789cytochrome bd-I terminal oxidase subunit II
CMJKDNLE_00707719-0.646917small membrane protein
CMJKDNLE_00708420-0.085663putative lipoprotein
CMJKDNLE_00709423-0.002425esterase/thioesterase
CMJKDNLE_00710422-0.200435The Colicin A Import System
CMJKDNLE_00711421-0.359584The Colicin A Import System
CMJKDNLE_00712320-0.803401The Colicin A Import System
CMJKDNLE_00713216-0.754609The Colicin A Import System
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00660HTHFIS927e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 7e-24
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 1/125 (0%)

Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61
+L+ +D+ AIR L AL G V A DL++ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFIRDLRQWSA-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120
+ + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 ATTAP 125
+
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00661PF06580320.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.012
Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 785 LLENAVKYAGAQAE----IGIDAHVEGENLQLDVWDNGPGLPPGQEQT 828
L+EN +K+ AQ I + + + L+V + G +++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00687PF005776020.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 602 bits (1553), Expect = 0.0
Identities = 235/861 (27%), Positives = 379/861 (44%), Gaps = 63/861 (7%)

Query: 5 RLSFVSCLVMAMPCAMA-VEFNLNVLDKSMRDRIDISLLKEKGVIAPGEYFVSVAVNNNK 63
RL P + A + FN L + D+S + + PG Y V + +NN
Sbjct: 29 RLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGY 88

Query: 64 ISNGQ-KINWQKKGDKTIPCINDSLVDKFGLKPDIRQSLPQI--DRCIDFSSR-PEMLFN 119
++ N +PC+ + + GL + + D C+ +S +
Sbjct: 89 MATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQ 148

Query: 120 FDQANQQLNISIPQAWLAWHSENWAPPSTWKEGVAGVLMDYNLFASSYRPQDGSSSTNLN 179
D Q+LN++IPQA+++ + + PP W G+ L++YN +S + + G +S
Sbjct: 149 LDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAY 208

Query: 180 AYGTAGINAGAWRLRSDYQLNKTDSEDNHDQSGGI--SRTYLFRPLPQLGSKLTLGETDF 237
+G+N GAWRLR + + S+ + T+L R + L S+LTLG+
Sbjct: 209 LNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYT 268

Query: 238 SSNIFDGFSYTGAALASDDRMLPWELRGYAPQISGIAQTNATVTISQSGRVIYQKKVPPG 297
+IFDG ++ GA LASDD MLP RG+AP I GIA+ A VTI Q+G IY VPPG
Sbjct: 269 QGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPG 328

Query: 298 PFIIDDLNQ-SVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRYKLAAGQPRPSMS 356
PF I+D+ G L V + E DG F V +S P L R+G RY + AG+ R +
Sbjct: 329 PFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG-N 387

Query: 357 HQTENETFFSNEVSWGMLSNTSLYGGLLISDDDYHSAAMGIGQNMLWLGALSFDVTWASS 416
Q E FF + + G+ + ++YGG ++ D Y + GIG+NM LGALS D+T A+S
Sbjct: 388 AQQEKPRFFQSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQANS 446

Query: 417 HFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYLDHKYND------- 469
G S RF Y+K ++ + + I L YR+S + ++A+ + N
Sbjct: 447 TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQD 506

Query: 470 -------------SDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITA 516
+ A +++ + L+V Q + LY + HQT+W A
Sbjct: 507 GVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL-GRTSTLYLSGSHQTYWGTSNVDE-QFQA 564

Query: 517 GFNVDIGDWRDISISTSFNTTHYE-DKDRDNQIYLSISLPFGNGGR-----------VGY 564
G N + DI+ + S++ T K RD + L++++PF + R Y
Sbjct: 565 GLNT---AFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASY 621

Query: 565 DMQNSSHS-TIHRMSWNDTLDERN--SWGMSAGL-QSDRPDNGAQVSGNYQHLSSAGEWD 620
M + + + TL E N S+ + G ++G+ + G +
Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681

Query: 621 ISGTYAASDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDY- 679
I ++ + D + SG A G + N+ ++V G D V+
Sbjct: 682 IGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPGAKDAKVENQTGVR 738

Query: 680 TNHFGIAVVPLISSYQPSTVAVNMNDLPDGVTVAENVIKETWIEGAIGYKSLASRSGKDV 739
T+ G AV+P + Y+ + VA++ N L D V + V GAI +R G +
Sbjct: 739 TDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKL 798

Query: 740 NVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAENQLFTVVWGE---QSCI 796
+ + + + P GA + +S S G+V + G +LSG+ V WGE C+
Sbjct: 799 LMTLT-HNNKPLPFGAMV-TSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856

Query: 797 IH--LPERLEDTT-KRLILPC 814
+ LP + +L C
Sbjct: 857 ANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00688FIMBRIALPAPE359e-05 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 34.6 bits (79), Expect = 9e-05
Identities = 39/179 (21%), Positives = 78/179 (43%), Gaps = 26/179 (14%)

Query: 14 SLLFTAPVYAADEGSGEIHFKGEVIEAPCEIHPEDID-KNIDLGQVTTTHINREHHSNKV 72
++L + V+AAD + FKG++I C + +++ +I++ + + N++ +
Sbjct: 15 AVLMSQHVHAADN----LTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSGGNQKDFT--- 67

Query: 73 AVDIRLINCDLPASDNGSGMPVSKVGVTFDSTAKTTGATPLLSNTSAGEATGVGVRLMDK 132
++ + P S + + VT S TG + L+ NTS G+ + L +
Sbjct: 68 ------VDMNCPYS-------LGTMKVTITSNG-QTGNSILVPNTSTASGDGLLIYLYNS 113

Query: 133 NDGNI----VLGSAAPDLDLDASSSEQTLNFFAWMEQIDNAVDVTAGEVTANATYVLDY 187
N+ I LGS + ++ + + +A + N + AG +A AT V Y
Sbjct: 114 NNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00694TCRTETOQM310.006 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 30.6 bits (69), Expect = 0.006
Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 14 VDDAPRMQDYTLEADEGRDM-MLLDALIQLKEKDPSLSFRR 53
+++ + T+E + + MLLDAL+++ + DP L +
Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00697RTXTOXIND300.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.020
Identities = 27/196 (13%), Positives = 56/196 (28%), Gaps = 12/196 (6%)

Query: 48 EVPASADGILDAVLEDEGTTVTSRQILGRLREGNSAGKETSAKSE-EKASTPAQRQQASL 106
E+ + I+ ++ EG +V +L +L + +S +A R Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 107 EEQNNDAL----SPAIRRLLAEHNLDASAIKGTGVGGRLTRED----VEKHLAKAPAKES 158
+ L P + + T ++ E +L K A+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 159 APAAAAPAAQPALAARSEKRVPMTRLRKRVA---ERLLEAKNSTAMLTTFNEVNMKPIMD 215
A + + + L + A +LE +N V +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 216 LRKQYGEAFEKRHGIR 231
+ + A E+ +
Sbjct: 278 IESEILSAKEEYQLVT 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00712IGASERPTASE609e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.1 bits (145), Expect = 9e-12
Identities = 34/199 (17%), Positives = 69/199 (34%), Gaps = 8/199 (4%)

Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEE 158
E E+ Q QA+ + E A A ++ E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 159 AAK--KAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAA--AAEARKKAATE 214
A+ K + +K E +A + A+ ++ A+ A + +K + E A +E ++ TE
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 215 AAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKK 274
E A E E+KA E + + K+ + +A ++ + +
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 275 AAAAKAAAEKAAAAKAAAE 293
+ A + A + ++
Sbjct: 1160 SQTNTTADTEQPAKETSSN 1178



Score = 57.0 bits (137), Expect = 9e-11
Identities = 30/236 (12%), Positives = 85/236 (36%), Gaps = 11/236 (4%)

Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127
Q+ S ++E+ ++ +E ++ + +K E+ A +
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061

Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAA 180
+ ++ A+EA + A+ A++ +E E + A + ++KA+ E K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240
+ ++ + ++++E + A AR+ T ++ +++ A E+ A + +
Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 241 EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADD 296
E+ + + + A ++ K + ++ +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235



Score = 56.2 bits (135), Expect = 2e-10
Identities = 28/228 (12%), Positives = 75/228 (32%), Gaps = 2/228 (0%)

Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
R ++E+ + + + Q+ E +E Q E + +EKE A E +K E
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAK--KAAADAKKKAEAEAAKAAAEAQ 183
+++ KQ + + A+ + + E ++ A + E + +
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185

Query: 184 KKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKA 243
++ + E A + + + K + ++ ++ +++
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245

Query: 244 AADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 291
+D +A A+ A + A ++ ++ +
Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 55.5 bits (133), Expect = 2e-10
Identities = 32/265 (12%), Positives = 86/265 (32%), Gaps = 14/265 (5%)

Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103
D V A + ++ ++K+ + + EQ A E + ++ A++ +
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163
+ E + ++ ++ Q K+ +K+ KA + + E ++ + K+
Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 164 AADA-KKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAE 222
++ + +AE K+ ++ + A+ ++ +
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 223 AEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAA 282
+ A + +++ K + + + + A + A +
Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254

Query: 283 EKAAAAKAAAEADDIFGELSSGKNA 307
A + A A F L+ GK
Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAV 1279


13CMJKDNLE_00803CMJKDNLE_00814Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00803-2133.369949putative pyruvate formate lyase
CMJKDNLE_00804-1113.121515putative pyruvate formate lyase activating
CMJKDNLE_00805-1122.613899Fsa
CMJKDNLE_00806-1122.522230molybdopterin-synthase adenylyltransferase
CMJKDNLE_008070142.278022molybdenum::molybdopterin ligase
CMJKDNLE_008080150.565132Isoaspartyl peptidase
CMJKDNLE_00809116-2.299744glutathione ABC transporter - ATP binding
CMJKDNLE_00810116-4.614467glutathione ABC transporter - periplasmic
CMJKDNLE_00811012-4.676553glutathione ABC transporter - membrane subunit
CMJKDNLE_00812011-5.505056glutathione ABC transporter - membrane subunit
CMJKDNLE_00813110-5.530669putative c-di-GMP-specific phosphodiesterase
CMJKDNLE_00814010-4.352104putative c-di-GMP-specific phosphodiesterase
14CMJKDNLE_00881CMJKDNLE_00887Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00881117-3.231479dimethyl sulfoxide reductase, chain C
CMJKDNLE_00882116-3.949592putative hydrolase monomer
CMJKDNLE_00883219-2.756742YcaD MFS transporter
CMJKDNLE_00884323-3.269846YcaM predicted APC amino acid transporter
CMJKDNLE_00885225-2.269140putative transcriptional regulator LYSR-type
CMJKDNLE_00886326-2.092600hypothetical protein
CMJKDNLE_00887227-1.474437pyruvate formate-lyase activating enzyme
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00882ISCHRISMTASE403e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 40.0 bits (93), Expect = 3e-06
Identities = 30/159 (18%), Positives = 53/159 (33%), Gaps = 20/159 (12%)

Query: 7 RLDKNDAAVLLVDHQAGLLSLVRDIEP--DKFKNNVLALGDLAKYFNLPTILTT---SFE 61
D N A +L+ D Q + + N+ L + +P + T S
Sbjct: 25 VPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQN 84

Query: 62 TGPNGPLV----PELKAQFPDTPYIAR----PGNI-------NAWDNEDFVKAVKATGKK 106
L P L + + I ++ +A+ + ++ ++ G+
Sbjct: 85 PDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRD 144

Query: 107 QLIIAGVVTEVCVAFPALSAIEEGFDVFVVTDASGTFNE 145
QLII G+ + A A E F V DA F+
Sbjct: 145 QLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183


15CMJKDNLE_00921CMJKDNLE_00926Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00921221-3.493542aliphatic sulfonate ABC transporter -
CMJKDNLE_00922123-4.241005NAD(P)H-dependent FMN reductase monomer
CMJKDNLE_00923225-4.893379fimbrial-like adhesin protein
CMJKDNLE_00924124-3.913443putative periplasmic pilin chaperone
CMJKDNLE_00925021-3.748567putative outer membrane usher protein
CMJKDNLE_00926-123-3.427299putative fimbrial-like adhesin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00923FIMBRIALPAPE280.012 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.5 bits (63), Expect = 0.012
Identities = 26/92 (28%), Positives = 37/92 (40%), Gaps = 14/92 (15%)

Query: 6 LTAFITVVCATSSVMAADDNAITDGSVTFNGKVIAPACTLVAATKDSVVTLPDVSATKLQ 65
L + V + V AAD+ +TF GK+I PACT+ A V D+ L
Sbjct: 9 LPVMLGAVLMSQHVHAADN-------LTFKGKLIIPACTVQNAE----VNWGDIEIQNLV 57

Query: 66 TNGQVS---GVQIDVPIELKDCDTTVTKNATF 94
+G V ++ P L T+T N
Sbjct: 58 QSGGNQKDFTVDMNCPYSLGTMKVTITSNGQT 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00925PF005778270.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 827 bits (2139), Expect = 0.0
Identities = 414/862 (48%), Positives = 569/862 (66%), Gaps = 18/862 (2%)

Query: 15 GVPSFIGGLVVFVSAAFNAQAETWFDPAFFKDDPSMVADLSRFEKGQKITPGVYRVDIVL 74
G + F + A + AE +F+P F DDP VADLSRFE GQ++ PG YRVDI L
Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 75 NQTIVDTRNVNFVEITPEKGIAACLTTESLDAMGVNTDAFPAFKQLDKQACVPLAEIIPD 134
N + TR+V F E+GI CLT L +MG+NT + L ACVPL +I D
Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144

Query: 135 ASVTFNVNKLRLEISVPQIAIKSNARGYVPPERWDEGINALLLGYSFSGANSIHSSADSD 194
A+ +V + RL +++PQ + + ARGY+PPE WD GINA LL Y+FSG + + +
Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204

Query: 195 SGDSYFLNLNSGVNLGPWRLRNNSTWSR-----SSGQTAEWKNLSSYLQRAVIPLKGELT 249
+LNL SG+N+G WRLR+N+TWS SSG +W++++++L+R +IPL+ LT
Sbjct: 205 --HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 250 VGDDYTAGDFFDSVSFRGVQLASDDNMLPDSLKGFAPVVRGIAKSNAQITIKQNGYTIYQ 309
+GD YT GD FD ++FRG QLASDDNMLPDS +GFAPV+ GIA+ AQ+TIKQNGY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 310 TYVSPGAFEISDLYSTSSSGDLLVEIKEADGSVNSYSVPFSSVPLLQRQGRIKYAVTLAK 369
+ V PG F I+D+Y+ +SGDL V IKEADGS ++VP+SSVPLLQR+G +Y++T +
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 370 YRTNSNEQQESKFAQATLQWGGPWGTTWYGGGQYAEYYRAAMFGLGFNLGDFGAISFDAT 429
YR+ + +Q++ +F Q+TL G P G T YGG Q A+ YRA FG+G N+G GA+S D T
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442

Query: 430 QAKSTLADQSEHKGQSYRFLYAKTLNHLGTNFQLMGYRYSTSGFYTLSDTMYKHMDGY-- 487
QA STL D S+H GQS RFLY K+LN GTN QL+GYRYSTSG++ +DT Y M+GY
Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502

Query: 488 EFNDGDDEDTPMWSRYYNLFYTKRGKLQVNISQQLGEYGSFYLSGSQQTYWHTDQQDRLL 547
E DG + P ++ YYNL Y KRGKLQ+ ++QQLG + YLSGS QTYW T D
Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562

Query: 548 QFGYNTQIKDLSLGISWNYSKSRGQPDADQVFALNFSLPLNLLLPRSNDSYTRKKNYAWM 607
Q G NT +D++ +S++ +K+ Q DQ+ ALN ++P + L + S R +A
Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWR---HASA 619

Query: 608 TSNTSIDNEGHTTQNLGLTETLLDDGNLSYSVQQGYNSEGKTANGS---ASMDYKGAFAD 664
+ + S D G T G+ TLL+D NLSYSVQ GY G +GS A+++Y+G + +
Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679

Query: 665 ARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIAAPGAENTRVANSTGL 724
A +GY++SD+ +QL Y +SG ++AH+ G+TLGQ L +T VL+ APGA++ +V N TG+
Sbjct: 680 ANIGYSHSDD--IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 725 KTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTKGALVLAEFNAHAGAR 784
+TDWRGY V+PYAT YRENR+ALD +L NVDL+NAV NVVPT+GA+V AEF A G +
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 785 VLMKTSKQGIPLRFGAIATLDGVQANSGIIDDDGSLYMAGLPAKGTISVRWGEAPDQICH 844
+LM + PL FGA+ T + +SGI+ D+G +Y++G+P G + V+WGE + C
Sbjct: 798 LLMTLTHNNKPLPFGAMVTSES-SQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856

Query: 845 INYELTEQQINSAITRMDAICR 866
NY+L + +T++ A CR
Sbjct: 857 ANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00926CLENTEROTOXN320.004 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.004
Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 295 VGVVVTDSQNNIISPAGGTLPLSIPDDADSIARMNVYPVSTTGVPPET 342
+ V TD + I+ A T L++ D +S N+Y ++ P T
Sbjct: 188 LTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWT 235


16CMJKDNLE_00984CMJKDNLE_01026Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00984-216-4.656987chaperone protein for trimethylamine-N-oxide
CMJKDNLE_00986-116-4.581890chaperone modulator protein CbpM
CMJKDNLE_00987-117-4.910563CbpA monomer
CMJKDNLE_00988017-4.118486hypothetical protein
CMJKDNLE_009891141.4211863-phytase / glucose-1-phosphatase
CMJKDNLE_009902182.715855hypothetical protein
CMJKDNLE_009911173.818979WrbA monomer
CMJKDNLE_009920183.925820hypothetical protein
CMJKDNLE_00993-1184.031096putative xanthine/uracil transporter
CMJKDNLE_00994-1204.488641flavin reductase
CMJKDNLE_009950163.413578putative malonic semialdehyde reductase
CMJKDNLE_00996-1124.131664putative aminoacrylate hydrolase
CMJKDNLE_009970123.290372putative aminoacrylate peracid reductase
CMJKDNLE_009980133.051474peroxyureidoacrylate / ureidoacrylate amido
CMJKDNLE_00999-1132.939867pyrimidine oxygenase
CMJKDNLE_010000152.399447RutR DNA-binding transcriptional dual regulator
CMJKDNLE_01001-1142.536411fused PutA transcriptional repressor / proline
CMJKDNLE_010020130.456930hypothetical protein
CMJKDNLE_010030130.491776proline:Na+ symporter
CMJKDNLE_01004-213-1.067217hypothetical protein of the OFeT transport
CMJKDNLE_01005-218-3.241051hypothetical protein
CMJKDNLE_01006-224-3.966138heme-containing peroxidase/deferrochelatase
CMJKDNLE_01007-128-6.309730ATP-binding protein
CMJKDNLE_01008031-7.327860inner membrane protein involved in biofilm
CMJKDNLE_01009-131-7.783696UDP-N-acetyl-D-glucosamine
CMJKDNLE_01010032-7.376628poly-beta-1,6-N-acetyl-D-glucosamine
CMJKDNLE_01011-129-6.926177partially N-deacetylated
CMJKDNLE_01012027-7.114105diguanylate cyclase
CMJKDNLE_01013121-3.818666hypothetical protein
CMJKDNLE_01014018-3.037830putative inner membrane protein
CMJKDNLE_01016018-1.649975*glyoxylate reductase / hydroxypyruvate
CMJKDNLE_01017-117-2.722332zinc-binding phosphatase
CMJKDNLE_01018019-3.975074protein involved in maturation of YcdX
CMJKDNLE_01019123-5.734777putative inner membrane protein
CMJKDNLE_01020025-6.344107curli secretion channel
CMJKDNLE_01021233-8.145961curli assembly component
CMJKDNLE_01022232-7.955257curli transport specificity factor
CMJKDNLE_01023230-6.324984CsgD DNA-binding transcriptional dual regulator
CMJKDNLE_01024025-3.633390curlin, minor subunit precursor
CMJKDNLE_01025021-4.169457curlin, major subunit
CMJKDNLE_01026-117-3.809930putative curli production protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00998ISCHRISMTASE753e-18 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 75.4 bits (185), Expect = 3e-18
Identities = 44/176 (25%), Positives = 71/176 (40%), Gaps = 23/176 (13%)

Query: 13 TFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQ 72
DP ++ L++ DMQN + +D S + ANI+ G+ +++
Sbjct: 25 VPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQLGIPVVY-- 76

Query: 73 NGWDEQYVEAGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVL 132
PGS N L G L G ++ +++ EL P+ D+VL
Sbjct: 77 ---------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 133 PKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDA 188
K RYS F T L ++R G L+ TGI ++ T + F + + DA
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01000HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 2e-15
Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 8/165 (4%)

Query: 10 GKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAV 69
K + ++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 LRQILDIWLAPLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAG 122
++ F PL+ ++E + LE + + L + E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 APLLMDELTGDLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166
++ D + +++ L A + + ++
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01011ARGDEIMINASE300.047 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 29.8 bits (67), Expect = 0.047
Identities = 27/183 (14%), Positives = 61/183 (33%), Gaps = 23/183 (12%)

Query: 450 WPRAAENELKK-AEVIEPRNINLEVEQAWTALTLQEWQQA--AVLTHDVVEREPQDPGVV 506
+ A E + A +++ + +E + + L ++ ++E E + +
Sbjct: 47 YLEVARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTI 106

Query: 507 -RLK---RAVDVHNLAELRIAGSTGIDAEGPDSGKHDVDLTTIVYS---PPLKDNWRGFA 559
LK ++ + N+ I+G E + DL P+ + F
Sbjct: 107 NLLKDYFSSLTIDNMISKMISGVVT--EELKNYTSSLDDLVNGANLFIIDPMPNVL--FT 162

Query: 560 GFGYADGQFSEGKGIVRDWLAGVEWRSRNIWLEAEYAERVFNHEHKPGARLSGWYDFNDN 619
D S G G+ + + + R E +AE +F + + W + +
Sbjct: 163 ----RDPFASIGNGVT---INKMFTKVRQ--RETIFAEYIFKYHPVYKENVPIWLNRWEE 213

Query: 620 WRI 622
+
Sbjct: 214 ASL 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01012BINARYTOXINA300.027 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 29.6 bits (66), Expect = 0.027
Identities = 22/77 (28%), Positives = 36/77 (46%), Gaps = 6/77 (7%)

Query: 335 DQVIKTVVNIIGKSIRPDDLLA--RVGGEEFGVLLTDIDTERAKALAERIRENVERLTGD 392
D + + N + + P +L+ R G +EFG+ LT + + K E I E+ G
Sbjct: 313 DSKVNNIENALKLTPIPSNLIVYRRSGPQEFGLTLTSPEYDFNK--IENIDAFKEKWEGK 370

Query: 393 NPEYAIPQKVTISIGAV 409
Y P ++ SIG+V
Sbjct: 371 VITY--PNFISTSIGSV 385


17CMJKDNLE_01050CMJKDNLE_01067Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_010502161.046103hypothetical protein
CMJKDNLE_010512151.265819putative oxidoreductase, NAD(P)-binding
CMJKDNLE_010520120.729312lipid II flippase
CMJKDNLE_010531200.631986flagellar biosynthesis protein FlgN
CMJKDNLE_010542160.790923anti-sigma factor for FliA (sigma 28)
CMJKDNLE_010551161.937720flagellar biosynthesis; assembly of basal-body
CMJKDNLE_010562152.166615flagellar basal-body rod protein FlgB
CMJKDNLE_010573132.133587flagellar basal-body rod protein FlgC
CMJKDNLE_010582112.256905flagellar biosynthesis, initiation of hook
CMJKDNLE_010590112.288691flagellar hook protein FlgE
CMJKDNLE_01060-1122.157424flagellar basal-body rod protein FlgF
CMJKDNLE_01061091.029029flagellar basal-body rod protein FlgG
CMJKDNLE_010621131.953863flagellar L-ring protein FlgH; basal-body
CMJKDNLE_010631131.719456flagellar P-ring protein FlgI
CMJKDNLE_010641141.288816FlgJ
CMJKDNLE_010652150.876193flagellar biosynthesis, hook-filament junction
CMJKDNLE_010663181.171965flagellar biosynthesis; hook-filament junction
CMJKDNLE_010673171.384303RNase E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01059FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 36.9 bits (85), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01061FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01062FLGLRINGFLGH349e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 349 bits (897), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01063FLGPRINGFLGI427e-152 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 427 bits (1100), Expect = e-152
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01064FLGFLGJ5110.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 511 bits (1318), Expect = 0.0
Identities = 313/313 (100%), Positives = 313/313 (100%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01065FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1762), Expect = 0.0
Identities = 545/546 (99%), Positives = 545/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMEQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM QANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01066FLAGELLIN452e-07 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 45.4 bits (107), Expect = 2e-07
Identities = 30/132 (22%), Positives = 60/132 (45%), Gaps = 3/132 (2%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAG 138
T NG + +
Sbjct: 128 QTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01067IGASERPTASE652e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 64.7 bits (157), Expect = 2e-12
Identities = 47/288 (16%), Positives = 83/288 (28%), Gaps = 36/288 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPVAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT----- 1037

Query: 572 SRFFGALKALFSGGEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
ET + Q QN + + + ++
Sbjct: 1038 ---------------ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTADEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETVAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
+P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 63.2 bits (153), Expect = 5e-12
Identities = 46/261 (17%), Positives = 83/261 (31%), Gaps = 26/261 (9%)

Query: 551 VAPAPKAAPATPVAPAQPGLLSRFFGALKALFSGGEETKPTEQP-APKAEAKPERQQDRR 609
+ P + S E + E P P A A P
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSETT--- 1037

Query: 610 KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663
N ++++++ D E +NR A++ + + + Q EV T
Sbjct: 1038 -----ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 664 EKARTADEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723
++ +T + ++ E+ + + + Q+ K + + QE + + + R
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783
+N K Q+ P E + E V E+ T V P A Q
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 784 NNADNRDNGGMPRRSRRSPRH 804
N+ + RRS RS H
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232


18CMJKDNLE_01104CMJKDNLE_01156Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01104-219-4.247856protein deacetylase, Sir2
CMJKDNLE_01105-121-5.058064putative inner membrane protein
CMJKDNLE_01106120-3.745888putative inner membrane protein
CMJKDNLE_01107122-3.591601putrescine / spermidine ABC transporter -
CMJKDNLE_01108126-4.578414putrescine / spermidine ABC transporter -
CMJKDNLE_01109230-4.992716e14 prophage; predicted integrase
CMJKDNLE_01110226-4.590210hypothetical protein
CMJKDNLE_01111327-5.027542exonuclease VIII, ds DNA exonuclease, 5' --> 3'
CMJKDNLE_01112642-8.288959Qin prophage; predicted protein
CMJKDNLE_01113529-5.629619Qin prophage; cell division inhibition protein
CMJKDNLE_01115529-1.471442Qin prophage; predicted protein
CMJKDNLE_01116427-1.541777hypothetical protein
CMJKDNLE_01117430-1.754749LexA repressor
CMJKDNLE_01118427-1.745665hypothetical protein
CMJKDNLE_01119327-1.893186Rac prophage; predicted protein
CMJKDNLE_01120230-3.396881hypothetical protein
CMJKDNLE_01121332-6.902362hypothetical protein
CMJKDNLE_01122228-3.641054hypothetical protein
CMJKDNLE_01123226-2.710141hypothetical protein
CMJKDNLE_01124225-1.947458hypothetical protein
CMJKDNLE_01125226-3.399944hypothetical protein
CMJKDNLE_01127225-2.345480Qin prophage; small toxic polypeptide
CMJKDNLE_01128324-2.161897Qin prophage; predicted protein
CMJKDNLE_01129331-4.243302endodeoxyribonuclease RUS (Holliday junction
CMJKDNLE_01130330-3.627833Qin prophage; predicted antitermination protein
CMJKDNLE_01134327-3.205250*inhibitor of sS proteolysis
CMJKDNLE_01135225-1.912147hypothetical protein
CMJKDNLE_01136225-1.413959DLP12 prophage; predicted phage lysis protein
CMJKDNLE_01137024-1.164672Qin prophage; predicted protein
CMJKDNLE_01138121-0.521111Qin prophage; predicted lysozyme
CMJKDNLE_01139221-1.048759DLP12 prophage; predicted murein endopeptidase
CMJKDNLE_01140222-1.117383bacteriophage lambda Bor protein
CMJKDNLE_01141321-0.206292hypothetical protein
CMJKDNLE_01142423-0.557680hypothetical protein
CMJKDNLE_01143424-0.939925e14 prophage; predicted DNA-binding
CMJKDNLE_01144528-0.655588hypothetical protein
CMJKDNLE_011455300.076589hypothetical protein
CMJKDNLE_011466310.222908hypothetical protein
CMJKDNLE_01147535-0.488659hypothetical protein
CMJKDNLE_01148436-0.407792hypothetical protein
CMJKDNLE_011495350.198830hypothetical protein
CMJKDNLE_011504330.078180hypothetical protein
CMJKDNLE_01151432-0.138290hypothetical protein
CMJKDNLE_011523290.070075hypothetical protein
CMJKDNLE_011533281.424717hypothetical protein
CMJKDNLE_011543272.119697hypothetical protein
CMJKDNLE_011553233.186283hypothetical protein
CMJKDNLE_011564254.573665hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01107CHLAMIDIAOMP280.044 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 28.4 bits (63), Expect = 0.044
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)

Query: 137 GVNGDAVDPKSVTSWADL------WKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPK 190
G GD DP T+W D + ++ +L D + FQM + +GN T P
Sbjct: 42 GFGGDPCDP--CTTWCDAISMRMGYYGDFVFDRVLKTDVNKEFQMGDKPTSTTGNATAPT 99

Query: 191 EIEAAYN 197
+ A N
Sbjct: 100 TLTAREN 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01127HOKGEFTOXIC673e-19 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 67.2 bits (164), Expect = 3e-19
Identities = 19/46 (41%), Positives = 32/46 (69%)

Query: 23 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEP 68
+ +++ ++++CLT+++ +TRK LCE+R R G EVA F AYE
Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01140PF062911741e-60 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 174 bits (441), Expect = 1e-60
Identities = 95/97 (97%), Positives = 96/97 (98%)

Query: 1 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQEKTVDAAKICGG 60
MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQ+KTVDAAKICGG
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65

Query: 61 AENVVKTETQQTFVNGFLGFITLGIYTPLEARVYCSQ 97
AENVVKTETQQTFVNG LGFITLGIYTPLEARVYCSQ
Sbjct: 66 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01147cloacin320.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.003
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 6/88 (6%)

Query: 36 EWNRAKAELDALDEQIAREEELRRQDQAYVDESGPEERQNNEAENGKKAVEEKRAAAFNR 95
+ RA+AEL+ +E +AR +E QA + + +A N A FNR
Sbjct: 322 NYERARAELNQANEDVARNQE----RQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNR 377

Query: 96 FLRAGFAELNAEERNLMRELRAQSVTTD 123
F A + + M L+AQ TD
Sbjct: 378 FAHDPMAGGHRMWQ--MAGLKAQRAQTD 403


19CMJKDNLE_01175CMJKDNLE_01197Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01175020-4.711129HMP-PP hydrolase /thiamin pyrophosphate
CMJKDNLE_01176022-5.28556023S rRNA pseudouridine 2457 synthase
CMJKDNLE_01177125-6.761075isocitrate dehydrogenase
CMJKDNLE_01178237-9.487905hypothetical protein
CMJKDNLE_01179133-8.154744BluR DNA-binding transcriptional repressor
CMJKDNLE_01180335-9.355513blue light-responsive regulator of BluR
CMJKDNLE_01181441-11.210131hypothetical protein
CMJKDNLE_01182435-9.135412regulator of acid resistance, influenced by
CMJKDNLE_01183332-8.583076protein involved in biofilm formation
CMJKDNLE_01184231-8.141762hypothetical protein
CMJKDNLE_01185233-7.715718inner membrane protein that interacts with cell
CMJKDNLE_01186132-7.516607hypothetical protein
CMJKDNLE_01187-117-2.504740hypothetical protein
CMJKDNLE_01188119-2.647750hypothetical protein
CMJKDNLE_01189021-3.420268hypothetical protein
CMJKDNLE_01190-220-3.960170hypothetical protein
CMJKDNLE_01191021-4.921651cell division topological specificity factor and
CMJKDNLE_01192-120-3.581906membrane ATPase of the MinC-MinD-MinE system
CMJKDNLE_01193-221-4.213402cell division inhibitor of the MinC-MinD-MinE
CMJKDNLE_01194-121-6.938087hypothetical protein
CMJKDNLE_01195-119-6.202418inhibitor of g-type lysozyme
CMJKDNLE_01196-218-4.260973hypothetical protein
CMJKDNLE_01197-215-3.362961putative isomerase/hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01186IGASERPTASE424e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.4 bits (99), Expect = 4e-06
Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 7/115 (6%)

Query: 387 GTTTLKLSENTIWNMKDDSVVTHLTNSDSIINL-SYDDGQTFTQGKTLTVKGNYVGNNGQ 445
G + ++L+EN+ W++ +S V L ++ I+L S D+ T+ TLTV N + NG
Sbjct: 842 GNSQVRLTENSHWHLTGNSDVHQLDLANGHIHLNSADNSNNVTKYNTLTV--NSLSGNGS 899

Query: 446 LNIRTVLGDDKSATDRLIVEGNTSGSTTVYVKNAGGSGAATLNGIELITVNGDES 500
T L + + D+++V + +G+ T+ V + G N + L + +
Sbjct: 900 FYYLTDLSNKQG--DKVVVTKSATGNFTLQVADKTGE--PNHNELTLFDASKAQR 950


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01187PRTACTNFAMLY621e-12 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 62.4 bits (151), Expect = 1e-12
Identities = 53/225 (23%), Positives = 83/225 (36%), Gaps = 39/225 (17%)

Query: 110 WRLGVMAGYARDYNLTHSSVSDYRSKGSVRGYSAGLYATWFADDISKKGAYIDSWAQYSW 169
W LG +AGY R + G G YAT+ AD G Y+D+ + S
Sbjct: 690 WHLGGLAGYTR----GDRGFTGDGG-GHTDSVHVGGYATYIADS----GFYLDATLRASR 740

Query: 170 FKN----------SVKGDELAYESYSAKGATVSLEAGYGFALNKSFGLEAAKYTWIFQPQ 219
+N +VKG Y G SLEAG F W +PQ
Sbjct: 741 LENDFKVAGSDGYAVKGK------YRTHGVGASLEAGRRFTHADG---------WFLEPQ 785

Query: 220 AQAIWMGVDHNAHTEANGSRIENDANNNIQTRLGFRTFIRTQEKNSGPHGDDFEPFVEMN 279
A+ A+ ANG R+ ++ +++ RLG R + G +P+++ +
Sbjct: 786 AELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAG----GRQVQPYIKAS 841

Query: 280 WIHNSK-DFAVSMNGVKVEQDGASNLGEIKLGVNGNLNPAASVWG 323
+ V NG+ + E+ LG+ L S++
Sbjct: 842 VLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYA 886


20CMJKDNLE_01260CMJKDNLE_01286Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01260-223-3.940943**formyltetrahydrofolate deformylase
CMJKDNLE_01261-128-4.607719hypothetical protein
CMJKDNLE_01262-123-3.726888hypothetical protein
CMJKDNLE_01263-123-3.975384TorR transcriptional dual regulator
CMJKDNLE_01264132-2.764946UTP--glucose-1-phosphate uridylyltransferase
CMJKDNLE_01265131-2.545305H-NS DNA-binding transcriptional dual regulator
CMJKDNLE_01266129-2.804455thymidine kinase / deoxyuridine kinase
CMJKDNLE_01267231-2.466571IS1 predicted transposase
CMJKDNLE_01268229-3.519776hypothetical protein
CMJKDNLE_01269126-4.075609pyruvate formate-lyase deactivase
CMJKDNLE_01270217-4.829652putative inner membrane protein
CMJKDNLE_01271014-3.927026hypothetical protein
CMJKDNLE_01272012-2.896676hypothetical protein
CMJKDNLE_01273011-2.526901peptide ABC transporter - periplasmic binding
CMJKDNLE_01274012-1.327041murein tripeptide ABC transporter / peptide ABC
CMJKDNLE_01275013-0.531717murein tripeptide ABC transporter / peptide ABC
CMJKDNLE_01276016-2.851248murein tripeptide ABC transporter / peptide ABC
CMJKDNLE_01277-119-3.493175murein tripeptide ABC transporter / peptide ABC
CMJKDNLE_01278-121-3.312707hypothetical protein
CMJKDNLE_01279-223-3.233075cardiolipin synthase
CMJKDNLE_01280-224-4.855037hypothetical protein
CMJKDNLE_01281-121-4.469466K+ channel Kch monomer
CMJKDNLE_01282218-2.061223putative enzyme
CMJKDNLE_01283019-3.287007TonB energy transducing system - TonB subunit
CMJKDNLE_01284022-5.305540acyl-CoA thioesterase
CMJKDNLE_01285021-5.271196putative inner membrane protein
CMJKDNLE_01286-121-3.599536putative inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01261SECA572e-12 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 57.2 bits (138), Expect = 2e-12
Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 125 IDGTRPQFGRNDPCPCGSGKKFKKCCGQ 152
+ GRNDPCPCGSGKK+K+C G+
Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01263HTHFIS907e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 7e-22
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 3/152 (1%)

Query: 10 ILIVEDEQVFRSLLDSWFSSLGATTVLAADGVDALELLGGFTPDLMICDIAMPRMNGLKL 69
IL+ +D+ R++L+ S G + ++ + DL++ D+ MP N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 LEHIRNRGDQTPVLVISATENMADIAKALRLGVEDVLLKPVKDLNRLREMVFACLYPSMF 129
L I+ PVLV+SA KA G D L KP DL L ++ L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122

Query: 130 NSRVEEEERLFRDWDAMVDNPAAAAKLLQELQ 161
R + E +D +V AA ++ + L
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01277HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.008
Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 55 VVGESGCGKSTFARAI 70
+ GESG GK ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01282adhesinmafb314e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 31.2 bits (70), Expect = 4e-04
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97
P+PA G GS E + EA W +P A V +V KV
Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01283TONBPROTEIN2626e-91 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 262 bits (670), Expect = 6e-91
Identities = 239/239 (100%), Positives = 239/239 (100%)

Query: 6 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 65
MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 66 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 125
VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120

Query: 126 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 185
PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF
Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180

Query: 186 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 244
DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239


21CMJKDNLE_01332CMJKDNLE_01342Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01332-1173.746699glutamate-putrescine ligase
CMJKDNLE_013330173.643409gamma-glutamyl-gamma-aminobutyrate hydrolase
CMJKDNLE_013341183.376315DNA-binding transcriptional repressor
CMJKDNLE_013352203.496690gamma-glutamyl-gamma-aminobutyraldehyde
CMJKDNLE_013362172.513465gamma-glutamylputrescine oxidase
CMJKDNLE_013372121.2061334-aminobutyrate aminotransferase
CMJKDNLE_01338311-1.135693PspF transcriptional dual regulator
CMJKDNLE_01339-118-4.807566regulatory protein for the phage shock protein
CMJKDNLE_01340-120-5.088191stimulates PspC-mediated transcriptional
CMJKDNLE_01341016-4.300640PspC transcriptional regulator; toxin of a
CMJKDNLE_01342-115-3.292337peripheral inner membrane phage-shock protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01338HTHFIS341e-117 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 341 bits (877), Expect = e-117
Identities = 125/341 (36%), Positives = 182/341 (53%), Gaps = 23/341 (6%)

Query: 6 DNLLGEANSFLEVLEQVSHLAPLDKPVLIIGERGTGKELIASRLHYLSSRWQGPFISLNC 65
L+G + + E+ ++ L D ++I GE GTGKEL+A LH R GPF+++N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 66 AALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMMVQEKLLRVIE 125
AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLRV++
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 126 YGELERVGGSQPLQVNVRLVCATNADLPAMVNEGTFRADLLDRLAFDVVQLPPLRERESD 185
GE VGG P++ +VR+V ATN DL +N+G FR DL RL ++LPPLR+R D
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 186 IMLMAEYFAIQMCREIKLPLFPGFTERARETLLNYRWPGNIRELKNVVERSVYRHGTSDY 245
I + +F Q +E F + A E + + WPGN+REL+N+V R +
Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374

Query: 246 PLDDIIID---PFKRRPPEDAIAVSETTSLPTLPLD------------------LREFQM 284
+ I + P E A A S + S+ +
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434

Query: 285 QQEKELLQLSLQQGKYNQKRAAELLGLTYHQFRALLKKHQI 325
+ E L+ +L + NQ +AA+LLGL + R +++ +
Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01340MPTASEINHBTR250.030 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 24.6 bits (53), Expect = 0.030
Identities = 7/43 (16%), Positives = 17/43 (39%)

Query: 30 SGRSELSQSEQQRLAQLADEAKRMRERIQALESILDAEHPNWR 72
+G+ + + A A++A + + E L + +W
Sbjct: 37 AGQLGIEATGSGVCAGPAEQANALAGDVACAEQWLGDKPVSWS 79


22CMJKDNLE_01362CMJKDNLE_01367Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01362-120-4.449793murein peptide amidase A
CMJKDNLE_01363-121-5.640753putative oxidoreductase
CMJKDNLE_01364-118-4.731034putative hydrolase
CMJKDNLE_01365-116-4.254195putative transcriptional regulator LysR-type
CMJKDNLE_01366-114-3.512459murein tripeptide ABC transporter OppBCDFMppA -
CMJKDNLE_01367-113-3.133112mechanosensitive channel YnaI monomer
23CMJKDNLE_01407CMJKDNLE_01415Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_014072130.314715beta-ketoadipyl-CoA thiolase
CMJKDNLE_01408214-1.872612phenylacetate-CoA ligase
CMJKDNLE_01409215-1.840718PaaX DNA-binding transcriptional repressor
CMJKDNLE_01410218-3.439412putative hexapeptide repeat acetyltransferase
CMJKDNLE_01411119-3.677575hypothetical protein
CMJKDNLE_01412119-4.032322hypothetical protein
CMJKDNLE_01413-121-4.146784hypothetical protein
CMJKDNLE_01414-228-4.248837putative oxidoreductase, NAD(P)-binding
CMJKDNLE_01415-226-4.521945protein involved in detoxification of
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01409PF08280300.016 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 29.8 bits (67), Expect = 0.016
Identities = 10/36 (27%), Positives = 16/36 (44%)

Query: 185 RVEECWHLTEQNAMYETFIQSFRPLVPLLKEAADEL 220
+ +C L E+N + + L+P LKE L
Sbjct: 311 HIRQCCQLFEENDTFRLLLNPIITLLPNLKEQKASL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01412FLAGELLIN388e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 38.1 bits (88), Expect = 8e-05
Identities = 37/336 (11%), Positives = 76/336 (22%), Gaps = 9/336 (2%)

Query: 161 NGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPES 220
NG + + ++ N+G+ I + + D V+ TV D
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITIDLQKIDVK----SLGLDGFNVNGPKEATVGDL-- 185

Query: 221 MGIQIDGDKAIVNNEGESTITNGGTGTQINGDDATANN-NGKTTVDGKDSTGTEINGNNG 279
+ + D TA K V+ + T + N
Sbjct: 186 -KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENN 244

Query: 280 KVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQVDGDQAVVNNEGESAIT 339
+ S G + + + D + + +D N S
Sbjct: 245 TAVDLFKTTKSTAG-TAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTI 303

Query: 340 NGGTGTQINGDDATANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGD 399
NG T D N N D + S ++
Sbjct: 304 NGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNA 363

Query: 400 SATVDNKGTMTVTDPESIGIQIDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTT 459
+ ++ + + + +
Sbjct: 364 VKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS 423

Query: 460 VDGKDSTGTKIAGNIGIVNLDGSLTVTGGAHGVENI 495
+D S + ++G + +T + V N+
Sbjct: 424 IDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNL 459



Score = 35.0 bits (80), Expect = 7e-04
Identities = 43/277 (15%), Positives = 76/277 (27%), Gaps = 14/277 (5%)

Query: 248 QINGDDATANNNGKTTVDGKDSTGTEINGNNGKVIQ-----------DGDLDVSGGGHGI 296
+I+ NG + + ++ N+G+ I D G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 297 DITGDSATVDNKGTMTVTDPESIGIQVDGDQAVVNNEGESAITNGGTGTQINGDDATANN 356
+ ++ N + +VD + V + + T ++
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 357 NGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPES 416
T T AG G + G D G + T+D K S
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 417 IGIQIDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGK---DSTGTKIAGN 473
I + V + Q + N T+ +G+ T D K +S
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360

Query: 474 IGIVNLDGSLTVTGGAHGVENIGDNGTVNNKGVMTPT 510
V + +TV G + GD T+ K +
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDK 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01413PF03944320.020 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.0 bits (72), Expect = 0.020
Identities = 54/204 (26%), Positives = 75/204 (36%), Gaps = 31/204 (15%)

Query: 855 IMNISGTGAVAMEGDKNAQLVNNGTINLGTAGTTDTGMIGMQLDANATAD---AVIENNG 911
I NISG V D L N N+ + T G + + + AV EN
Sbjct: 423 IRNISGVPLVVRNEDLRRPLHYNEIRNIASPSGTPGGARAYMVSVHNRKNNIHAVHENGS 482

Query: 912 TINIFANDSFAFSVLGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGN------- 964
I++ ND F++ VNN T G+ QGDS+ E N
Sbjct: 483 MIHLAPNDYTGFTISPIHATQVNNQTRTFISEKFGN----QGDSLRFEQNNTTARYTLRG 538

Query: 965 NGNSSEVHYGDYTLPDVPKPNTVSVTSGSDEAGGSMNNLNGYV-VGTNVNGSAGKLKVNN 1023
NGNS Y Y +T+ VT +NG V TNVN + VN+
Sbjct: 539 NGNS----YNLYLRVSSIGNSTIRVT------------INGRVYTATNVNTTTNNDGVND 582

Query: 1024 ASMNGVEINTGFTAGTADTTVSFD 1047
+IN G ++++ V D
Sbjct: 583 NGARFSDINIGNVVASSNSDVPLD 606


24CMJKDNLE_01499CMJKDNLE_01514Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01499-214-3.356425D-Ala-D-Ala dipeptidase
CMJKDNLE_01500-113-4.090852c-di-GMP phosphodiesterase, heme-regulated
CMJKDNLE_01501-115-5.223824diguanylate cyclase
CMJKDNLE_01502-117-5.785224putative lipoprotein
CMJKDNLE_01503-119-6.823020glutamic acid:4-aminobutyrate antiporter
CMJKDNLE_01504-122-7.694265putative zinc peptidase
CMJKDNLE_01505023-8.769653putative porin protein
CMJKDNLE_01506025-7.532077YddA complex
CMJKDNLE_01507127-7.329528putative anaerobic sulfatase maturation enzyme;
CMJKDNLE_01508127-6.232551putative sulfatase
CMJKDNLE_01509127-5.804247YdeO DNA-binding transcriptional dual regulator
CMJKDNLE_01510328-5.202446two-component system connector protein
CMJKDNLE_01511126-4.657596acid resistance protein
CMJKDNLE_01512129-6.294745putative fimbrial-like adhesin protein
CMJKDNLE_01513322-3.374898putative fimbrial-like adhesin protein
CMJKDNLE_01514219-2.621988putative fimbrial-like adhesin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01513FIMBRIALPAPF325e-04 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 32.0 bits (72), Expect = 5e-04
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 7/93 (7%)

Query: 16 LFTATLQAADVTITVNGRVVAKPCTIQT-KEANVNLGDLYTRNLQQPGSASGWHNITLSL 74
L T+ ADV I + G V PCTI + V+ G++ N + ++ G +S+
Sbjct: 11 LLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNI---NPEHVDNSRGEVTKNISI 67

Query: 75 TDCPVETSAVTAIVTGSTDNTGYYKNEGTAENI 107
+ CP ++ ++ VTG+T G +N A NI
Sbjct: 68 S-CPYKSGSLWIKVTGNTMGVG--QNNVLATNI 97


25CMJKDNLE_01534CMJKDNLE_01554Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01534017-3.666533putative DNA-binding transcriptional regulator,
CMJKDNLE_01535120-4.692002hypothetical protein
CMJKDNLE_01536217-1.750596arabinose efflux transporter
CMJKDNLE_01537116-2.168143putative transporter
CMJKDNLE_01538016-2.787664MarR transcriptional repressor
CMJKDNLE_01539018-3.899915MarA DNA-binding transcriptional dual regulator
CMJKDNLE_01540019-3.714698multiple antibiotic resistance protein
CMJKDNLE_01541019-3.704262O-acetylserine/cysteine export protein
CMJKDNLE_01542120-3.305095putative transport protein YdeE
CMJKDNLE_01543123-3.340109small membrane protein
CMJKDNLE_01545020-2.962686diguanylate cyclase
CMJKDNLE_01546019-2.020937stress response protein
CMJKDNLE_01547-117-1.947458hypothetical protein
CMJKDNLE_01548-119-2.564688dipeptidyl carboxypeptidase II
CMJKDNLE_01549-119-3.6161753-hydroxy acid dehydrogenase monomer
CMJKDNLE_01550-118-3.415086putative DNA-binding transcriptional regulator
CMJKDNLE_01551-119-3.422424hypothetical protein
CMJKDNLE_01552-118-3.386070putative mannonate dehydrogenase
CMJKDNLE_01553-115-3.888145putative transporter
CMJKDNLE_01554-118-4.286316putative oxidoreductase, Zn-dependent and
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01536TCRTETB537e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 53.3 bits (128), Expect = 7e-10
Identities = 41/192 (21%), Positives = 83/192 (43%), Gaps = 8/192 (4%)

Query: 36 LSDIAQSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95
L DIA F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154
F+ S F++L+++R A F ++ + R P R +A LI + A+ +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214
+G ++ Y W I + +IT+ L+KLL + LMS+
Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209

Query: 215 YLLTVVVVTAHY 226
++ ++ T Y
Sbjct: 210 GIVFFMLFTTSY 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01542TCRTETA431e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 1e-06
Identities = 42/239 (17%), Positives = 82/239 (34%), Gaps = 18/239 (7%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVVFSLGF 63
R +L++ L +G G +P + L R S D+ G + + + +
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 64 GILADKFDKKRYMLLAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123
G L+D+F ++ +L+++ A + + + ++ + + + A A+ AD
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122

Query: 124 NLSSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183
+ + F G GP LG L+ S + PF+ AA + L +
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233
S K + W + A L F+ V A+ +
Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237



Score = 32.5 bits (74), Expect = 0.003
Identities = 22/155 (14%), Positives = 60/155 (38%), Gaps = 2/155 (1%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLIGYAMTIALTIGVVF-SLGFGI 65
+AL+A ++ + I+ ++ IG ++ + + ++ G
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 66 LADKFDKKRYMLLAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFADNL 125
+A + ++R ++L + A +G+I + + L+ + L+A + +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQV 328

Query: 126 SSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSI 160
+ ++ + ++ +GP L T + SI
Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01549DHBDHDRGNASE1002e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (249), Expect = 2e-27
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%)

Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58
I +TGA G GE + R QG + A E+L+++ L A+ DVR+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118
AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178
M++R G I+ +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227
T+ + ++G + +T++ + L P D+++AV + VS H+
Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 228 INTL 231
++ L
Sbjct: 248 MHNL 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01553TCRTETB493e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.7 bits (116), Expect = 3e-08
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 16/118 (13%)

Query: 72 VGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLG 131
+G ++GK+ D++G K++L I + + + V ++ + + A R IQG G
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAG 116

Query: 132 AGAEISGAGTMLAEYAPKGKR----GIISSFVAMGTNCGTLSATAI-----WAFMFFI 180
A A + ++A Y PK R G+I S VAMG G I W+++ I
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


26CMJKDNLE_01668CMJKDNLE_01675Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01668315-3.678330putative inner membrane protein
CMJKDNLE_01670120-5.096965hypothetical protein
CMJKDNLE_01671117-4.180564putative transporter
CMJKDNLE_01672017-4.460987putative inner membrane protein YdiN
CMJKDNLE_01673018-4.191115shikimate dehydrogenase / quinate dehydrogenase
CMJKDNLE_01674-216-3.6748783-dehydroquinate dehydratase
CMJKDNLE_01675-215-3.058150fused predicted acetyl-CoA:acetoacetyl-CoA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01671TCRTETA310.010 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.010
Identities = 58/311 (18%), Positives = 107/311 (34%), Gaps = 16/311 (5%)

Query: 61 FAGLLSDRFGRRPFIMLGMCCYMAFFFGILQTNNIIIAYVFGFLAGMANSFLDAGTYPSL 120
G LSDRFGRRP +++ + + + + + Y+ +AG+ + A +
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYI 120

Query: 121 MEAFPRSPGTANI-LIKAFVSSGQFLLPLIISLLVWAELWFGWSFMIAAGIMFINALFLY 179
+ + + A G P++ L+ F AA + +N L
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGC 178

Query: 180 RCTFPPHPGRRLPV---IKKTTSSTEHRCSIIDLASYTLYGYISMATFYLVSQWLAQYGQ 236
H G R P+ +S + +A+ +I + + +G+
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 237 FVAGMSYTM-SIKLLSIYTVGSLLCVFITAPLIRNTVRPTTLLMLYTFISFIALFTVCLH 295
T I L + + SL IT P+ L++ IA T +
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-----GMIADGTGYIL 293

Query: 296 PTFYVVIIFAF-VIGFTSAGGVVQIGLTLMAERF--PYAKGKATGIYYSAGSIATFTIPL 352
F AF ++ ++GG+ L M R +G+ G + S+ + PL
Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353

Query: 353 ITAHLSQRSIA 363
+ + SI
Sbjct: 354 LFTAIYAASIT 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01672TCRTETB310.011 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.011
Identities = 38/177 (21%), Positives = 75/177 (42%), Gaps = 9/177 (5%)

Query: 12 ILAVLCIYFSYFLHGISVITLAQNMSSLAEKFSTDNAGIAYLISGIGLGRLISILFFGVI 71
IL LCI F ++ + L ++ +A F+ A ++ + L I +G +
Sbjct: 15 ILIWLCIL--SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 72 SDKFGRRAVILMAVIMY----LLFFFGIPACPNLTLAYGLAVCVGIANSALDTGGYPALM 127
SD+ G + ++L +I+ ++ F G L +A + A AL
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY- 131

Query: 128 ECFPKASGSAVILVKAMVSFGQMFYPMLVSYMLLNNIWYGYGLIIPGILFVLITLML 184
+ G A L+ ++V+ G+ P + M+ + I + Y L+IP I + + ++
Sbjct: 132 -IPKENRGKAFGLIGSIVAMGEGVGP-AIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


27CMJKDNLE_01695CMJKDNLE_01739Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_016954290.747656integration host factor (IHF), alpha subunit
CMJKDNLE_01696328-0.076191phenylalanyl-tRNA synthetase beta-chain
CMJKDNLE_01697224-3.134445phenylalanyl-tRNA synthetase alpha-chain
CMJKDNLE_01699024-6.73670850S ribosomal subunit protein L20
CMJKDNLE_01700022-6.91098750S ribosomal subunit protein L35
CMJKDNLE_01701121-6.407221protein chain initiation factor IF-3
CMJKDNLE_01702019-5.107601threonyl-tRNA synthetase
CMJKDNLE_01703330-6.806220hypothetical protein
CMJKDNLE_01704127-4.341690regulator of acetyl CoA synthetase
CMJKDNLE_01705121-2.026892small predicted membrane protein
CMJKDNLE_01706020-1.627986hypothetical protein
CMJKDNLE_01707019-1.3063216-phosphofructokinase-2 monomer
CMJKDNLE_01708116-1.462942hypothetical protein
CMJKDNLE_01709020-3.580874putative phosphotransferase/kinase
CMJKDNLE_01710020-4.108529putative inner membrane protein
CMJKDNLE_01711-115-2.1294662-deoxyglucose-6-phosphatase
CMJKDNLE_01712-113-2.167796putative inner membrane protein regulated by
CMJKDNLE_01713-213-2.871854putative transporter
CMJKDNLE_01714-116-4.630145hypothetical protein
CMJKDNLE_01715-214-2.788436cell division modulator
CMJKDNLE_01716-112-2.859363heme d synthase / hydroperoxidase
CMJKDNLE_01717018-4.586051chito-oligosaccharide mono-deacetylase
CMJKDNLE_01718017-5.100839monoacetylchitobiose-6-phosphate hydrolase
CMJKDNLE_01719017-4.561974ChbR DNA-binding transcriptional dual regulator
CMJKDNLE_01720118-2.450571chitobiose / cellobiose PTS permease - ChbA
CMJKDNLE_01721216-2.155710chitobiose / cellobiose PTS permease - ChbC
CMJKDNLE_01722115-1.795800chitobiose / cellobiose PTS permease - ChbB
CMJKDNLE_01723013-0.541613osmotically inducible protein OsmE
CMJKDNLE_017240120.690485NAD synthetase, NH3-dependent
CMJKDNLE_017250122.546230endonuclease of nucleotide excision repair
CMJKDNLE_017260123.258510hypothetical protein
CMJKDNLE_01727-1113.520972ATP independent periplasmic chaperone
CMJKDNLE_01728-1113.609370succinylglutamate desuccinylase
CMJKDNLE_017290113.128023succinylarginine dihydrolase
CMJKDNLE_017300112.243982aldehyde dehydrogenase
CMJKDNLE_01731-1130.762272arginine succinyltransferase
CMJKDNLE_017320130.441298succinylornithine transaminase
CMJKDNLE_017331150.750191exonuclease III
CMJKDNLE_017343171.817522putative inner membrane protein
CMJKDNLE_017353162.245071hypothetical protein
CMJKDNLE_017363142.782213hypothetical protein
CMJKDNLE_017373153.032155hypothetical protein
CMJKDNLE_017383152.882374hypothetical protein
CMJKDNLE_017392142.233686YnjC/YnjD ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01695DNABINDINGHU1193e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 3e-39
Identities = 34/89 (38%), Positives = 55/89 (61%)

Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63
K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92
NP+TGE+I I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01730DNABINDINGHU310.002 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 31.2 bits (71), Expect = 0.002
Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 74 SNKAELTAIIARETGKPRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHR 133
+NK +L A +A T + ++A V A+ + ++ + GE+ + G +R R
Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56

Query: 134 P 134

Sbjct: 57 A 57


28CMJKDNLE_01750CMJKDNLE_01778Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01750-114-3.159313protease IV
CMJKDNLE_01751020-4.698687asparaginase I
CMJKDNLE_01752-122-5.694268pyrazinamidase / nicotinamidase
CMJKDNLE_01753024-6.348719YdjE MFS transporter
CMJKDNLE_01754024-5.400128putative DNA-binding transcriptional regulator
CMJKDNLE_01755021-4.329572methylglyoxal reductase (NADH-dependent)
CMJKDNLE_01756122-4.493547putative kinase
CMJKDNLE_01757121-4.011649putative aldolase
CMJKDNLE_01758-118-3.204585putative oxidoreductase, Zn-dependent and
CMJKDNLE_01759017-2.598714putative transporter
CMJKDNLE_01760019-2.334369putative oxidoreductase, Zn-dependent and
CMJKDNLE_01761022-1.912147hypothetical protein
CMJKDNLE_01762218-1.637799methionine sulfoxide reductase B
CMJKDNLE_01763116-1.748017glyceraldehyde 3-phosphate dehydrogenase-A
CMJKDNLE_01764-110-4.126015hypothetical protein
CMJKDNLE_01765-112-4.698485methylglyoxal reductase
CMJKDNLE_01766012-4.796375scaffolding protein that interacts with murein
CMJKDNLE_01767-114-4.847554protein kinase
CMJKDNLE_01768-218-5.441461hypothetical protein
CMJKDNLE_01769-122-4.758912putative diguanylate cyclase
CMJKDNLE_01770-120-1.692986putative diguanylate cyclase
CMJKDNLE_017711220.239752hypothetical protein
CMJKDNLE_01772021-0.698019hypothetical protein
CMJKDNLE_01773021-1.435220putative DNA-binding transcriptional regulator
CMJKDNLE_01774-120-1.557537putative amino acid/amine MFS transporter
CMJKDNLE_01775020-2.962016hypothetical protein
CMJKDNLE_01776122-3.837967hypothetical protein
CMJKDNLE_01777-120-4.988142diguanylate cyclase
CMJKDNLE_01778-123-3.358688hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01752ISCHRISMTASE373e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 37.3 bits (86), Expect = 3e-05
Identities = 36/192 (18%), Positives = 56/192 (29%), Gaps = 58/192 (30%)

Query: 2 PPRALLLV-DLQNDFCAGGALAVPEGDSTVDVANRLIDWCQSRGEAVI-----ASQD--- 52
P RA+LL+ D+QN F +L + C G V+ SQ+
Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87

Query: 53 -------WHPANHGSFASQHGVEPYTPGQLDGLPQTFWPDHCVQNSEGAQLHPLLHQKAI 105
W P + + + P D + T W
Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDDDLV-LTKW---------------------- 124

Query: 106 AAVFHKGENPLVDSYSAFFDNGRRQKTSLDDWLRDHEIDELIVMGLATDYCVKFTVLDAL 165
YSAF +T+L + +R D+LI+ G+ T +A
Sbjct: 125 -------------RYSAFK------RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAF 165

Query: 166 QLGYKVNVITDG 177
K + D
Sbjct: 166 MEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01753TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 2e-05
Identities = 30/129 (23%), Positives = 50/129 (38%), Gaps = 1/129 (0%)

Query: 65 ALMFGYFIGSLTGGFIGDYFGRRRAFRINLLIVGIAATGAAFVPDMY-WLIFFRFLMGTG 123
A M + IG+ G + D G +R ++I + + LI RF+ G G
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116

Query: 124 MGALIMVGYASFTEFIPATVRGKWSARLSFVGNWSPMLSAAIGVVVIAFFSWQIMFLLGG 183
A + +IP RGK + + + AIG ++ + W + L+
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176

Query: 184 IGILLAWFL 192
I I+ FL
Sbjct: 177 ITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01759TCRTETB310.011 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.011
Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 23/142 (16%)

Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMDF-LIACRFVMGVGLGALL 129
+G V G + D+ G + + I+ V+G + LI RF+ G G A
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVSFIGNWSYPLCSLIAMGLTPLISA----EWNWR 181
+ Y+P NR G S V+ + G+ P I +W
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA------------MGEGVGPAIGGMIAHYIHWS 169

Query: 182 VQLLIPAILSLIATALAWRYFP 203
LLIP I I T
Sbjct: 170 YLLLIPMI--TIITVPFLMKLL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01772PRTACTNFAMLY280.022 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 27.7 bits (61), Expect = 0.022
Identities = 18/61 (29%), Positives = 26/61 (42%)

Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGGRGVTLMGS 108
Q +I L IG + + LPPS ++ N ++ A VS LG +TL G
Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233

Query: 109 Q 109

Sbjct: 234 H 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01778HTHTETR306e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.0 bits (67), Expect = 6e-04
Identities = 9/37 (24%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTILL 35
+ I+ G I+G++ W+ K ++ ILL
Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199


29CMJKDNLE_01909CMJKDNLE_02004Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01909213-2.239273periplasmic cystine-binding protein; member of
CMJKDNLE_01910214-2.067568FliZ DNA-binding transcriptional regulator
CMJKDNLE_01911010-1.795051RNA polymerase, sigma 28 (sigma F) factor
CMJKDNLE_01912-111-1.789176flagellar biosynthesis; flagellin, filament
CMJKDNLE_01913-2160.374323flagellar cap protein FliD; filament capping
CMJKDNLE_01914-1130.163600flagellar biosynthesis protein FliS
CMJKDNLE_01915-1120.393789flagellar biosynthesis protein FliT
CMJKDNLE_01916013-0.438062alpha-amylase
CMJKDNLE_01917020-3.615725hypothetical protein
CMJKDNLE_01918019-4.088600putative inner membrane protein
CMJKDNLE_01919-221-3.408535hypothetical protein
CMJKDNLE_01920015-1.739659hypothetical protein
CMJKDNLE_01921115-0.977266hypothetical protein
CMJKDNLE_019221140.072238hypothetical protein
CMJKDNLE_019230164.378306flagellar basal-body protein FliE
CMJKDNLE_019241164.228204flagellar M-ring protein FliF; basal-body
CMJKDNLE_019253184.428672flagellar motor switch protein FliG
CMJKDNLE_019262164.158658flagellar biosynthesis protein FliH
CMJKDNLE_01927-1173.273808flagellum-specific ATP synthase FliI
CMJKDNLE_01928-2163.149904flagellum-specific ATP synthase FliI
CMJKDNLE_019290162.221814flagellar biosynthesis protein FliJ
CMJKDNLE_01930-1162.242372flagellar hook-length control protein FliK
CMJKDNLE_01931-2211.740401flagellar biosynthesis
CMJKDNLE_019320160.365025flagellar motor switch protein FliM
CMJKDNLE_01933116-2.600077flagellar motor switch protein FliN
CMJKDNLE_01934117-3.308043flagellar biosynthesis protein FliO
CMJKDNLE_01935119-4.222553flagellar biosynthesis protein FliP
CMJKDNLE_01936120-4.435319flagellar biosynthesis protein FliQ
CMJKDNLE_01937-216-2.903387flagellar biosynthesis protein FliR
CMJKDNLE_01938-217-2.380688positive DNA-binding transcriptional regulator
CMJKDNLE_01939-3150.110687hypothetical protein; transcription is regulated
CMJKDNLE_01940-2160.534818stress-induced protein
CMJKDNLE_01942-2161.065471putative phosphatase
CMJKDNLE_019430150.896842putative diguanylate cyclase
CMJKDNLE_019441161.321360hypothetical protein
CMJKDNLE_019452171.177554hypothetical protein
CMJKDNLE_01946-112-0.478793putative membrane transport protein
CMJKDNLE_01947-113-1.508750DNA mismatch endonuclease of the very short
CMJKDNLE_01948-217-4.680993DNA-cytosine methyltransferase
CMJKDNLE_01949-125-6.648141putative phosphohydrolase
CMJKDNLE_01951030-8.172831hypothetical protein
CMJKDNLE_01952-126-6.693202outer membrane pore protein N, non-specific
CMJKDNLE_01953029-6.576437glyoxalase III, Hsp31 molecular chaperone
CMJKDNLE_01954034-7.955876putative sensory kinase in two-component
CMJKDNLE_01955129-6.262509putative DNA-binding response regulator in
CMJKDNLE_01956021-3.936481hypothetical protein
CMJKDNLE_01957017-1.823777reductase
CMJKDNLE_01958018-0.176387hypothetical protein
CMJKDNLE_019590171.869011cadmium-induced cadmium binding protein
CMJKDNLE_019611183.764577*Mlc titration factor
CMJKDNLE_019631194.819879*KpLE2 phage-like element; predicted integrase
CMJKDNLE_019640206.730387anthranilate synthase component I
CMJKDNLE_019651237.807161muropeptide:H+ symporter
CMJKDNLE_019660237.875317ATP-binding lipopolysaccharide transport
CMJKDNLE_019670237.945757ATP-binding lipopolysaccharide transport
CMJKDNLE_019680248.098829SoxS DNA-binding transcriptional dual regulator
CMJKDNLE_019690247.931492fatty acyl-CoA synthetase
CMJKDNLE_019700237.440904hypothetical protein
CMJKDNLE_019710193.910182hypothetical protein
CMJKDNLE_019720172.561882Thioesterase PikA5
CMJKDNLE_01973-119-0.227227enterobactin synthase multienzyme complex
CMJKDNLE_01974020-2.108911outer membrane receptor involved in uptake of
CMJKDNLE_01975026-4.295369adhesin
CMJKDNLE_01976-128-5.248562adhesin
CMJKDNLE_01977-231-6.434564adhesin
CMJKDNLE_01978-130-5.980543hypothetical protein
CMJKDNLE_01979-227-4.038100shikimate:H+ symporter
CMJKDNLE_01980-129-4.713542AMP nucleosidase
CMJKDNLE_01981032-5.009492hypothetical protein
CMJKDNLE_01982130-2.465070hypothetical protein
CMJKDNLE_01984128-1.497455*YeeO MATE transporter
CMJKDNLE_01986129-1.641459*Cbl DNA-binding transcriptional activator
CMJKDNLE_01987228-1.303272Nac DNA-binding transcriptional dual regulator
CMJKDNLE_01989231-5.281152*L,D-transpeptidase ErfK
CMJKDNLE_01990334-7.557142nicotinate-nucleotide dimethylbenzimidazole
CMJKDNLE_01991329-6.784937cobalamin 5'-phosphate synthase / cobalamin
CMJKDNLE_01992623-2.468444cobinamide-P guanylyltransferase / cobinamide
CMJKDNLE_01993524-1.688395hypothetical protein
CMJKDNLE_01994523-1.765897hypothetical protein
CMJKDNLE_019957241.542398hypothetical protein
CMJKDNLE_019969253.276237CP4-44 prophage; predicted GTP-binding protein
CMJKDNLE_019988242.830290CP4-44 prophage; antigen 43 (Ag43)
CMJKDNLE_019997271.421186CP4-44 prophage; predicted membrane protein
CMJKDNLE_020009282.288083hypothetical protein
CMJKDNLE_020019273.075065CP4-57 prophage; predicted protein
CMJKDNLE_020027270.838662CP4-6 prophage; predicted protein
CMJKDNLE_02003317-0.333631CP4-44 prophage; predicted protein
CMJKDNLE_02004215-1.187509CP4-44 prophage; antitoxin of the CbtA-CbeA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01912FLAGELLIN1631e-45 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 163 bits (413), Expect = 1e-45
Identities = 177/418 (42%), Positives = 223/418 (53%), Gaps = 6/418 (1%)

Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61
AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRIRELTVQATTGTNSDSDLDSIQDEIKSRLD 121
TQA+RNANDGIS+AQTTEGAL+EINNNLQR+REL+VQAT GTNSDSDL SIQDEI+ RL+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVSGQTQFNGVNVLSKDGSMKIQVGANDGETITIDLKKIDSDTLNLAGFNVNGKGSV 181
EIDRVS QTQFNGV VLS+D MKIQVGANDGETITIDL+KID +L L GFNVNG
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 182 ANTAATSDDLKLAGFTKGTTDTNGVTAYTNTISNDKAKASDLLANITDGSVITGGGANAF 241
S + G+ N +++ + D +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRV---DVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 242 GVAAKNGYTYDAASKSYSFAADGADSAKTLSIINPNTGDSSQATVTIGGKEQKVNISQDG 301
A+N D + S A A +I GD+ + K +G
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNG 297

Query: 302 KITAADDNATLYL---DKQGNLTKTNAGNDTAATWDGLISNSDSTGAVPVGVATTITITS 358
K++ + + L D +A ++ + + ++
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357

Query: 359 GTASGMSVQSAGAGIQTSTNSQILAGGAFAAKVSIEGGAATDILVASNGNITAADGNA 416
A+ + + + + AG T V++ N AA
Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKK 415



Score = 92.8 bits (230), Expect = 7e-22
Identities = 91/332 (27%), Positives = 125/332 (37%)

Query: 338 SNSDSTGAVPVGVATTITITSGTASGMSVQSAGAGIQTSTNSQILAGGAFAAKVSIEGGA 397
++T +T A G + V+ G
Sbjct: 176 GPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQ 235

Query: 398 ATDILVASNGNITAADGNALYLDATTGGFTTTAGGNTAASLDNLIANSKDATLTVTSGTG 457
T +N + A T T G
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 458 QNTVYSTTGSGAQFTSLAKVDTVNVTNAHVSAEGMANLTKSNFTIDMGGTGTVTYTVSNG 517
V +T ++A + + + N+ S +
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 518 DVKAAANADVYVEDGALSANATKDVTYFEQKNGAITNSTGGTIYETADGKLTTEATTASS 577
A NA ++ ++ A + +A A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKK 415

Query: 578 STADPLKALDEAISSIDKFRSSLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEV 637
STA+PL ++D A+S +D RSSLGA+QNR DSA+TNL NT TNL+ A+SRI+DADYATEV
Sbjct: 416 STANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEV 475

Query: 638 SNMSKAQIIQQAGNSVLAKANQVPQQVLSLLQ 669
SNMSKAQI+QQAG SVLA+ANQVPQ VLSLL+
Sbjct: 476 SNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01913TYPE3OMBPROT330.003 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 32.7 bits (74), Expect = 0.003
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273
N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++
Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293

Query: 274 AIKDWVNAYNSL 285
+KD VNA L
Sbjct: 294 MLKDQVNALKGL 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01918RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01919PF01206936e-29 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 92.5 bits (230), Expect = 6e-29
Identities = 16/71 (22%), Positives = 37/71 (52%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66
D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01923FLGHOOKFLIE1175e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (294), Expect = 5e-38
Identities = 103/103 (100%), Positives = 103/103 (100%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01924FLGMRINGFLIF7560.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 756 bits (1952), Expect = 0.0
Identities = 478/555 (86%), Positives = 514/555 (92%), Gaps = 5/555 (0%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQFNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQ NTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQ--QASTTSNS---GPRSTQRNETSN 357
+GYPGGVPGALSNQPAP N API+TPP NQ N Q Q ST++NS GPRSTQRNETSN
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEK 417
YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIEDLTREAMGFS+K
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424

Query: 418 RGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477
RGD+LNVVNSPF++ D +GGELPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT
Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484

Query: 478 RRAEAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537
RR E KA Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544

Query: 538 VVALVIRQWINNDHE 552
VVALVIRQW++NDHE
Sbjct: 545 VVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01925FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01926FLGFLIH373e-135 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 373 bits (959), Expect = e-135
Identities = 226/228 (99%), Positives = 228/228 (100%)

Query: 1 MSDNLPWKTWTPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTWTPDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01929FLGFLIJ2022e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (515), Expect = 2e-70
Identities = 146/147 (99%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01930FLGHOOKFLIK461e-165 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 461 bits (1186), Expect = e-165
Identities = 361/375 (96%), Positives = 366/375 (97%)

Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSEILADAQQADLLIPVDETPPVINDEQSTSTPLTTAQTMTLAAVAGNNTAKDEKA 120
GEPL+S+I++DAQQA+LLIPVDETPPVINDEQSTSTPLTTAQTM LAAVA NT KDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDVPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTD PSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQVRVTGNSSVDIFA 375
LQ RVTGNS VDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01932FLGMOTORFLIM381e-135 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 381 bits (979), Expect = e-135
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01933FLGMOTORFLIN2121e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 212 bits (542), Expect = 1e-74
Identities = 125/137 (91%), Positives = 134/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T++KSAADAVFQQ GGGDVSG +QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01935FLGBIOSNFLIP334e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 334 bits (857), Expect = e-119
Identities = 244/245 (99%), Positives = 245/245 (100%)

Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01936TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01937TYPE3IMRPROT2026e-67 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 202 bits (516), Expect = 6e-67
Identities = 256/261 (98%), Positives = 259/261 (99%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEIFNLLADIISELPLI 261
EHLFSEIFNLLADIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01948PF05272290.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.045
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01949CARBMTKINASE343e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 3e-04
Identities = 22/92 (23%), Positives = 35/92 (38%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVTLAKNHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01952ECOLIPORIN5400.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 540 bits (1393), Expect = 0.0
Identities = 264/399 (66%), Positives = 307/399 (76%), Gaps = 22/399 (5%)

Query: 1 MKRKVLAMLVPALLVAGAANAAEVYNKDGNKLDLYGKVAGLHYFSDDAGSDGDKSYARIG 60
MKRKVLA+++PALL AGAA+AAE+YNKDGNKLDLYGKV GLHYFSDD+ DGD++Y R+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQIADQFTGYGQWEFNIGANGTESDKGNTATRLAFAGLGFGQNGTFDYGRNYGVVY 120
FKGETQI DQ TGYGQWE+N+ AN TE + N+ TRLAFAGL FG G+FDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DVEAWTDMLPEFGGDTYAGADNFMNGRANGVATYRNNGFFGQVDGLNFALQYQSNNEN-S 179
DVE WTDMLPEFGGD+Y ADN+M GRANGVATYRN FFG VDGLNFALQYQ NE+ S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 180 GGLFGQEGSGKGKGRDIAQENGDGFGMSTSYDFDFGLSLGAAYSNSDRTDNQVHKGWHNT 239
+ + G DI +NGDGFG+ST+YD G S GAAY+ SDRT+ QV
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQV------- 233

Query: 240 RDGDRSDTTAGGETAEAWTVGAKYDANNVYLAAMYAETRNMTGYGKVDA-----IANKTQ 294
+ T AGG+ A+AWT G KYDANN+YLA MY+ETRNMT YGK D +ANKTQ
Sbjct: 234 ---NAGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQ 290

Query: 295 NFEVVAQYQFDFGLRPSIAYLQSKGKDLGGWAHDGNGDPRYTNKDLVKYVDIGATYYFNK 354
NFEV AQYQFDFGLRP++++L SKGKDL + +KDLVKY D+GATYYFNK
Sbjct: 291 NFEVTAQYQFDFGLRPAVSFLMSKGKDLTY------NNVNGDDKDLVKYADVGATYYFNK 344

Query: 355 NMSTYVDYKINLLDNDDDFYKENGIATDDIVAVGLVYQF 393
N STYVDYKINLLD+DD FYK+ GI+TDDIVA+G+VYQF
Sbjct: 345 NFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01954PF06580387e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 7e-05
Identities = 29/147 (19%), Positives = 58/147 (39%), Gaps = 21/147 (14%)

Query: 308 DTLSLNKEVENLLDYL--EYLSDEKEIRFKVECNQQIFADKI---LLQRMLSNLIVNAIR 362
+SL E+ + YL + E ++F+ + N I ++ L+Q ++ N I + I
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 363 YSPEKSRIHITSFLDANGSLNIDIASPGTKINEPEKLFRRFWRGDNSRHSVGQGLGLSLV 422
P+ +I + D NG++ +++ + G+ + K G GL V
Sbjct: 274 QLPQGGKILLKGTKD-NGTVTLEVENTGSLALKNTKE--------------STGTGLQNV 318

Query: 423 KA-IAELHGGSATYHYLSKHNVFRITL 448
+ + L+G A K +
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01955HTHFIS849e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 9e-21
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01969ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 82 AWNQLMLSRS 91
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01970DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01975INTIMIN739e-18 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 73.2 bits (179), Expect = 9e-18
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 86 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVNEDFS 145
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01976INTIMIN571e-10 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 56.6 bits (136), Expect = 1e-10
Identities = 61/263 (23%), Positives = 90/263 (34%), Gaps = 20/263 (7%)

Query: 175 IAVKAHVNDQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVK 234
I A V G + P +F+ S ++S N+ +TN G A VT+ ++ G V
Sbjct: 578 ITYTATVKKN-GVAQANVPVSFNIV-SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635

Query: 235 ASLANGASLEKQLEAI---DEKLTLTSSPLIGVNAPKGATLTATLT---SANGTPVEGQV 288
A A S I K ++T A T T PV Q
Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695

Query: 289 INFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFHNGVTIQTQTTVKVTGN 348
+ F+ T LS +T+++G A V LTS G V+A + V+
Sbjct: 696 VTFTTT--LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTT 753

Query: 349 PSTTHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTVYFALKSGSTTLTSLTA 407
+ I T ++ G NL G + +S + + S
Sbjct: 754 LTID------DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS--- 804

Query: 408 VTDQNGIATTSVKGEITGSVTVS 430
V +G T KG T SV S
Sbjct: 805 VDASSGQVTLKEKGTTTISVISS 827



Score = 52.8 bits (126), Expect = 3e-09
Identities = 45/169 (26%), Positives = 62/169 (36%), Gaps = 5/169 (2%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGVTIQTQTTVKVTGNPSTTHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGLT 390
+ V + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQTKASI-TEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQE 695

Query: 391 VYFALKSGSTTLTSLTAVTDQNGIATTSVKGEITGSVTVSAVTSAGGMQ 439
V F G + + T TD NG A ++ G VSA S +
Sbjct: 696 VTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 51.2 bits (122), Expect = 8e-09
Identities = 39/139 (28%), Positives = 59/139 (42%), Gaps = 3/139 (2%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN 149
VSA ++D +V + +
Sbjct: 731 LVSARVSDVAVDVKAPEVE 749



Score = 39.7 bits (92), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01977INTIMIN300.004 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.0 bits (67), Expect = 0.004
Identities = 23/129 (17%), Positives = 47/129 (36%), Gaps = 6/129 (4%)

Query: 11 KISAIDYSQNINGDYKATVTGGGEGIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVS 70
K+S + NG K T+T G + + ++ V + +EF G +
Sbjct: 704 KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIE 762

Query: 71 VNGANLPTASFPSQGFTGAYYQLNNDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDG 130
+ G + P+ L + G + ++ A ++G+VT K G
Sbjct: 763 IVGTGV-KGKLPTVWLQYGQVNL---KASGGNGKYTWRSANPAIASVDASSGQVTLKEKG 818

Query: 131 DSNTVIITA 139
+ T+ + +
Sbjct: 819 -TTTISVIS 826


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01979TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 5e-04
Identities = 42/262 (16%), Positives = 99/262 (37%), Gaps = 24/262 (9%)

Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138
+G ++G D+LG KR+L+ + + + + + SF ++ L+ R IQG
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116

Query: 139 VGGEWGGAALLSVESAPENKK----AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFL 194
++ P+ + S V +G GVG + + I
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---------- 166

Query: 195 SWGWRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHYQAAAKKRIPVIEALLRHPGAFLK 254
W L ++ ++ ++ +++ + + + ++ +L +
Sbjct: 167 --HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSIS 224

Query: 255 IIALRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFG 314
+ + + L +++ + + GL + + IG+L GG+ T+ F +
Sbjct: 225 FLIVSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283

Query: 315 RRRVYITGALIGTLSAFPFFMA 336
+ ++ A IG++ FP M+
Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMS 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01998PRTACTNFAMLY320.010 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 32.3 bits (73), Expect = 0.010
Identities = 162/831 (19%), Positives = 253/831 (30%), Gaps = 125/831 (15%)

Query: 38 IALSLAAVTSVPALAAD----TVVQAGETVSGGTLTNHDNQIVLGTANGMTISTG----- 88
+A++L A+ + PA AD ++V+ GE G + D V TA+G TI
Sbjct: 19 LAMALGALGAAPAAHADWNNQSIVKTGERQHGIHIQGSDPGGVR-TASGTTIKVSGRQAQ 77

Query: 89 ---LEYGPDNEANTGGQWIQNGGIANNTTVTGGGLQRVNAGGSVSDTVISAGGGQSLQGQ 145
LE G +G ++++ G V AG V+D A G +
Sbjct: 78 GILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDD 137

Query: 146 AVNTTLNGGEQWVHEGGIA---TGTVINEKGWQAVKSGAVATDTVVNTGAEGGPDAENGD 202
+ + G + G V E+G + D ++ GA E+
Sbjct: 138 GIALYVAGEQAQASIADSTLQGAGGVQIERGANVTVQRSAIVDGGLHIGALQSLQPEDLP 197

Query: 203 TGQTVYGDAVRTTINKNGRQIVAAEGTVNTTVVYAGGDQTVHGHALDTTLNGGYQYVHNG 262
+ V D T V A G V + T+ G + G +
Sbjct: 198 PSRVVLRDTNVTA--------VPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGA 249

Query: 263 GTASGTVVNSDGWQIIKEGGLADFTTVNQKGKLQVNAGGTATNVTLKQGGALVTSTAATV 322
G GG GG GGA+
Sbjct: 250 VVHLQRATIRRGDA--PAGGAV--------------PGGAV------PGGAVPGGFGPGG 287

Query: 323 TGSNRLGNFTVENGKADGVVLESGGRLDVLEGHSAEKTRVDDGGTLAVSAGGKA---TGV 379
G G + V+ + + +S L G +GG G
Sbjct: 288 FGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGR-----GARVTVSGGSLSAPHGN 342

Query: 380 TMTSGGALI---ADSGATV---EGTNASGKFSIDGISGQASGLLLENG----GSFTVNAG 429
+ +GGA + ++ G +A GK + + + L L G G
Sbjct: 343 VIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATEL 402

Query: 430 GQASNTTVGHRGTLMLAAGGSLSGRTQLSKGASMVLNGDVVSTGDIVNAGEIRFDNQTTP 489
T++G + LA+ +G T+ S+ N V T + N G +R + +
Sbjct: 403 PSIPGTSIG-PLDVALASQARWTGATRAVDSLSID-NATWVMTDN-SNVGALRLASDGSV 459

Query: 490 DAVLSRAVAKGDSPVTFHKLTTSNLTGQGGTINMRVRLDGSNASDQLVINGGQATGKTWL 549
D + F LT + L G G M V D SD+LV+ A+G+ L
Sbjct: 460 D------FQQPAEAGRFKVLTVNTLAGS-GLFRMNVFAD-LGLSDKLVVMQD-ASGQHRL 510

Query: 550 AFTNVGNSNLGVATTGQGIRVVDAQNGATTEEGAFALSRPLQAGAFNYTLNRDSDEDWYL 609
N G+ T + +V G+ + G + Y L + + W L
Sbjct: 511 WVRNSGSEPASANT----LLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSL 566

Query: 610 RSENAYRAEVPLY-----------------------TSMLTQAMDYDRILAGSRSHQTGV 646
A A P L+ A + G T
Sbjct: 567 VGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLW 626

Query: 647 NGENNSVRLSIQGGHLGHDNNGGIARGATPKSSGSYGFVRLEGDLLRTEVAGMSL----- 701
E+N++ + L D G RG + R +VAG L
Sbjct: 627 YAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGR----RFDQKVAGFELGADHA 682

Query: 702 ---TTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRH 758
G + G + D + G D+ +GGY SG + D + +R
Sbjct: 683 VAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYI-ADSGFYLDATLRASRL 741

Query: 759 SMKASSDNND-------FRARGWGWLGSLETGLPFSITDNLMLEPQLQYTW 802
+D +R G G SLE G F+ D LEPQ +
Sbjct: 742 ENDFKVAGSDGYAVKGKYRTHGVGA--SLEAGRRFTHADGWFLEPQAELAV 790


30CMJKDNLE_02019CMJKDNLE_02045Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_020190203.187140YefM antitoxin of the YoeB-YefM toxin-antitoxin
CMJKDNLE_02021-1213.737421ATP phosphoribosyltransferase
CMJKDNLE_02022-1213.457222histidinal dehydrogenase / histidinol
CMJKDNLE_02023-1242.558393histidinol-phosphate aminotransferase
CMJKDNLE_02024-1242.580041imidazoleglycerol-phosphate dehydratase /
CMJKDNLE_02025-1211.625138imidazole glycerol phosphate synthase, HisH
CMJKDNLE_020260211.502539N-(5'-phospho-L-ribosyl-formimino)-5-amino-1-
CMJKDNLE_020270262.586195imidazole glycerol phosphate synthase, HisF
CMJKDNLE_020281323.670525phosphoribosyl-AMP cyclohydrolase /
CMJKDNLE_020292354.518153Alpha-D-kanosaminyltransferase
CMJKDNLE_020303313.117240UDP-glucose 6-dehydrogenase
CMJKDNLE_020312251.704472dTDP-4-dehydrorhamnose 3,5-epimerase
CMJKDNLE_02032221-2.167771dTDP-4-dehydrorhamnose reductase
CMJKDNLE_02033424-6.880507dTDP-glucose pyrophosphorylase 2
CMJKDNLE_02034335-12.000081dTDP-glucose 4,6-dehydratase 2
CMJKDNLE_02035649-16.5339266-phosphogluconate dehydrogenase
CMJKDNLE_02036760-20.173883hypothetical protein
CMJKDNLE_02037860-19.262256hypothetical protein
CMJKDNLE_02038759-18.190906hypothetical protein
CMJKDNLE_02039759-17.627244hypothetical protein
CMJKDNLE_02040654-14.686598UDP-Glc:alpha-D-GlcNAc-diphosphoundecaprenol
CMJKDNLE_02041542-10.641672hypothetical protein
CMJKDNLE_02042540-9.518313putative colanic acid biosynthsis UDP-glucose
CMJKDNLE_02043533-7.257729autophosphorylating protein tyrosine kinase
CMJKDNLE_02044321-2.648645tyrosine phosphatase
CMJKDNLE_020452180.534558putative exopolysaccharide export protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02032NUCEPIMERASE542e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.6 bits (129), Expect = 2e-10
Identities = 38/163 (23%), Positives = 65/163 (39%), Gaps = 29/163 (17%)

Query: 1 MKILLIGKNGQVGWELQRSLSTLGD-VVAVD----YFDKEL----------------CGD 39
MK L+ G G +G+ + + L G VV +D Y+D L D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 40 LTNLEGIAQTVRTVRPDVVVNAAAHTAVDKA-ESERELSDLLNDKGVAVL--AAESAKLG 96
L + EG+ + + V + AV + E+ +D N G + K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD-SNLTGFLNILEGCRHNKIQ 119

Query: 97 ALMVHYSTDYVFDGAGSH--YRREDEATGPLNVYGETKRAGEL 137
L ++ S+ V+ G + +D P+++Y TK+A EL
Sbjct: 120 HL-LYASSSSVY-GLNRKMPFSTDDSVDHPVSLYAATKKANEL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02034NUCEPIMERASE1828e-57 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 182 bits (463), Expect = 8e-57
Identities = 84/355 (23%), Positives = 147/355 (41%), Gaps = 46/355 (12%)

Query: 1 MKILVTGGAGFIGSAVVRHIIENTRDEVRVVDCLT--YAGNL-ESLAPVAGSERYSFSQT 57
MK LVTG AGFIG V + ++E +V +D L Y +L ++ + + F +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DITDAAAVAAQFSEFRPDIVMHLAAESHVDRSIDGPTAFIQTNVIGTFTLLEAARHYWSG 117
D+ D + F+ + V V S++ P A+ +N+ G +LE RH
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LGEEQKQAFRFHHISTDEVYGDLHGTDDLFTEETPYA-PSSPYSASKAGSDHLVRAWNRT 176
+ + S+ VYG F+ + P S Y+A+K ++ + ++
Sbjct: 117 ------KIQHLLYASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL 168

Query: 177 YGLPVVVTNCSNNYGPYHFPEKLIPLTILNALAGKPLPVYGNGEQIRDWLYVEDHARALY 236
YGLP YGP+ P+ + L GK + VY G+ RD+ Y++D A A+
Sbjct: 169 YGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 237 KV------------------ATEGKSGETYNIGGHNERKNIDVVRTICAILDKVVAQKPG 278
++ A YNIG + + +D ++ + L A+K
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI-EAKK-- 285

Query: 279 NITHFADLITFVTDRPGHDLRYAIDAAKIQRDLGWVPQETFESGIEKTVHWYLNN 333
+ L +PG L + D + +G+ P+ T + G++ V+WY +
Sbjct: 286 ---NMLPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02043GPOSANCHOR366e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.8 bits (82), Expect = 6e-04
Identities = 28/162 (17%), Positives = 55/162 (33%), Gaps = 14/162 (8%)

Query: 241 LEKTLNSISNNYLAQNVARQAAQDAKSLEFLNQQLPKVRNDLDIAEDKLNQYRRQKDSVD 300
LEK L N A + + + L + + L+ A + + +++
Sbjct: 195 LEKALEGAMNFSTADSAKIKTLE--AEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 252

Query: 301 LSLEAKSVLEQIVNVDNQLNELTFRESEISQLYTKEHPTYKALMEKRKTLQDEKAKLNER 360
+ ++ + EL T + K L ++ L+ EKA L +
Sbjct: 253 ---------AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ 303

Query: 361 VSAMPKTQQDILQLSRDVDSGQAVYMQLLNRQQELNIAKSSA 402
+ +Q L RD+D+ + QL Q+L +
Sbjct: 304 SQVLNANRQS---LRRDLDASREAKKQLEAEHQKLEEQNKIS 342


31CMJKDNLE_02086CMJKDNLE_02100Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_020862110.166926fructose bisphosphate aldolase monomer
CMJKDNLE_020871120.723181YegT MFS transporter
CMJKDNLE_020881141.588926putative hydrolase
CMJKDNLE_020890150.897864putative kinase
CMJKDNLE_02090-115-0.337159putative DNA-binding transcriptional regulator
CMJKDNLE_02091-116-0.430088putative hydrolase
CMJKDNLE_02092323-2.970739hydroxymethylpyrimidine kinase /
CMJKDNLE_02093327-5.308176hydroxyethylthiazole kinase
CMJKDNLE_02095328-7.278174RcnR DNA-binding transcriptional repressor
CMJKDNLE_02096329-7.829407membrane protein conferring nickel and cobalt
CMJKDNLE_02097432-8.961633periplasmic protein involved in nickel/cobalt
CMJKDNLE_02098227-7.331670putative type-1 fimbrial protein
CMJKDNLE_02099-113-4.024428putative outer membrane usher protein
CMJKDNLE_02100-112-3.219516putative fimbrial chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02087TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.0 bits (83), Expect = 2e-04
Identities = 53/268 (19%), Positives = 89/268 (33%), Gaps = 17/268 (6%)

Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGALLMYFAAQQ 88
L S G A A+ ++G+++DRF ++ + ++ AGA + Y
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89

Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147
A F +L + T A T ++A A + D+ R R G + G+ G
Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147

Query: 148 LPQILGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206
P + G SP + P A + L + FL K + + L A
Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205

Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260
VFF + +P A + IF G + +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265

Query: 261 ALPFFTKRFGIKKVLLLGLVTAAIRYGF 288
R G ++ L+LG++ Y
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYIL 293



Score = 31.7 bits (72), Expect = 0.006
Identities = 32/153 (20%), Positives = 52/153 (33%), Gaps = 20/153 (13%)

Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSADEYFTYALLFPGILLHGV 312
+ L + RFG + VLL+ L AA+ Y +L+ G ++ G+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108

Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVN 372
+ V D R G ++ C GFG + G LGG+M F+ P
Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG--FSPHAP-- 162

Query: 373 GLTFNWSGMWTFGAVMIAIIAVLFMIFFRESDN 405
+ A + + + ES
Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02097TYPE3OMGPROT260.029 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 26.4 bits (58), Expect = 0.029
Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 6 KMLLGVLLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 46
++L G LLL++S +WA ++K E L + DF
Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02099PF005777180.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 718 bits (1855), Expect = 0.0
Identities = 243/843 (28%), Positives = 392/843 (46%), Gaps = 35/843 (4%)

Query: 2 LRMTPLASAI---VALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRL--DDNQPLPGQY 56
R+ + A +AE F+ F+ Q VA++ + + PG Y
Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTY 78

Query: 57 DIDIYVNKQWRGKYEIIVKDNPQET----CLSREVIKRLGIN-----SDNFASGKQCLTF 107
+DIY+N + ++ E CL+R + +G+N N + C+
Sbjct: 79 RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138

Query: 108 EQLVQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYLSQYYSDY 167
++ + D+G RL+ ++PQA++ GY+PPE W+ GINA +Y S
Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198

Query: 168 KASGNNKSTYVRFNSGLNLLGWQLHSDASFSKTNNNPGV-----WKSNTLYLERGFAQLL 222
+ GN+ Y+ SGLN+ W+L + ++S +++ W+ +LER L
Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258

Query: 223 GTLRVGDMYTSSDIFDSVRFRGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGF 282
L +GD YT DIFD + FRG +L D MLP+S++ F P + GIA+ A VTI+QNG+
Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318

Query: 283 VVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDF 342
+Y VPPGPF I D+ AG DL V++KEADGS + VPY++VP + + G ++Y
Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378

Query: 343 AAGRSHIEGASKQSE-FVQVGYQYGFNNLLTLYGGSMVANNYYAFTLGTGWNT-RIGAIS 400
AG A ++ F Q +G T+YGG+ +A+ Y AF G G N +GA+S
Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438

Query: 401 VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNK 460
VD T+++S + DGQS + YNK ++++ T L +RYS+ Y F D ++
Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498

Query: 461 DNYRRDENDVYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSG 516
++ V + DYY + ++ ++Q L ++ LS + YWG S
Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSN 557

Query: 517 SSKDYQLSYSNNWRRISYTLAASQAYDENHHE-EKRFNIFISIPFD--WGDDVTTPRRQI 573
+ +Q + + I++TL+ S + ++ + ++IPF D + R
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 574 YMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSYQHQGN---ETTAGANLTWNAPV 630
S S + D G +N G+ GT+ + +Y V Y G+ +T A L +
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 631 ATVNGSYSQSSTYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYR 690
N YS S +Q VSGG++A + GV L L++T ++ APG KDA V Q
Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 691 TTNRNGVVVYDGMTPYRENHLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP 750
T+ G V T YREN + LD + +L P RGA+V F +
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGI 796

Query: 751 WFIKALRADGQPLMFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVAIDKQQGLSCT 810
+ L + +PL FG V + G+V Q+++ + V V +++ C
Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856

Query: 811 ITF 813
+
Sbjct: 857 ANY 859


32CMJKDNLE_02184CMJKDNLE_02196Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_021840203.493535FimZ transcriptional regulator
CMJKDNLE_021850213.837401CcmEFGH holocytochrome c synthetase
CMJKDNLE_021861194.216745CcmEFGH holocytochrome c synthetase
CMJKDNLE_021870174.223679cytochrome c-type biogenesis protein
CMJKDNLE_021880152.828191membrane anchored periplasmic heme chaperone
CMJKDNLE_021890173.298091protoheme IX ABC transporter - membrane subunit
CMJKDNLE_021900153.372406protoheme IX ABC transporter - membrane subunit
CMJKDNLE_02191-1174.097098protoheme IX ABC transporter - membrane subunit
CMJKDNLE_02192-1183.894079protoheme IX ABC transporter - ATP binding
CMJKDNLE_02193-1213.885765cytochrome c protein
CMJKDNLE_02194-1194.214185subunit of periplasmic nitrate reductase,
CMJKDNLE_02195-1183.629997ferredoxin-type protein
CMJKDNLE_021960183.070447ferredoxin-type protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02184HTHFIS643e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 3e-14
Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 2/113 (1%)

Query: 9 VMIVDDHPLMRRGVRQLLELDPGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68
+++ DD +R + Q L G++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


33CMJKDNLE_02255CMJKDNLE_02281Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_022551153.211443undecaprenyl-phosphate-alpha-L-Ara4N flippase -
CMJKDNLE_022560133.512690undecaprenyl-phosphate-alpha-L-Ara4N flippase -
CMJKDNLE_022571144.744305polymyxin B resistance protein
CMJKDNLE_022581144.498109o-succinylbenzoate-CoA ligase
CMJKDNLE_022590144.349599o-succinylbenzoate synthase
CMJKDNLE_02260-1133.1326961,4-dihydroxy-2-naphthoyl-CoA synthase
CMJKDNLE_022610132.319191(1R,6R)-2-succinyl-6-hydroxy-2,
CMJKDNLE_02262-117-1.8268522-succinyl-5-enolpyruvyl-6-hydroxy-3-
CMJKDNLE_02263023-4.944754isochorismate synthase 2
CMJKDNLE_02264127-6.746821hypothetical protein
CMJKDNLE_02265127-7.064918putative acyltransferase with acyl-CoA
CMJKDNLE_02266130-7.980269RNase BN
CMJKDNLE_02267233-9.394589deubiquitinase
CMJKDNLE_02268233-7.563986putative lipoprotein
CMJKDNLE_02269221-4.786411putative peptidase
CMJKDNLE_02270114-2.322656hypothetical protein
CMJKDNLE_022711150.001623hypothetical protein
CMJKDNLE_022722181.577314hypothetical protein
CMJKDNLE_022731212.470597hypothetical protein
CMJKDNLE_022741283.953685NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022751293.347808NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022761293.931261NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022770293.643409NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022780293.658464NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022791283.816624NADH:ubiquinone oxidoreductase, chain I
CMJKDNLE_022801273.580987NADH:ubiquinone oxidoreductase, membrane subunit
CMJKDNLE_022811253.625574NADH:ubiquinone oxidoreductase, chain G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02255BCTERIALGSPC280.008 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 28.0 bits (62), Expect = 0.008
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 34 KHIVLWLGLALACLGLAMVLWLLVL-QNVPV 63
+ I+ +L + L C LAM+ W + L N PV
Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPV 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02258ACETATEKNASE300.016 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 30.2 bits (68), Expect = 0.016
Identities = 19/124 (15%), Positives = 47/124 (37%), Gaps = 20/124 (16%)

Query: 339 EMHNGKLTIVG-----RLDNLFFSGGEGIQPEEVERVIAAHPAVLQVFIVPVADKEF--- 390
E +G + G +++ + + ++++ + H +++ + + + ++
Sbjct: 19 ESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDAIKLVLDALVNSDYGVI 78

Query: 391 ---------GHRPVAVMEYDHESVDLSEWVKDKLARFQQPVRWLTLPPELKNGGIKISRQ 441
GHR V EY SV +++ V + + L P + GIK Q
Sbjct: 79 KDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDC-IELAPLHNPANI--EGIKACTQ 135

Query: 442 ALKE 445
+ +
Sbjct: 136 IMPD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02265AUTOINDCRSYN356e-05 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 34.8 bits (80), Expect = 6e-05
Identities = 14/79 (17%), Positives = 32/79 (40%), Gaps = 12/79 (15%)

Query: 1 MIEWQDLHHSELSVSQLYALLQLRCAVFV--------VEQNCPYQDIDGDDLTGDNRHIL 52
M+E D++H+ LS ++ L LR F + D + + ++
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56

Query: 53 GWKNDELVAYARILKSDDD 71
G K++ ++ R +++
Sbjct: 57 GIKDNTVICSLRFIETKYP 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02268PERTACTIN300.035 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.7 bits (66), Expect = 0.035
Identities = 31/125 (24%), Positives = 47/125 (37%), Gaps = 14/125 (11%)

Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80
PQP ++ + P P +++ AA AA+ A+ Y++ AL RL
Sbjct: 598 PQPPQRQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLW--------YAESNALSKRL 649

Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140
E A A G A+ QQ D+ ++ Q +A F L D R+
Sbjct: 650 GELRLNPDAGGAWGR-----GFAQRQQLDNRAGRRFDQK-VAGFELGADHAVAVAGGRWH 703

Query: 141 NQGLL 145
GL
Sbjct: 704 LGGLA 708


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02273SYCDCHAPRONE300.007 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.9 bits (67), Expect = 0.007
Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 3/67 (4%)

Query: 90 NGISIEDQDFAANLFRVARKCLSTGRLDDALPLLQRATEQLPEVSEYWLALAIQYRRCKK 149
N IS + + L+ +A +G+ +DA + Q S ++L L + +
Sbjct: 29 NEISSDTLE---QLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQ 85

Query: 150 TEAAAQA 156
+ A +
Sbjct: 86 YDLAIHS 92


34CMJKDNLE_02340CMJKDNLE_02359Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02340-117-3.205023FadI monomer
CMJKDNLE_02341018-6.021005hypothetical protein
CMJKDNLE_02342120-5.367041long-chain fatty acid outer membrane porin;
CMJKDNLE_02343122-4.961792hypothetical protein
CMJKDNLE_02344020-2.296762MlaA
CMJKDNLE_02345022-2.674903putative inner membrane protein
CMJKDNLE_02347022-2.215475*lactose / melibiose:H+ symporter LacY
CMJKDNLE_02348026-2.913712ribokinase
CMJKDNLE_02349027-4.600805Sucrose-6-phosphate hydrolase
CMJKDNLE_02350-130-6.345137CytR-cytidine
CMJKDNLE_02351033-9.027986D-serine transporter
CMJKDNLE_02352034-8.863250D-serine ammonia-lyase
CMJKDNLE_02353136-9.837283EmrKY putative multidrug efflux transporter -
CMJKDNLE_02354036-8.959860EmrKY-TolC multidrug efflux transport system -
CMJKDNLE_02355133-8.250023FimZ transcriptional regulator
CMJKDNLE_02356134-7.881491putative DNA-binding response regulator in
CMJKDNLE_02357233-6.234380putative CoA transferase, NAD(P)-binding
CMJKDNLE_02358232-6.026761YfdV AEC Transporter
CMJKDNLE_02359127-4.771282oxalyl-CoA decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02344VACJLIPOPROT407e-148 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 407 bits (1048), Expect = e-148
Identities = 250/251 (99%), Positives = 250/251 (99%)

Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60

Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120
DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM
Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120

Query: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADGLYPVLSWLTWPM 180
ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD LYPVLSWLTWPM
Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180

Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240
SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240

Query: 241 IQDDLKDIDSE 251
IQDDLKDIDSE
Sbjct: 241 IQDDLKDIDSE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02353TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02354RTXTOXIND786e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 78.3 bits (193), Expect = 6e-18
Identities = 62/412 (15%), Positives = 122/412 (29%), Gaps = 96/412 (23%)

Query: 13 RRKYFSLLAVVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71
RR ++ F+ + + ++E + + + G + I + V + K
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97
+ VR+GD+L+ L A K
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136
K + Q + L + AE + + Y+
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182
R+ L + I+K + S + + I + K LV
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233
L + + + + + I++PV+ + Q V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292
++LM +VP + V A + + + +GQ+ I + F G +G
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404

Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 340
+ + +V V +S++ L PL G+++TA I T
Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02355HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02356HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


35CMJKDNLE_02422CMJKDNLE_02436Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02422-2163.587480coproporphyrinogen III oxidase
CMJKDNLE_02423-2174.580696putative ARAC-type regulatory protein
CMJKDNLE_02424-1214.857135putative structural protein, ethanolamine
CMJKDNLE_02425-1215.094838putative structural protein, ethanolamine
CMJKDNLE_024260215.356894ethanolamine ammonia-lyase, beta subunit
CMJKDNLE_024271215.403940ethanolamine ammonia-lyase, alpha subunit
CMJKDNLE_024282195.570525reactivating factor for ethanolamine
CMJKDNLE_024291185.170709putative inner membrane protein
CMJKDNLE_024314195.864305putative alcohol dehydrogenase in ethanolamine
CMJKDNLE_024322185.796725putative chaperonin, ethanolamine utilization
CMJKDNLE_024333195.351104putative aldehyde dehydrogenase, ethanolamine
CMJKDNLE_024341194.350057putative carboxysome structural protein
CMJKDNLE_024352204.074641putative structural protein, ethanolamine
CMJKDNLE_024362173.222219phosphate acetyltransferase monomer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02432SHAPEPROTEIN512e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 50.5 bits (121), Expect = 2e-09
Identities = 33/116 (28%), Positives = 50/116 (43%), Gaps = 9/116 (7%)

Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHAATSFPPGTDP---RISINVLESAGL 118
++DG++ DFF +++ + F R P G R + AG
Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135

Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169
+++EP A A L + G VVDIGGGTT +A++ V YS+ GG
Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191


36CMJKDNLE_02481CMJKDNLE_02489Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02481-112-3.195326phosphoribosylglycinamide formyltransferase 1
CMJKDNLE_02482-211-3.793867degradosome
CMJKDNLE_02483015-3.972507exopolyphosphatase monomer
CMJKDNLE_02484-113-3.055279cyclic di-GMP phosphodiesterase
CMJKDNLE_024853290.007456hypothetical protein
CMJKDNLE_024863220.886294hypothetical protein
CMJKDNLE_024872210.812888putative lipoprotein
CMJKDNLE_024882211.135257putative membrane protein
CMJKDNLE_024892211.465118GMP synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02488IGASERPTASE280.020 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.020
Identities = 19/124 (15%), Positives = 40/124 (32%), Gaps = 6/124 (4%)

Query: 34 QQGKNEEQRQHDEWVAERNREIQQEKQRRANAQAAANKRAATAAANKKARQDKLDAEASA 93
Q + ++ + + + E+ Q Q K AT +KA+ + +
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET-EKTQEV 1122

Query: 94 DKKRDQSYEDELRSLEIQKQKLALAKEEARVKRENEFIDQELKHKAAQTDVVQSEADANR 153
K Q + +S +Q Q + + V I + D Q + +
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARENDPTVN-----IKEPQSQTNTTADTEQPAKETSS 1177

Query: 154 NMTE 157
N+ +
Sbjct: 1178 NVEQ 1181


37CMJKDNLE_02508CMJKDNLE_02513Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_025080183.672959aminopeptidase B
CMJKDNLE_025093202.666000protein with possible role in iron-sulfur
CMJKDNLE_025103242.527257ring 1,2-phenylacetyl-CoA epoxidase, reductase
CMJKDNLE_025113242.625703chaperone, member of Hsp70 protein family
CMJKDNLE_025122241.246894Hsc20 co-chaperone that acts with Hsc66 in IscU
CMJKDNLE_025132281.623207iron-sulfur cluster assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02511SHAPEPROTEIN1145e-30 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 114 bits (288), Expect = 5e-30
Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%)

Query: 23 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 75
IDLGT N+L+ G + +E PSVV +Q VG+DA+
Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63

Query: 76 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 135
T I++++ + +AD V+ +L+
Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92

Query: 136 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 194
+ V++ VP +R+ +++A+ AG + L+ EP AAAI G
Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 195 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 254
L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 255 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 304
I + + R++ E+ A + + V G +G ++ + E +
Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260

Query: 305 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 361
+ + A AL+ E A ++ E +V+ GG + + + E G P + + D
Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320

Query: 362 PDKVVAIGAAI 372
P VA G
Sbjct: 321 PLTCVARGGGK 331


38CMJKDNLE_02596CMJKDNLE_02629Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02596211-0.004790protein component of the signal recognition
CMJKDNLE_02597312-0.712720putative inner membrane protein
CMJKDNLE_02598212-0.894183putative inner membrane protein
CMJKDNLE_02599314-1.523454phage lambda replication; host DNA synthesis;
CMJKDNLE_02600315-1.331595NAD kinase monomer
CMJKDNLE_02601218-1.217086protein used in recombination and DNA repair
CMJKDNLE_02602421-2.626433Outer Membrane Protein Assembly Complex - BamE
CMJKDNLE_02603324-4.297332hypothetical protein
CMJKDNLE_02604326-5.47019950S ribosomal subunit-binding toxin of a
CMJKDNLE_02605427-6.192818small protein B
CMJKDNLE_02606430-7.210633hypothetical protein
CMJKDNLE_02609432-8.132804adhesin-like autotransporter
CMJKDNLE_02610132-11.666145hypothetical protein
CMJKDNLE_02611118-5.897176hypothetical protein
CMJKDNLE_02613215-1.476417*hypothetical protein
CMJKDNLE_026142151.201236hypothetical protein
CMJKDNLE_026153182.132483hypothetical protein
CMJKDNLE_026162224.060016hypothetical protein
CMJKDNLE_026173213.807184L-2-hydroxyglutarate oxidase
CMJKDNLE_026183213.205380succinate-semialdehyde dehydrogenase (NADP+)
CMJKDNLE_026192171.9430484-aminobutyrate aminotransferase monomer
CMJKDNLE_02620317-1.2546284-aminobutyrate:H+ symporter
CMJKDNLE_02621019-1.992083CsiR DNA-binding transcriptional repressor
CMJKDNLE_02622-120-3.200919hypothetical protein
CMJKDNLE_02623024-3.423136putative membrane protein
CMJKDNLE_02624023-2.825098putative DNA-binding transcriptional regulator
CMJKDNLE_02625025-2.889983putative inner membrane protein with hydrolase
CMJKDNLE_02626320-0.837616H-NS-like DNA-binding protein with RNA chaperone
CMJKDNLE_02627112-0.091419L-alanine exporter
CMJKDNLE_02628113-0.593689hypothetical protein
CMJKDNLE_02629214-0.826217hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02602BLACTAMASEA260.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 26.3 bits (58), Expect = 0.032
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%)

Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61
K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A
Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122

Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88
MSD N + G G+T
Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02609PRTACTNFAMLY2325e-65 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 232 bits (593), Expect = 5e-65
Identities = 220/891 (24%), Positives = 344/891 (38%), Gaps = 103/891 (11%)

Query: 722 NDGGTLDVREKGSATGIQQSSQGAL-VATTRATRVTGTRADGVAFSIEQGAANNILLANG 780
N+ + E+ IQ S G + A+ +V+G +A G+ + A + NG
Sbjct: 37 NNQSIVKTGERQHGIHIQGSDPGGVRTASGTTIKVSGRQAQGILL---ENPAAELQFRNG 93

Query: 781 GVLT----VESDTSSDKTQVNMGGREIVKTKATATGTTLTGGEQ----IVEGVANETTIN 832
V + + V + ++V AT T + V G + +I
Sbjct: 94 SVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIA 153

Query: 833 DGGIQTVSANGEAIKTKINEGGTLTVNDNGKATDIVQN--------SGAALQTSTANGIE 884
D +Q + + D G +Q+ S L+ + +
Sbjct: 154 DSTLQGAGGVQIERGANVTVQR-SAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVP 212

Query: 885 ISGTHQY------------GTFSISGNLATNMLLENGGNLLVLAGTEARDSTVG------ 926
SG G G A ++ L A D+ G
Sbjct: 213 ASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGG 272

Query: 927 --KGGAMQNLGQDSATKVNSGGQYTL---GRSKDEFQALARAEDLQVA-----GGTAIVY 976
GGA+ G Y + G S + Q++ A +L A G V
Sbjct: 273 AVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVS 332

Query: 977 AGTLA--DASVSGATGSLSLMTPRDNVTPVKLEGAVRITDSA----------TLTLGNGV 1024
G+L+ +V G+ P+ + L+ A LTL G
Sbjct: 333 GGSLSAPHGNVIETGGARRFA-PQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGA 391

Query: 1025 DTTLADLTA----------ASRGSVWLNSNNSCAG---------------TSNCEYRVNS 1059
D D+ A V L S G V +
Sbjct: 392 DA-QGDIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGA 450

Query: 1060 LLLNDGDVYLSAQTAAPATTNGIYNTLTTNELSGSGNFYLHTNVAGSRGDQLVVNNNATG 1119
L L D + Q A A G + LT N L+GSG F ++ D+LVV +A+G
Sbjct: 451 LRL-ASDGSVDFQQPAEA---GRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASG 506

Query: 1120 NFKIFVQDTGVSPQSDDAMTLVKT-GGGDASFTLGNTGGFVDLGTYEYVLKSDGNSNWNL 1178
+++V+++G P S + + LV+T G A+FTL N G VD+GTY Y L ++GN W+L
Sbjct: 507 QHRLWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSL 566

Query: 1179 TNDVKPNPDPIPNPKPDPKPDPKPDPNPKPDPTPDPTPTPVPEKRITPSTAAVLNMA--A 1236
P P PKP P+P P+P P+P P P P P + ++ + A +N
Sbjct: 567 VGAKAP-----PAPKPAPQPGPQPPQPPQPQPEA-PAPQPPAGRELSAAANAAVNTGGVG 620

Query: 1237 TLPLVFDAELNSIRERLNIMKASPHNNNVWGATYNTRNNVTTDAGAGFEQTLTGMTVGID 1296
++ AE N++ +RL ++ +P WG + R + AG F+Q + G +G D
Sbjct: 621 LASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGAD 680

Query: 1297 SRNDIPEGITTLGAFMGYSHSHIGFDRGGHGSVGSYSLGGYASWEHESGFYLDGVVKLNR 1356
+ G LG GY+ GF G G S +GGYA++ +SGFYLD ++ +R
Sbjct: 681 HAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASR 740

Query: 1357 FKSNVAGKMSSGGAANGSYHSNGLGGHIETGMRFT-DGNWNLTPYASLTGFTADNPEYHL 1415
+++ S G A G Y ++G+G +E G RFT W L P A L F A Y
Sbjct: 741 LENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRA 800

Query: 1416 SNGMKSKSVDTRSIYRELGATLSYNMRLGNGMEVEPWLKAAVRKEFVDDNRVKVNSDGNF 1475
+NG++ + S+ LG + + L G +V+P++KA+V +EF V N +
Sbjct: 801 ANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHR 860

Query: 1476 VNYLSGRRGIYQAGIKASFSSTLSGHLGVGYSHSAGVESPWNAVAGVNWSF 1526
L G R G+ A+ S + YS + PW AG +S+
Sbjct: 861 TE-LRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


39CMJKDNLE_02666CMJKDNLE_02684Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_026661143.408318GutR DNA-binding transcriptional repressor
CMJKDNLE_026671143.843340D-arabinose 5-phosphate isomerase
CMJKDNLE_026680153.326311NorR DNA-binding transcriptional dual regulator
CMJKDNLE_026690153.179203flavorubredoxin
CMJKDNLE_026700142.849758flavorubredoxin reductase
CMJKDNLE_02671-1152.906709hydrogenase maturation protein,
CMJKDNLE_02672-1161.564393putative electron transport protein HydN
CMJKDNLE_026730172.081002AscG DNA-binding transcriptional repressor
CMJKDNLE_02674-1192.421974beta-glucoside PTS permease AscF - cryptic
CMJKDNLE_02675-1242.7612766-phospho-beta-glucosidase; cryptic
CMJKDNLE_02676-1284.416836hydrogenase 3 maturation protease
CMJKDNLE_02677-1265.147533protein required for maturation of hydrogenase
CMJKDNLE_026780265.450008hydrogenase 3 and formate hydrogenlyase complex,
CMJKDNLE_026791265.030926formate hydrogenlyase complex iron-sulfur
CMJKDNLE_026801254.917619hydrogenase 3, large subunit
CMJKDNLE_026813224.633704hydrogenase 3, membrane subunit
CMJKDNLE_026823224.158794hydrogenase 3, membrane subunit
CMJKDNLE_026832202.921322hydrogenase 3, Fe-S subunit
CMJKDNLE_026841193.146504regulator of the transcriptional regulator FhlA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02666ARGREPRESSOR290.014 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 28.7 bits (64), Expect = 0.014
Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 17/105 (16%)

Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDLVILEHAGTVIRTYGG 55
M QR I E + + +EL ++ T T+ +D+ E + T G
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIK--ELHLVKVPTNNG 58

Query: 56 ---VVLNKEESDPPIDHKTLINTHKKELIAEAAVSFIHDGDSIIL 97
L ++ P+ K + +A V I+L
Sbjct: 59 SYKYSLPADQRFNPLS-------KLKRSLMDAFVKIDSASHLIVL 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02668HTHFIS374e-127 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 374 bits (961), Expect = e-127
Identities = 125/388 (32%), Positives = 196/388 (50%), Gaps = 33/388 (8%)

Query: 149 IAALAAGALS----------NALLIEQLESQNMLPGDATPFEAVKQTQMIGLSPGMTQLK 198
I A GA +I + ++ ++ ++G S M ++
Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150

Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258
+ + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG
Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210

Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRCLR 318
H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378
DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329

Query: 379 RQGLSRVVLSAGARNLLQHYSFPGNVRELEHAIHRAVVLARATRSGDEVIL-----EAQH 433
++GL A L++ + +PGNVRELE+ + R L E+I E
Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389

Query: 434 FAFPEVTLPTPEVAAVPVVKQNLR-----------------EATEAFQRETIRQALAQNH 476
+ + ++ V++N+R + I AL
Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449

Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504
N A +L + L + + LG+
Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02673HTHTETR280.036 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.036
Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 7/93 (7%)

Query: 3 TTMLEVAKRAGVSKATVSRVLSG-----NGYVSQETKDRVFQAVEESGYRPNLLARNLSA 57
T++ E+AK AGV++ + + + +E P L
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 58 KSTQTLGLVVTNTLYHGIYFSELLFHAARMAEE 90
L VT + E++FH E
Sbjct: 92 ILIHVLESTVTEERRRLLM--EIIFHKCEFVGE 122


40CMJKDNLE_02713CMJKDNLE_02740Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02713036-4.451209alkaline phosphatase isozyme conversion protein
CMJKDNLE_02714140-5.057965putative endoribonuclease
CMJKDNLE_02715140-5.901664multifunctional nuclease
CMJKDNLE_02716241-7.115046crRNA endonuclease
CMJKDNLE_02717236-6.539013Cascade subunit D
CMJKDNLE_02718023-4.494830Cascade subunit C
CMJKDNLE_02719-216-2.225323Cascade subunit B
CMJKDNLE_02720-114-2.050651Cascade subunit A
CMJKDNLE_02721-211-0.222681protein involved in CRISPR R-loop formation and
CMJKDNLE_027230142.9601483'-phospho-adenylylsulfate reductase
CMJKDNLE_027240132.663671sulfite reductase, hemoprotein subunit
CMJKDNLE_027250122.332401sulfite reductase, flavoprotein subunit
CMJKDNLE_027261171.7807596-carboxy-5,6,7,8-tetrahydropterin synthase
CMJKDNLE_027273161.944442putative oxidoreductase with FAD/NAD(P)-binding
CMJKDNLE_027281120.661314putative 4Fe-4S cluster-containing protein
CMJKDNLE_02729110-0.378405putative anti-terminator regulatory protein
CMJKDNLE_0273019-0.298753putative flavoprotein
CMJKDNLE_0273119-1.327840putative flavoprotein
CMJKDNLE_02732110-1.785028YgcS MFS transporter
CMJKDNLE_02733111-3.159781putative FAD-containing dehydrogenase
CMJKDNLE_02734215-4.905071putative deoxygluconate dehydrogenase
CMJKDNLE_02735-115-3.967180YqcE MFS transporter
CMJKDNLE_02736019-3.272357putative kinase
CMJKDNLE_02737022-3.047881hypothetical protein
CMJKDNLE_02738127-3.595549small protein involved in the cell envelope
CMJKDNLE_02739227-2.924293hypothetical protein
CMJKDNLE_02740225-0.240136degradosome
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02720FLGMRINGFLIF300.029 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.5 bits (66), Expect = 0.029
Identities = 22/130 (16%), Positives = 40/130 (30%), Gaps = 16/130 (12%)

Query: 261 SAPSWTQISRVVVDKIIQNENGNRVAAVVNQ-FRNIAPQSPLELIMGGYRNNQASILERR 319
+A QI + + + ++ VVN F + EL Q S +++
Sbjct: 404 TADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGG-ELPF----WQQQSFIDQ- 457

Query: 320 HDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAER 379
G + +V + ++ A+R L E K + E A
Sbjct: 458 ---------LLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVE 508

Query: 380 HFYRQSELLI 389
+ E L
Sbjct: 509 VRLSKDEQLQ 518


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02724PF07675300.021 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.021
Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%)

Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260
++ +P+ T +P PQN + A+ ++VAI+++G L G + G++
Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297

Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287
+ K Y + YLP+ + E
Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02732TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 45/314 (14%), Positives = 112/314 (35%), Gaps = 36/314 (11%)

Query: 93 LGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 151
+G+ V G +SD +G +++ F ++ S + F + LI R + G G ++
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 152 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISENPEAWRWLLASAAL 207
++A + P+ +RG G + VG + + H+ W +LL +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177

Query: 208 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLF-- 265
+ + L + R +G F I+ +L + + + L
Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 266 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 312
++ R+ ++ F+ V+ +I+ ++ + + L+ + +
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 313 LNALLIVGALLGLV-------LTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLF 365
+ ++ G + ++ L L L+ + + + L +S + +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354

Query: 366 VLFSTTISAVSNLV 379
++F + + V
Sbjct: 355 IVFVLGGLSFTKTV 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02734DHBDHDRGNASE1024e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (255), Expect = 4e-28
Identities = 73/257 (28%), Positives = 116/257 (45%), Gaps = 11/257 (4%)

Query: 11 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEK-QGVEVD 69
M+ ++GK A +TG G+G+A A LA GA+I + + E K + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 70 FMQVGITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 129
+ A +I A G +DILVN AG+ + + +W+ VN T
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 130 FELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNI 189
F S +K M+ ++SG I+ + S + + AY+++K A FTK EL +YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 190 QVNGIAPGYYATDI--TLATRSNPETNQRVLDH-------IPANRWGDTQDLMGAAVFLA 240
+ N ++PG TD+ +L N Q + IP + D+ A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 241 SPASNYVNGHLLVVDGG 257
S + ++ H L VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02735TCRTETA300.018 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.018
Identities = 22/103 (21%), Positives = 45/103 (43%), Gaps = 8/103 (7%)

Query: 48 GLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIA 107
G++++ + + G ++D+F R ++ ++ + +MAT P LWV+ +I
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVG 147
IT + A + + D E+ + G+M G G
Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02739cloacin330.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.1 bits (75), Expect = 0.001
Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 253 ASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
+ G + S+N+ GGS SG GGG G GG +G
Sbjct: 34 SDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69



Score = 30.8 bits (69), Expect = 0.006
Identities = 11/34 (32%), Positives = 14/34 (41%)

Query: 254 SGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGAS 287
SG H G G SGGG +GG ++
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.012
Identities = 11/30 (36%), Positives = 11/30 (36%)

Query: 259 HSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
GGS G G G S GG G G
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79



Score = 29.7 bits (66), Expect = 0.013
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 255 GRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
GR H+ + S G+ +GG +G G G SG
Sbjct: 6 GRG-HNTGAHSTSGNINGGPTGLGVGGGASDGSG 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02740ANTHRAXTOXNA290.038 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.038
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 9/132 (6%)

Query: 211 GYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLA-----GEGN 265
P L N + A+ +E K YE+GK I+L + + ++ + +
Sbjct: 147 RETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSS 206

Query: 266 KAFTSEEFTHFLEELTKQYPIVSIEDGLDESDW---DGFAYQTKVLG-DKIQLVGDDLFV 321
S++F LE K I I++ L E F+Y ++L D+F
Sbjct: 207 DLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFE 266

Query: 322 TNTKILKEGIEK 333
K+ K G EK
Sbjct: 267 YMNKLEKGGFEK 278


41CMJKDNLE_02807CMJKDNLE_02834Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02807016-3.190392arabinose:H+ symporter
CMJKDNLE_02808120-4.4665862-deoxy-D-gluconate 3-dehydrogenase
CMJKDNLE_02809325-6.534867putative 5-keto 4-deoxyuronate isomerase
CMJKDNLE_02810131-8.802527putative acyltransferase
CMJKDNLE_02811336-12.979514YqeG STP transporter
CMJKDNLE_02812547-16.559766hypothetical protein
CMJKDNLE_02813549-17.635804hypothetical protein
CMJKDNLE_02814547-17.649910putative transcriptional regulator
CMJKDNLE_02815950-18.667940hypothetical protein
CMJKDNLE_02816951-18.385390small protein
CMJKDNLE_02817951-18.219101hypothetical protein
CMJKDNLE_02818952-17.863250hypothetical protein
CMJKDNLE_02819850-17.501439putative chaperone
CMJKDNLE_028201051-17.999369putative transcriptional regulator
CMJKDNLE_02821854-18.417783hypothetical protein
CMJKDNLE_02822855-17.712147hypothetical protein
CMJKDNLE_02823755-17.091322putative DNA-binding transcriptional regulator
CMJKDNLE_02824854-16.835894hypothetical protein
CMJKDNLE_02825754-17.348044hypothetical protein
CMJKDNLE_02826649-14.329146hypothetical protein
CMJKDNLE_02827334-9.107200Lipoprotein PrgK
CMJKDNLE_02828328-6.179308hypothetical protein
CMJKDNLE_02829527-4.594139Yop proteins translocation protein F
CMJKDNLE_02830728-3.956172hypothetical protein
CMJKDNLE_02831829-2.485870hypothetical protein
CMJKDNLE_028321134-1.316492hypothetical protein
CMJKDNLE_02833933-1.771446hypothetical protein
CMJKDNLE_02834324-0.911457hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02807TCRTETB562e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 55.7 bits (134), Expect = 2e-10
Identities = 39/167 (23%), Positives = 69/167 (41%), Gaps = 1/167 (0%)

Query: 38 LDIGVIAGALPFITDHFVLTSRLQEWVVSSMMLGAAIGALFNGWLSFRLGRKYSLMAGAI 97
L+ V+ +LP I + F WV ++ ML +IG G LS +LG K L+ G I
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 98 LFVLGSIGSAFATS-VEMLIAARVVLGIAVGIASYTAPLYLSEMASENVRGKMISMYQLM 156
+ GS+ S +LI AR + G + ++ + RGK + +
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 157 VTLGIVLAFLSDTAFSYSGNWRAMLGVLALPAVLLIILVVFLPNSPR 203
V +G + ++ +W +L + + + + L+ L R
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02808DHBDHDRGNASE1111e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (278), Expect = 1e-31
Identities = 72/257 (28%), Positives = 129/257 (50%), Gaps = 11/257 (4%)

Query: 3 LSAFSLEGKVAVVTGCDTGLGQGMALGLAQAGCDIVGIN-IVEPTETIEQ-VTALGRRFL 60
++A +EGK+A +TG G+G+ +A LA G I ++ E E + + A R
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 SLTADLRKIDGIPALLDRAVAEFGHIDILVNNAGLIRREDALEFSEKDWDDVMNLNIKSV 120
+ AD+R I + R E G IDILVN AG++R S+++W+ ++N V
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 121 FFMSQAAAKHFIAQGNGGKIINIASMLSFQGGIRVPSYTASKSGVMGVTRLMANEWAKHN 180
F S++ +K+ + + G I+ + S + + +Y +SK+ + T+ + E A++N
Sbjct: 121 FNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 181 INVNAIAPGYMATNNTQQLRADEQRSAEILD--------RIPAGRWGLPSDLMGPIVFLA 232
I N ++PG T+ L ADE + +++ IP + PSD+ ++FL
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 233 SSASDYVNGYTIAVDGG 249
S + ++ + + VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02819SYCDCHAPRONE714e-18 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 70.7 bits (173), Expect = 4e-18
Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 9/164 (5%)

Query: 1 MSTETIEIFNNSDEWANQLKHALSKGENLALLHGLTPDILDRIYAYAFDYHEKGNITDAE 60
M ET + + E+ ++ L G +A+L+ ++ D L+++Y+ AF+ ++ G DA
Sbjct: 1 MQQETTD----TQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAH 56

Query: 61 IYYKFLCIYAFENHEYLKDFASVCQPKKKYQQAYDLYKLSYNYFPYDDYSVIYRMGQCQI 120
++ LC+ + + + Q +Y A Y + + +C +
Sbjct: 57 KVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI-KEPRFPFHAAECLL 115

Query: 121 GAKNIDNAMQCFYH----IINNCEDDSVKSKAQAYIELLNDNSE 160
+ A + I + E + ++ + +E + E
Sbjct: 116 QKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02827FLGMRINGFLIF353e-04 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 34.6 bits (79), Expect = 3e-04
Identities = 22/126 (17%), Positives = 49/126 (38%), Gaps = 5/126 (3%)

Query: 4 ISLLLFILLLCGCKQQE-LLNHLDQQQANDVLAVLQRHNINAEKKDQGKTGFSIYVEPTD 62
+++++ ++L L ++L Q ++A L + NI + I V
Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGA---IEVPADK 91

Query: 63 FASAVDWLKIYNLPGKPDIQISQMFPADALVSSPRAEKARLYSAIEQRLEQSLKIMDGIV 122
L LP + + + S +E+ A+E L ++++ + +
Sbjct: 92 VHELRLRLAQQGLPKGGAVGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150

Query: 123 SSRVHV 128
S+RVH+
Sbjct: 151 SARVHL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02833IGASERPTASE280.012 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.012
Identities = 18/98 (18%), Positives = 31/98 (31%), Gaps = 2/98 (2%)

Query: 31 FTEIVVTSMLNGLPALSAGAHAILTSLHAAGLNANDYGAY--SRAWAESNAEARREAERQ 88
E V P+ + A + + + N+ A + E EA+ +
Sbjct: 1020 VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN 1079

Query: 89 RIENEKDRQRIAAMYATPEEIAKEAAERKERKAELERR 126
NE + E + A KE KA++E
Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE 1117


42CMJKDNLE_02940CMJKDNLE_02983Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02940426-4.016202putative transport protein
CMJKDNLE_02942629-4.146155*KpLE2 phage-like element; predicted integrase
CMJKDNLE_02943629-4.420787putative hydrolase, inner membrane
CMJKDNLE_02944531-4.911085Outer membrane protein PagN
CMJKDNLE_02945732-4.548682hypothetical protein
CMJKDNLE_02946729-3.634658hypothetical protein
CMJKDNLE_02947732-4.554312member of ATP-dependent helicase superfamily II
CMJKDNLE_02948830-4.400332hypothetical protein
CMJKDNLE_02949828-3.299844hypothetical protein
CMJKDNLE_02950927-1.608503hypothetical protein
CMJKDNLE_02951825-1.466713hypothetical protein
CMJKDNLE_02952925-0.986221hypothetical protein
CMJKDNLE_0295310282.672735CP4-44 prophage; predicted DNA repair protein
CMJKDNLE_029548271.731578CP4-44 prophage; predicted protein
CMJKDNLE_02955928-0.984900hypothetical protein
CMJKDNLE_02956727-0.843263CP4-44 prophage; antitoxin of the CbtA-CbeA
CMJKDNLE_029573210.635815CP4-44 prophage; toxin of the CbtA-CbeA
CMJKDNLE_029581191.578587CP4-44 prophage; predicted protein
CMJKDNLE_029591162.404566hypothetical protein
CMJKDNLE_029601182.933717hypothetical protein
CMJKDNLE_029611174.447191putative secretion pathway protein, M-type
CMJKDNLE_029620144.366474Type II secretion system protein L
CMJKDNLE_029631184.764057putative protein secretion protein for export
CMJKDNLE_029640204.766340putative protein secretion protein for export
CMJKDNLE_029650235.474638putative protein secretion protein for export
CMJKDNLE_02966-1194.475288putative protein secretion protein for export
CMJKDNLE_02967-1163.894035putative protein secretion protein for export
CMJKDNLE_02968-1153.374048putative protein secretion protein for export
CMJKDNLE_02969-2132.249996putative protein secretion protein for export
CMJKDNLE_02970-1131.207498putative protein secretion protein for export
CMJKDNLE_02971-1130.489515putative protein secretion protein for export
CMJKDNLE_02972-3120.210630putative secretion pathway protein, C-type
CMJKDNLE_02973-2120.397690putative lipoprotein
CMJKDNLE_02974-3121.076708prepilin peptidase
CMJKDNLE_02975-3131.583348putative lipoprotein
CMJKDNLE_02976-2133.033728glycolate / lactate:H+ symporter
CMJKDNLE_029780133.404611malate synthase G
CMJKDNLE_029791113.046802hypothetical protein
CMJKDNLE_029800102.671525glycolate oxidase, predicted iron-sulfur
CMJKDNLE_029811102.581837glycolate oxidase, predicted FAD-binding
CMJKDNLE_029821101.529795glycolate oxidase, predicted FAD-linked subunit
CMJKDNLE_029832110.430286GlcC transcriptional dual regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02944OMPADOMAIN434e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 43.0 bits (101), Expect = 4e-07
Identities = 52/246 (21%), Positives = 79/246 (32%), Gaps = 55/246 (22%)

Query: 1 MNKVIAVSALAMAGMFSTQALADESKTGFYVTGKAGASVMSLADQRFLSGDGEETSKYKG 60
M K A+A+AG F+T A A +Y K G S D F++
Sbjct: 1 MKKTAIAIAVALAG-FATVAQAAPKDNTWYTGAKLGWS--QYHDTGFIN---------NN 48

Query: 61 GDGHDTVFSGGIAAGYDFYPQFSIPVRTELEFYARGKADSKYNVDKDSWSGGYWRDDLKN 120
G H+ G GY P E+ + G+ K +V+ +
Sbjct: 49 GPTHENQLGAGAFGGYQVNPYVGF----EMGYDWLGRMPYKGSVE-----------NGAY 93

Query: 121 EVSVNTLMLNAYYDFRNDSAFTPWVSAGIGYARIHQKTTGISTWDYGYGSSGRESLSRSG 180
+ L Y +D + G R K YG + +S
Sbjct: 94 KAQGVQLTAKLGYPITDD--LDIYTRLGGMVWRADTK-------SNVYGKNHDTGVS--- 141

Query: 181 SADNFAWSFGAGVRYDVTPDIALDLSYRYLDAGDSSVSYKDEWGDKYKSEVDVKSHDIML 240
F GV Y +TP+IA L Y++ + GD + + + L
Sbjct: 142 ------PVFAGGVEYAITPEIATRLEYQWT----------NNIGDAHTIGTRPDNGMLSL 185

Query: 241 GVTYNF 246
GV+Y F
Sbjct: 186 GVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02945TCRTETB310.023 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.023
Identities = 25/149 (16%), Positives = 47/149 (31%), Gaps = 17/149 (11%)

Query: 483 HHVKADVDGVLVLFPAGERGRFSPSPEFITAVLTLRLGSVVALIDNS------------L 530
H + G +++FP +I +L R G + L L
Sbjct: 287 HQLSTAEIGSVIIFPGTMSVIIF---GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL 343

Query: 531 DQAEQKVLENAINNNASFSDDEKRSLHAYLTWQLHTPANMTGMK--SRIELLGAAEKSAV 588
+ + I K + ++ L GM + L A+
Sbjct: 344 LETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403

Query: 589 GKVIVSVACSDGRITPAEIKQLEKIYTSL 617
++S+ D R+ P E+ Q +Y++L
Sbjct: 404 VGGLLSIPLLDQRLLPMEVDQSTYLYSNL 432


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02964BCTERIALGSPG328e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.8 bits (72), Expect = 8e-04
Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 1 MRRTR--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTR--VNSAVAD 43
MR T GFTLLE++V I I LA L+ + + AV+D
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02965BCTERIALGSPH339e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.4 bits (76), Expect = 9e-05
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02966BCTERIALGSPH744e-19 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 74.2 bits (182), Expect = 4e-19
Identities = 41/196 (20%), Positives = 69/196 (35%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASSGVVQTFATDSEPPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDAPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02967BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 91/146 (62%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADARNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P A NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02968BCTERIALGSPF455e-162 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 455 bits (1171), Expect = e-162
Identities = 226/406 (55%), Positives = 302/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKELIPVHI-EARMNTSSGGMLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPVSTRTLIAMSDALQASGVYWLAGLLGLLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP+STR L+ MSDA++ G + L LL +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMRLRWDKTLLRLPVIGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+IGR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02971BCTERIALGSPD5740.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 574 bits (1482), Expect = 0.0
Identities = 295/668 (44%), Positives = 431/668 (64%), Gaps = 34/668 (5%)

Query: 24 LLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83
L L++ AAL P AEE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+
Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143
LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127

Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203
VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD
Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187

Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262
+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P
Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322
+R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306

Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382
+ + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+
Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366

Query: 383 QWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLST-LA 441
QWA+K+AG+ QF N + +PI T A + +G +S+ LA
Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406

Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501
LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556
S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L
Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNL 616
F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+ST+ K KRNL
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPVLPAQNQALPP 674
M+FIRPT++RD S +Y Q + E +++ + P Q+ A
Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645

Query: 675 EVRAFLNA 682
+V A ++A
Sbjct: 646 QVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02972BCTERIALGSPC1016e-28 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 101 bits (254), Expect = 6e-28
Identities = 66/278 (23%), Positives = 108/278 (38%), Gaps = 38/278 (13%)

Query: 1 MLLIISAKMAHSLWRYISFSAEYTA-VSQPVNKPSRVDAKTFDKNDVQLISQQNWFGKYQ 59
++L+ ++A WR A VS P++ + ND L FG
Sbjct: 22 LMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL------FGVSP 72

Query: 60 PV--AAQVKQPEPVPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQGETLGSHN 114
A + + + + LN+ L G+ G +R A+I + +Q E + +N
Sbjct: 73 EKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYN 132

Query: 115 AVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTNKKAVSDEAKQAVAEPAVSVPVEIP 174
A I I D V+L+YQG+ E L L +E S SD A +
Sbjct: 133 AKIVSIRPDRVVLQYQGRYEVLGLYSQEDSG---------SDGVPGAQVNEQLQ------ 177

Query: 175 AAVRQALAKDPQKIFNYIQLTPVRKEG-IVGYAAKPGADRSLFDASGFKEGDIAIALNQQ 233
+ + +Y+ +P+ + + GY PG F G ++ D+A+ALN
Sbjct: 178 -------QRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGL 230

Query: 234 DFTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIAL 271
D D M ++ + + LTV R G R DI +
Sbjct: 231 DLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02974PREPILNPTASE2828e-98 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 282 bits (723), Expect = 8e-98
Identities = 110/274 (40%), Positives = 150/274 (54%), Gaps = 12/274 (4%)

Query: 1 MLFDVFQQYPTAMPVLATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGEMSSAQSKI-- 57
+L ++ P L + L+IGSFLNVVI R PIML R+ AE+ + +
Sbjct: 3 LLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDE 62

Query: 58 ---SLALPRSHCPHCQQTIRIRDNIPLFSWLMLKGRCRDCQAKISKRYPLVELLTALAFL 114
+L +PRS CPHC I +NIPL SWL L+GRCR CQA IS RYPLVELLTAL +
Sbjct: 63 PPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSV 122

Query: 115 LASLVWPESGWGLAVMILSAWLIAASVIDLDHQWLPDVFTQGVLWTGLIAAWAQQSPLTL 174
++ LA ++L+ L+A + IDLD LPD T +LW GL+ ++L
Sbjct: 123 AVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFVSL 181

Query: 175 QDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGALSLPNVALIASCC 234
DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S
Sbjct: 182 GDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV 241

Query: 235 GLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 263
G + S +PFGP L++ G L
Sbjct: 242 GAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02975PF03544494e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 49.2 bits (117), Expect = 4e-08
Identities = 24/60 (40%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEP---IPDPEPTPEPEPEPVP 88
S T + V+P P P EP PEP P PEP E I P+P P+P+P+PV
Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109



Score = 41.9 bits (98), Expect = 9e-06
Identities = 16/92 (17%), Positives = 27/92 (29%), Gaps = 2/92 (2%)

Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92
+D P + PE +P P PEP PEP + E + V
Sbjct: 58 ADLEPPQAVQ-PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 93 YLTLGGSQRVTGATCNGESSDGFTFKPGEDVT 124
+ S+ + N + + +
Sbjct: 117 DVKPVESRPASPFE-NTAPARPTSSTATAATS 147



Score = 40.7 bits (95), Expect = 2e-05
Identities = 18/59 (30%), Positives = 23/59 (38%), Gaps = 2/59 (3%)

Query: 35 TPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDP--EPTPEPEPEPVPTKT 91
P + +P +P PEP +PEP PEPIP+P E E K
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 103



Score = 40.7 bits (95), Expect = 2e-05
Identities = 20/96 (20%), Positives = 28/96 (29%), Gaps = 7/96 (7%)

Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPT---PEPTPDPEPTPEPIPDPEPTPEPEPE 85
+ P V PE +P P P E +P P P+P P+P+ E
Sbjct: 65 AVQPPPEPVV----EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 86 PVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGE 121
R T +T +S T
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156



Score = 35.0 bits (80), Expect = 0.001
Identities = 17/40 (42%), Positives = 17/40 (42%)

Query: 50 PDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89
P P T D EP P PEP EPEPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83



Score = 30.7 bits (69), Expect = 0.039
Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 52 PTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91
P P + + P P P P EPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83


43CMJKDNLE_03049CMJKDNLE_03065Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03049025-4.985981heavy metal divalent cation transporter ZupT
CMJKDNLE_03050230-7.620239hypothetical protein
CMJKDNLE_03051232-8.5386533,4-dihydroxy-2-butanone 4-phosphate synthase
CMJKDNLE_03053436-10.371890hypothetical protein
CMJKDNLE_03054437-10.680951putative fimbrial-like adhesin protein
CMJKDNLE_03055438-10.781543putative membrane protein
CMJKDNLE_03056131-7.873461putative membrane protein
CMJKDNLE_03057021-4.733464putative membrane protein
CMJKDNLE_0305809-0.744012protein involved in detoxification of
CMJKDNLE_03059081.445969putative glycogen synthesis protein
CMJKDNLE_03060181.586086putative oxidoreductase
CMJKDNLE_03061192.720230putative membrane protein
CMJKDNLE_03064-1133.348780fused heptose 7-phosphate kinase/heptose
CMJKDNLE_030650153.036421glutamine synthetase adenylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03054FIMBRIALPAPE280.015 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.1 bits (62), Expect = 0.015
Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 35/163 (21%)

Query: 14 AMILSNNVFADEGHGIVKFKGEVISAPCSIKPGDEDLTVNLGEVADTVLKSDQKSLAE-- 71
A+++S +V A + + FKG++I C++ ++ VN G++ L + +
Sbjct: 15 AVLMSQHVHAADN---LTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSGGNQKDFT 67

Query: 72 -----PFTIHLQDCMLSQGGTTYSKAKVTFTTANTMTGQSDLLKNTKETEIGGATGVGVR 126
P+++ ++ G T + V T+ + G L N+ + IG A
Sbjct: 68 VDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNA------ 121

Query: 127 ILDSQSGEVTLGTPVV---ITFNNTNS----YQELNFKARMES 162
VTLG+ V IT Y +L +K M+S
Sbjct: 122 --------VTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQS 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03055PF005772012e-61 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 201 bits (513), Expect = 2e-61
Identities = 66/251 (26%), Positives = 111/251 (44%), Gaps = 12/251 (4%)

Query: 22 CSLSVIIIGCA-------SAYAVEFNKDLIEAEDRENVNLSQFETDGQLPVGKYSLSTLI 74
+ + CA S+ + FN + + + +LS+FE +LP G Y + +
Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 75 NNKRTPIHLDLQWVLIDN--QTAVCVTPEQLTLLGFTDEFIEKTQQNLIDGCYPIEK-EK 131
NN D+ + D+ C+T QL +G + D C P+
Sbjct: 85 NNGYMA-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIH 143

Query: 132 QITTYLDKGKMQLSISAPQAWLKYKDANWTPPELWNHGIAGAFLDYNLYASHYAPHQGDN 191
T LD G+ +L+++ PQA++ + + PPELW+ GI L+YN + G N
Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGN 203

Query: 192 SQNISSYGQAGVNLGAWRLRTDYQYDQSFNNGKS-QATNLDFPRIYLFRPIPAMNAKLTI 250
S Q+G+N+GAWRLR + + + ++ S +L R I + ++LT+
Sbjct: 204 SHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTL 263

Query: 251 GQYDTESSIFD 261
G T+ IFD
Sbjct: 264 GDGYTQGDIFD 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03056PF00577422e-140 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 422 bits (1086), Expect = e-140
Identities = 152/577 (26%), Positives = 266/577 (46%), Gaps = 51/577 (8%)

Query: 1 MLPPDLRGYAPQITGVAQTNAKVTVSQNNRIIYQENVPPGPFAITNLFNT-LQGQLDVKV 59
MLP RG+AP I G+A+ A+VT+ QN IY VPPGPF I +++ G L V +
Sbjct: 289 MLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTI 348

Query: 60 EEEDGRVTQWQVASNSIPYLTRKGQIRYTTAMGKPTSVGGDSLQQPFFWTGEFSWGWLNN 119
+E DG + V +S+P L R+G RY+ G+ S G ++P F+ G
Sbjct: 349 KEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS-GNAQQEKPRFFQSTLLHGLPAG 407

Query: 120 VSLYGGSVLTNRDYQSLAAGVGFNLNSLGSLSFDVTRSDAQLHNQDKETGYSYRANYSKR 179
++YGG+ L +R Y++ G+G N+ +LG+LS D+T++++ L + + G S R Y+K
Sbjct: 408 WTIYGGTQLADR-YRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKS 466

Query: 180 FESTGSQLTFAGYRFSDKNFVTMNEYIND--------------------TNHYTNYQNEK 219
+G+ + GYR+S + + T++Y N++
Sbjct: 467 LNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKR 526

Query: 220 ESYIVTFNQYLESLRLNTYVSLARNTYWDAS-SNVNYSLSLSRDFDIGPLKNVSTSLTFS 278
+T Q L Y+S + TYW S + + L+ F ++++ +L++S
Sbjct: 527 GKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF-----EDINWTLSYS 580

Query: 279 RIN--WEEDNQDQLYLNISIPWGTSR-----------TLSYGMQRNQDNEISHTASWYDS 325
W++ L LN++IP+ + SY M + + +++ A Y +
Sbjct: 581 LTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGT 640

Query: 326 --SDRNNSWSVSASGDNDEFKDMKASLRASYQHNTENGRLYLSGTSQRDSYYSLNASWNG 383
D N S+SV + ++ A+ + G + + D L +G
Sbjct: 641 LLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD-IKQLYYGVSG 699

Query: 384 SFTATRHGAAFHDYSGSADSRFMIDADGTEDIPLNNKRAV-TNRYGIGVIPSVSSYITTS 442
A +G D+ ++ A G +D + N+ V T+ G V+P + Y
Sbjct: 700 GVLAHANGVTLGQPLN--DTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENR 757

Query: 443 LSVDTRNLPENVDIENSVITTTLTEGAIGYAKLDTRKGYQIIGVIRLADGSHPPLGISVK 502
+++DT L +NVD++N+V T GAI A+ R G +++ + + P G V
Sbjct: 758 VALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVT 816

Query: 503 DETSHKELGLVADGGFVYLNGIQDDNKLALRWGDKSC 539
E+S + G+VAD G VYL+G+ K+ ++WG++
Sbjct: 817 SESS-QSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03061IGASERPTASE527e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.6 bits (123), Expect = 7e-09
Identities = 47/287 (16%), Positives = 92/287 (32%), Gaps = 16/287 (5%)

Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEAFMTLEQ 256
N A+ + + E R A + + ++ E +QE+ +
Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA---ENSKQESKTVEKN 1054

Query: 257 EQQVKTRTAEQNARIAAFEAERRREAE-QTRILAERQIQETEIDREQAVRSRKVEAEREV 315
EQ TA+ R A EA+ +A QT +A+ + E + + VE E +
Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQ---QSQAEARANLALAEAVSAQQNVETTRQTA 372
+++ + Q+V ++ +Q +++ Q + E + + E S T Q A
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 373 EADRAKQVALIAAAQDAET------KAVELTVRAKAEKEAAEMQAAAIVELAEATRKKGL 426
+ + + + T T +E + R
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 427 AEAEAQRALNDAINVLSDEQTSLKFKLALLQALPAVIEKSVEPMKSI 473
A + ND V + TS L A ++ K++
Sbjct: 1233 NVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03064LPSBIOSNTHSS290.028 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.0 bits (65), Expect = 0.028
Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383
G FD + GH+ + +L D++ VAV + + + +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43


44CMJKDNLE_03128CMJKDNLE_03133Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03128015-3.808109pyruvate formate-lyase (inactive)
CMJKDNLE_03129125-8.801466propionate kinase
CMJKDNLE_03130121-7.837167serine / threonine:H+ symporter TdcC
CMJKDNLE_03131026-7.813095catabolic threonine dehydratase
CMJKDNLE_03132023-6.980640TdcA DNA-binding transcriptional activator
CMJKDNLE_03133016-5.088778hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03129ACETATEKNASE5370.0 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 537 bits (1385), Expect = 0.0
Identities = 173/397 (43%), Positives = 254/397 (63%), Gaps = 11/397 (2%)

Query: 7 VLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVN-GGEPAP--LAHHSYEGA 63
+LVINCGSSS+K+ ++++ D VL G+A+ I ++ L+ N GE ++ A
Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62

Query: 64 LKAIAFELEKRNLN-----DSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLH 118
+K + L + + +GHR+ HGG FT S +ITD+V+ I LAPLH
Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122

Query: 119 NYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTS 178
N AN+ GI++ Q+ P V VAVFDT+FHQTM AYLY +P++YY + +R+YGFHGTS
Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182

Query: 179 HRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 238
H+YVSQRA +LN + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG
Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242

Query: 239 DVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERAQLAI 297
+D +S++ + N S ++ ++NK+SG+ GISG+SSD R + + A+ G +RAQLA+
Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302

Query: 298 KTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGLEIDTEMNNRS 357
F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG ++D E N
Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362

Query: 358 NSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGK 394
E I+S+ +++V V+PTNEE MIA D + +
Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


45CMJKDNLE_03146CMJKDNLE_03157Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_031460153.101236PTS system, cytoplasmic,
CMJKDNLE_031470133.007761PTS system N-acetylgalactosameine-specific IIC
CMJKDNLE_031480121.761198galactosamine PTS permease - cryptic
CMJKDNLE_031491131.867551AgaX
CMJKDNLE_03150-1131.824671putative truncated
CMJKDNLE_03151015-0.071351putative tagatose-6-phosphate aldose/ketose
CMJKDNLE_03152013-1.898130tagatose-1,6-bisphosphate aldolase 1
CMJKDNLE_03153113-3.294705galactosamine PTS permease - cryptic
CMJKDNLE_03154114-3.536079galactosamine PTS permease - cryptic
CMJKDNLE_03155-117-3.786514galactosamine PTS permease - cryptic
CMJKDNLE_03156018-4.324428galactosamine PTS permease - cryptic
CMJKDNLE_03157019-3.609032putative galactosamine-6-phosphate isomerase
46CMJKDNLE_03167CMJKDNLE_03187Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_031670193.063194putative permease
CMJKDNLE_031680201.516717putative nucleoside-diphosphate-sugar epimerase
CMJKDNLE_031690202.265914protein involved in stress response
CMJKDNLE_031701193.250822hypothetical protein
CMJKDNLE_03171-1183.556418putative endonuclease
CMJKDNLE_03172-1173.022893putative acyltransferase with acyl-CoA
CMJKDNLE_031730173.154336putative lipid carrier protein
CMJKDNLE_031740142.527952putative peptidase (collagenase-like)
CMJKDNLE_031752222.122417putative protease
CMJKDNLE_031763261.813976hypothetical protein
CMJKDNLE_031773291.364206tryptohan / indole:H+ symporter Mtr
CMJKDNLE_031785331.396677DeaD, DEAD-box RNA helicase
CMJKDNLE_031795330.857652lipoprotein involved in cell division
CMJKDNLE_031806371.373170polynucleotide phosphorylase monomer
CMJKDNLE_031826320.77602530S ribosomal subunit protein S15
CMJKDNLE_031844280.791481tRNA pseudouridine 55 synthase
CMJKDNLE_03185322-0.92204830S ribosome binding factor
CMJKDNLE_03187222-1.086177protein chain initiation factor IF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03168NUCEPIMERASE290.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.014
Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 4 VLITGATGLVGGHLLRMLINEP 25
L+TGA G +G H+ + L+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03187TCRTETOQM732e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 73.4 bits (180), Expect = 2e-15
Identities = 69/313 (22%), Positives = 109/313 (34%), Gaps = 77/313 (24%)

Query: 396 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETEN 437
++ HVD GKT+L + + T++ S + G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 438 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAAQVPVVVAV 497
+ +DTPGH F + R D +L+++A DGV QT + +P + +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 498 NKIDKPEADPDRV----KNELSQYGI-----------------LPEEWG----------- 525
NKID+ D V K +LS + E+W
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 526 ---------------GESQFV---------HVSAKAGTGIDELLDAILLQAEVLELKAVR 561
ES H SAK GID L++ I +
Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245

Query: 562 KGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVL-CGFEYGRVRAMRNELGQEVLEA 620
+ G V + + R +A + + G LH D V E ++ M + E+ +
Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305

Query: 621 GPSIPVEILGLSG 633
+ EI+ L
Sbjct: 306 DKAYSGEIVILQN 318


47CMJKDNLE_03235CMJKDNLE_03240Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03235121-4.383049glutamate synthase, small subunit
CMJKDNLE_03236533-7.429546periplasmic protein
CMJKDNLE_03237434-7.768628hypothetical protein
CMJKDNLE_03238333-7.131002putative periplasmic chaperone protein
CMJKDNLE_03239230-5.733179putative outer membrane protein
CMJKDNLE_03240227-5.113171putative outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03239PF00577613e-14 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 61.0 bits (148), Expect = 3e-14
Identities = 21/129 (16%), Positives = 33/129 (25%), Gaps = 12/129 (9%)

Query: 3 KKTLLAYTIGFAFSP-PANADGIEIAAVDFDRETLKSLGVDPNISHYFSRSARFLPGEYS 61
K L + + + A + A + F+ L F PG Y
Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79

Query: 62 LIVSVNGEKKGNIATRFDENGD-----ICLDQAFLQQAGLKIPSEEK------NGCYDYI 110
+ + +N F+ CL +A L GL S + C
Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLT 139

Query: 111 LSYPGTTIT 119
T
Sbjct: 140 SMIHDATAQ 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03240PF00577326e-103 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 326 bits (836), Expect = e-103
Identities = 146/697 (20%), Positives = 265/697 (38%), Gaps = 76/697 (10%)

Query: 2 SSRAEFSNGSSDYSQAALEGGININDWMLRSHQFLTQTNGTFSN------QNSSTYLQRT 55
+S G+S Y+ L+ G+NI W LR + + + S+ Q+ +T+L+R
Sbjct: 194 NSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253

Query: 56 FTDLKTLMRAGEVNLNNSVLEGASIYGIEIAPDNALQTS---GSGVQVTGIANTSQARVE 112
L++ + G+ + +G + G ++A D+ + G + GIA A+V
Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARG-TAQVT 312

Query: 113 IRQQGVLIHSILVPAGAFTIPDVPVRNGNSDLNVTVVETDGSSHNYIVP-STLFNQHVES 171
I+Q G I++ VP G FTI D+ + DL VT+ E DGS+ + VP S++ E
Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372

Query: 172 FQGYRFAIGRVDDDY--DESPWVISASSGWNLTRWSAMNGGVIVAENYQAASIRSSLVPL 229
Y G E P ++ L + GG +A+ Y+A +
Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432

Query: 230 PDLTVSSQISTSQ---DTKDSLQGQKYRLDANYNLPFSLGLTTSLTR-----SDRHYREL 281
+S ++ + GQ R YN + T++ S Y
Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRF--LYNKSLN-ESGTNIQLVGYRYSTSGYFNF 489

Query: 282 SEAIDD------------------------DYTDPTKSTYALGLNWSNSILGGFNISGYK 317
++ + + L + +SG
Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSH 549

Query: 318 TYSYDGDNDSSNLNINWNKAFKHATVSVNWQHQLSASENNEDDGDLFYVNISIPFGR--- 374
+ N N AF+ ++++ +A + D +N++IPF
Sbjct: 550 QTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQM--LALNVNIPFSHWLR 607

Query: 375 --------SNTATLYTRHDDH-KTHYGTGVMGVV--SDEMSYYVNAERDHDER---ETSL 420
+A+ HD + + GV G + + +SY V ++
Sbjct: 608 SDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTG 667

Query: 421 NGSISSNLHYTQVSLAAGASGSDSRTYNGTMSGGIAVHDQGVTFSPWTINDTFAIAKMDN 480
+++ Y ++ S D + +SGG+ H GVT +NDT + K
Sbjct: 668 YATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQ-PLNDTVVLVKAP- 724

Query: 481 NIAGVRITSQAGPVWTDFRGNAVIPSIQPWRTSGVEIDTASLPKNVDIGNGTKMIKQGRG 540
++ +Q G V TD+RG AV+P +R + V +DT +L NVD+ N + RG
Sbjct: 725 GAKDAKVENQTG-VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRG 783

Query: 541 AVGKVGFSAITQRRALLNITLSDGKKLPRGVAIEDSEGNYLTTSVDDGVVFLNNIKPDMV 600
A+ + F A + L+ +T + K LP G + D+G V+L+ +
Sbjct: 784 AIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK 842

Query: 601 LDIK---DEQQSCRIHLTFPEDAPKDVFYETATGECQ 634
+ +K +E C + P ++ + + + + EC+
Sbjct: 843 VQVKWGEEENAHCVANYQLPPESQQQLLTQL-SAECR 878


48CMJKDNLE_03406CMJKDNLE_03413Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_034062140.504713phosphoglycolate phosphatase
CMJKDNLE_034072140.933815ribulose-5-phosphate 3-epimerase
CMJKDNLE_034082141.125804DNA adenine methyltransferase
CMJKDNLE_034092151.833058cell division protein DamX
CMJKDNLE_034101181.7968403-dehydroquinate synthase
CMJKDNLE_034112212.463013shikimate kinase I
CMJKDNLE_03412-1163.418487protein involved in utilization of DNA as a
CMJKDNLE_03413-1163.087853protein involved in utilization of DNA as a
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03409IGASERPTASE442e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.5 bits (102), Expect = 2e-06
Identities = 39/199 (19%), Positives = 70/199 (35%), Gaps = 19/199 (9%)

Query: 143 DLAGNATDQANGVQPAPGTTSAENTQQDVSL-----------------PPISSTPTQGQT 185
DL ++ N T+ N Q DV PP +TP++
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 186 PVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRDTAKT 245
VA + +Q + T+ Q + S + T ++G+ +++T T
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 246 QTAERPSTTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPA 305
+T E + + + + E + V ++ P + + +P A A P
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 306 PKETATTAPVQTASPAQTT 324
+T TTA T PA+ T
Sbjct: 1159 QSQTNTTA--DTEQPAKET 1175



Score = 42.0 bits (98), Expect = 4e-06
Identities = 41/203 (20%), Positives = 68/203 (33%), Gaps = 10/203 (4%)

Query: 126 APSTTSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAENTQQDVSLPPISST-PTQGQ 184
P+ +D + + ++A D+A PAP T S + S T Q
Sbjct: 999 TPNNIQADVPSVPSNNEEIA--RVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 185 TPVATDGQQRVEVQGDLNNALTQPQN----QQQLNNVAVNSTLPTEPATVAPVRNGNASR 240
T Q R + +N Q Q +T E ATV + A
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE--KEEKAKV 1114

Query: 241 DTAKTQTAERPSTTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAA 300
+T KTQ + ++ +Q+ E +PQA E P + A T+ PA
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQS-ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173

Query: 301 TSTPAPKETATTAPVQTASPAQT 323
++ ++ T + +
Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVV 1196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03411CARBMTKINASE328e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 32.1 bits (73), Expect = 8e-04
Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%)

Query: 32 FYDSDQEIEKRTGADVGWVFDLEGEEGFRD----------REEKVINELTEKQGIVLATG 81
FYD + KR + GW+ + G+R E + I +L E+ IV+A+G
Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193

Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112
GG V + +GV E I+K LA
Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03412TYPE3OMGPROT2871e-93 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 287 bits (735), Expect = 1e-93
Identities = 80/301 (26%), Positives = 132/301 (43%), Gaps = 18/301 (5%)

Query: 107 LENRSITLQYADAGELAKAGEKLLSAKGSMTVDKRTNRLLLRDNKTALSALEQWVAQMDL 166
L + +I D + +A SA+ + D N +++RD+ + ++ + +D
Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277

Query: 167 PVGQVELSAHIVTINEKSLRELGVKWTLADAQHAGGVGQVTTLGSDLSVATATTHVGFNI 226
P ++E++ IV IN L ELGV W + + T G ++A+ G
Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333

Query: 227 GRINGRLLDL---ELSALEQKQQLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGAT 283
++ R LD ++ LE + +++ P LL A I SE Y +G+ A
Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391

Query: 284 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 338
E K G + +TP VL +G I L LHI +G + I + ++T
Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448

Query: 339 QVEVKSGETLALGGIFTRKNKSGQDSVPLLGDIPWFGQLFRHDGKEDERRELVVFITPRL 398
V G++L +GGI+ + VPLLGDIP+ G LFR + R + I PR+
Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508

Query: 399 V 399
+
Sbjct: 509 I 509



Score = 33.3 bits (76), Expect = 0.002
Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 20/116 (17%)

Query: 10 KPQKVTLMVDDVPVAQVLQALAEQEKLNLVVSPDVSGTVSLHLTDVPWKQALQTVVKSAG 69
P + + +L +VVS ++ VS + LQ +
Sbjct: 32 LPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHIASLYN 91

Query: 70 LITRQEGNILSVHSIAWQNNNIARQEAEQARAQANLPLENRSITLQYADAGELAKA 125
L+ +GN+L + ++N+ +A +R I LQ ++A EL +A
Sbjct: 92 LVWYYDGNVLYI----FKNSEVA----------------SRLIRLQESEAAELKQA 127


49CMJKDNLE_03457CMJKDNLE_03510Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03457015-4.246698D-gluconate kinase, thermostable
CMJKDNLE_03458219-6.398147GntR DNA-binding transcriptional repressor
CMJKDNLE_03459223-8.511860pirin-like protein
CMJKDNLE_03460318-5.842153putative oxidoreductase with NAD(P)-binding
CMJKDNLE_03462323-6.811383putative acetyltransferase
CMJKDNLE_03463322-7.137829hypothetical protein
CMJKDNLE_03464315-1.318835hypothetical protein
CMJKDNLE_03465-1173.013156putative protein with chaperone-like activity
CMJKDNLE_03466-1203.464530Gamma-glutamyltranspeptidase
CMJKDNLE_03467-1233.752013hypothetical protein
CMJKDNLE_03468-1233.366757hypothetical protein
CMJKDNLE_03469-1253.786778glycerophosphodiester phosphodiesterase,
CMJKDNLE_03470-1253.601772glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03471-1263.413959glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03472-2263.688668glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03473-2243.387166glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03474-2233.868590branched chain amino acid ABC transporter - ATP
CMJKDNLE_03475-1233.481858branched chain amino acid transporter - ATP
CMJKDNLE_03476-2243.319384branched chain amino acid transporter - membrane
CMJKDNLE_03477-2242.601550branched chain amino acid transporter - membrane
CMJKDNLE_03478-1222.536801leucine ABC transporter - periplasmic binding
CMJKDNLE_034791202.319659putative maturation factor for PanD
CMJKDNLE_034802182.137697branched chain amino acid ABC transporter -
CMJKDNLE_034812161.587044RNA polymerase, sigma 32 (sigma H) factor
CMJKDNLE_034822131.268343cell division protein FtsX
CMJKDNLE_034833121.809536cell division protein FtsE
CMJKDNLE_034843123.288863SRP receptor
CMJKDNLE_034850153.82174516S rRNA m2G966 methyltransferase
CMJKDNLE_034860143.307633hypothetical protein
CMJKDNLE_034870153.500224putative receptor
CMJKDNLE_03488-1153.955890hypothetical protein
CMJKDNLE_034890153.112979zinc, cadmium and lead efflux system
CMJKDNLE_034901171.678126sulfur transfer protein
CMJKDNLE_034910151.604027hypothetical protein; gene is a predicted member
CMJKDNLE_034920162.688155hypothetical protein
CMJKDNLE_034930183.766213putative transport protein YhhS
CMJKDNLE_03494-1204.141353putative inner membrane protein
CMJKDNLE_03495-1235.051944holo-[acyl carrier protein] synthase 2
CMJKDNLE_034960245.126141nickel ABC transporter - periplasmic binding
CMJKDNLE_034971214.130256nickel ABC transporter - membrane subunit
CMJKDNLE_034981191.155171nickel ABC transporter - membrane subunit
CMJKDNLE_03499221-2.493542nickel ABC transporter - ATP binding subunit
CMJKDNLE_03500027-6.439105nickel ABC transporter - ATP binding subunit
CMJKDNLE_03501225-5.937246NikR DNA-binding transcriptional repressor,
CMJKDNLE_03502218-1.474725RhsB protein in rhs element
CMJKDNLE_035030160.977605hypothetical protein
CMJKDNLE_03504015-1.372773putative lyase containing HEAT-repeat protein
CMJKDNLE_03505117-3.769062putative transposase; receptor protein
CMJKDNLE_03506117-3.534854putative transporter subunit: membrane component
CMJKDNLE_03507-117-3.424363ribosome-associated ATPase
CMJKDNLE_03508-121-5.195822putative HlyD family secretion protein
CMJKDNLE_03509124-6.862304hypothetical protein
CMJKDNLE_03510017-4.577184hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03462SACTRNSFRASE384e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 4e-06
Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 16/92 (17%)

Query: 55 VACIDGDVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108
+ ++ + +G + I + + D + D R K GV +AL+ + IE C
Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125

Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKYGFEIEG 140
L I N A Y K+ F I
Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03466NAFLGMOTY320.007 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 31.6 bits (71), Expect = 0.007
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%)

Query: 275 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331
R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131

Query: 332 YAYADRSEYLGDPDFVKVPWQA 353
Y P F WQ+
Sbjct: 132 GRY---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03469PF04619300.008 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 29.5 bits (66), Expect = 0.008
Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 4/65 (6%)

Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84
+G ++ D + G+ FL+ D+N ++ W + D GSW
Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129

Query: 85 YSKMF 89
+
Sbjct: 130 GGIIG 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03470PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.003
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKD 75
+V+ G G GKSTL+ + GL+ + +D KD
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03473MALTOSEBP392e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 39.3 bits (91), Expect = 2e-05
Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193
G L++ P L YNKD PPKTW+++ +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251
+ +A G F +N +D D ++ K + +++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291
+ F G+ AMT + +NI + +K NYGV ++P
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03484IGASERPTASE541e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.5 bits (128), Expect = 1e-09
Identities = 39/181 (21%), Positives = 62/181 (34%), Gaps = 14/181 (7%)

Query: 19 EQTPEKETEVQNEQPVVEEI---VQAQEPVKASEQAVEEQPQAHTEAEAETFAADVVEVT 75
TP + TE E E Q+ + + Q E +A + +A T EV
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN---EVA 1086

Query: 76 EQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETV 135
+ +E+++ Q E E E +E E+ V ++ VSP++ Q+E
Sbjct: 1087 QSGSETKETQTT----ETKETATVEKEEKAKVETEKTQEVPKVTSQ-VSPKQEQSETVQP 1141

Query: 136 EIVEAAEEEA---AKEEITDEELETALAAEAAEEAVMVVPPAEEEQPVEEIAQEQEKPTK 192
+ A E + KE + A E + V P E V E P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201

Query: 193 E 193

Sbjct: 1202 T 1202



Score = 45.4 bits (107), Expect = 4e-07
Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 10/157 (6%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASE------QAVEEQPQAHTEAEAETFAAD 70
Q +T E T + E+ VE + P S+ Q+ QPQA E +
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 71 VVEVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQA 130
++ ++ QP E + E V E+ V + PE+ P
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQP---TV 1211

Query: 131 EAETVEIVEAAEEEAAKEEITDEELETALAAEAAEEA 167
+E+ + + + + E T + + + A
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 41.2 bits (96), Expect = 1e-05
Identities = 31/153 (20%), Positives = 57/153 (37%), Gaps = 7/153 (4%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEQAVE-EQPQAHTEAEAETFAADVV--- 72
+ ++ P+ ++V +Q E + EP + ++ V ++PQ+ T A+T
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS 1177

Query: 73 EVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEA 132
V + V ES VV PE T +P E P++ + +V E
Sbjct: 1178 NVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS-ESSNKPKNRHRRSVRSVPHNVEP 1236

Query: 133 ETVEIVEAAEEEAAKEEITDEELETALAAEAAE 165
T + A ++T L+ A+
Sbjct: 1237 ATTSSNDR--STVALCDLTSTNTNAVLSDARAK 1267



Score = 38.1 bits (88), Expect = 9e-05
Identities = 29/178 (16%), Positives = 53/178 (29%), Gaps = 7/178 (3%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEQAVEEQPQAHTEAE-AETFAADVVEVT 75
+E E ++ V+ E+ Q+ K ++ ++ + E A+ EV
Sbjct: 1065 NREVAKEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 76 EQVAESEKAQPEAEVV-AQPEPVVEETPEPVAIEREELPLPEDVNAEAVSP-EEWQAEAE 133
+ ++ Q ++E V Q EP E P E + + A+ P +E + E
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS---QTNTTADTEQPAKETSSNVE 1180

Query: 134 TVEIVEAAEEEAAKEEITDEELETALAAEAAEEAVMVVPPAEEEQPVEEIAQEQEKPT 191
E A P + V + E T
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238



Score = 34.3 bits (78), Expect = 0.001
Identities = 30/144 (20%), Positives = 50/144 (34%), Gaps = 9/144 (6%)

Query: 52 VEEQPQAHTEAEAETFAADVVEVT-EQVAESEKAQPEAEVVAQPEPVVE-ETPEPVAIER 109
VE++ Q T +V E A+ + V P P ET E VA
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 110 EELPLPEDVNAEAVSPEEWQAEAETVEIVEAAEEEAAKEEITDEELETALAAEAAEEAVM 169
++ ++ V E A T + E A+E + + + E A + +E
Sbjct: 1045 KQ-------ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 170 VVPPAEEEQPVEEIAQEQEKPTKE 193
EE A+ + + T+E
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQE 1121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03487SHIGARICIN260.042 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 25.9 bits (57), Expect = 0.042
Identities = 6/21 (28%), Positives = 13/21 (61%)

Query: 7 FFIVIIGLIVVAASFRFMQQR 27
+V+I AA ++F++Q+
Sbjct: 173 ALMVLIQSTSEAARYKFIEQQ 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03490PF012061053e-34 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 105 bits (265), Expect = 3e-34
Identities = 24/72 (33%), Positives = 41/72 (56%)

Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKET 68
D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F HEL+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 69 DGLPYRYLIRKG 80
+ Y + +++
Sbjct: 65 EDGTYHFRLKRA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03493TCRTETA569e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 56.0 bits (135), Expect = 9e-11
Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%)

Query: 24 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 81
++ N ++ I+ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 82 PHAGRYADSLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 140
P G +D G + +++ L G + + Y L V L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112

Query: 141 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 198
A G+ + + H G + + G +G +G H A AL +
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 199 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 251
LL + + P + + +A +A V A
Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 252 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 307
+F + + WD ++L + + + ++ R+G M+ + G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 308 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 366
L+ AT WMA VLLA G + PAL + + V ++ QG + L+ +
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349

Query: 367 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 400
GPL + A + ++A A L + L R
Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03500HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03506ABC2TRNSPORT505e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 49.9 bits (119), Expect = 5e-09
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%)

Query: 200 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 367
+P +H + L + I+ ++ + I FF + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03507PF05272300.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.045
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03508RTXTOXIND845e-20 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 84.1 bits (208), Expect = 5e-20
Identities = 73/409 (17%), Positives = 142/409 (34%), Gaps = 83/409 (20%)

Query: 6 RHLAWWVVGLLAVAAIVVWWLLRPAGVPEGFAVSNGRI--EATEVDIASKIAGRIDTILV 63
R +A++++G L +A + +L E A +NG++ +I + I+V
Sbjct: 58 RLVAYFIMGFLVIA--FILSVLGQV---EIVATANGKLTHSGRSKEIKPIENSIVKEIIV 112

Query: 64 KEGQFVREGEVLAKMDTRV----------------LQEQRLEAI---------------- 91
KEG+ VR+G+VL K+ L++ R + +
Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172

Query: 92 --------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDS 131
Q Q+ + L+++++E + +N+ +
Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRV 232

Query: 132 VAKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 233 EKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 191 ------------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAA 235
QT T + S ++AP +V Q +V G V+
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352

Query: 236 GGRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFT 294
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNIN 409

Query: 295 PKTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


50CMJKDNLE_03520CMJKDNLE_03536Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03520117-3.644153glutathione reductase (NADPH)
CMJKDNLE_03521122-5.865059hypothetical protein
CMJKDNLE_03522021-6.761874ArsR-antimonite
CMJKDNLE_03523122-7.135381arsenite:H+ antiporter
CMJKDNLE_03524127-10.771006arsenate reductase
CMJKDNLE_03525231-11.912147hypothetical protein
CMJKDNLE_03526221-8.027255starvation lipoprotein
CMJKDNLE_03527122-10.419091putative DNA-binding transcriptional regulator
CMJKDNLE_03528-221-5.832877putative Mg(2+) transport ATPase
CMJKDNLE_03529-213-2.201642acid stress chaperone
CMJKDNLE_03530-113-2.045748acid-resistance protein, possible chaperone
CMJKDNLE_03531-113-2.601370acid-resistance membrane protein
CMJKDNLE_03532-115-3.404010GadE DNA-binding transcriptional activator
CMJKDNLE_03533-113-2.325150MdtEF-TolC multidrug efflux transport system -
CMJKDNLE_03534-110-2.162210MdtEF-TolC multidrug efflux transport system -
CMJKDNLE_03535019-4.254046hypothetical protein
CMJKDNLE_03536016-3.296314GadW DNA-binding transcriptional dual regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03533RTXTOXIND514e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.0 bits (122), Expect = 4e-09
Identities = 41/218 (18%), Positives = 70/218 (32%), Gaps = 33/218 (15%)

Query: 97 LQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSR-QDYDT-ARTQLNEAEANVT 154
+ + A L S + K Y Q + +L + N+
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIE----SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 155 VAKAAVEQATINLQYANVTSPITGVSGKSSV-TVGALVTANQADSLVTVQRLDPIYVDLT 213
+ + + Q + + +P++ + V T G +VT + +V V D + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTAL 371

Query: 214 QSVQDFLRMKEEVASGQIKQVQGSTPVQLNLE--NGKRY-SQTGTLK--FSDPTVDETTG 268
+D I + + +E RY G +K D D+ G
Sbjct: 372 VQNKD------------IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 269 SVT--LRAI------FPNPNGDLLPGMYVTALVDEGSR 298
V + +I N N L GM VTA + G R
Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 32.1 bits (73), Expect = 0.004
Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 7/118 (5%)

Query: 53 PGRTVPY-EVAEIRPQVGGIIIKRNFI-EGDKVNQGDSLYQIDPAPLQAELNSAKGSLAK 110
G+ EI+P I+ K + EG+ V +GD L ++ +A+ + SL +
Sbjct: 87 NGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 111 ALSTASNARITFNRQASLLKTNYVSRQDYDTARTQLNEAEANVTVAKAAVEQATINLQ 168
A + +I +R L K + D + N +E V + +++ Q
Sbjct: 146 ARLEQTRYQIL-SRSIELNKLPELKLPDEPYFQ---NVSEEEVLRLTSLIKEQFSTWQ 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03534ACRIFLAVINRP12920.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1292 bits (3345), Expect = 0.0
Identities = 723/1032 (70%), Positives = 845/1032 (81%), Gaps = 1/1032 (0%)

Query: 1 MANYFIDRPVFAWVLAIIMMLAGGLAIMNLPVAQYPQIAPPTITVSATYPGADAQTVEDS 60
MAN+FI RP+FAWVLAII+M+AG LAI+ LPVAQYP IAPP ++VSA YPGADAQTV+D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGLDGLMYMSSTSDAAGNASITLTFETGTSPDIAQVQVQNKLQLAMPSLPE 120
VTQVIEQNMNG+D LMYMSSTSD+AG+ +ITLTF++GT PDIAQVQVQNKLQLA P LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVQQQGISVDKSSSNILMVAAFISDNGSLNQYDIADYVASNIKDPLSRTAGVGSVQLFGS 180
VQQQGISV+KSSS+ LMVA F+SDN Q DI+DYVASN+KD LSR GVG VQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EYAMRIWLDPQKLNKYNLVPSDVISQIKVQNNQISGGQLGGMPQAADQQLNASIIVQTRL 240
+YAMRIWLD LNKY L P DVI+Q+KVQN+QI+ GQLGG P QQLNASII QTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTPEEFGKILLKVQQDGSQVLLRDVARVELGAEDYSTVARYNGKPAAGIAIKLAAGANAL 300
+ PEEFGK+ L+V DGS V L+DVARVELG E+Y+ +AR NGKPAAG+ IKLA GANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTSRAVKEELNRLSAYFPASLKTVYPYDTTPFIEISIQEVFKTLVEAIILVFLVMYLFLQ 360
DT++A+K +L L +FP +K +YPYDTTPF+++SI EV KTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATIIPTIAVPVVILGTFAILSAVGFTINTLTMFGMVLAIGLLVDDAIVVVENVERVI 420
N RAT+IPTIAVPVV+LGTFAIL+A G++INTLTMFGMVLAIGLLVDDAIVVVENVERV+
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEDKLPPKEATHKSMGQIQRALVGIAVVLSAVFMPMAFMSGATGEIYRQFSITLISSMLL 480
EDKLPPKEAT KSM QIQ ALVGIA+VLSAVF+PMAF G+TG IYRQFSIT++S+M L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVFVAMSLTPALCATILKAAPEGGHK-PNALFARFNTLFEKSTQHYTDSTRSLLRCTGRY 539
SV VA+ LTPALCAT+LK H+ F FNT F+ S HYT+S +L TGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 MVVYLLICAGMAVLFLRTPTSFLPEEDQGVFMTTAQLPSGATMVNTTKVLQQVTDYYLTK 599
+++Y LI AGM VLFLR P+SFLPEEDQGVF+T QLP+GAT T KVL QVTDYYL
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 600 EKDNVQSVFTVGGFGFSGQGQNNGLAFISLKPWSERVGEENSVTAIIQRAMIALSSINKA 659
EK NV+SVFTV GF FSGQ QN G+AF+SLKPW ER G+ENS A+I RA + L I
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 660 VVFPFNLPAVAELGTASGFDMELLDNGNLGHEKLTQARNELLSLAAQSPNQVTGVRPNGL 719
V PFN+PA+ ELGTA+GFD EL+D LGH+ LTQARN+LL +AAQ P + VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 EDTPMFKVNVNAAKAEAMGVALSDINQTISTAFGSSYVNDFLNQGRVKKVYVQAGTPFRM 779
EDT FK+ V+ KA+A+GV+LSDINQTISTA G +YVNDF+++GRVKK+YVQA FRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 780 LPDNINQWYVRNASGTMAPLSAYSSTEWTYGSPRLERYNGIPSMEILGEAAAGKSTGDAM 839
LP+++++ YVR+A+G M P SA++++ W YGSPRLERYNG+PSMEI GEAA G S+GDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 840 KFMADLVAKLPAGVGYSWTGLSYQEALSSNQAPALYAISLVVVFLALAALYESWSIPFSV 899
M +L +KLPAG+GY WTG+SYQE LS NQAPAL AIS VVVFL LAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 900 MLVVPLGVVGALLATDLRGLSNDVYFQVGLLTTIGLSAKNAILIVEFAVEMMQKEGKTPI 959
MLVVPLG+VG LLA L NDVYF VGLLTTIGLSAKNAILIVEFA ++M+KEGK +
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 960 EAIIEAARMRLRPILMTSLAFILGVLPLVISHGAGSGAQNAVGTGVMGGMFAATVLAIYF 1019
EA + A RMRLRPILMTSLAFILGVLPL IS+GAGSGAQNAVG GVMGGM +AT+LAI+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1020 VPVFFVVVEHLF 1031
VPVFFVV+ F
Sbjct: 1021 VPVFFVVIRRCF 1032


51CMJKDNLE_03578CMJKDNLE_03591Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03578119-3.325760glyoxylate reductase / glyoxylate reductase B /
CMJKDNLE_03579325-6.477750hypothetical protein
CMJKDNLE_03580220-1.732341putative transcriptional regulator
CMJKDNLE_035822260.000034CspA DNA-binding transcriptional activator
CMJKDNLE_03583124-0.119410small toxic membrane polypeptide
CMJKDNLE_03584-120-0.595848IS150 protein InsA
CMJKDNLE_03585-221-0.947960hypothetical protein
CMJKDNLE_03587-218-0.666595glycyl-tRNA synthetase, beta subunit
CMJKDNLE_03588115-1.473254glycyl-tRNA synthetase, alpha subunit
CMJKDNLE_03589016-2.447788hypothetical protein
CMJKDNLE_03590015-3.345354O-acetyltransferase
CMJKDNLE_03591-216-3.413870hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03583HOKGEFTOXIC658e-19 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 65.2 bits (159), Expect = 8e-19
Identities = 17/50 (34%), Positives = 32/50 (64%)

Query: 1 MPQKYRLLSLIVICFTLLFFTWMIRDSLCELHIKQESYELAAFLACKLKE 50
+P+ + ++++C TLL FT++ R SLCE+ + E+AAF+A + +
Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03591FLGBIOSNFLIP270.017 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 27.5 bits (61), Expect = 0.017
Identities = 19/66 (28%), Positives = 26/66 (39%), Gaps = 1/66 (1%)

Query: 77 MTCLTVFIISVALLMVGLWNATLLLSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKET 136
MT T II LL L + + GLA FL+ F V I P E
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAP-PNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 137 QVTQEE 142
+++ +E
Sbjct: 120 KISMQE 125


52CMJKDNLE_03647CMJKDNLE_03659Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03647019-4.8847902-amino-3-ketobutyrate CoA ligase
CMJKDNLE_03648125-7.808833ADP-L-glycero-D-mannoheptose-6-epimerase
CMJKDNLE_03649234-10.202920ADP-heptose:LPS heptosyltransferase II
CMJKDNLE_03650444-13.530901ADP-heptose:LPS heptosyltransferase I
CMJKDNLE_03651550-16.344177hypothetical protein
CMJKDNLE_03652448-15.597361lipopolysaccharide glucosyltransferase I
CMJKDNLE_03653450-17.401714protein involved in KdoIII attachment during
CMJKDNLE_03654449-16.380422lipopolysaccharide core heptose (II) kinase
CMJKDNLE_03655346-14.210768UDP-glucose:(glucosyl)LPS
CMJKDNLE_03656343-12.669349UDP-D-glucose:(glucosyl)LPS
CMJKDNLE_03657234-8.915147UDP-D-galactose:(glucosyl)lipopolysaccharide-1,
CMJKDNLE_03658229-7.502588UDP-glucose:(glucosyl)LPS
CMJKDNLE_03659020-3.386437lipopolysaccharide core heptose (I) kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03648NUCEPIMERASE1022e-27 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 102 bits (257), Expect = 2e-27
Identities = 76/348 (21%), Positives = 127/348 (36%), Gaps = 67/348 (19%)

Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47
+VTG AGFIG ++ K L + G ++ +DNL D +++
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100
AD + + + A F E +F + +Y ++N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158
L C +I LYASS++ YG F + P+++Y +K +
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218
G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224

Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258
+ + W +E G ++N+G A + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305
+P G T AD L G+ P TV +GV ++ W
Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327


53CMJKDNLE_03679CMJKDNLE_03703Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03679-2133.086558RNA polymerase, o subunit
CMJKDNLE_03680-2122.980819guanosine 3'-diphosphate 5'-triphosphate
CMJKDNLE_03681-1122.593512tRNA (Gm18) 2'-O-methyltransferase
CMJKDNLE_03682-1111.793759RecG DNA helicase
CMJKDNLE_03684-190.061752glutamate:sodium symporter
CMJKDNLE_03685-29-0.537750xanthine:H+ symporter XanP
CMJKDNLE_03686-310-1.528454hypothetical protein
CMJKDNLE_03687-212-2.778698alpha-xylosidase
CMJKDNLE_03688-115-3.178403YicJ GPH transporter
CMJKDNLE_03690019-2.990698*inner membrane protein SetC - putative arabinose
CMJKDNLE_03691020-2.484817inhibitor of heme biosynthesis
CMJKDNLE_03692116-0.820222lipoprotein-28
CMJKDNLE_03693-1110.032297hypothetical protein
CMJKDNLE_036940120.925583purine ribonucleoside efflux transporter
CMJKDNLE_03695-1121.742937hypothetical protein
CMJKDNLE_03696-1122.184970putative membrane protein with possible
CMJKDNLE_03697-1143.070370cryptic adenine deaminase monomer
CMJKDNLE_036980173.496046hexose-6-phosphate:phosphate antiporter
CMJKDNLE_036991173.923010glycerol-3-phosphate:phosphate antiporter
CMJKDNLE_037001184.416139Signal transduction histidine-protein
CMJKDNLE_037011183.842917FimZ transcriptional regulator
CMJKDNLE_037022172.922644acetohydroxybutanoate synthase / acetolactate
CMJKDNLE_037031153.132186acetohydroxybutanoate synthase / acetolactate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03682SECA429e-06 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.8 bits (98), Expect = 9e-06
Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 18/129 (13%)

Query: 233 NLSMLALRAGAQRFHAQPLSANDTLKNKLLAALPFKPTGAQARVVAEIEHDM-ALDVPMM 291
LS L+ F A+ L + L+N + A A R ++ M DV ++
Sbjct: 37 KLSDEELKGKTAEFRAR-LEKGEVLENLIPEAF------AVVREASKRVFGMRHFDVQLL 89

Query: 292 ---RLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFA 342
L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F
Sbjct: 90 GGMVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFE 148

Query: 343 PLGIKVGWL 351
LG+ VG
Sbjct: 149 FLGLTVGIN 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03690TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 2e-05
Identities = 71/391 (18%), Positives = 130/391 (33%), Gaps = 33/391 (8%)

Query: 20 LLVAFLTSIAGALQTPTLSIFLADELKARPIM--VGFFFTGSAIMGILVSQFLARHSDKQ 77
L L ++ L P L L D + + + G A+M + L SD+
Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70

Query: 78 GDRKLLILLCCLFGVLACTLFAWNRNYFILLSTGVLLSSFASTANPQMFALAREHADRTG 137
G R ++L+ + + A ++L ++ A AD T
Sbjct: 71 GRR-PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV----AGITGATGAVAGAYIADITD 125

Query: 138 RET-VMFSTFLRAQISLAWVIGPPLAYELAMGFSFKVMYLTAAIAFVVCGLIVWLFLP-- 194
+ F+ A V GP L + GFS + AA + L LP
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 195 --SIQRNIPVVT-QPVEILPSTHRKRDTRLLFVVCSMMWAANNLYMINMPLFIIDELHLT 251
+R + P+ L V +M + +F D H
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244

Query: 252 DKLAGEMI-GIAAGLEIPMMLIAGYYMKRIGKRLLMLIAIVSGMCFYASVLMATTPAVEL 310
G + + +I G R+G+R +++ +++ Y +L+A +
Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY--ILLAFATRGWM 302

Query: 311 ELQILNAIFLGILCGIGMLYFQDLMPEKI---------GSATTLYANTSRVGWIIAGSVD 361
+ L GIGM Q ++ ++ GS L + TS VG ++ ++
Sbjct: 303 ---AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359

Query: 362 GIMVEIWSYHALFWLAIGMLGIAMICLLFIK 392
+ W+ W I + ++CL ++
Sbjct: 360 AASITTWNG----WAWIAGAALYLLCLPALR 386



Score = 33.3 bits (76), Expect = 0.002
Identities = 18/102 (17%), Positives = 34/102 (33%)

Query: 17 AAFLLVAFLTSIAGALQTPTLSIFLADELKARPIMVGFFFTGSAIMGILVSQFLARHSDK 76
AA + V F+ + G + IF D +G I+ L +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272

Query: 77 QGDRKLLILLCCLFGVLACTLFAWNRNYFILLSTGVLLSSFA 118
+ + ++L + L A+ ++ VLL+S
Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03694TCRTETA384e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 4e-05
Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 13/208 (6%)

Query: 49 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 101
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 102 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 161
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 162 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGE 217
+ +V LG +G F AAA + + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 218 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 245
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03697UREASE389e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.2 bits (89), Expect = 9e-05
Identities = 28/105 (26%), Positives = 41/105 (39%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG----------AEYTDAPA 71
V+R D +I N ILD + G + I +K IA +G P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03698TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03699TCRTETB419e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 9e-06
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 87 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 143
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 202
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03700PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 478
+KH + L+G + + + L +E+ GS + + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 479 G---TLHISCLHG-TRVSVSLP 496
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03701HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


54CMJKDNLE_03749CMJKDNLE_03776Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03749116-4.2228656-phosphogluconate phosphatase
CMJKDNLE_03750116-4.726601putative inner membrane protein
CMJKDNLE_03751213-4.059619colicin E2 tolerance protein
CMJKDNLE_03752112-3.005409putative 6-phosphogluconolactonase
CMJKDNLE_03753114-3.820879putative xylanase
CMJKDNLE_03754014-3.739904carbohydrate-specific outer membrane porin,
CMJKDNLE_03755-213-2.867366carbohydrate-specific outer membrane porin,
CMJKDNLE_03756-211-1.3982656-phospho-beta-glucosidase B; cryptic
CMJKDNLE_03757-214-0.280100beta-glucoside PTS permease BglF - cryptic
CMJKDNLE_03758-320-0.124042BglG transcriptional antiterminator (monomer)
CMJKDNLE_03759-1291.607622PhoU phosphate transport system protein
CMJKDNLE_03760-1271.531861phosphate ABC transporter - ATP binding subunit
CMJKDNLE_03761-1261.780468phosphate ABC transporter - membrane subunit
CMJKDNLE_037622311.663870phosphate ABC transporter - membrane subunit
CMJKDNLE_037632311.496450phosphate ABC transporter - periplasmic binding
CMJKDNLE_037643361.745073L-glutamine:D-fructose-6-phosphate
CMJKDNLE_037654351.620728fused N-acetylglucosamine-1-phosphate
CMJKDNLE_037675391.789426ATP synthase F1 complex - epsilon subunit
CMJKDNLE_037685411.683403ATP synthase F1 complex - beta subunit
CMJKDNLE_037694340.718254ATP synthase F1 complex - gamma subunit
CMJKDNLE_037705350.497492ATP synthase F1 complex - alpha subunit
CMJKDNLE_03771320-0.737979ATP synthase F1 complex - delta subunit
CMJKDNLE_037722200.530187ATP synthase F0 complex - b subunit
CMJKDNLE_037732180.238024ATP synthase
CMJKDNLE_03774115-0.134947ATP synthase F0 complex - a subunit
CMJKDNLE_037752141.083052AtpI
CMJKDNLE_037762111.07318216S rRNA m7G527 methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03765RTXTOXINA290.048 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.048
Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%)

Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426
LGD + D V + AG+ N G DV T G AT A T
Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665

Query: 427 VTRNVGENALAISRVPQTQK 446
VTR +G + + V + Q+
Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03772IGASERPTASE270.028 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.3 bits (60), Expect = 0.028
Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 18/101 (17%)

Query: 31 AAIEKRQKEIADGLASAERAHKDLDLAKASATDQLKKAKAEAQVIIEQ--ANKRRSQILD 88
+EK +++ + A K+ + T + A++ ++ Q K + +
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 89 EAKAEAEQERTKIVA----------------QAQAEIEAER 113
E KA+ E E+T+ V Q QAE E
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149


55CMJKDNLE_03797CMJKDNLE_03802Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03797-2143.050472putative ATP-dependent protease
CMJKDNLE_03798-1194.265107acetohydroxybutanoate synthase / acetolactate
CMJKDNLE_03799-1244.136704acetohydroxybutanoate synthase / acetolactate
CMJKDNLE_03800-1264.221416branched-chain amino-acid aminotransferase
CMJKDNLE_03801-1243.967859dihydroxy acid dehydratase
CMJKDNLE_038021213.089406threonine deaminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03797HTHFIS357e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 7e-04
Identities = 38/196 (19%), Positives = 62/196 (31%), Gaps = 51/196 (26%)

Query: 170 KHALERPKPTDAVSRALQHDLSDVIGQEQG----KRGLEITAAGGHNILLIGPPGTGKTM 225
AL PK + D ++G+ R L +++ G GTGK +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 226 LASRINGLLPDLSNEEALESAAILSLVNAESVQKQWRQRPFRSPHHSA--------SLTA 277
+A ++ R PF + + +A L
Sbjct: 176 VARALHDYGK-------------------------RRNGPFVAINMAAIPRDLIESELFG 210

Query: 278 MVGG---GAIP-GPGEISLAHNGVLFLDEL----PEFERRTLDALREPIESGQIHLSRTR 329
G GA G A G LFLDE+ + + R L L++ G+
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GEYT--TVG 264

Query: 330 AKITYPARFQLVAAMN 345
+ + ++VAA N
Sbjct: 265 GRTPIRSDVRIVAATN 280


56CMJKDNLE_03881CMJKDNLE_03903Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03881021-5.593267molybdopterin-guanine dinucleotide biosynthesis
CMJKDNLE_03882-219-6.060709molybdopterin guanine dinucleotide synthase
CMJKDNLE_03883-114-3.994592hypothetical protein
CMJKDNLE_03884-113-3.615310serine/threonine protein kinase
CMJKDNLE_03885013-3.364249Thiol:disulfide interchange protein DsbA
CMJKDNLE_03886010-2.499001putative GTP-binding protein
CMJKDNLE_03887013-0.714920putative endonuclease
CMJKDNLE_038881141.303970DNA polymerase I, 5' --> 3' polymerase, 5' -->
CMJKDNLE_038900162.411752cell division protein; predicted checkpoint
CMJKDNLE_038930172.358953GAP-like protein that activates GTPase activity
CMJKDNLE_038942232.287602coproporphyrinogen III dehydrogenase
CMJKDNLE_038962221.955256small predicted membrane protein
CMJKDNLE_038972180.673826HyfR DNA-binding transcriptional activator
CMJKDNLE_03898118-1.497897NtrB
CMJKDNLE_03899318-3.226599glutamate-putrescine ligase
CMJKDNLE_03900214-4.841352protein possibly involved in ribosome structure
CMJKDNLE_03901013-5.506309YihL putative transcriptional regulator
CMJKDNLE_03902010-3.888457hypothetical protein
CMJKDNLE_03903110-3.344793YihN MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03893SECA300.004 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.004
Identities = 11/71 (15%), Positives = 30/71 (42%)

Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73
+K + + EE+++ + R+ + +R ++ + + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 74 LGVTEKVTKQH 84
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03897HTHFIS6020.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 602 bits (1553), Expect = 0.0
Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL + S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03898PF06580280.042 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.042
Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%)

Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223
I+E + R ++ L + L + + S+ V + L S++ D ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279
+P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+
Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293

Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336
++VE+ G + ++ TG GL R + G I+ +
Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 337 GHTEFSVYLP 346
G V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03900TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03903TCRTETB290.028 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.028
Identities = 31/161 (19%), Positives = 64/161 (39%), Gaps = 15/161 (9%)

Query: 227 NVFFVYAVYCGLTFFIPFLKNIYLLP----------VALVGAYGIINQYCLKMIGGPIGG 276
N+ F+ V CG F + ++P A +G+ I +I G IGG
Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314

Query: 277 MISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMACTLGFGAIVFTQRAVFF 336
++ D+ + P L + + + L E+ ++ + G + FT+
Sbjct: 315 ILVDR--RGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTK--TVI 369

Query: 337 APIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDL 377
+ I + + + + GA M+L +F + ++ G +L +
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


57CMJKDNLE_03948CMJKDNLE_03955Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03948127-4.297496putative DNA-binding response regulator in
CMJKDNLE_03949130-4.993228regulator of the Cpx response and possible
CMJKDNLE_03950129-6.626932Tyrosine recombinase XerD
CMJKDNLE_03951231-6.881944hypothetical protein
CMJKDNLE_03952232-6.654244hypothetical protein
CMJKDNLE_03953027-3.583984hypothetical protein
CMJKDNLE_03954128-4.673488hypothetical protein
CMJKDNLE_03955-225-3.713222hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03948HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 35/117 (29%), Positives = 62/117 (52%), Gaps = 2/117 (1%)

Query: 3 KILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLL-DDSIDLLLLDVMMPKKNGID 61
IL+ DDD + ++L + L G++V + + + DL++ DV+MP +N D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 TLKALRQTH-QTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRR 117
L +++ PV++++A+ + + + E GA DYLPKPF+ EL+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03953SECA280.015 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.3 bits (63), Expect = 0.015
Identities = 11/72 (15%), Positives = 26/72 (36%)

Query: 9 VEKQPAAMRRIIGKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDV 68
+ ++ L + W D ++ RER+ +++ + E M
Sbjct: 703 IPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHF 762

Query: 69 ERERLVCAIDEL 80
E+ ++ +D L
Sbjct: 763 EKGVMLQTLDSL 774


58CMJKDNLE_03978CMJKDNLE_03987Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_039783113.177122peptidase component of the HslVU protease
CMJKDNLE_039802113.221953essential cell division protein FtsN
CMJKDNLE_039810122.810980CytR-cytidine
CMJKDNLE_039821164.470371primosome factor N'
CMJKDNLE_039830173.35896450S ribosomal subunit protein L31
CMJKDNLE_039840173.665359hypothetical protein
CMJKDNLE_03985-1183.740578MetJ transcriptional repressor
CMJKDNLE_03986-1163.643409O-succinylhomoserine lyase /
CMJKDNLE_03987-1173.614719aspartate kinase / homoserine dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03980IGASERPTASE415e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 5e-06
Identities = 31/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%)

Query: 79 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 138
+ +QAD+ P+ E+ ++ P + +AE + Q+S+
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049

Query: 139 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASSQQPYQDLLQTPAHTTAQSKPQQAAPVARA 197
T E++ Q T T+Q V + + + A++Q + T ++ ++ A V +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 198 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 232
A T + ++ + Q + EQ+ETV+ Q
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142


59CMJKDNLE_04089CMJKDNLE_04103Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04089017-5.078216maltose ABC transporter - ATP binding subunit
CMJKDNLE_04090119-6.430766maltose outer membrane porin / phage lambda
CMJKDNLE_04092-125-7.145887maltose regulon periplasmic protein
CMJKDNLE_04093016-3.980542hypothetical protein
CMJKDNLE_04094116-3.600918hypothetical protein
CMJKDNLE_04095014-2.645381hypothetical protein
CMJKDNLE_040961132.178688chorismate lyase
CMJKDNLE_040971131.8651724-hydroxybenzoate octaprenyltransferase
CMJKDNLE_040980132.029914glycerol-3-phosphate acyltransferase
CMJKDNLE_04099218-0.831477diacylglycerol kinase
CMJKDNLE_04100021-3.733339LexA DNA-binding transcriptional repressor
CMJKDNLE_04101120-2.886734DinF MATE Transporter
CMJKDNLE_04102123-5.851541putative stress response protein
CMJKDNLE_04103120-3.808515Zur-Zn2+ DNA-binding transcriptional repressor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04089PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 6e-04
Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKR 66
VV G G GKSTL+ + GL+ + IG +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


60CMJKDNLE_04117CMJKDNLE_04139Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04117738-9.909071ssDNA-binding protein
CMJKDNLE_04118741-10.745680Tyrosine recombinase XerC
CMJKDNLE_04119841-12.002209hypothetical protein
CMJKDNLE_04120943-13.272494hypothetical protein
CMJKDNLE_04121842-14.053638hypothetical protein
CMJKDNLE_04122840-13.760908hypothetical protein
CMJKDNLE_04123227-7.772496hypothetical protein
CMJKDNLE_04124017-4.479435hypothetical protein
CMJKDNLE_04125-1140.085025putative inner membrane protein
CMJKDNLE_04126015-3.614051putative c-di-GMP-specific phosphodiesterase
CMJKDNLE_04127-114-0.946609SoxS DNA-binding transcriptional dual regulator
CMJKDNLE_04128014-1.027125SoxR DNA-binding transcriptional dual regulator
CMJKDNLE_041300160.345368putative permease
CMJKDNLE_04131015-0.250297YjcE CPA1 transporter
CMJKDNLE_04132-115-1.046616hypothetical protein
CMJKDNLE_04133-2213.163611acetate / glycolate transporter
CMJKDNLE_04134-2193.378187hypothetical protein
CMJKDNLE_04136-2174.040729acetyl-CoA synthetase (AMP-forming)
CMJKDNLE_04137-1153.477924formate dependent nitrite reductase - NrfA
CMJKDNLE_041380183.962853formate-dependent nitrite reductase - penta-heme
CMJKDNLE_041390193.559279formate-dependent nitrite reductase, 4Fe-4S
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04117PERTACTIN270.048 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.0 bits (59), Expect = 0.048
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)

Query: 119 GGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG----GAQSRPQQSAPAAP 164
G APAGG + GG GG + + G + QS AP
Sbjct: 261 GDAPAGGAVPGGAVPGGAVPGGFGPLLDGWYGVDVSDSTVDLAQSIVEAP 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04122HTHFIS320.021 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.021
Identities = 17/61 (27%), Positives = 32/61 (52%), Gaps = 3/61 (4%)

Query: 229 VIVSGEGGVGKTAVIKKIYEA-EKQYTPFYVFKASEFKKDSI-NELFGAHGLDDFSNAHQ 286
++++GE G GK V + +++ +++ PF + +D I +ELFG H F+ A
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG-HEKGAFTGAQT 221

Query: 287 D 287

Sbjct: 222 R 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04134RTXTOXIND270.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 26.7 bits (59), Expect = 0.020
Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%)

Query: 17 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 48
EL+E R +++ ++ + +L
Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04139VACJLIPOPROT300.006 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 29.9 bits (67), Expect = 0.006
Identities = 6/21 (28%), Positives = 11/21 (52%)

Query: 179 FGNLDDPNSEISQLLRQKPTY 199
GNL++P ++ L+ P
Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95


61CMJKDNLE_04160CMJKDNLE_04172Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_041602296.550649allose-6-phosphate isomerase /
CMJKDNLE_041612347.805802hypothetical protein
CMJKDNLE_041620368.3789135-phospho-alpha-D-ribosyl 1,2-cyclic phosphate
CMJKDNLE_041630388.187491putative acyltransferase with acyl-CoA
CMJKDNLE_041641398.908305ribose 1,5-bisphosphokinase
CMJKDNLE_041651408.978845RPnTP hydrolase
CMJKDNLE_041660379.035677PhnL subunit of methylphosphonate degradation
CMJKDNLE_041670389.296694putative carbon-phosphorous lyase subunit
CMJKDNLE_041680399.141265carbon-phosphorous lyase
CMJKDNLE_041693408.348826PhnI subunit of methylphosphonate degradation
CMJKDNLE_041702397.366858PhnH subunit of methylphosphonate degradation
CMJKDNLE_041712366.223714PhnG subunit of methylphosphonate degradation
CMJKDNLE_041722364.621404PhnF predicted transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04163SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 5/84 (5%)

Query: 50 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAG 109
L L+ +G I + + N I+++ V R VG+ LL A E A++
Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 110 AEMTELSTNVKRHDAHRFYLREGY 133
L T A FY + +
Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04166PF05272290.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.013
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 89
VVL G G GKSTL+ +L + I G + + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 90 TTVGWVSQFL 99
V F
Sbjct: 656 ADAEAVKAFF 665


62CMJKDNLE_04201CMJKDNLE_04216Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04201213-2.957986dicarboxylate transporter DcuB
CMJKDNLE_04202112-4.406260Transcriptional regulatory protein DcuR
CMJKDNLE_04203-113-3.696211NtrB
CMJKDNLE_04204-117-4.401598hypothetical protein
CMJKDNLE_04205021-4.081730putative acyltransferase with acyl-CoA
CMJKDNLE_04206120-4.944094hypothetical protein
CMJKDNLE_04207119-4.116486lysyl-tRNA synthetase
CMJKDNLE_04208114-2.768898dipeptide:H+ symporter YjdL
CMJKDNLE_04209117-3.084380lysine decarboxylase 1
CMJKDNLE_04210217-1.963701cadaverine:H+ symporter / lysine:cadaverine
CMJKDNLE_04211217-2.035496CadC DNA-binding transcriptional activator
CMJKDNLE_04213119-0.119387*putative transcriptional regulator
CMJKDNLE_04214115-0.176356Thiol:disulfide interchange protein DsbD
CMJKDNLE_04215121-1.246264copper binding protein CutA
CMJKDNLE_04216234-0.555739dicarboxylate transporter DcuA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04202HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 4e-16
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04203PF06580417e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 7e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04205SACTRNSFRASE270.011 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.011
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04208TCRTETA300.023 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.023
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04211SYCDCHAPRONE377e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.8 bits (85), Expect = 7e-05
Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445
++ A+ + + T+ LN +S +Y + A + GK +++++
Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64

Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481
L+ + L LG + G A +Y +
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04213HTHTETR455e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 5e-08
Identities = 28/188 (14%), Positives = 51/188 (27%), Gaps = 13/188 (6%)

Query: 3 REDVLGEALKLLELQGIANTTLEMVAERVDYPLDELRRFWPDKEAILYDALRYLSQQIDV 62
R+ +L AL+L QG+++T+L +A+ + + DK + + I
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 63 WRRQLMLDETQTAEQKLLARYQALSECVKNNRYPGCLFIAACTFYPDPGH----PIHQLA 118
+ L + E + F+ + Q
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLEST--VTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 119 DQQKSAAYDFTHELLTT-------LEVDDPAMVAKQMELVLEGCLSRMLVNRSQADVDTA 171
+YD + L A M + G + L D+
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190

Query: 172 HRLAEDIL 179
R IL
Sbjct: 191 ARDYVAIL 198


63CMJKDNLE_04239CMJKDNLE_04267Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04239-1133.102447mechanosensitive channel of miniconductance McsM
CMJKDNLE_04240-2122.445209phosphatidylserine decarboxylase, proenzyme
CMJKDNLE_04241-1132.814467ribosome small subunit-dependent GTPase A
CMJKDNLE_04242-1133.410410oligoribonuclease monomer
CMJKDNLE_04246-1133.272121***epoxyqueuosine reductase
CMJKDNLE_04247-1123.436401NAD(P)HX epimerase / NAD(P)HX dehydratase
CMJKDNLE_04248-1142.571615protein involved in threonylcarbamoyladenosine
CMJKDNLE_042490123.061375N-acetylmuramoyl-L-alanine amidase 2
CMJKDNLE_042501142.716648MutHLS complex, methyl-directed mismatch repair
CMJKDNLE_042512191.784895tRNA(i6A37) synthase
CMJKDNLE_042525251.806089RNA-binding protein that affects many cellular
CMJKDNLE_042535231.662547GTPase associated with the 50S subunit of the
CMJKDNLE_042544232.145168regulator of FtsH protease
CMJKDNLE_042555231.971825regulator of FtsH protease
CMJKDNLE_042562180.835854hypothetical protein
CMJKDNLE_04257114-0.687440adenylosuccinate synthetase
CMJKDNLE_04258-112-2.003388NsrR-nitric oxide
CMJKDNLE_04259-114-2.570042RNase R
CMJKDNLE_04260023-5.37439423S rRNA 2'-O-ribose methyltransferase monomer
CMJKDNLE_04261229-7.232155IclR transcriptional repressor
CMJKDNLE_04262129-7.059539CP4-6 prophage; probable 2-keto-3-deoxygluconate
CMJKDNLE_04263-128-5.291038putative transporter
CMJKDNLE_04264123-5.221304D-altronate dehydratase
CMJKDNLE_04265421-5.948234hypothetical protein
CMJKDNLE_04266422-5.016909hypothetical protein
CMJKDNLE_04267219-2.393188putative transcriptional regulator effector
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04239GPOSANCHOR512e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 50.8 bits (121), Expect = 2e-08
Identities = 50/312 (16%), Positives = 105/312 (33%), Gaps = 18/312 (5%)

Query: 121 SRQAQQEQERAREIADSLNQLPQQQTDARRQLNEIERRLGTLTGNTPLNQAQNFALQSDS 180
+ ++ QERA + N L + +D ++ LT L+ +
Sbjct: 49 TDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTE---ELSNAKEKLRKND 105

Query: 181 ARLKALVDEL-ELAQLSANNRQELARLRSELAEKES--QQLDAYLQALRNQLNSQRQLEA 237
L ++ EL A+ + L + + + L+A AL + + +
Sbjct: 106 KSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAAR-KADLEKAL 164

Query: 238 ERALESTELLAENSADLPKDIVAQFKINRELSAALNQQAQRMDLVASQQRQAASQTLQVR 297
E A+ + + L + A EL AL +++ + ++ +
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 298 QALNTLREQSQWLGSSNLLGEALRAQVARLPEMPKPQQLDTEMAQLRVQRLRYEDLLNKQ 357
L + L + A A++ L + L+ A+L +
Sbjct: 225 ARKADLEKA---LEGAMNFSTADSAKIKTLEA--EKAALEARQAELEKALEGAMNFSTAD 279

Query: 358 PLLRQIHQADGQPLTAE------QNRILEAQLRTQRELLNSLLQGGDTLLLELTKLKVSN 411
+ +A+ L AE Q+++L A ++ R L++ + L E KL+ N
Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQN 339

Query: 412 GQLEDALKEVNE 423
E + + +
Sbjct: 340 KISEASRQSLRR 351



Score = 42.7 bits (100), Expect = 7e-06
Identities = 48/239 (20%), Positives = 92/239 (38%), Gaps = 23/239 (9%)

Query: 20 ATAPDSKQITQELEQAKAAKPAQPEVVEALQSALNALEERKGSLER-IKQYQQVIDNYPK 78
A A + + LE A A ++ L++ ALE R+ LE+ ++
Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281

Query: 79 LSATLRAQLNNMRDEPRSVSPGMSTDALNQEILQVSSQLLDKSRQAQQEQERAREIADSL 138
TL A+ + E + + Q + LD SR+A+++ E + +
Sbjct: 282 KIKTLEAEKAALEAEKADL---EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQ 338

Query: 139 NQLPQQQTDARRQLNEIERRLGTLTGNTPLNQAQNFALQSDSARLKAL--VDELELAQLS 196
N++ ++A RQ + R L L+++ +L+ + E L
Sbjct: 339 NKI----SEASRQ--SLRRDLDASR-------EAKKQLEAEHQKLEEQNKISEASRQSLR 385

Query: 197 AN---NRQELARLRSELAEKESQQLDAYLQALRNQLNSQRQLEAERALESTELLAENSA 252
+ +R+ ++ L E S +L A + + S++ E E+A +L AE A
Sbjct: 386 RDLDASREAKKQVEKALEEANS-KLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKA 443


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04250PF03544310.010 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.010
Identities = 27/147 (18%), Positives = 43/147 (29%), Gaps = 25/147 (17%)

Query: 329 VLQQQLETPLPLDDEPQPAPRAIPENRVAAGRNHFAEPAAREPVAPRYTPAPA------- 381
+ + P +P P P PE + A P+ P P
Sbjct: 53 TMVAPADLEPPQAVQPPPEPVVEPE-PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV 111

Query: 382 ---------SGSRPAAPWPNAQPGYQ---KQQGEVYRQLLQTPAPMQKLKAPEPQEPALA 429
SRPA+P+ N P + + + + L +PQ PA A
Sbjct: 112 EQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARA 171

Query: 430 ANSQSFGRVLTIVHSDCALLERDGNIS 456
+ G+V + DG +
Sbjct: 172 QALRIEGQVKVKFD-----VTPDGRVD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04253SECA320.005 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.2 bits (73), Expect = 0.005
Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%)

Query: 282 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLEDFEPRIDRDEENK-PIRV 340
++D +DV N + IDA+ P L ++ + + R+ D + PI
Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 341 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400
WL + L + + + + + + R + LQ ++ W E ++
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782

Query: 401 SLQVRMPIVDWRRLCKQEPALIDY 424
+R I R +++P +Y
Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04254cloacin320.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.006
Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%)

Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72
S G +SE N GG G G GGG GTG G S+ P
Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91

Query: 73 -----RPQLGGRVVTIAAAAI 88
P GG V+I+A A+
Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04259RTXTOXIND310.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.028
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218
+VP+D L L+ I +G ++++ P R + GK+ + D +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04263TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 5e-04
Identities = 30/150 (20%), Positives = 64/150 (42%), Gaps = 5/150 (3%)

Query: 29 IMYFVAFIDRVNVGFAKDAMKLDIGLSESAFALGAGIFFAAYALFGIPANLILNKIGAQK 88
I+ F + ++ + + + + D ++ F +++ + +++G ++
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 89 WLSITTAIWGLLSAMMGFVTGETQFIIL---RFLLGLGEAGFYPGILLLASIYFPNKVRG 145
L I S ++GFV G + F +L RF+ G G A F ++++ + Y P + RG
Sbjct: 81 LLLFGIIINCFGS-VIGFV-GHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 146 SVVGIFVLGVPLALTLGSPISGALLELHGW 175
G+ V + +G I G + W
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


64CMJKDNLE_04334CMJKDNLE_04339Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04334126-5.821339aspartate carbamoyltransferase, PyrB subunit
CMJKDNLE_04335337-8.847013putative mRNA endoribonuclease
CMJKDNLE_04336328-7.217187c-di-GMP binding protein involved in biofilm
CMJKDNLE_04337227-8.406114putative transcriptional regulator
CMJKDNLE_04338225-8.281497toxin-antitoxin biofilm protein
CMJKDNLE_04339127-9.178157hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04336DHBDHDRGNASE884e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 88.2 bits (218), Expect = 4e-23
Identities = 67/250 (26%), Positives = 113/250 (45%), Gaps = 24/250 (9%)

Query: 6 GKTVLILGGSRGIGAAIVRHFVTDGANVRFTYAGSKDAAERLAQETGATAVFT-----DS 60
GK I G ++GIG A+ R + GA++ + + E++ A A D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 61 ADRDAVIDVV----RKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVE 116
D A+ ++ R+ G +DILV AG+ G L+ ++ + F +N ++AS
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 117 AARQMP--EGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVV 174
++ M G I+ +GS N +P MAAYA+SK+A + L + I N+V
Sbjct: 127 VSKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 175 QPGPIDTDA--------NPANGPMRDMLHGF---MAIKRHGQPEEVAGMVAWLAGPEASF 223
PG +TD N A ++ L F + +K+ +P ++A V +L +A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 224 VTGAMHTIDG 233
+T +DG
Sbjct: 246 ITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04337HTHTETR513e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.2 bits (122), Expect = 3e-10
Identities = 20/100 (20%), Positives = 38/100 (38%), Gaps = 6/100 (6%)

Query: 5 KQSRVPGRPRRFAPEQAVSAAKVLFHQKGFDAVSVAEVTDYLGINPPSLYAAFGSKAGLF 64
++++ + R + + A LF Q+G + S+ E+ G+ ++Y F K+ LF
Sbjct: 3 RKTKQEAQETR---QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 65 SRVLNEYVGTEAIPLADILLDDRPVGECLVEVLKEAARRY 104
S + L P + VL+E
Sbjct: 60 SEIWELSESN-IGELELEYQAKFP--GDPLSVLREILIHV 96


65CMJKDNLE_04351CMJKDNLE_04387Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04351-119-3.158009LptABCFG ABC transporter
CMJKDNLE_04352028-4.732083putative ATPase
CMJKDNLE_04353233-6.084775putative alcohol dehydrogenase, Zn-dependent and
CMJKDNLE_04355241-8.055541*KpLE2 phage-like element; predicted integrase
CMJKDNLE_04356234-6.214473hypothetical protein
CMJKDNLE_04357228-0.258535hypothetical protein
CMJKDNLE_043581214.475418hypothetical protein
CMJKDNLE_043592215.134833hypothetical protein
CMJKDNLE_043601214.844610hypothetical protein
CMJKDNLE_043611214.582477hypothetical protein
CMJKDNLE_043621210.438891hypothetical protein
CMJKDNLE_04363225-3.138772DNA primase TraC
CMJKDNLE_04364638-9.612569hypothetical protein
CMJKDNLE_04365844-10.714325hypothetical protein
CMJKDNLE_04366843-9.805389DNA-binding transcriptional regulator, prophage
CMJKDNLE_04368634-6.239299hypothetical protein
CMJKDNLE_04369419-0.300009hypothetical protein
CMJKDNLE_043704195.370566KpLE2 phage-like element; predicted integrase
CMJKDNLE_043721236.155486KpLE2 phage-like element; predicted protein
CMJKDNLE_043732236.515307ferric dicitrate ABC transporter - ATP binding
CMJKDNLE_043742256.706472ferric dicitrate ABC transporter - membrane
CMJKDNLE_043752266.016990ferric dicitrate ABC transporter - membrane
CMJKDNLE_043761244.149355ferric dicitrate ABC transporter - periplasmic
CMJKDNLE_04377-1243.450486ferric citrate outer membrane porin FecA
CMJKDNLE_043781202.924454regulator for fec operon, periplasmic
CMJKDNLE_043791222.304720RNA polymerase, sigma 19 factor
CMJKDNLE_043802221.978043hypothetical protein
CMJKDNLE_043812221.857084hypothetical protein
CMJKDNLE_043823233.073383hypothetical protein
CMJKDNLE_043832241.549729hypothetical protein
CMJKDNLE_04384127-4.170417hypothetical protein
CMJKDNLE_04385132-6.281632IS2 element protein InsA
CMJKDNLE_04386234-6.763693hypothetical protein
CMJKDNLE_04387219-4.084103hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04376FERRIBNDNGPP655e-14 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 64.6 bits (157), Expect = 5e-14
Identities = 44/240 (18%), Positives = 91/240 (37%), Gaps = 13/240 (5%)

Query: 36 TPQRIVVLELSFADALAAVDVSPIGIADDNDAKRILPEVRAHLKPWQSVGTRAQPSLEAI 95
P RIV LE + L A+ + P G+AD + + + E VG R +P+LE +
Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSE-PPLPDSVIDVGLRTEPNLELL 92

Query: 96 AALKPDLIIADSSRHAGVYIALQQIAPVLLLKSR--NETYAENLQSAAIIGEMVGKKREM 153
+KP ++ S+ + L +IAP + A +S + +++ +
Sbjct: 93 TEMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151

Query: 154 QARLEQHKERMAQWASQLPKGTR---VAFGTSREQQFNLHTQETWTGSVLASLGLNVPAA 210
+ L Q+++ + + K + + + + +L G +P A
Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG--IPNA 209

Query: 211 MAGSS----MPSIGLEQLLAVNPAWLLVAHYREESIVKRWQQDPLWQMLTAAQKQQVASV 266
G + ++ +++L A +L + + PLWQ + + + V
Sbjct: 210 WQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRV 269


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04377ECOLNEIPORIN330.004 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 32.9 bits (75), Expect = 0.004
Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 9/89 (10%)

Query: 546 GSFGTVQYSQIGKAVQSGNVEPEKARTWELGTRYDDGALTAEMGLFLINFNNQYDSNQTN 605
G F + NV EK + L + YD+ AL A + Q D+
Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASV------AVQQQDAKLVE 240

Query: 606 DTVTARGKTRHTGLETQARYDLGTLTPTL 634
+ T + Y G +TP +
Sbjct: 241 E---NYSHNSQTEVAATLAYRFGNVTPRV 266


66CMJKDNLE_04450CMJKDNLE_04455Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_044501253.059304thymidine phosphorylase / uracil phosphorylase
CMJKDNLE_04451-1253.368276phosphopentomutase
CMJKDNLE_04452-2214.271309purine nucleoside phosphorylase deoD-type
CMJKDNLE_04453-2203.433415hypothetical protein
CMJKDNLE_04454-2203.637253lipoyl-protein ligase A
CMJKDNLE_04455-1193.033451membrane protein
67CMJKDNLE_00381CMJKDNLE_00387N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_003811141.458771manno(fructo)kinase
CMJKDNLE_003831111.555640putative arabinose efflux transporter
CMJKDNLE_003840111.864154ATP-dependent dsDNA exonuclease
CMJKDNLE_003850112.054870ATP-dependent dsDNA exonuclease
CMJKDNLE_00386-1122.099152putative DNA-binding response regulator in
CMJKDNLE_003870122.075254EnvZ sensory histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00381ACETATEKNASE280.037 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 28.2 bits (63), Expect = 0.037
Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 10/69 (14%)

Query: 187 FISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245
+G + D+R L A + D A+LAL + R+ K++ +
Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323

Query: 246 DVIVLGGGM 254
DVIV G+
Sbjct: 324 DVIVFTAGI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00383TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.1 bits (125), Expect = 2e-09
Identities = 74/356 (20%), Positives = 126/356 (35%), Gaps = 35/356 (9%)

Query: 33 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 89
+ ++AL G+G+ IM VL L ++ S H +++ YAL AP++
Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 90 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 149
S R+ + +LL +A + A+ + +L IGR+V+G GA + I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124

Query: 150 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 209
G A G +S ++ P+ L FS F A N + F +P+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 210 RDEAKGNLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 257
+ LR + + A + F + G W +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238

Query: 258 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCG 314
F A T + L G+ + M++G ++ R R + ++L F
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 315 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 368
I + G+ LQ +L + E G G +A +L S VG
Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00384RTXTOXIND413e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.6 bits (95), Expect = 3e-05
Identities = 27/204 (13%), Positives = 61/204 (29%), Gaps = 18/204 (8%)

Query: 487 EARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEG 546
EA ++ Q + Q ++E + E + +E + L
Sbjct: 133 EADTLKTQSSLLQARLEQ---TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT- 188

Query: 547 ATLRGQLDAITKQLQRDENETQSLRQDEQALTQQWQAVTASLNITLQPLDDIQPWLDAQD 606
+ ++ Q Q + E R + + + + LDD L Q
Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248

Query: 607 -------EHERQL-RLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQLLLTTLTGYALTLP 658
E E + +++ + Q+ +I+ +++ + QL L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF------KNEILD 302

Query: 659 QEDEEESWLATRQQEAQSWQQRQN 682
+ + + E ++RQ
Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQ 326



Score = 35.6 bits (82), Expect = 0.001
Identities = 35/199 (17%), Positives = 72/199 (36%), Gaps = 13/199 (6%)

Query: 671 QQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDELPHCEETVVLENWRQVHEQCLALH 730
+ + Q + Q R Q L+ +E + E + +V +
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTAL--------QASVFDDQQAFLAALMDEQTLTQL 782
Q T Q Q +L K +A+ T L + V + ++L+ +Q + +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252

Query: 783 EQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLREN 842
L+Q EN+ +A + L Q + + + + E+ KLR+
Sbjct: 253 AVLEQ--ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL---DKLRQT 307

Query: 843 TTSQGEIRQQLKQDADNRQ 861
T + G + +L ++ + +Q
Sbjct: 308 TDNIGLLTLELAKNEERQQ 326



Score = 33.6 bits (77), Expect = 0.004
Identities = 25/212 (11%), Positives = 65/212 (30%), Gaps = 19/212 (8%)

Query: 375 QTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEV------ATALAQHAEQRPLRQHL 428
+ Q L A + ++ ++ +++ Q+ + + +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 429 VALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEA 488
+ Q Q + Q ++ + E+ A +N + + +L D ++ ++A
Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248

Query: 489 --RIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEG 546
+ LE + ++A + Y++ + L A E
Sbjct: 249 IAKHAVLEQENKYVEAVN-----------ELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297

Query: 547 ATLRGQLDAITKQLQRDENETQSLRQDEQALT 578
+ +L T + E + +QA
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00385FRAGILYSIN300.022 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 29.7 bits (66), Expect = 0.022
Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%)

Query: 149 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 208
K+ ++ I ++Y + + + I T D + + I A
Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190

Query: 209 AQNFPPADYI 218
N P DYI
Sbjct: 191 ILNLPECDYI 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00386HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%)

Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63
ILV +D+A IR ++ L + G+ + + + DL++ D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123
+ +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I +
Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152
E L D + G
Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00387PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 0.001
Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%)

Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380
LV N + H P+G I ++ + VE+ G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422
+G GL V+ + E+++ + GK +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


68CMJKDNLE_00420CMJKDNLE_00428N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00420121-0.134130muropeptide:H+ symporter
CMJKDNLE_00421327-0.368794putative lipoprotein
CMJKDNLE_00422427-0.454730BolA DNA-binding transcriptional dual regulator
CMJKDNLE_00423327-0.119474trigger factor; a molecular chaperone involved
CMJKDNLE_004241210.162879ClpAXP
CMJKDNLE_00425121-0.109519ClpAXP
CMJKDNLE_00427019-0.077054DNA-binding, ATP-dependent protease La
CMJKDNLE_00428-1120.073548transcriptional dual regulator HU-beta, NS1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00420TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%)

Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121
+F +P++ + F RR LL + V A M W+ + ++A +
Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109

Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181
+ V A+ D+ +ER + GM+ L + ++ A
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167

Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238
AL + + L PE + P+ + + + A L+ + ++ +G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298
A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++
Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285

Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358
A GY LL+ + + V GG+G A A+L ++ L+
Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403
AL+++ + VGP+ + A +T+ + + AA+ L L + R
Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00421PF06291270.027 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.5 bits (58), Expect = 0.027
Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36
KK+LF ++ GCA+ T+ PT P++
Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00425HTHFIS290.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.043
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%)

Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119
E P+ E + ++G+ A + +Y RL D +++ G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167

Query: 120 PTGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 168 ESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00427GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%)

Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249
LE A +E + +L R +++ ++ S+ +Q++A ++L E + +
Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344

Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308
++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ +
Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397

Query: 309 VKKDLRQAQEILD 321
V+K L +A L
Sbjct: 398 VEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00428DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 3e-38
Identities = 49/88 (55%), Positives = 67/88 (76%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89
NPQTG+EI I A+KVP+F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


69CMJKDNLE_00451CMJKDNLE_00459N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_004512150.747252AcrB RND-type permease
CMJKDNLE_004522110.061224AcrA membrane fusion protein
CMJKDNLE_00453213-0.103430AcrR DNA-binding transcriptional repressor
CMJKDNLE_004543142.245481potassium dependent mechanosensitive channel
CMJKDNLE_004554154.064154small protein involved in the cell envelope
CMJKDNLE_004563164.606577primosomal replication protein N''
CMJKDNLE_004573223.155718hypothetical protein
CMJKDNLE_004584272.986843adenine phosphoribosyltransferase
CMJKDNLE_004592212.882374DNA polymerase III, gamma subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00451ACRIFLAVINRP13690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1369 bits (3546), Expect = 0.0
Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00452RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 6e-07
Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%)

Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268
+ D + KV I D I+ + G + ++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429

Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
+ S + I L GM V A ++ G
Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.4 bits (79), Expect = 8e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00453HTHTETR2012e-67 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 201 bits (511), Expect = 2e-67
Identities = 189/192 (98%), Positives = 189/192 (98%)

Query: 36 FLTAGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 95
F GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG
Sbjct: 24 FSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 83

Query: 96 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ 155
DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ
Sbjct: 84 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQ 143

Query: 156 TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 215
TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL
Sbjct: 144 TLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203

Query: 216 LCPTLRNPATNE 227
LCPTLRNPATNE
Sbjct: 204 LCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00454RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRIKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00459IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 4e-05
Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


70CMJKDNLE_00529CMJKDNLE_00540N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00529012-0.357337FimZ transcriptional regulator
CMJKDNLE_00531-1110.862850*EnvY DNA-binding transcriptional activator
CMJKDNLE_005320111.154603hypothetical protein
CMJKDNLE_00533-1110.934730bacteriophage N4 receptor, outer membrane
CMJKDNLE_005340120.527546bacteriophage N4 receptor, inner membrane
CMJKDNLE_005350191.794266putative sensory kinase in two-component
CMJKDNLE_00536-1181.898079putative DNA-binding response regulator in
CMJKDNLE_00537-1170.995899copper / silver efflux transport system - outer
CMJKDNLE_00538-2161.170560copper / silver efflux transport system -
CMJKDNLE_00539-1161.327643copper / silver efflux transport system -
CMJKDNLE_00540-1140.417083copper / silver efflux transport system -
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00529HTHFIS614e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 4e-13
Identities = 26/122 (21%), Positives = 55/122 (45%), Gaps = 2/122 (1%)

Query: 1 MKPTSVIIMDTHPIIRMSIEVLLQKNSELQIVLKTDDYRITIDYLRTRPVDLIIMDIDLP 60
M ++++ D IR + L + V T + ++ DL++ D+ +P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GTDGFTFLKRIKQIQSTVKVLFLSSKSECFYAGRAIQAGANGFVSKCNDQNDIFHAVQMI 120
+ F L RIK+ + + VL +S+++ A +A + GA ++ K D ++ +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 LS 122
L+
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00535PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.007
Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 46/190 (24%)

Query: 306 EELTRMAKMVSDML-FLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDRGVELRFVG 364
+ M +S+++ + + N + + LADE+ V + + ++F
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVS------LADELTVVDSYLQLA------SIQF-E 237

Query: 365 DKCQV-------AGDPLMLRRALSNLLSNALRY----TPTGETIVVRCQTVDHLVQVIVE 413
D+ Q D + + L+ N +++ P G I+++ + V + VE
Sbjct: 238 DRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297

Query: 414 NPGTPIAPEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVK---SIVVAHKGTVAVTSDAR 470
N G+ E +G GL V+ ++ + + ++
Sbjct: 298 NTGSLALKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 471 GTRFVITLPA 480
++ +P
Sbjct: 340 KVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00536HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 35/117 (29%), Positives = 62/117 (52%)

Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61
+L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118
++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00537RTXTOXIND394e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 4e-05
Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 13/189 (6%)

Query: 254 QAQTVNSDSLQSVKLPA-GLSSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307
+ +S + +K + +I+++ + + L+ A A+ ++
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141

Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367
SL + + + + P F + L + + ++Q
Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200

Query: 368 QKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARALYQHGAVSYLEVLDAER 427
QK Q + A R ++ +I+ + + L +L A++ VL+ E
Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 428 SLFATRQTL 436
L
Sbjct: 260 KYVEAVNEL 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00538BLACTAMASEA260.033 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 25.9 bits (57), Expect = 0.033
Identities = 9/56 (16%), Positives = 24/56 (42%), Gaps = 1/56 (1%)

Query: 3 KALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGIDLESKKITIHH 58
+ +++ + SL + A+ E + ++ Q+ G++ +DL S +
Sbjct: 2 RYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMI-EMDLASGRTLTAW 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00540ACRIFLAVINRP6950.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 695 bits (1794), Expect = 0.0
Identities = 214/1059 (20%), Positives = 440/1059 (41%), Gaps = 54/1059 (5%)

Query: 1 MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQI 60
M + IRR + A+ L + G I+ PV P ++ V + +YPG Q
Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56

Query: 61 VENQVTYPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119
V++ VT + M + + S G + + F+ GTDP A+ +V L
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 120 KLPAGVSAELGP-DATGVGWIYEYALVDRSGKHDLADLRSLQDWFLKYELKTIPDVAEVA 178
LP V + + + ++ V + D+ +K L + V +V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 179 SVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELA------EAEYMVR 232
G ++ +D L +Y ++ +V + L N + + + +
Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 233 ASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQIGPEMRRGIAELNGEGEVAGGVVILR 292
A + ++F + L+ + +G V L+DVA+V++G E IA +NG+ AG + L
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294

Query: 293 SGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVV 352
+G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 CALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412
LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 NAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472
N + + E D + + ++ AL ++++ FIP+ G G
Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 473 RLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRF----------LI 522
++ + T AMA + L+A+++ P L ++ + E F +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523

Query: 523 RVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISA 582
Y + K+L LL+ AL V ++ ++ FLP+ ++G L M G +
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583

Query: 583 AEAASMLQKTDKLIM--SVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQEQW-RPG 639
+L + + V VF G + + + LKP E+
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640

Query: 640 MTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADI-DAMAE 698
+ + +I + + + +++ + I +G + A +
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 699 QIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGE 758
+ A+ + S LE +E+++EKA G++++D+ +++A+GG V +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 759 TVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADIKVSTGPSMLKTENA 818
++ + ++ +R P+ + +L + + + + + G L+ N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 819 RPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPM 878
P+ I +A L + +A K L G ++G + ++ +V +
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 879 TLMIIFVLLYLAFRRVGEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAA 938
+ +++F+ L + + ++ VP +VG + V G + G++A
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 939 EFGVVMLMYLRHAIEAVPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPI 998
+ ++++ + + +E + + EA +R+RP MT I G+LP+
Sbjct: 939 KNAILIVEFAKDLMEK----------EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037
GAGS + + ++GGM++A LL++F +P + +
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


71CMJKDNLE_00555CMJKDNLE_00560N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00555-1164.557550enterobactin efflux transporter EntS
CMJKDNLE_00556-2154.239163ferric enterobactin ABC transporter -
CMJKDNLE_00557-1194.699543isochorismate synthase 1
CMJKDNLE_00558-1204.575888enterobactin synthase multienzyme complex
CMJKDNLE_005590194.425621EntB monomer
CMJKDNLE_005600174.0808542,3-dihydro-2,3-dihydroxybenzoate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00555TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 81/393 (20%), Positives = 144/393 (36%), Gaps = 38/393 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELR 402
+ A + + +G+ + L LL L LR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00556FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00559ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1134), Expect = e-159
Identities = 145/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 LSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
S ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00560DHBDHDRGNASE364e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 364 bits (935), Expect = e-131
Identities = 110/258 (42%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDALVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


72CMJKDNLE_00771CMJKDNLE_00776N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00771-1193.400932YbhF/YbhR/YbhS ABC transporter
CMJKDNLE_00772-1173.564718YbhF/YbhR/YbhS ABC transporter
CMJKDNLE_00773-1152.963952YbhF/YbhR/YbhS ABC transporter
CMJKDNLE_00774-1132.870978putative membrane fusion protein
CMJKDNLE_00775-1122.661292putative DNA-binding transcriptional regulator
CMJKDNLE_007760122.541136ATP-dependent RNA helicase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00771ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00773PF05272320.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.012
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.7 bits (66), Expect = 0.046
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00774RTXTOXIND626e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 62.2 bits (151), Expect = 6e-13
Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 82 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 141
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 142 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 196
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 197 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 254
E Q S + AP + V G V+ T+ + + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 255 QPGRKVLLYTDGRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 308
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 309 ----DADDALRQGMPVTVQ 323
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00775HTHTETR737e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.7 bits (178), Expect = 7e-18
Identities = 33/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%)

Query: 9 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 67
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 68 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQL 127
IGE E + P + +RE+++ + + + + + F E +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120

Query: 128 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDANDTRMILHTHALIGEILAFRLGKETIL 187
A + + + + + +A L T + + G
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173

Query: 188 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 221
L W + + + ++ ++L+
Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00776SECA300.025 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.025
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


73CMJKDNLE_00820CMJKDNLE_00829N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00820014-1.103880penicillin-binding protein 6
CMJKDNLE_00821114-0.187292DeoR DNA-binding transcriptional repressor
CMJKDNLE_00822013-0.038509undecaprenyl pyrophosphate phosphatase
CMJKDNLE_00823012-0.107252multidrug efflux transporter MdfA
CMJKDNLE_00824-114-0.419610hypothetical protein
CMJKDNLE_00825015-0.902361FMN phosphatase
CMJKDNLE_00826-1140.087853putative transporter
CMJKDNLE_00827012-0.889096putative DNA-binding transcriptional regulator
CMJKDNLE_008290110.157879putative transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00820BLACTAMASEA438e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.2 bits (102), Expect = 8e-07
Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%)

Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65
+ L A A P + I MD ASG+ L ADE+
Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65

Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 125
S K++ V + A +L + + +P V D ++V +L
Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119

Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181
I S N A L V G + + +++G T ++T PG
Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174

Query: 182 --STARDMA------LLGKAL 194
+T MA L + L
Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00823TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 1e-05
Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 23/269 (8%)

Query: 71 LLGPLSDRIGRRPVMLAGVVWFIVTCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAI 130
+LG LSDR GRRPV+L + V + A + + R + GI+ GAV A I
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120

Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVG---AAWIHVLPWEGMFVLFAALAAISFFG 187
+ + + M+ + GP++G + P F AAL ++F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFLT 176

Query: 188 LQRAMPETATRIGEKLSLKELGRDYKLVLKNG-RFVAGALALGFVSLPLLAWIAQSP--I 244
+PE+ L + L G VA +A+ F ++ + Q P +
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAAL 232

Query: 245 IIITGEQLSSYEYGLLQVPIFGALIAGNL----LLARLTSRRTVRSLIIMGGWPIMIGLL 300
+I GE ++ + + + I +L + + +R R +++G G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 301 VAAAATVISSHAYLWMTAGLSIYAFGIGL 329
+ A AT ++ + + + GIG+
Sbjct: 293 LLAFAT----RGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00826TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.7 bits (77), Expect = 0.001
Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%)

Query: 191 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 248
+IGV+ + F + + P +M D H S GS+I T+ + + + GG +
Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317

Query: 249 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 304
DR + V+ + L ++ S ++ + VL GL + TI ++S
Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377

Query: 305 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 334
+A +S++ T +L+ G ++G L
Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00827HTHTETR504e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 4e-10
Identities = 14/81 (17%), Positives = 31/81 (38%)

Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFS 61
+ + R+ I+ L G+ + + +IA AGV G++ ++F +L E +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 62 SFTEIMSRQYQAFFSDVSDAP 82
+ + + P
Sbjct: 65 LSESNIGELELEYQAKFPGDP 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00829TCRTETA320.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.006
Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%)

Query: 394 LMIGMITFQFSTFSFGMGNAAGLLFAGIML-GFMRANHPTFG-YIPQ--GALSMVKEFGL 449
L++ + +L+ G ++ G A G YI + FG
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135

Query: 450 MVFMAGVGLSAGSGINNGLGAIGGQM--LIAGLIVSLVPVVICFLF 493
M G G+ AG + +G A + L + CFL
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181


74CMJKDNLE_00847CMJKDNLE_00852N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00847-1132.597657arginine ABC transporter - ATP binding subunit
CMJKDNLE_00848-1133.187501putative lipoprotein
CMJKDNLE_00849-1143.000050hypothetical protein
CMJKDNLE_00850-1133.097902anhydro-N-acetylmuramoyl-L-alanine amidase
CMJKDNLE_00851-2142.928634putative NAD(P)H-binding oxidoreductase with
CMJKDNLE_00852-3122.205653hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00847PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.010
Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 31 LVLLGPSGAGKSSLLRVL 48
+VL G G GKS+L+ L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00850ECOLIPORIN280.041 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 28.0 bits (62), Expect = 0.041
Identities = 20/54 (37%), Positives = 26/54 (48%), Gaps = 9/54 (16%)

Query: 2 RRFFWLVAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55
R+ LV ALL AG A I K+G +LD Y ++ L HY +DD
Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00851NUCEPIMERASE752e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 75.2 bits (185), Expect = 2e-17
Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%)

Query: 1 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 51
MK LVTGA +G + + L + G V +A +LL + G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104
L + + ++ S +P A+ +N+ + E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116

Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 164
+++ ++ SS S+Y + D + +A +K A+E + + S T
Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174

Query: 165 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 222
LR +++GP + + + + M SI + + G D TY ++ A+
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 223 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 268
RVYNI N L +Q L D L I+ + +P D
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294

Query: 269 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAW 328
+ T D E +G+ P T+ +G++ W
Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327

Query: 329 LRD 331
RD
Sbjct: 328 YRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00852NUCEPIMERASE561e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.6 bits (134), Expect = 1e-10
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%)

Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54
+ LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106
+ + + L + V+ H S+ + LN+ + R ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 107 IFLSS 111
++ SS
Sbjct: 122 LYASS 126


75CMJKDNLE_00923CMJKDNLE_00928N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_00923225-4.893379fimbrial-like adhesin protein
CMJKDNLE_00924124-3.913443putative periplasmic pilin chaperone
CMJKDNLE_00925021-3.748567putative outer membrane usher protein
CMJKDNLE_00926-123-3.427299putative fimbrial-like adhesin protein
CMJKDNLE_00927021-2.932098putative fimbrial-like adhesin protein
CMJKDNLE_00928-211-0.941515putative fimbrial-like adhesin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00923FIMBRIALPAPE280.012 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.5 bits (63), Expect = 0.012
Identities = 26/92 (28%), Positives = 37/92 (40%), Gaps = 14/92 (15%)

Query: 6 LTAFITVVCATSSVMAADDNAITDGSVTFNGKVIAPACTLVAATKDSVVTLPDVSATKLQ 65
L + V + V AAD+ +TF GK+I PACT+ A V D+ L
Sbjct: 9 LPVMLGAVLMSQHVHAADN-------LTFKGKLIIPACTVQNAE----VNWGDIEIQNLV 57

Query: 66 TNGQVS---GVQIDVPIELKDCDTTVTKNATF 94
+G V ++ P L T+T N
Sbjct: 58 QSGGNQKDFTVDMNCPYSLGTMKVTITSNGQT 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00925PF005778270.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 827 bits (2139), Expect = 0.0
Identities = 414/862 (48%), Positives = 569/862 (66%), Gaps = 18/862 (2%)

Query: 15 GVPSFIGGLVVFVSAAFNAQAETWFDPAFFKDDPSMVADLSRFEKGQKITPGVYRVDIVL 74
G + F + A + AE +F+P F DDP VADLSRFE GQ++ PG YRVDI L
Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 75 NQTIVDTRNVNFVEITPEKGIAACLTTESLDAMGVNTDAFPAFKQLDKQACVPLAEIIPD 134
N + TR+V F E+GI CLT L +MG+NT + L ACVPL +I D
Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144

Query: 135 ASVTFNVNKLRLEISVPQIAIKSNARGYVPPERWDEGINALLLGYSFSGANSIHSSADSD 194
A+ +V + RL +++PQ + + ARGY+PPE WD GINA LL Y+FSG + + +
Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204

Query: 195 SGDSYFLNLNSGVNLGPWRLRNNSTWSR-----SSGQTAEWKNLSSYLQRAVIPLKGELT 249
+LNL SG+N+G WRLR+N+TWS SSG +W++++++L+R +IPL+ LT
Sbjct: 205 --HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 250 VGDDYTAGDFFDSVSFRGVQLASDDNMLPDSLKGFAPVVRGIAKSNAQITIKQNGYTIYQ 309
+GD YT GD FD ++FRG QLASDDNMLPDS +GFAPV+ GIA+ AQ+TIKQNGY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 310 TYVSPGAFEISDLYSTSSSGDLLVEIKEADGSVNSYSVPFSSVPLLQRQGRIKYAVTLAK 369
+ V PG F I+D+Y+ +SGDL V IKEADGS ++VP+SSVPLLQR+G +Y++T +
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 370 YRTNSNEQQESKFAQATLQWGGPWGTTWYGGGQYAEYYRAAMFGLGFNLGDFGAISFDAT 429
YR+ + +Q++ +F Q+TL G P G T YGG Q A+ YRA FG+G N+G GA+S D T
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442

Query: 430 QAKSTLADQSEHKGQSYRFLYAKTLNHLGTNFQLMGYRYSTSGFYTLSDTMYKHMDGY-- 487
QA STL D S+H GQS RFLY K+LN GTN QL+GYRYSTSG++ +DT Y M+GY
Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502

Query: 488 EFNDGDDEDTPMWSRYYNLFYTKRGKLQVNISQQLGEYGSFYLSGSQQTYWHTDQQDRLL 547
E DG + P ++ YYNL Y KRGKLQ+ ++QQLG + YLSGS QTYW T D
Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562

Query: 548 QFGYNTQIKDLSLGISWNYSKSRGQPDADQVFALNFSLPLNLLLPRSNDSYTRKKNYAWM 607
Q G NT +D++ +S++ +K+ Q DQ+ ALN ++P + L + S R +A
Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWR---HASA 619

Query: 608 TSNTSIDNEGHTTQNLGLTETLLDDGNLSYSVQQGYNSEGKTANGS---ASMDYKGAFAD 664
+ + S D G T G+ TLL+D NLSYSVQ GY G +GS A+++Y+G + +
Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679

Query: 665 ARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIAAPGAENTRVANSTGL 724
A +GY++SD+ +QL Y +SG ++AH+ G+TLGQ L +T VL+ APGA++ +V N TG+
Sbjct: 680 ANIGYSHSDD--IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 725 KTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTKGALVLAEFNAHAGAR 784
+TDWRGY V+PYAT YRENR+ALD +L NVDL+NAV NVVPT+GA+V AEF A G +
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 785 VLMKTSKQGIPLRFGAIATLDGVQANSGIIDDDGSLYMAGLPAKGTISVRWGEAPDQICH 844
+LM + PL FGA+ T + +SGI+ D+G +Y++G+P G + V+WGE + C
Sbjct: 798 LLMTLTHNNKPLPFGAMVTSES-SQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856

Query: 845 INYELTEQQINSAITRMDAICR 866
NY+L + +T++ A CR
Sbjct: 857 ANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00926CLENTEROTOXN320.004 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.004
Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 295 VGVVVTDSQNNIISPAGGTLPLSIPDDADSIARMNVYPVSTTGVPPET 342
+ V TD + I+ A T L++ D +S N+Y ++ P T
Sbjct: 188 LTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWT 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_00928PF00577280.025 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 27.9 bits (62), Expect = 0.025
Identities = 19/90 (21%), Positives = 32/90 (35%), Gaps = 8/90 (8%)

Query: 25 LFLLGLTWGCELFAHDGTVNISGSFRRNTCVLAQDSKQINVQLGDVSLTRFSHGNYGPEK 84
F + L C A + F N LA D + L+RF +G P
Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYF--NPRFLADDPQ------AVADLSRFENGQELPPG 76

Query: 85 SFIINLQDCGTDVSTVDVTFSGTPDGVQSE 114
++ +++ ++T DVTF+
Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIV 106


76CMJKDNLE_01059CMJKDNLE_01067N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_010590112.288691flagellar hook protein FlgE
CMJKDNLE_01060-1122.157424flagellar basal-body rod protein FlgF
CMJKDNLE_01061091.029029flagellar basal-body rod protein FlgG
CMJKDNLE_010621131.953863flagellar L-ring protein FlgH; basal-body
CMJKDNLE_010631131.719456flagellar P-ring protein FlgI
CMJKDNLE_010641141.288816FlgJ
CMJKDNLE_010652150.876193flagellar biosynthesis, hook-filament junction
CMJKDNLE_010663181.171965flagellar biosynthesis; hook-filament junction
CMJKDNLE_010673171.384303RNase E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01059FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 36.9 bits (85), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01061FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01062FLGLRINGFLGH349e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 349 bits (897), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01063FLGPRINGFLGI427e-152 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 427 bits (1100), Expect = e-152
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01064FLGFLGJ5110.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 511 bits (1318), Expect = 0.0
Identities = 313/313 (100%), Positives = 313/313 (100%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01065FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1762), Expect = 0.0
Identities = 545/546 (99%), Positives = 545/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMEQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM QANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01066FLAGELLIN452e-07 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 45.4 bits (107), Expect = 2e-07
Identities = 30/132 (22%), Positives = 60/132 (45%), Gaps = 3/132 (2%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAG 138
T NG + +
Sbjct: 128 QTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01067IGASERPTASE652e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 64.7 bits (157), Expect = 2e-12
Identities = 47/288 (16%), Positives = 83/288 (28%), Gaps = 36/288 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPVAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT----- 1037

Query: 572 SRFFGALKALFSGGEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
ET + Q QN + + + ++
Sbjct: 1038 ---------------ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTADEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETVAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
+P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 63.2 bits (153), Expect = 5e-12
Identities = 46/261 (17%), Positives = 83/261 (31%), Gaps = 26/261 (9%)

Query: 551 VAPAPKAAPATPVAPAQPGLLSRFFGALKALFSGGEETKPTEQP-APKAEAKPERQQDRR 609
+ P + S E + E P P A A P
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSETT--- 1037

Query: 610 KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663
N ++++++ D E +NR A++ + + + Q EV T
Sbjct: 1038 -----ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 664 EKARTADEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723
++ +T + ++ E+ + + + Q+ K + + QE + + + R
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783
+N K Q+ P E + E V E+ T V P A Q
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 784 NNADNRDNGGMPRRSRRSPRH 804
N+ + RRS RS H
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232


77CMJKDNLE_01161CMJKDNLE_01171N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01161-114-2.187470Rac prophage; predicted protein, C-ter fragment
CMJKDNLE_01162-214-2.002172putative membrane protein
CMJKDNLE_01164-210-1.742079DLP12 prophage; predicted SAM-dependent
CMJKDNLE_01166-111-1.040637putrescine / spermidine ABC transporter -
CMJKDNLE_01167-110-0.638900putrescine / spermidine ABC transporter - ATP
CMJKDNLE_01168-1110.020109peptidase T
CMJKDNLE_011690130.730990hypothetical protein
CMJKDNLE_01170-1140.283829EnvZ sensory histidine kinase
CMJKDNLE_01171-2170.390216putative DNA-binding response regulator in
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01161ENTEROVIROMP1393e-44 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 139 bits (351), Expect = 3e-44
Identities = 63/200 (31%), Positives = 99/200 (49%), Gaps = 30/200 (15%)

Query: 1 MRKVYAAILSAAICLAVSGTPAWASEHQSTLSAGYLHARTNVPGSDNLNGINVKYRYEFT 60
M+K+ AA+ +GT A ST++ GY + + + G N+KYRYE
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA---TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56

Query: 61 DT-LGLVTSFSYANAKDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGM 119
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G+
Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108

Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAVDVAYE 179
Y + T T+ HD S+ ++GAG+QFNP E+VA+D +YE
Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151

Query: 180 GSGSGDWRTDGFIVGVGYKF 199
S +I GVGY+F
Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01162IGASERPTASE451e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 1e-06
Identities = 54/346 (15%), Positives = 107/346 (30%), Gaps = 43/346 (12%)

Query: 9 LKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDE-AGRYSMDVEYGQYSVILLVEGF 67
+ D TG+P N A + + +L D A +Y + G+Y + +
Sbjct: 928 VADKTGEPNHNELTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVNGRYDL------Y 981

Query: 68 PPSHAGTITVYEDSRPGTLNDFLGAMTEDDARPEALRRFELMV-------------EEVA 114
P + + T N+ + + E + R + E VA
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATRATDAAGSARAASTSAGQAASSAQSASSSAG 174
N+ ++ ++ A++ + RE A A + + A + + ++ ++
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSEMNAAASQKSAATSASTATTKASEAAT 234
+T E ++ + TS + K + Q A +
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 235 SARDASASKVAAKSS-------ETSAASSAG-----------SAASSATAAGNSAKAAKT 276
+ A + A ++S S + G A + T S+ K
Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221

Query: 277 SETNADNSAQAAADSQTASANSATAAKKSE-----TNAKNSEAAAK 317
+ S + T S+N + + TNA S+A AK
Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAK 1267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01164LUXSPROTEIN310.002 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.002
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 90 AGESKI 95
++KI
Sbjct: 114 ENQNKI 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01167PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.017
Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80
+ L G G GK+T++ + GL+ D+ + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01170PF06580290.048 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.048
Identities = 11/69 (15%), Positives = 22/69 (31%), Gaps = 20/69 (28%)

Query: 389 NACKYCLE------FVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQ 442
N K+ + + + + + + + VE+ G + +E
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------------ST 311

Query: 443 GVGLAVARE 451
G GL RE
Sbjct: 312 GTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01171HTHFIS876e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 6e-22
Identities = 31/124 (25%), Positives = 62/124 (50%)

Query: 2 RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHIPDIAIVDLGLPDEDGLS 61
+LV +D+A +R L + AG+ V +A ++ D+ + D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSG 121
L+ R + LP+LV++A+ ++ ++ GA DY+ KPF + E++ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 LASQ 125
S+
Sbjct: 125 RPSK 128


78CMJKDNLE_01217CMJKDNLE_01221N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01217016-1.287997dihydroxyacetone kinase subunit M
CMJKDNLE_01218-214-1.139349dihydroxyacetone kinase subunit L
CMJKDNLE_01219-115-1.445584dihydroxyacetone kinase subunit K
CMJKDNLE_01220-116-1.738837DhaR DNA-binding transcriptional dual regulator
CMJKDNLE_01221116-0.928220putative adhesion and penetration protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01217PHPHTRNFRASE1433e-39 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 143 bits (361), Expect = 3e-39
Identities = 62/206 (30%), Positives = 102/206 (49%), Gaps = 1/206 (0%)

Query: 258 GKAFYYQPVLCTVQAKSTLTVEEEQDRLRQAIDFTLLDLMTLTAKAEASGLDDIAAIFSG 317
KAF + ++ S V E ++L A++ + +L + + EAS D A IF+
Sbjct: 17 AKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAA 76

Query: 318 HHTLLDDPELLAAASELLQHEHCTAEYAWQQVLKELSQQYQQLDDEYLQARYIDVDDLLH 377
H +LDDPEL+ +++E AEYA ++V ++ +D+EY++ R D+ D+
Sbjct: 77 HLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAADIRDVSK 136

Query: 378 RTLVHLT-QTKEELPQFNSPTILLAENIYPSTVLQLDPAVVKGICLSAGSPVSHSALIAR 436
R L HL L T+++AE++ PS QL+ VKG G SHSA+++R
Sbjct: 137 RVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSR 196

Query: 437 ELGIGWICQQGEKLYAIQPEETLTLD 462
L I + E IQ + + +D
Sbjct: 197 SLEIPAVVGTKEVTEKIQHGDMVIVD 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01218adhesinmafb280.019 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 28.5 bits (63), Expect = 0.019
Identities = 10/47 (21%), Positives = 26/47 (55%)

Query: 138 VESLRQSSEQNLSVPVALEAASSIAESAAQSTITMQARKGRASYLGE 184
E++ + ++N + +EA ++A +A + + A+ G+A+ G+
Sbjct: 293 REAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAAKPGKAAVSGD 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01220HTHFIS2446e-76 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 244 bits (625), Expect = 6e-76
Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 33/363 (9%)

Query: 308 QMRQLMTSQLGKVSHTFAHMPQDDPQTRRLIHFGRQAARSSFPVLLCGEEGVGKALLSQA 367
+ S+L S + + + + ++ +++ GE G GK L+++A
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 368 IHNESERAAGPYIAVNCELYGDAALAEEFIG---GDRTDNENGRLSRLELAHGGTLFLEK 424
+H+ +R GP++A+N + E G G T + R E A GGTLFL++
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 425 IEYLAVELQSALLQVIKQGVITRLDARRLIPIDVKVIATTTADLAMLVEQNRFSRQLYYA 484
I + ++ Q+ LL+V++QG T + R I DV+++A T DL + Q F LYY
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 485 LHAFEITIPPLRMRRGSIPALVNNKLRSLEKRFSTRLKIDDDALARLVSCAWPGNDFELY 544
L+ + +PPLR R IP LV + ++ EK + D +AL + + WPGN EL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 545 SVIENLALSSDNGRIRVSDLPEHLFTEQATDDVSATRLSTS------------------- 585
+++ L I + L +E + +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 586 -----------LSFAEVEKEAIINAAQVTGGRIQEMSALLGIGRTTLWRKMKQHGIDAGQ 634
AE+E I+ A T G + + LLG+ R TL +K+++ G+ +
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYR 479

Query: 635 FKR 637
R
Sbjct: 480 SSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01221PRTACTNFAMLY2147e-60 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 214 bits (546), Expect = 7e-60
Identities = 242/979 (24%), Positives = 400/979 (40%), Gaps = 115/979 (11%)

Query: 14 RLAELKIRSPSIQLIKFGAIGLNAIIFSPLLIAADTGSQYGTNITINDGDRI---TGDTA 70
+ A L+ + ++ L GA ++ I Q+G +I +D + +G T
Sbjct: 10 KAAPLRRTTLAMALGALGAAPAAHADWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTI 69

Query: 71 DPSGN-LYGVMTPAGNTPGNINLGNDVTVN---VNDASGYAKGIIIQGKNSSLTANRLTV 126
SG G++ N + N + ++D + K L A+ T+
Sbjct: 70 KVSGRQAQGILLE--NPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATL 127

Query: 127 DVVGQT---SAIGINLIGDYTHADLGTGSTIKSNDDGIIIGHSSTLTATQFTIENSNGIG 183
VG T I + + G+ A + ST++ G+ I + +T + I + G+
Sbjct: 128 ANVGDTWDDDGIALYVAGEQAQASIAD-STLQGAG-GVQIERGANVTVQRSAIVD-GGLH 184

Query: 184 LTINDYGTSVDLGSGSKITTDGS-TGVYIGGLNGNNANGAARFTATDLTID---VQGYSA 239
+ DL + D + T V G + A++LT+D + G A
Sbjct: 185 IGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPA----AVSVLGASELTLDGGHITGGRA 240

Query: 240 MGINVQKNSVVDLGTNSTIKTNGDNAHGLWSFGQVSANAL-------TVDVTGAAANGVE 292
G+ + +VV L +TI+ A G G V A+ GV+
Sbjct: 241 AGVAAMQGAVVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVD 299

Query: 293 VRGGTTTIGADSHISSAQGGGLVTSGSDAIINFTGTAAQRNSIFSGGSYGASAQTATAVV 352
V G + + A S + + + G + G A + +G + +G +T A
Sbjct: 300 VSGSSVEL-AQSIVEAPELGAAIRVGRGARVTVSGGSLS-------APHGNVIETGGARR 351

Query: 353 NM-QNTDITVD-RNGSLALGLWALSGGRITGDSLAITGAAGARGIYAMTNSQIDLTSDLV 410
Q +++ + G+ A G L L +TG A A+G T + +
Sbjct: 352 FAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATELPSIPGTSI- 410

Query: 411 IDMSTPDQMAIATQHDDGYATSRINASGRMLINGSVLSKGGLINLDMHPGSVWTGSSLSD 470
P +A+A+ + WTG++
Sbjct: 411 ----GPLDVALAS------------------------------------QARWTGAT--R 428

Query: 471 NVNGGKLDVAMNNSVWNVTSNSNLDTLAL-SHSTVDFASHGSTAGTFATLNVENLSGNST 529
V+ +D N+ W +T NSN+ L L S +VDF + AG F L V L+G+
Sbjct: 429 AVDSLSID----NATWVMTDNSNVGALRLASDGSVDFQQ-PAEAGRFKVLTVNTLAGSGL 483

Query: 530 FIMRADVVGEGNGVNNKGDLLNISGSSAGNHVLAIRNQGSEATTGNEVLTVVKTTDGAAS 589
F M D L + ++G H L +RN GSE + N +L V AA+
Sbjct: 484 FRMNV------FADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAAT 537

Query: 590 FSASS---QVELGGYLYDVRKNG-TNWELYASGTVPEPTPNPEPTPAPAQPPIVNPD-PT 644
F+ ++ +V++G Y Y + NG W L + P P P P+P P P QPP P+ P
Sbjct: 538 FTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPA 597

Query: 645 PEPAPTPKPTTTADAGGNYLNVGYL--LNYVENRTLMQRMGDLRNQSKDGNIWLRSYG-- 700
P+P + + A+A N VG L Y E+ L +R+G+LR G W R +
Sbjct: 598 PQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQR 657

Query: 701 GSLDSFASGKLSGFDMGYSGIQFGGDKRLSDVM-PLYVGLYIGSTHASPDYSG-GDGTAR 758
LD+ A + FD +G + G D ++ ++G G T ++G G G
Sbjct: 658 QQLDNRAGRR---FDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD 714

Query: 759 SDYMGMYASYMAQNGFYSDLVIKASRQKNSFHVLDSQNNGVNANGTANGMSISLEAGQRF 818
S ++G YA+Y+A +GFY D ++ASR +N F V S V +G+ SLEAG+RF
Sbjct: 715 SVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRF 774

Query: 819 NLSPTGYGFYIEPQTQLTYSHQNEMTMKASNGLNIHLNHYESLLGRASMILGYDIT-AGN 877
+ G+++EPQ +L +A+NGL + S+LGR + +G I AG
Sbjct: 775 THAD---GWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGG 831

Query: 878 SQLNVYVKTGAIREFSGDTEYLLNNSREKYSFKGNGWNNGVGVSAQYNKQHTFYLEADYT 937
Q+ Y+K ++EF G N + +G G+G++A + H+ Y +Y+
Sbjct: 832 RQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYS 891

Query: 938 QGNLFDQK-QVNGGYRFSF 955
+G + GYR+S+
Sbjct: 892 KGPKLAMPWTFHAGYRYSW 910


79CMJKDNLE_01245CMJKDNLE_01249N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01245-1111.229840putative invasin
CMJKDNLE_012460181.716626FimZ transcriptional regulator
CMJKDNLE_01247-1192.055094Nitrate/nitrite sensor protein NarX
CMJKDNLE_012480252.355630hypothetical protein
CMJKDNLE_012490232.067869nitrate:nitrite antiporter NarK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01245INTIMIN2575e-79 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 257 bits (658), Expect = 5e-79
Identities = 120/378 (31%), Positives = 197/378 (52%), Gaps = 21/378 (5%)

Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138
G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+
Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237

Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198
++ L + Q+G D+ +N+G GQR+ ++GYN F D + R G G E W
Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297

Query: 199 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256
+Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD
Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357

Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316
V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P +
Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417

Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376
Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L +
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476

Query: 377 RSRYGIRQLIWQGDTQILS-----LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 431
+S+YG+ +++W D+ + S G+Q SA+ + I+P + +G SN ++++
Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531

Query: 432 EDNQGQRVSSNEITLTLV 449
D G SSN + LT+
Sbjct: 532 YDRNGN--SSNNVLLTIT 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01246HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%)

Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66
ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123
+ L ++++ ++V S N + A ++GA YL K + +L+ + +A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01247PF06580531e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 53.3 bits (128), Expect = 1e-09
Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%)

Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483
P +RE+L+ + + S + +LT +++ + S +F ++ +
Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243

Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534
Q+ P + VP L+Q E N +KH Q ++++ +++ V L V
Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585
++ G +N S G+ +R+R Q L G + +++ E G + IP
Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01249ACRIFLAVINRP310.010 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.010
Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%)

Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314
I+S + L+ + I A A L K + + FFG F S ++
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370
LG T L+ + L+ +LFL LP+ G F+ L +G+T
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583

Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411
+ + ++T +K E + E + A + F+S
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629


80CMJKDNLE_01873CMJKDNLE_01882N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_018730131.226756putative DNA-binding response regulator in
CMJKDNLE_018740111.410721putative response regulator in two-component
CMJKDNLE_01875-1121.272270chemotaxis protein methyltransferase
CMJKDNLE_018760120.923041methyl accepting chemotaxis protein - dipeptide
CMJKDNLE_01877-1130.379041methyl accepting chemotaxis protein II -
CMJKDNLE_01879-113-0.147667chemotaxis signaling complex - aspartate
CMJKDNLE_01880016-0.715416Chemotaxis protein CheA
CMJKDNLE_01881-117-2.140458MotB protein, enables flagellar motor rotation,
CMJKDNLE_01882-117-2.705798MotA protein, proton conductor component of
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01873HTHFIS904e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 4e-24
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMPNMDGL 66
LV DD + +R ++ L G++ V + + AG V++D MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111
+LL I+ LPVL+++A+ I A++ GA Y+ KPF
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01874HTHFIS658e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 8e-14
Identities = 35/188 (18%), Positives = 72/188 (38%), Gaps = 23/188 (12%)

Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60
M+ +L DD A +R ++ + ++ V + I + D++ DV MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119
+ D L ++ + RP V+V ++ + + ++A E GA D++ KP + E +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115

Query: 120 NEMIAEKVRTAAKASLAAHKPLSAPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179
+AE R +K + + +G S E R + + +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMP-----------------LVGRSAAMQEIYRVLARLMQ 158

Query: 180 LSSPALLI 187
++
Sbjct: 159 TDLTLMIT 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01880PF06580433e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 3e-06
Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%)

Query: 361 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 418
+++ ++++ + P+ LV N + HGI G ++L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 419 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVV 478
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 479 KRNIQEMGG---HVEIQSKQGTGTTIRILLP 506
+ +Q + G +++ KQG +L+P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01881PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.010
Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%)

Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105
L +SSP A P + G + ++ PGGGDD GE +++
Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435

Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135
R+ + L+ R L + + S P L
Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01882PF05844330.001 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 33.1 bits (75), Expect = 0.001
Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%)

Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103
++LL +L+R+ K+R++G+ L+RD EN
Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99


81CMJKDNLE_01912CMJKDNLE_01937N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01912-111-1.789176flagellar biosynthesis; flagellin, filament
CMJKDNLE_01913-2160.374323flagellar cap protein FliD; filament capping
CMJKDNLE_01914-1130.163600flagellar biosynthesis protein FliS
CMJKDNLE_01915-1120.393789flagellar biosynthesis protein FliT
CMJKDNLE_01916013-0.438062alpha-amylase
CMJKDNLE_01917020-3.615725hypothetical protein
CMJKDNLE_01918019-4.088600putative inner membrane protein
CMJKDNLE_01919-221-3.408535hypothetical protein
CMJKDNLE_01920015-1.739659hypothetical protein
CMJKDNLE_01921115-0.977266hypothetical protein
CMJKDNLE_019221140.072238hypothetical protein
CMJKDNLE_019230164.378306flagellar basal-body protein FliE
CMJKDNLE_019241164.228204flagellar M-ring protein FliF; basal-body
CMJKDNLE_019253184.428672flagellar motor switch protein FliG
CMJKDNLE_019262164.158658flagellar biosynthesis protein FliH
CMJKDNLE_01927-1173.273808flagellum-specific ATP synthase FliI
CMJKDNLE_01928-2163.149904flagellum-specific ATP synthase FliI
CMJKDNLE_019290162.221814flagellar biosynthesis protein FliJ
CMJKDNLE_01930-1162.242372flagellar hook-length control protein FliK
CMJKDNLE_01931-2211.740401flagellar biosynthesis
CMJKDNLE_019320160.365025flagellar motor switch protein FliM
CMJKDNLE_01933116-2.600077flagellar motor switch protein FliN
CMJKDNLE_01934117-3.308043flagellar biosynthesis protein FliO
CMJKDNLE_01935119-4.222553flagellar biosynthesis protein FliP
CMJKDNLE_01936120-4.435319flagellar biosynthesis protein FliQ
CMJKDNLE_01937-216-2.903387flagellar biosynthesis protein FliR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01912FLAGELLIN1631e-45 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 163 bits (413), Expect = 1e-45
Identities = 177/418 (42%), Positives = 223/418 (53%), Gaps = 6/418 (1%)

Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61
AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRIRELTVQATTGTNSDSDLDSIQDEIKSRLD 121
TQA+RNANDGIS+AQTTEGAL+EINNNLQR+REL+VQAT GTNSDSDL SIQDEI+ RL+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVSGQTQFNGVNVLSKDGSMKIQVGANDGETITIDLKKIDSDTLNLAGFNVNGKGSV 181
EIDRVS QTQFNGV VLS+D MKIQVGANDGETITIDL+KID +L L GFNVNG
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 182 ANTAATSDDLKLAGFTKGTTDTNGVTAYTNTISNDKAKASDLLANITDGSVITGGGANAF 241
S + G+ N +++ + D +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRV---DVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 242 GVAAKNGYTYDAASKSYSFAADGADSAKTLSIINPNTGDSSQATVTIGGKEQKVNISQDG 301
A+N D + S A A +I GD+ + K +G
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNG 297

Query: 302 KITAADDNATLYL---DKQGNLTKTNAGNDTAATWDGLISNSDSTGAVPVGVATTITITS 358
K++ + + L D +A ++ + + ++
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357

Query: 359 GTASGMSVQSAGAGIQTSTNSQILAGGAFAAKVSIEGGAATDILVASNGNITAADGNA 416
A+ + + + + AG T V++ N AA
Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKK 415



Score = 92.8 bits (230), Expect = 7e-22
Identities = 91/332 (27%), Positives = 125/332 (37%)

Query: 338 SNSDSTGAVPVGVATTITITSGTASGMSVQSAGAGIQTSTNSQILAGGAFAAKVSIEGGA 397
++T +T A G + V+ G
Sbjct: 176 GPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQ 235

Query: 398 ATDILVASNGNITAADGNALYLDATTGGFTTTAGGNTAASLDNLIANSKDATLTVTSGTG 457
T +N + A T T G
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 458 QNTVYSTTGSGAQFTSLAKVDTVNVTNAHVSAEGMANLTKSNFTIDMGGTGTVTYTVSNG 517
V +T ++A + + + N+ S +
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 518 DVKAAANADVYVEDGALSANATKDVTYFEQKNGAITNSTGGTIYETADGKLTTEATTASS 577
A NA ++ ++ A + +A A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKK 415

Query: 578 STADPLKALDEAISSIDKFRSSLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEV 637
STA+PL ++D A+S +D RSSLGA+QNR DSA+TNL NT TNL+ A+SRI+DADYATEV
Sbjct: 416 STANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEV 475

Query: 638 SNMSKAQIIQQAGNSVLAKANQVPQQVLSLLQ 669
SNMSKAQI+QQAG SVLA+ANQVPQ VLSLL+
Sbjct: 476 SNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01913TYPE3OMBPROT330.003 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 32.7 bits (74), Expect = 0.003
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273
N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++
Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293

Query: 274 AIKDWVNAYNSL 285
+KD VNA L
Sbjct: 294 MLKDQVNALKGL 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01918RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01919PF01206936e-29 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 92.5 bits (230), Expect = 6e-29
Identities = 16/71 (22%), Positives = 37/71 (52%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66
D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01923FLGHOOKFLIE1175e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (294), Expect = 5e-38
Identities = 103/103 (100%), Positives = 103/103 (100%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01924FLGMRINGFLIF7560.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 756 bits (1952), Expect = 0.0
Identities = 478/555 (86%), Positives = 514/555 (92%), Gaps = 5/555 (0%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQFNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQ NTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQ--QASTTSNS---GPRSTQRNETSN 357
+GYPGGVPGALSNQPAP N API+TPP NQ N Q Q ST++NS GPRSTQRNETSN
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEK 417
YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIEDLTREAMGFS+K
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424

Query: 418 RGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477
RGD+LNVVNSPF++ D +GGELPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT
Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484

Query: 478 RRAEAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537
RR E KA Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544

Query: 538 VVALVIRQWINNDHE 552
VVALVIRQW++NDHE
Sbjct: 545 VVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01925FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01926FLGFLIH373e-135 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 373 bits (959), Expect = e-135
Identities = 226/228 (99%), Positives = 228/228 (100%)

Query: 1 MSDNLPWKTWTPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTWTPDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01929FLGFLIJ2022e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (515), Expect = 2e-70
Identities = 146/147 (99%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01930FLGHOOKFLIK461e-165 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 461 bits (1186), Expect = e-165
Identities = 361/375 (96%), Positives = 366/375 (97%)

Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSEILADAQQADLLIPVDETPPVINDEQSTSTPLTTAQTMTLAAVAGNNTAKDEKA 120
GEPL+S+I++DAQQA+LLIPVDETPPVINDEQSTSTPLTTAQTM LAAVA NT KDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDVPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTD PSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQVRVTGNSSVDIFA 375
LQ RVTGNS VDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01932FLGMOTORFLIM381e-135 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 381 bits (979), Expect = e-135
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01933FLGMOTORFLIN2121e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 212 bits (542), Expect = 1e-74
Identities = 125/137 (91%), Positives = 134/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T++KSAADAVFQQ GGGDVSG +QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01935FLGBIOSNFLIP334e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 334 bits (857), Expect = e-119
Identities = 244/245 (99%), Positives = 245/245 (100%)

Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01936TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01937TYPE3IMRPROT2026e-67 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 202 bits (516), Expect = 6e-67
Identities = 256/261 (98%), Positives = 259/261 (99%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEIFNLLADIISELPLI 261
EHLFSEIFNLLADIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


82CMJKDNLE_01948CMJKDNLE_01955N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_01948-217-4.680993DNA-cytosine methyltransferase
CMJKDNLE_01949-125-6.648141putative phosphohydrolase
CMJKDNLE_01951030-8.172831hypothetical protein
CMJKDNLE_01952-126-6.693202outer membrane pore protein N, non-specific
CMJKDNLE_01953029-6.576437glyoxalase III, Hsp31 molecular chaperone
CMJKDNLE_01954034-7.955876putative sensory kinase in two-component
CMJKDNLE_01955129-6.262509putative DNA-binding response regulator in
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01948PF05272290.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.045
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01949CARBMTKINASE343e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 3e-04
Identities = 22/92 (23%), Positives = 35/92 (38%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVTLAKNHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01952ECOLIPORIN5400.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 540 bits (1393), Expect = 0.0
Identities = 264/399 (66%), Positives = 307/399 (76%), Gaps = 22/399 (5%)

Query: 1 MKRKVLAMLVPALLVAGAANAAEVYNKDGNKLDLYGKVAGLHYFSDDAGSDGDKSYARIG 60
MKRKVLA+++PALL AGAA+AAE+YNKDGNKLDLYGKV GLHYFSDD+ DGD++Y R+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQIADQFTGYGQWEFNIGANGTESDKGNTATRLAFAGLGFGQNGTFDYGRNYGVVY 120
FKGETQI DQ TGYGQWE+N+ AN TE + N+ TRLAFAGL FG G+FDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DVEAWTDMLPEFGGDTYAGADNFMNGRANGVATYRNNGFFGQVDGLNFALQYQSNNEN-S 179
DVE WTDMLPEFGGD+Y ADN+M GRANGVATYRN FFG VDGLNFALQYQ NE+ S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 180 GGLFGQEGSGKGKGRDIAQENGDGFGMSTSYDFDFGLSLGAAYSNSDRTDNQVHKGWHNT 239
+ + G DI +NGDGFG+ST+YD G S GAAY+ SDRT+ QV
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQV------- 233

Query: 240 RDGDRSDTTAGGETAEAWTVGAKYDANNVYLAAMYAETRNMTGYGKVDA-----IANKTQ 294
+ T AGG+ A+AWT G KYDANN+YLA MY+ETRNMT YGK D +ANKTQ
Sbjct: 234 ---NAGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQ 290

Query: 295 NFEVVAQYQFDFGLRPSIAYLQSKGKDLGGWAHDGNGDPRYTNKDLVKYVDIGATYYFNK 354
NFEV AQYQFDFGLRP++++L SKGKDL + +KDLVKY D+GATYYFNK
Sbjct: 291 NFEVTAQYQFDFGLRPAVSFLMSKGKDLTY------NNVNGDDKDLVKYADVGATYYFNK 344

Query: 355 NMSTYVDYKINLLDNDDDFYKENGIATDDIVAVGLVYQF 393
N STYVDYKINLLD+DD FYK+ GI+TDDIVA+G+VYQF
Sbjct: 345 NFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01954PF06580387e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 7e-05
Identities = 29/147 (19%), Positives = 58/147 (39%), Gaps = 21/147 (14%)

Query: 308 DTLSLNKEVENLLDYL--EYLSDEKEIRFKVECNQQIFADKI---LLQRMLSNLIVNAIR 362
+SL E+ + YL + E ++F+ + N I ++ L+Q ++ N I + I
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 363 YSPEKSRIHITSFLDANGSLNIDIASPGTKINEPEKLFRRFWRGDNSRHSVGQGLGLSLV 422
P+ +I + D NG++ +++ + G+ + K G GL V
Sbjct: 274 QLPQGGKILLKGTKD-NGTVTLEVENTGSLALKNTKE--------------STGTGLQNV 318

Query: 423 KA-IAELHGGSATYHYLSKHNVFRITL 448
+ + L+G A K +
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01955HTHFIS849e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 9e-21
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


83CMJKDNLE_01969CMJKDNLE_01979N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_019690247.931492fatty acyl-CoA synthetase
CMJKDNLE_019700237.440904hypothetical protein
CMJKDNLE_019710193.910182hypothetical protein
CMJKDNLE_019720172.561882Thioesterase PikA5
CMJKDNLE_01973-119-0.227227enterobactin synthase multienzyme complex
CMJKDNLE_01974020-2.108911outer membrane receptor involved in uptake of
CMJKDNLE_01975026-4.295369adhesin
CMJKDNLE_01976-128-5.248562adhesin
CMJKDNLE_01977-231-6.434564adhesin
CMJKDNLE_01978-130-5.980543hypothetical protein
CMJKDNLE_01979-227-4.038100shikimate:H+ symporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01969ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 82 AWNQLMLSRS 91
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01970DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01975INTIMIN739e-18 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 73.2 bits (179), Expect = 9e-18
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 86 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVNEDFS 145
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01976INTIMIN571e-10 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 56.6 bits (136), Expect = 1e-10
Identities = 61/263 (23%), Positives = 90/263 (34%), Gaps = 20/263 (7%)

Query: 175 IAVKAHVNDQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVK 234
I A V G + P +F+ S ++S N+ +TN G A VT+ ++ G V
Sbjct: 578 ITYTATVKKN-GVAQANVPVSFNIV-SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635

Query: 235 ASLANGASLEKQLEAI---DEKLTLTSSPLIGVNAPKGATLTATLT---SANGTPVEGQV 288
A A S I K ++T A T T PV Q
Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695

Query: 289 INFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFHNGVTIQTQTTVKVTGN 348
+ F+ T LS +T+++G A V LTS G V+A + V+
Sbjct: 696 VTFTTT--LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTT 753

Query: 349 PSTTHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTVYFALKSGSTTLTSLTA 407
+ I T ++ G NL G + +S + + S
Sbjct: 754 LTID------DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS--- 804

Query: 408 VTDQNGIATTSVKGEITGSVTVS 430
V +G T KG T SV S
Sbjct: 805 VDASSGQVTLKEKGTTTISVISS 827



Score = 52.8 bits (126), Expect = 3e-09
Identities = 45/169 (26%), Positives = 62/169 (36%), Gaps = 5/169 (2%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGVTIQTQTTVKVTGNPSTTHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGLT 390
+ V + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQTKASI-TEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQE 695

Query: 391 VYFALKSGSTTLTSLTAVTDQNGIATTSVKGEITGSVTVSAVTSAGGMQ 439
V F G + + T TD NG A ++ G VSA S +
Sbjct: 696 VTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 51.2 bits (122), Expect = 8e-09
Identities = 39/139 (28%), Positives = 59/139 (42%), Gaps = 3/139 (2%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN 149
VSA ++D +V + +
Sbjct: 731 LVSARVSDVAVDVKAPEVE 749



Score = 39.7 bits (92), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01977INTIMIN300.004 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.0 bits (67), Expect = 0.004
Identities = 23/129 (17%), Positives = 47/129 (36%), Gaps = 6/129 (4%)

Query: 11 KISAIDYSQNINGDYKATVTGGGEGIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVS 70
K+S + NG K T+T G + + ++ V + +EF G +
Sbjct: 704 KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIE 762

Query: 71 VNGANLPTASFPSQGFTGAYYQLNNDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDG 130
+ G + P+ L + G + ++ A ++G+VT K G
Sbjct: 763 IVGTGV-KGKLPTVWLQYGQVNL---KASGGNGKYTWRSANPAIASVDASSGQVTLKEKG 818

Query: 131 DSNTVIITA 139
+ T+ + +
Sbjct: 819 -TTTISVIS 826


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_01979TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 5e-04
Identities = 42/262 (16%), Positives = 99/262 (37%), Gaps = 24/262 (9%)

Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138
+G ++G D+LG KR+L+ + + + + + SF ++ L+ R IQG
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116

Query: 139 VGGEWGGAALLSVESAPENKK----AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFL 194
++ P+ + S V +G GVG + + I
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---------- 166

Query: 195 SWGWRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHYQAAAKKRIPVIEALLRHPGAFLK 254
W L ++ ++ ++ +++ + + + ++ +L +
Sbjct: 167 --HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSIS 224

Query: 255 IIALRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFG 314
+ + + L +++ + + GL + + IG+L GG+ T+ F +
Sbjct: 225 FLIVSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283

Query: 315 RRRVYITGALIGTLSAFPFFMA 336
+ ++ A IG++ FP M+
Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMS 305


84CMJKDNLE_02058CMJKDNLE_02071N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02058-1120.782959actin family protein
CMJKDNLE_02059-3121.912361hypothetical protein
CMJKDNLE_02060-2153.304812hypothetical protein
CMJKDNLE_02061-2163.663833hypothetical protein
CMJKDNLE_02064-2163.649736hypothetical protein
CMJKDNLE_02066-1163.604412MdtABC-TolC multidrug efflux transport system -
CMJKDNLE_02067-1173.600406MdtABC-TolC multidrug efflux transport system -
CMJKDNLE_02068-1162.921908MdtABC-TolC multidrug efflux transport system -
CMJKDNLE_02069-2111.166295putative transport protein MdtD
CMJKDNLE_02070-28-0.042711putative sensory kinase in two-component
CMJKDNLE_02071-311-1.343041TorR transcriptional dual regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02058SHAPEPROTEIN492e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.0 bits (117), Expect = 2e-08
Identities = 32/129 (24%), Positives = 58/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IRQQAQAQLPDAITQAVIGRPINFQGLGGDEANTQAQGILERAAKRAGFKDVVF 190
M+ H I+Q + ++ P+ + + + I E +A+ AG ++V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV-------ERRAIRE-SAQGAGAREVFL 140

Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI 250
EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02066RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.4 bits (118), Expect = 2e-08
Identities = 47/369 (12%), Positives = 105/369 (28%), Gaps = 87/369 (23%)

Query: 44 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGRRG---MR 95
S + R V ++ IA G+ + + A G + + ++
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 96 SG-------PLA---PVQAATAVEQAVPRYLTGLGTIIAANTVTVRSRVDG--QLMALHF 143
G L + A + L ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 144 QEGQQVKAGDLLAEI------------DPSQFKVALAQTQGQLA-------KDKATLANA 184
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 185 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 227
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 228 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 260
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 261 DTTGIVVITQTHPIDLVFTLPESDIATIVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 319
+T +V++ + +++ + DI I Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 320 DNQIDATTG 328
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02067ACRIFLAVINRP9200.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 920 bits (2379), Expect = 0.0
Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02068ACRIFLAVINRP9230.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 923 bits (2386), Expect = 0.0
Identities = 288/1035 (27%), Positives = 508/1035 (49%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQG------ALEDGTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP ++VPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
++ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G + ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRHPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS ++ +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 79.5 bits (196), Expect = 4e-17
Identities = 77/446 (17%), Positives = 161/446 (36%), Gaps = 26/446 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L++ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 703
V S+ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V+
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRHPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPK 1020
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02069TCRTETB1213e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (305), Expect = 3e-32
Identities = 98/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLL-LMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + L+ L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLTIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257
G +L++VG+ L + L V V++ ++++ H R L
Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLI---------VSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLDFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYT--WLSMALIIAL 445
+Y+ L + II +
Sbjct: 428 LYSNLLLLFSGIIVI 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02070BCTERIALGSPF310.009 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.3 bits (71), Expect = 0.009
Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALATLLAALATFLLA------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSE 217
RQ + L+ A L AL L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GKLAQDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02071HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


85CMJKDNLE_02078CMJKDNLE_02085N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02078013-1.385369Alpha-ketoglutarate permease
CMJKDNLE_02079017-1.011435xylulokinase
CMJKDNLE_02080020-2.543243D-mannonate oxidoreductase
CMJKDNLE_02081-118-2.340268LsrR DNA-binding transcriptional repressor
CMJKDNLE_02082-114-2.033697MalI DNA-binding transcriptional repressor
CMJKDNLE_02083-113-1.3379293-hydroxy acid dehydrogenase monomer
CMJKDNLE_02084-111-0.561548L-ribulokinase monomer
CMJKDNLE_02085010-0.457372Alpha-ketoglutarate permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02078TCRTETB419e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 9e-06
Identities = 32/160 (20%), Positives = 66/160 (41%), Gaps = 5/160 (3%)

Query: 221 LYTNRNILLSSIVRIINTLSLFGFAVIMPMMFVDELGFTTSEWLQVWAVFFFTTIFSNVF 280
L N ++ + I ++ GF ++P M D +T+E + +V F S +
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE---IGSVIIFPGTMSVII 308

Query: 281 WGVLAEKMGWMKVVRWFGCVGMALSSLAFYYMP-QHFGHNFAMALIPAVALGIFVAAFVP 339
+G + + + + +G+ S++F ++ M +I LG
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 340 MAAVFP-ALEPKHKGAAISVYNLSAGLSNFLAPAIAVVLL 378
++ + +L+ + GA +S+ N ++ LS AI LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408



Score = 29.1 bits (65), Expect = 0.041
Identities = 26/130 (20%), Positives = 50/130 (38%), Gaps = 9/130 (6%)

Query: 255 ELGFTTSEWLQVWAVFFFTTIFSNVFWGVLAEKMGWMKVVRWFGCVGMALSSLAFYYMPQ 314
+ + V F T +G L++++G +++ + + S + F
Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF----- 97

Query: 315 HFGHNFAMALIPA-VALGIFVAAFVPMAAVFPA--LEPKHKGAAISVYNLSAGLSNFLAP 371
GH+F LI A G AAF + V A + +++G A + + + P
Sbjct: 98 -VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 372 AIAVVLLPYF 381
AI ++ Y
Sbjct: 157 AIGGMIAHYI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02079SHAPEPROTEIN290.050 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.0 bits (65), Expect = 0.050
Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 13/65 (20%)

Query: 383 VLQESGTAIEQCS-------------LVGGGARSPFWAQLLADILDMPVVTHKGGETGGA 429
++ A+EQC L GGGA +LL + +PVV + T A
Sbjct: 267 IVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVA 326

Query: 430 LGAAR 434
G +
Sbjct: 327 RGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02083DHBDHDRGNASE1102e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (276), Expect = 2e-31
Identities = 72/252 (28%), Positives = 118/252 (46%), Gaps = 17/252 (6%)

Query: 12 LNGKVAAITGAASGIGLQCAKTLLDAGAKVVLIDREGDKLHKIVAELGENAY---ALQLD 68
+ GK+A ITGAA GIG A+TL GA + +D +KL K+V+ L A A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 LFNNQQVDNMLADIIELAGGLDIFHANAGAYIGGPVAEGDPDVWDRVLNLNINAAFRCVR 128
+ ++ +D + A I G +DI AG G + + W+ ++N F R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 AVLPHMIAQRSGDIIFTSSIAGVVPVIWEPIYTASKFAVQAFVHTTRRQVSQYGVRVGAV 188
+V +M+ +RSG I+ S VP Y +SK A F ++++Y +R V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 189 LPGPVVTALLDD-WPKAKMEEALANGSLMQ------------PIEVAESVLFMVT-RSKN 234
PG T + W E + GSL P ++A++VLF+V+ ++ +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 235 VTVRDLVILPGS 246
+T+ +L + G+
Sbjct: 246 ITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02085TCRTETB416e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.0 bits (96), Expect = 6e-06
Identities = 32/160 (20%), Positives = 65/160 (40%), Gaps = 5/160 (3%)

Query: 221 LYTNRNIFLSSIVRIINTLSLFGFAVIMPMMFVDELGFTTSEWLQVWAAFFFTTIFSNIF 280
L N + + I ++ GF ++P M D +T+E + + F S I
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE---IGSVIIFPGTMSVII 308

Query: 281 WGIVAEKMGWMRVIRWFGCLGMAASSLAFYYMPQY-FGHNYWMAMIPAIALGTFVAAFVP 339
+G + + R + +G+ S++F +++M +I LG
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 340 MAAVFP-ALEPKHKGAAISVYNLSAGMSNFLAPAIAVVLL 378
++ + +L+ + GA +S+ N ++ +S AI LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


86CMJKDNLE_02113CMJKDNLE_02117N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02113014-0.709907hypothetical protein
CMJKDNLE_02114-115-2.259120hypothetical protein
CMJKDNLE_02115-115-0.215136hypothetical protein
CMJKDNLE_02116-1170.687241putative response regulator in two-component
CMJKDNLE_021171162.177316putative sensory kinase in two-component system
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02113PF09025280.043 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 28.4 bits (63), Expect = 0.043
Identities = 28/102 (27%), Positives = 40/102 (39%), Gaps = 6/102 (5%)

Query: 374 WPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDA 433
+ ++ PAA RRL + GAL + A L + L + +PL
Sbjct: 32 FEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGRQQ 91

Query: 434 ----WQMLSAPLRQPGIVALREYLRQRPPACIRPLN-QVDNL 470
Q+L A PG L + R+ I PLN +DNL
Sbjct: 92 QTFLLQLLGAVEHAPGGEYLAQLARRELQVLI-PLNGMLDNL 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02114INTIMIN270.028 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 27.3 bits (60), Expect = 0.028
Identities = 19/94 (20%), Positives = 31/94 (32%)

Query: 36 LNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKL 95
+ + AITY K K K S ++ F + KT AK + K
Sbjct: 671 VANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 96 TYTDTYAQENVTIDMEKVDFKALQGISGINVSAE 129
+ + V + +V+F I N+
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02116HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 1e-16
Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 12/177 (6%)

Query: 2 IKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRIS 61
+L+ DD+ R L L ++ ++ SNA + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQ 117
+++ + + RP + + ++A + AIKA E+ A+DYL KP D L + R
Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 118 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVT--SHEGKE 172
E ++ L ++Q + + G S + +A + + +T S GKE
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02117PF065802204e-69 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 220 bits (562), Expect = 4e-69
Identities = 63/216 (29%), Positives = 115/216 (53%), Gaps = 3/216 (1%)

Query: 343 LGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402
L G + + + +M ++++ L AQ+NPHF+FNALN I+A+I D +A
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193

Query: 403 SQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQ 461
+++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ I +
Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253

Query: 462 QLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGL-YQPVTNASGL 520
Q+P +Q +VEN IKHG +QL G++ + ++ + LE+E+ L + ++G
Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313

Query: 521 GMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLP 556
G+ V +RL+ +G + I ++ + + +P
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


87CMJKDNLE_02126CMJKDNLE_02131N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_021261182.004377penicillin-binding protein 7
CMJKDNLE_021271182.257172putative inner membrane protein
CMJKDNLE_021282182.205256hypothetical protein
CMJKDNLE_021291152.271586putative oxidoreductase with NAD(P)-binding
CMJKDNLE_021300131.224529putative channel/filament protein
CMJKDNLE_02131113-0.111735tRNA-dihydrouridine synthase C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02126BLACTAMASEA445e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.6 bits (103), Expect = 5e-07
Identities = 42/195 (21%), Positives = 76/195 (38%), Gaps = 18/195 (9%)

Query: 1 MPKFRVSLFSLALMLAVPFAPQAVAKTAAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 59
M R+ + SL + +P A A + S+ +++ MI +DL + + + +
Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58

Query: 60 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 116
D P+ S K++ VL DE+L+ I + YS V L ++ ++
Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118

Query: 117 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 169
A+ S+N +AA+L GG + A + +G N TR E
Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173

Query: 170 HNVSTARDLTKLLIA 184
+ +T + L
Sbjct: 174 RDTTTPASMAATLRK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02128BCTERIALGSPF280.019 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.019
Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 152 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 182
L + ++ + W++L ++ + R L++
Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02129DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 1e-32
Identities = 71/253 (28%), Positives = 116/253 (45%), Gaps = 12/253 (4%)

Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLDLG 62
++A IT + GIG+ A LA QG I ++ E+ K + AE D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKA-EARHAEAFPADVR 67

Query: 63 NLPEGALALEKLIQRLGRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122
+ ++ + +G ID+LVN AG + ++ +EW F+V+ G F S+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKAMALELVRHKILVNAVA 182
++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+
Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PGAIATPM-------NGMDDSDVKPDAEP---SIPLRRFGATHEIASLVVWLCSEGANYT 232
PG+ T M + +K E IPL++ +IA V++L S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 233 TGQSLIVDGGFML 245
T +L VDGG L
Sbjct: 247 TMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02131SHAPEPROTEIN280.044 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 28.2 bits (63), Expect = 0.044
Identities = 31/127 (24%), Positives = 53/127 (41%), Gaps = 5/127 (3%)

Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179
G EA+ ++ + +G + E+ K EI A E+ V GR +G R
Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249

Query: 180 HIDWQAIGD-IRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVK 238
++ I + +++ L V A + + IS V+ G GAL + NL R++
Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307

Query: 239 YNEPRMP 245
E +P
Sbjct: 308 MEETGIP 314


88CMJKDNLE_02208CMJKDNLE_02215N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02208-111-2.503637outer membrane porin C
CMJKDNLE_02210012-2.121102Phosphotransferase RcsD
CMJKDNLE_02211013-1.737757FimZ transcriptional regulator
CMJKDNLE_02213015-1.119044EnvZ sensory histidine kinase
CMJKDNLE_02214118-0.668366NtrB
CMJKDNLE_022150160.301721HyfR DNA-binding transcriptional activator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02208ECOLIPORIN5290.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 529 bits (1365), Expect = 0.0
Identities = 252/383 (65%), Positives = 295/383 (77%), Gaps = 19/383 (4%)

Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLYGKVDGLHYFSDNDSKDGDKTYMRLG 60
MK KVL+L++PALL AGAA+AAEIYNKDGNKLDLYGKVDGLHYFSD+ SKDGD+TYMR+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQVTDQLTGYGQWEYQIQGNEPESDNS-SWTRVAFAGLKFQDVGSFDYGRNYGVVY 119
FKGETQ+ DQLTGYGQWEY +Q N E + + SWTR+AFAGLKF D GSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 120 DVTSWTDVLPEFGGDTY-DSDNFMQQRGNGFATYRNTDFFGLVDGLDFAVQYQGKNGSAH 178
DV WTD+LPEFGGD+Y +DN+M R NG ATYRNTDFFGLVDGL+FA+QYQGKN S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 179 GEGMT-----TNGRDDVFEQNGDGVGGSITYNY-EGFGIGAAVSSSKRTWDQNNT-GLIG 231
+ + N DD+ NGDG G S TY+ GF GAA ++S RT +Q N G I
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240

Query: 232 TGDRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSL------GWANKAQNFEAVAQYQF 285
GD+A+ +T GLKYDANNIYLA Y++T N T G G ANK QNFE AQYQF
Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300

Query: 286 DFGLRPSLAYLQSKGKNLGR---GYDDEDILKYVDVGATYYFNKNMSTYVDYKINLLD-D 341
DFGLRP++++L SKGK+L DD+D++KY DVGATYYFNKN STYVDYKINLLD D
Sbjct: 301 DFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDDD 360

Query: 342 NRFTRDAGINTDDIVALGLVYQF 364
+ F +DAGI+TDDIVALG+VYQF
Sbjct: 361 DPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02211HTHFIS489e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 9e-09
Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%)

Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60
M +++ADD + + ++L + + + ++ L + D +++TD+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114
+ L+ IK+ P L ++V++ N +A+ ++GA P
Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106

Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139
DL + + + + S+L +
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02213HTHFIS823e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 3e-18
Identities = 29/106 (27%), Positives = 48/106 (45%)

Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNHIDIVLSDVNMPNMDGYRL 886
ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 887 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVI 932
RI++ LPV+ ++A + E G L KP L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02215HTHFIS5630.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 563 bits (1452), Expect = 0.0
Identities = 181/484 (37%), Positives = 270/484 (55%), Gaps = 35/484 (7%)

Query: 1 MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPE 60
MT IL+ DD+ +R +L+ A + G++ +N T A D+V+ D+ MP+
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 MDGIKALKEMRSHETRTPVILMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRAL 120
+ L ++ PV++M+A TA++A GA+DY+ KPFDL EL I+ RAL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 QLQSMKKEIRHLHQALSTSWQWGH-ILTNSPAMMDICKDTAKIALSQASVLISGESGTGK 179
+ L Q G ++ S AM +I + A++ + +++I+GESGTGK
Sbjct: 120 AEP------KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173

Query: 180 ELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEG 239
EL+ARA+H +R GPF+ +N AA+P L+ESELFGHEKGAFTGAQT G FE+A G
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233

Query: 240 TLLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFR 299
TL LDEIG+MP+ Q +LLR+LQ+ E+ +GG I+ D+RI+AATN+DL+ + +G FR
Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293

Query: 300 EDLFYRLNVIHLILPPLRDRREDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWP 359
EDL+YRLNV+ L LPPLRDR EDI L HF+Q+ E + D A+ L+ A WP
Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWP 352

Query: 360 GNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPV---------CNAGEVKTAPVGERN- 409
GN+REL N++ R + +I E + ++R + +G + + E N
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412

Query: 410 ----------------LKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQE 453
+ +E +I+ L GN+ + A +LG++R L K++E
Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472

Query: 454 YGID 457
G+
Sbjct: 473 LGVS 476


89CMJKDNLE_02353CMJKDNLE_02356N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02353136-9.837283EmrKY putative multidrug efflux transporter -
CMJKDNLE_02354036-8.959860EmrKY-TolC multidrug efflux transport system -
CMJKDNLE_02355133-8.250023FimZ transcriptional regulator
CMJKDNLE_02356134-7.881491putative DNA-binding response regulator in
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02353TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02354RTXTOXIND786e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 78.3 bits (193), Expect = 6e-18
Identities = 62/412 (15%), Positives = 122/412 (29%), Gaps = 96/412 (23%)

Query: 13 RRKYFSLLAVVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71
RR ++ F+ + + ++E + + + G + I + V + K
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97
+ VR+GD+L+ L A K
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136
K + Q + L + AE + + Y+
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182
R+ L + I+K + S + + I + K LV
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233
L + + + + + I++PV+ + Q V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292
++LM +VP + V A + + + +GQ+ I + F G +G
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404

Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 340
+ + +V V +S++ L PL G+++TA I T
Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02355HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02356HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


90CMJKDNLE_02638CMJKDNLE_02644N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02638-1122.159100putative transporter
CMJKDNLE_02639-1121.801902L-valine efflux transporter - YgaZ subunit
CMJKDNLE_02640-1131.254874L-valine efflux transporter - YgaH subunit
CMJKDNLE_02641-2110.916400MprA-CCCP
CMJKDNLE_02642-1121.331236EmrAB-TolC multidrug efflux transport system -
CMJKDNLE_02643-1131.154186EmrAB-TolC multidrug efflux transport system -
CMJKDNLE_026440150.483368S-ribosylhomocysteine lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02638TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.6 bits (108), Expect = 2e-07
Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%)

Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAACGMLIT 93
L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151
S ++I+ + G + LV + A + RGK G I S + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 152 TVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNY 196
+ G++A+ W + + + + + + +++ + H +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02641PF05272290.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.010
Identities = 24/99 (24%), Positives = 39/99 (39%), Gaps = 12/99 (12%)

Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82
P QE+ L + + L R A+G + + T + ++L ALG
Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808

Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRRCLYLQ 116
SS ++ D L + GW RE+ RR Y++
Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRRRGYMR 847


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02642RTXTOXIND786e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 78.3 bits (193), Expect = 6e-18
Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%)

Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80
L FI+ + I VL E A +G +I + V ++
Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104
+ V++GDVL+ L A K
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141
K ++ Q +Q +N + +A + + +S
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201
+ L + I + + + A +L V Q ++ IL K E Q Q
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 202 AATEVRN------------------AWLALERTRIISPMTGYVSRRAVQ-PGAQISPTTP 242
E+ + + + I +P++ V + V G ++
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298
LM +VP + V A + I + +GQ I + + +Y GKV + +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409

Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350
++ G V+ + + PL G++ + T R
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02643TCRTETB1329e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 132 bits (333), Expect = 9e-36
Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%)

Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76
I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135
G +L L+ I S V S ++LI R IQG A L ++ P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195
R A L V + GP +GG I+ HW + + +P+ + + L L +E R
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194

Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255
+ D G+ L+ +GI + ML F++ I +V+V++ +
Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315
+P VD L K+ F IG LC + + G + ++P ++++V+ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373
+ VI+ I G + ++ +V F ++ S + I F
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358

Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416
++ ++TI S L + A SL NFT L+ G +I
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02644LUXSPROTEIN292e-105 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 292 bits (750), Expect = e-105
Identities = 131/170 (77%), Positives = 148/170 (87%)

Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61
PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121
GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI 171
ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALP+ L+EL I
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170


91CMJKDNLE_02732CMJKDNLE_02740N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_02732110-1.785028YgcS MFS transporter
CMJKDNLE_02733111-3.159781putative FAD-containing dehydrogenase
CMJKDNLE_02734215-4.905071putative deoxygluconate dehydrogenase
CMJKDNLE_02735-115-3.967180YqcE MFS transporter
CMJKDNLE_02736019-3.272357putative kinase
CMJKDNLE_02737022-3.047881hypothetical protein
CMJKDNLE_02738127-3.595549small protein involved in the cell envelope
CMJKDNLE_02739227-2.924293hypothetical protein
CMJKDNLE_02740225-0.240136degradosome
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02732TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 45/314 (14%), Positives = 112/314 (35%), Gaps = 36/314 (11%)

Query: 93 LGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 151
+G+ V G +SD +G +++ F ++ S + F + LI R + G G ++
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 152 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISENPEAWRWLLASAAL 207
++A + P+ +RG G + VG + + H+ W +LL +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177

Query: 208 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLF-- 265
+ + L + R +G F I+ +L + + + L
Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 266 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 312
++ R+ ++ F+ V+ +I+ ++ + + L+ + +
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 313 LNALLIVGALLGLV-------LTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLF 365
+ ++ G + ++ L L L+ + + + L +S + +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354

Query: 366 VLFSTTISAVSNLV 379
++F + + V
Sbjct: 355 IVFVLGGLSFTKTV 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02734DHBDHDRGNASE1024e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (255), Expect = 4e-28
Identities = 73/257 (28%), Positives = 116/257 (45%), Gaps = 11/257 (4%)

Query: 11 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEK-QGVEVD 69
M+ ++GK A +TG G+G+A A LA GA+I + + E K + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 70 FMQVGITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 129
+ A +I A G +DILVN AG+ + + +W+ VN T
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 130 FELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNI 189
F S +K M+ ++SG I+ + S + + AY+++K A FTK EL +YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 190 QVNGIAPGYYATDI--TLATRSNPETNQRVLDH-------IPANRWGDTQDLMGAAVFLA 240
+ N ++PG TD+ +L N Q + IP + D+ A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 241 SPASNYVNGHLLVVDGG 257
S + ++ H L VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02735TCRTETA300.018 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.018
Identities = 22/103 (21%), Positives = 45/103 (43%), Gaps = 8/103 (7%)

Query: 48 GLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIA 107
G++++ + + G ++D+F R ++ ++ + +MAT P LWV+ +I
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVG 147
IT + A + + D E+ + G+M G G
Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02739cloacin330.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.1 bits (75), Expect = 0.001
Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 253 ASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
+ G + S+N+ GGS SG GGG G GG +G
Sbjct: 34 SDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69



Score = 30.8 bits (69), Expect = 0.006
Identities = 11/34 (32%), Positives = 14/34 (41%)

Query: 254 SGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGAS 287
SG H G G SGGG +GG ++
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.012
Identities = 11/30 (36%), Positives = 11/30 (36%)

Query: 259 HSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
GGS G G G S GG G G
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79



Score = 29.7 bits (66), Expect = 0.013
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 255 GRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASG 288
GR H+ + S G+ +GG +G G G SG
Sbjct: 6 GRG-HNTGAHSTSGNINGGPTGLGVGGGASDGSG 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02740ANTHRAXTOXNA290.038 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.038
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 9/132 (6%)

Query: 211 GYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLA-----GEGN 265
P L N + A+ +E K YE+GK I+L + + ++ + +
Sbjct: 147 RETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSS 206

Query: 266 KAFTSEEFTHFLEELTKQYPIVSIEDGLDESDW---DGFAYQTKVLG-DKIQLVGDDLFV 321
S++F LE K I I++ L E F+Y ++L D+F
Sbjct: 207 DLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFE 266

Query: 322 TNTKILKEGIEK 333
K+ K G EK
Sbjct: 267 YMNKLEKGGFEK 278


92CMJKDNLE_02964CMJKDNLE_02975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_029640204.766340putative protein secretion protein for export
CMJKDNLE_029650235.474638putative protein secretion protein for export
CMJKDNLE_02966-1194.475288putative protein secretion protein for export
CMJKDNLE_02967-1163.894035putative protein secretion protein for export
CMJKDNLE_02968-1153.374048putative protein secretion protein for export
CMJKDNLE_02969-2132.249996putative protein secretion protein for export
CMJKDNLE_02970-1131.207498putative protein secretion protein for export
CMJKDNLE_02971-1130.489515putative protein secretion protein for export
CMJKDNLE_02972-3120.210630putative secretion pathway protein, C-type
CMJKDNLE_02973-2120.397690putative lipoprotein
CMJKDNLE_02974-3121.076708prepilin peptidase
CMJKDNLE_02975-3131.583348putative lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02964BCTERIALGSPG328e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.8 bits (72), Expect = 8e-04
Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 1 MRRTR--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTR--VNSAVAD 43
MR T GFTLLE++V I I LA L+ + + AV+D
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02965BCTERIALGSPH339e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.4 bits (76), Expect = 9e-05
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02966BCTERIALGSPH744e-19 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 74.2 bits (182), Expect = 4e-19
Identities = 41/196 (20%), Positives = 69/196 (35%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASSGVVQTFATDSEPPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDAPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02967BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 91/146 (62%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADARNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P A NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02968BCTERIALGSPF455e-162 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 455 bits (1171), Expect = e-162
Identities = 226/406 (55%), Positives = 302/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKELIPVHI-EARMNTSSGGMLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPVSTRTLIAMSDALQASGVYWLAGLLGLLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP+STR L+ MSDA++ G + L LL +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMRLRWDKTLLRLPVIGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+IGR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02971BCTERIALGSPD5740.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 574 bits (1482), Expect = 0.0
Identities = 295/668 (44%), Positives = 431/668 (64%), Gaps = 34/668 (5%)

Query: 24 LLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83
L L++ AAL P AEE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+
Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143
LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127

Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203
VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD
Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187

Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262
+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P
Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322
+R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306

Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382
+ + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+
Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366

Query: 383 QWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLST-LA 441
QWA+K+AG+ QF N + +PI T A + +G +S+ LA
Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406

Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501
LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556
S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L
Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNL 616
F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+ST+ K KRNL
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPVLPAQNQALPP 674
M+FIRPT++RD S +Y Q + E +++ + P Q+ A
Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645

Query: 675 EVRAFLNA 682
+V A ++A
Sbjct: 646 QVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02972BCTERIALGSPC1016e-28 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 101 bits (254), Expect = 6e-28
Identities = 66/278 (23%), Positives = 108/278 (38%), Gaps = 38/278 (13%)

Query: 1 MLLIISAKMAHSLWRYISFSAEYTA-VSQPVNKPSRVDAKTFDKNDVQLISQQNWFGKYQ 59
++L+ ++A WR A VS P++ + ND L FG
Sbjct: 22 LMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL------FGVSP 72

Query: 60 PV--AAQVKQPEPVPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQGETLGSHN 114
A + + + + LN+ L G+ G +R A+I + +Q E + +N
Sbjct: 73 EKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYN 132

Query: 115 AVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTNKKAVSDEAKQAVAEPAVSVPVEIP 174
A I I D V+L+YQG+ E L L +E S SD A +
Sbjct: 133 AKIVSIRPDRVVLQYQGRYEVLGLYSQEDSG---------SDGVPGAQVNEQLQ------ 177

Query: 175 AAVRQALAKDPQKIFNYIQLTPVRKEG-IVGYAAKPGADRSLFDASGFKEGDIAIALNQQ 233
+ + +Y+ +P+ + + GY PG F G ++ D+A+ALN
Sbjct: 178 -------QRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGL 230

Query: 234 DFTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIAL 271
D D M ++ + + LTV R G R DI +
Sbjct: 231 DLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02974PREPILNPTASE2828e-98 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 282 bits (723), Expect = 8e-98
Identities = 110/274 (40%), Positives = 150/274 (54%), Gaps = 12/274 (4%)

Query: 1 MLFDVFQQYPTAMPVLATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGEMSSAQSKI-- 57
+L ++ P L + L+IGSFLNVVI R PIML R+ AE+ + +
Sbjct: 3 LLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDE 62

Query: 58 ---SLALPRSHCPHCQQTIRIRDNIPLFSWLMLKGRCRDCQAKISKRYPLVELLTALAFL 114
+L +PRS CPHC I +NIPL SWL L+GRCR CQA IS RYPLVELLTAL +
Sbjct: 63 PPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSV 122

Query: 115 LASLVWPESGWGLAVMILSAWLIAASVIDLDHQWLPDVFTQGVLWTGLIAAWAQQSPLTL 174
++ LA ++L+ L+A + IDLD LPD T +LW GL+ ++L
Sbjct: 123 AVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFVSL 181

Query: 175 QDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGALSLPNVALIASCC 234
DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S
Sbjct: 182 GDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV 241

Query: 235 GLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 263
G + S +PFGP L++ G L
Sbjct: 242 GAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_02975PF03544494e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 49.2 bits (117), Expect = 4e-08
Identities = 24/60 (40%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEP---IPDPEPTPEPEPEPVP 88
S T + V+P P P EP PEP P PEP E I P+P P+P+P+PV
Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109



Score = 41.9 bits (98), Expect = 9e-06
Identities = 16/92 (17%), Positives = 27/92 (29%), Gaps = 2/92 (2%)

Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92
+D P + PE +P P PEP PEP + E + V
Sbjct: 58 ADLEPPQAVQ-PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 93 YLTLGGSQRVTGATCNGESSDGFTFKPGEDVT 124
+ S+ + N + + +
Sbjct: 117 DVKPVESRPASPFE-NTAPARPTSSTATAATS 147



Score = 40.7 bits (95), Expect = 2e-05
Identities = 18/59 (30%), Positives = 23/59 (38%), Gaps = 2/59 (3%)

Query: 35 TPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDP--EPTPEPEPEPVPTKT 91
P + +P +P PEP +PEP PEPIP+P E E K
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 103



Score = 40.7 bits (95), Expect = 2e-05
Identities = 20/96 (20%), Positives = 28/96 (29%), Gaps = 7/96 (7%)

Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPT---PEPTPDPEPTPEPIPDPEPTPEPEPE 85
+ P V PE +P P P E +P P P+P P+P+ E
Sbjct: 65 AVQPPPEPVV----EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 86 PVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGE 121
R T +T +S T
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156



Score = 35.0 bits (80), Expect = 0.001
Identities = 17/40 (42%), Positives = 17/40 (42%)

Query: 50 PDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89
P P T D EP P PEP EPEPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83



Score = 30.7 bits (69), Expect = 0.039
Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 52 PTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91
P P + + P P P P EPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83


93CMJKDNLE_03054CMJKDNLE_03064N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03054437-10.680951putative fimbrial-like adhesin protein
CMJKDNLE_03055438-10.781543putative membrane protein
CMJKDNLE_03056131-7.873461putative membrane protein
CMJKDNLE_03057021-4.733464putative membrane protein
CMJKDNLE_0305809-0.744012protein involved in detoxification of
CMJKDNLE_03059081.445969putative glycogen synthesis protein
CMJKDNLE_03060181.586086putative oxidoreductase
CMJKDNLE_03061192.720230putative membrane protein
CMJKDNLE_03064-1133.348780fused heptose 7-phosphate kinase/heptose
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03054FIMBRIALPAPE280.015 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.1 bits (62), Expect = 0.015
Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 35/163 (21%)

Query: 14 AMILSNNVFADEGHGIVKFKGEVISAPCSIKPGDEDLTVNLGEVADTVLKSDQKSLAE-- 71
A+++S +V A + + FKG++I C++ ++ VN G++ L + +
Sbjct: 15 AVLMSQHVHAADN---LTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSGGNQKDFT 67

Query: 72 -----PFTIHLQDCMLSQGGTTYSKAKVTFTTANTMTGQSDLLKNTKETEIGGATGVGVR 126
P+++ ++ G T + V T+ + G L N+ + IG A
Sbjct: 68 VDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNA------ 121

Query: 127 ILDSQSGEVTLGTPVV---ITFNNTNS----YQELNFKARMES 162
VTLG+ V IT Y +L +K M+S
Sbjct: 122 --------VTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQS 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03055PF005772012e-61 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 201 bits (513), Expect = 2e-61
Identities = 66/251 (26%), Positives = 111/251 (44%), Gaps = 12/251 (4%)

Query: 22 CSLSVIIIGCA-------SAYAVEFNKDLIEAEDRENVNLSQFETDGQLPVGKYSLSTLI 74
+ + CA S+ + FN + + + +LS+FE +LP G Y + +
Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 75 NNKRTPIHLDLQWVLIDN--QTAVCVTPEQLTLLGFTDEFIEKTQQNLIDGCYPIEK-EK 131
NN D+ + D+ C+T QL +G + D C P+
Sbjct: 85 NNGYMA-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIH 143

Query: 132 QITTYLDKGKMQLSISAPQAWLKYKDANWTPPELWNHGIAGAFLDYNLYASHYAPHQGDN 191
T LD G+ +L+++ PQA++ + + PPELW+ GI L+YN + G N
Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGN 203

Query: 192 SQNISSYGQAGVNLGAWRLRTDYQYDQSFNNGKS-QATNLDFPRIYLFRPIPAMNAKLTI 250
S Q+G+N+GAWRLR + + + ++ S +L R I + ++LT+
Sbjct: 204 SHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTL 263

Query: 251 GQYDTESSIFD 261
G T+ IFD
Sbjct: 264 GDGYTQGDIFD 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03056PF00577422e-140 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 422 bits (1086), Expect = e-140
Identities = 152/577 (26%), Positives = 266/577 (46%), Gaps = 51/577 (8%)

Query: 1 MLPPDLRGYAPQITGVAQTNAKVTVSQNNRIIYQENVPPGPFAITNLFNT-LQGQLDVKV 59
MLP RG+AP I G+A+ A+VT+ QN IY VPPGPF I +++ G L V +
Sbjct: 289 MLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTI 348

Query: 60 EEEDGRVTQWQVASNSIPYLTRKGQIRYTTAMGKPTSVGGDSLQQPFFWTGEFSWGWLNN 119
+E DG + V +S+P L R+G RY+ G+ S G ++P F+ G
Sbjct: 349 KEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS-GNAQQEKPRFFQSTLLHGLPAG 407

Query: 120 VSLYGGSVLTNRDYQSLAAGVGFNLNSLGSLSFDVTRSDAQLHNQDKETGYSYRANYSKR 179
++YGG+ L +R Y++ G+G N+ +LG+LS D+T++++ L + + G S R Y+K
Sbjct: 408 WTIYGGTQLADR-YRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKS 466

Query: 180 FESTGSQLTFAGYRFSDKNFVTMNEYIND--------------------TNHYTNYQNEK 219
+G+ + GYR+S + + T++Y N++
Sbjct: 467 LNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKR 526

Query: 220 ESYIVTFNQYLESLRLNTYVSLARNTYWDAS-SNVNYSLSLSRDFDIGPLKNVSTSLTFS 278
+T Q L Y+S + TYW S + + L+ F ++++ +L++S
Sbjct: 527 GKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF-----EDINWTLSYS 580

Query: 279 RIN--WEEDNQDQLYLNISIPWGTSR-----------TLSYGMQRNQDNEISHTASWYDS 325
W++ L LN++IP+ + SY M + + +++ A Y +
Sbjct: 581 LTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGT 640

Query: 326 --SDRNNSWSVSASGDNDEFKDMKASLRASYQHNTENGRLYLSGTSQRDSYYSLNASWNG 383
D N S+SV + ++ A+ + G + + D L +G
Sbjct: 641 LLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD-IKQLYYGVSG 699

Query: 384 SFTATRHGAAFHDYSGSADSRFMIDADGTEDIPLNNKRAV-TNRYGIGVIPSVSSYITTS 442
A +G D+ ++ A G +D + N+ V T+ G V+P + Y
Sbjct: 700 GVLAHANGVTLGQPLN--DTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENR 757

Query: 443 LSVDTRNLPENVDIENSVITTTLTEGAIGYAKLDTRKGYQIIGVIRLADGSHPPLGISVK 502
+++DT L +NVD++N+V T GAI A+ R G +++ + + P G V
Sbjct: 758 VALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVT 816

Query: 503 DETSHKELGLVADGGFVYLNGIQDDNKLALRWGDKSC 539
E+S + G+VAD G VYL+G+ K+ ++WG++
Sbjct: 817 SESS-QSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03061IGASERPTASE527e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.6 bits (123), Expect = 7e-09
Identities = 47/287 (16%), Positives = 92/287 (32%), Gaps = 16/287 (5%)

Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEAFMTLEQ 256
N A+ + + E R A + + ++ E +QE+ +
Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA---ENSKQESKTVEKN 1054

Query: 257 EQQVKTRTAEQNARIAAFEAERRREAE-QTRILAERQIQETEIDREQAVRSRKVEAEREV 315
EQ TA+ R A EA+ +A QT +A+ + E + + VE E +
Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQ---QSQAEARANLALAEAVSAQQNVETTRQTA 372
+++ + Q+V ++ +Q +++ Q + E + + E S T Q A
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 373 EADRAKQVALIAAAQDAET------KAVELTVRAKAEKEAAEMQAAAIVELAEATRKKGL 426
+ + + + T T +E + R
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 427 AEAEAQRALNDAINVLSDEQTSLKFKLALLQALPAVIEKSVEPMKSI 473
A + ND V + TS L A ++ K++
Sbjct: 1233 NVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03064LPSBIOSNTHSS290.028 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.0 bits (65), Expect = 0.028
Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383
G FD + GH+ + +L D++ VAV + + + +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43


94CMJKDNLE_03160CMJKDNLE_03168N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03160013-1.138195putative outer membrane export usher protein
CMJKDNLE_031610110.672042putative fimbrial protein
CMJKDNLE_031620122.57713016S rRNA 2'-O-ribose methyltransferase
CMJKDNLE_031630132.607466outer membrane lipoprotein - activator of PBP1A
CMJKDNLE_031640172.277797hypothetical protein
CMJKDNLE_031651182.198701DnaA initiator-associating factor for
CMJKDNLE_031661192.876145lipoprotein
CMJKDNLE_031670193.063194putative permease
CMJKDNLE_031680201.516717putative nucleoside-diphosphate-sugar epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03160PF005777730.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 773 bits (1998), Expect = 0.0
Identities = 317/849 (37%), Positives = 469/849 (55%), Gaps = 48/849 (5%)

Query: 31 SGMLCTTANAEEYYFDPIMLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQK 90
+ ++ E YF+P L DLSRF PGTY+VDI+LN ++ +
Sbjct: 35 AFAAQAPLSSAELYFNPRFLAD--DPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATR 92

Query: 91 KITFTAN-AEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDF 149
+TF +EQ + P T QL +G+ + + DD+ + L +I A+ D
Sbjct: 93 DVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIHDATAQLDV 151

Query: 150 NHQQLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNM 209
Q+LNL+IPQ + ARGY+ P WD GI NY+F+G+ + R G S YLN+
Sbjct: 152 GQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNL 211

Query: 210 QNGANFGPWRLRNYSTWTRNDQTSS------WNTISSYLQRDIKALKSQLLLGESATSGS 263
Q+G N G WRLR+ +TW+ N SS W I+++L+RDI L+S+L LG+ T G
Sbjct: 212 QSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGD 271

Query: 264 IFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVSAGAFE 323
IF F G QLASDDNMLP+SQRGFAP + GIA +A VTI+QNGY IY S V G F
Sbjct: 272 IFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFT 331

Query: 324 INDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDS 383
IND+Y + NSGDL+VTI+E+DG+ + F PYSS+P++QR GH +YS TAG YR+
Sbjct: 332 INDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQE 391

Query: 384 KEPEFAEATAIYGLNNTFTLYGGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDN 443
K P F ++T ++GL +T+YGG ++ Y A GIG +GALGALS+D+ +A++ +
Sbjct: 392 K-PRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPD 450

Query: 444 QHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFNEA------------------ 485
G R Y K + E+ TNI + YRY+ GYF+F +
Sbjct: 451 DSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQ 510

Query: 486 ----NTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNDKNRNISVGVSGQQ 541
T ++ ++ ++Q ++Q + +LY SGS Q YWG ++ + G++
Sbjct: 511 VKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF 570

Query: 542 WGVGYSLNYQYSRYTDQN-NDRALSLNLSIPLERWLPRSR--------VSYQMTSQKDRP 592
+ ++L+Y ++ Q D+ L+LN++IP WL SY M+ +
Sbjct: 571 EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGR 630

Query: 593 TQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNS----SLNASYRSPYGTFSAGYSYGNDS 648
+ + G+LL+D LSYS++ + NS +YR YG + GYS+ +D
Sbjct: 631 MTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDI 690

Query: 649 SQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYL 708
Q YGV+GGV+ H +GVTL Q L + L+ A GA +++N G+ TD GYAV+PY
Sbjct: 691 KQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYA 750

Query: 709 TTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPL 768
T Y+ENR+++DT L DNVDL+ VVP RGA+V A F A +G ++L+T+ N KPL
Sbjct: 751 TEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL-THNNKPL 809

Query: 769 PFGALASNDDTGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTT 828
PFGA+ +++ + IV + G +YLSG+ + V+WG + + C + P
Sbjct: 810 PFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK-VQVKWGEEENAHCVANYQLPPESQQQ 868

Query: 829 SVLQGTAQC 837
+ Q +A+C
Sbjct: 869 LLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03161FIMBRIALPAPF300.011 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 29.7 bits (66), Expect = 0.011
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 21/160 (13%)

Query: 208 VKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTS 267
V+++I+GN+ P C IN G I V+FG IN + V +I+ C S
Sbjct: 21 VQINIRGNVYIP-PCTINNGQNIVVDFGNINPEHVDNSRG------EVTKNISISCPYKS 73

Query: 268 KIKNSLQMRIDGTTGVVDQYNLVARRRSSDNVPDVGIRIENLGGGVANIPFQNG------ 321
SL +++ G T V Q N++A N+ GI + G + NG
Sbjct: 74 ---GSLWIKVTGNTMGVGQNNVLA-----TNITHFGIALYQGKGMSTPLTLGNGSGNGYR 125

Query: 322 ILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVI 361
+ + T + P G L G F+ TA++++I
Sbjct: 126 VTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMI 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03163BINARYTOXINB300.029 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.029
Identities = 11/72 (15%), Positives = 24/72 (33%), Gaps = 4/72 (5%)

Query: 487 AGVNGGSGIALTGSPITLRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGE 546
+ V+G + + + I + + ++ T D + G R A + +
Sbjct: 330 SEVHGNAEVHASFFDIGGSVSAGFSNSNSS----TVAIDHSLSLAGERTWAETMGLNTAD 385

Query: 547 IAFIKPMIAMRN 558
A + I N
Sbjct: 386 TARLNANIRYVN 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03165RTXTOXINA280.036 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 27.6 bits (61), Expect = 0.036
Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%)

Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94
K+L GN + A T + IA + V AI+ D+
Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330

Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141
E Y+++ + LG+ GD LLA + A++A++T T++A
Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03168NUCEPIMERASE290.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.014
Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 4 VLITGATGLVGGHLLRMLINEP 25
L+TGA G +G H+ + L+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24


95CMJKDNLE_03258CMJKDNLE_03265N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03258-116-0.668366serine endoprotease, periplasmic
CMJKDNLE_03259-213-0.325339DegS serine endoprotease
CMJKDNLE_03260-112-0.183594malate dehydrogenase
CMJKDNLE_03261-212-0.532749ArgR-arg
CMJKDNLE_03262-2140.083318stress-induced protein
CMJKDNLE_03263-1120.764252putative barnase inhibitor
CMJKDNLE_03264-3101.330689hydroxylated, aromatic carboxylic acid efflux
CMJKDNLE_03265-191.275467hydroxylated, aromatic carboxylic acid efflux
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03258V8PROTEASE726e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.0 bits (176), Expect = 6e-16
Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 38/184 (20%)

Query: 90 GLGSGVIINASKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137
+ SGV++ K +LTN HV++ L +G ++ +
Sbjct: 102 FIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALG 190
D+A+++ + ++++ + +V G P V+ +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212

Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249
S + L+ +Q D S GNSG + N E+IGI+ G+
Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 250 MART 253
+
Sbjct: 264 VFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03259V8PROTEASE538e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 52.7 bits (126), Expect = 8e-10
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%)

Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124
+ SGV++ + ++TNKHV++ AL+ +G +
Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177
DLA++K + + ++ + + G P + T + G
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216

Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217
+I + +Q D S GNSG + N E++GI+
Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03260DHBDHDRGNASE280.045 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.045
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%)

Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62
+ GAA GIG+A+A L G+ ++ D P V S A + F +
Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66

Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104
A + D+++ AGV R + F+VN+ V N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146
V+K + + + +NP ++AA KA K G+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03261ARGREPRESSOR1694e-57 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 169 bits (430), Expect = 4e-57
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%)

Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74
+ ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+
Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69

Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131
S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI
Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128

Query: 132 FTTPANGFTVKDLYEAILELF 152
K + + ILEL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03265RTXTOXIND534e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 53.3 bits (128), Expect = 4e-10
Identities = 28/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%)

Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61
SR V I+ F+ I + + S I P + ++ ++
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 62 NVHDNQLVKKGQILFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114
V + + V+KG +L + Q +L +A+ + YQ+L++ E + L
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168

Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154
+ + VL + Q + Q + +L+L++
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211



Score = 51.4 bits (123), Expect = 2e-09
Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%)

Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150
E R + ++ + ++EE + + +L +L + +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208
+ +VIRAP V L V+T G +T T + +V ++ V A ++ + + G
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231
A I P L G V ++
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410


96CMJKDNLE_03286CMJKDNLE_03291N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03286-112-1.324737Fis DNA-binding transcriptional dual regulator
CMJKDNLE_03287-211-1.305804DNA adenine methyltransferase
CMJKDNLE_03288-213-0.953609AcrEF-TolC multidrug efflux transport system -
CMJKDNLE_03289-115-0.988626AcrEF-TolC multidrug efflux transport system -
CMJKDNLE_03290-117-2.021222hypothetical protein
CMJKDNLE_03291117-1.767758putative outer membrane lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03286DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03288RTXTOXIND413e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.4 bits (97), Expect = 3e-06
Identities = 38/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%)

Query: 13 ATYQANYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 71
K +L + E+ A + Q + I D RQ +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311

Query: 72 IAAKATVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 130
+ + + AP+S ++ + TEG +VT +T + V + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370

Query: 131 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 185
+ D F+ + Q+ +++ Y L G ++ D D+
Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418

Query: 186 GSIT--LRAV------FPNPQHTLLPGMFVRARIDEG 214
G + + ++ N L GM V A I G
Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03289ACRIFLAVINRP14060.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1406 bits (3642), Expect = 0.0
Identities = 1034/1034 (100%), Positives = 1034/1034 (100%)

Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120
VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180
EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240
QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300
KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360
DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540
SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600
LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660
EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720
FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780
EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840
LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900
ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960
MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020
EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPVFFVVIRRCFKG 1034
VPVFFVVIRRCFKG
Sbjct: 1021 VPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03291adhesinb280.004 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.5 bits (61), Expect = 0.004
Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%)

Query: 1 MKR---LIPVALLTALLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57
MK+ L+ + L LA C+ + +V TN+ + T++ IAG
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53

Query: 58 AAAVAGLT 65
+ +
Sbjct: 54 KINLHSIV 61


97CMJKDNLE_03345CMJKDNLE_03361N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03345121-1.693260putative protein secretion protein for export
CMJKDNLE_03346119-0.770824putative protein secretion protein for export
CMJKDNLE_03347223-0.464292putative protein secretion protein for export
CMJKDNLE_03348224-1.118496putative protein secretion protein for export
CMJKDNLE_03349324-1.789055putative protein secretion protein for export
CMJKDNLE_03350424-1.887673putative protein secretion protein for export
CMJKDNLE_03351324-2.523336putative protein secretion protein for export
CMJKDNLE_03352322-3.072266putative protein secretion protein for export
CMJKDNLE_03353223-3.427299putative protein secretion protein for export
CMJKDNLE_03354020-2.829255putative protein secretion protein for export
CMJKDNLE_03355-121-2.472733putative protein secretion protein
CMJKDNLE_03356030-1.888303leader peptidase, integral membrane protein
CMJKDNLE_03357135-1.977893bacterioferritin monomer
CMJKDNLE_03358239-1.226757bacterioferritin-associated ferredoxin
CMJKDNLE_03359239-1.097332endochitinase
CMJKDNLE_03360551-0.829549elongation factor Tu
CMJKDNLE_03361545-0.660896elongation factor G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03345BCTERIALGSPC852e-21 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 84.6 bits (209), Expect = 2e-21
Identities = 53/200 (26%), Positives = 95/200 (47%), Gaps = 15/200 (7%)

Query: 59 DFSLAALWRNENHAGVKDANPVAVNQETPKLSIALNGIVLTSNDETSFVLINEGSEQKRY 118
DF+L + +N AG DA N L+++L G++ +D S +I++ +EQ
Sbjct: 64 DFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSR 122

Query: 119 SLNEALESAPGT--FIRKINKTSVVFETHGHYEKVTLH-------PGLP--DIIKQPDSE 167
+NE + PG I I VV + G YE + L+ G+P + +Q
Sbjct: 123 GVNEEV---PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQR 179

Query: 168 SQNVLADYIIATPIRDGEQIYGLRLNPRKGLNAFTTSLLQPGDIALRINNLSLTHPDEVS 227
+ ++DY+ +PI + ++ G RLNP ++F LQ D+A+ +N L L ++
Sbjct: 180 ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAK 239

Query: 228 QALSLLLTQQSAQFTIRRNG 247
+A+ + + T+ R+G
Sbjct: 240 KAMERMADVHNFTLTVERDG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03346BCTERIALGSPD7190.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 719 bits (1856), Expect = 0.0
Identities = 348/630 (55%), Positives = 469/630 (74%), Gaps = 13/630 (2%)

Query: 7 ITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRS 66
+T + AALL A E++ A+F DI++F+ V ++L KT++IDPSV+GTI+VRS
Sbjct: 12 LTLLIFAALLF---RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 67 NDTFSQQEYYQFFLSILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELV 126
D ++++YYQFFLS+LD+YG++VI ++NG LKVVRS + KT+ +A + PG+GDE+V
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVV 128

Query: 127 TRIVPLENVPARDLAPLLRQMMDAGSVGNVVHYEPSNVLILTGRASTINKLIEVIKRVDV 186
TR+VPL NV ARDLAPLLRQ+ D VG+VVHYEPSNVL++TGRA+ I +L+ +++RVD
Sbjct: 129 TRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDN 188

Query: 187 IGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRTNSLIISGPEK 246
G + L +ASA D+ +++ +L ++ KS +P + A +VAD+RTN++++SG
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTEL-NKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 247 ARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNARKPSSSG 306
+RQRI +++K LD +++ +GNT+V YLKYAKA++LVEVLTG+S ++ EK A+
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK---PVA 304

Query: 307 AMD-NVAITADEQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNL 365
A+D N+ I A QTN+L++TA V L VIA+LDIRR QVLVEAII EVQD +GLNL
Sbjct: 305 ALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNL 364

Query: 366 GVQWANKNVGAQQFTNTGLPIFNAAQGVADYKKNGGITSANPAWDMFSAYNGMAAGFFNG 425
G+QWANKN G QFTN+GLPI A G Y K+G ++S+ S++NG+AAGF+ G
Sbjct: 365 GIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA--SALSSFNGIAAGFYQG 422

Query: 426 DWGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGDNVFNTVERK 485
+W +LLTAL+S+ KNDILATPSIVTLDN A+FNVGQ+VPVL+GSQTTSGDN+FNTVERK
Sbjct: 423 NWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERK 482

Query: 486 TVGTKLKVTPQVNEGDAVLLEIEQEVSSVD---SSSNSTLGPTFNTRTIQNAVLVKTGET 542
TVG KLKV PQ+NEGD+VLLEIEQEVSSV SS++S LG TFNTRT+ NAVLV +GET
Sbjct: 483 TVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGET 542

Query: 543 VVLGGLLDDFSKEQVSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRS 602
VV+GGLLD + KVPLLGDIP++G LFR TS + +KRNLM+FIRPT+IRD D YR
Sbjct: 543 VVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQ 602

Query: 603 LSKEKYTRYRQEQQQRIDGKSKALVGSEDL 632
S +YT + Q ++ ++ + ++DL
Sbjct: 603 ASSGQYTAFNDAQSKQRGKENNDAMLNQDL 632


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03348BCTERIALGSPF5150.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 515 bits (1329), Expect = 0.0
Identities = 195/405 (48%), Positives = 282/405 (69%), Gaps = 8/405 (1%)

Query: 2 NYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQK-------SSGVKTRRP 54
Y Y+A+ G+K +G +A+ RQAR LRE GL L + + S+G+ RR
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 55 -RISHSELTLFTRQLATLSAAALPLEESLAVIGQQSSNKRLGDVLNQVRSAILEGHPLSD 113
R+S S+L L TRQLATL AA++PLEE+L + +QS L ++ VRS ++EGH L+D
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 114 ALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKLIQSLIYPCMLTTV 173
A++ FP F+ LY +V AGE SG L VL +LADY E RQ++RS++ Q++IYPC+LT V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 174 AIGVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRIGPTLLATVFIVAVGFW 233
AI VV ILL+ VVPK+ EQF+HMKQ LPLSTR+L+G+SD ++ GP +L + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 234 LWLKRGNNRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLN 293
+ L++ R FH LL + LIG + +N+ARY RTLSIL +S VPLL M +S + ++
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 294 NLEIRQRLANAAENVRQGNSIHLSLEQTAIFPPMMLYMVASGEKSGQLGTLMVRAADNQE 353
N R RL+ A + VR+G S+H +LEQTA+FPPMM +M+ASGE+SG+L +++ RAADNQ+
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 354 TLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN 398
+++ L L +FEP L+++MA +VLFIV+++LQP+LQLN++++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03349BCTERIALGSPG2503e-89 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 250 bits (639), Expect = 3e-89
Identities = 145/145 (100%), Positives = 145/145 (100%)

Query: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60
MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120
LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145
LSAGPDGEMGTEDDITNWGLSKKKK
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03350BCTERIALGSPH1462e-47 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 146 bits (369), Expect = 2e-47
Identities = 51/156 (32%), Positives = 78/156 (50%), Gaps = 22/156 (14%)

Query: 3 QQRGFTLLEMMLVLALVAITASVVLFTY--GREDVASTRARETAARFTAALELAIDRATL 60
+QRGFTLLEMML+L L+ ++A +VL + R+D A+ +T ARF A L R
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAA----QTLARFEAQLRFVQQRGLQ 57

Query: 61 SGQPVGIHFSDSAWRIMV----PGKTP-------SAWRWVPLQEDAADESQNDWDEELSI 109
+GQ G+ W+ +V G P S +RW+PL+ S + +L++
Sbjct: 58 TGQFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNL 117

Query: 110 HL---QPFKPDDSNQPQVVILADGQITPFSLLMANA 142
+ + P D P V+I G++TPF L + A
Sbjct: 118 AFAQGEAWTPGD--NPDVLIFPGGEMTPFRLTLGEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03351BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 17/90 (18%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 1 MNKQSGMTLLEVLLAMSIFTAVALTLMSSMQGQ--RNAIERMRNETLALWIADNQLQSQD 58
+KQ G TLLE+++ + I +A ++ ++ G + ++ ++ +AL A + + D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK-LD 62

Query: 59 SFGEENTSSSGKELING-----EEWNWRSD 83
+ T+ + L+ N+ +
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKE 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03352BCTERIALGSPH333e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.4 bits (76), Expect = 3e-04
Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 4 RQQGFTLLEVMAALAIFSMLSVLAFMIFSQASELHQRSQKEIQQFNQ 50
RQ+GFTLLE+M L + + + + + F + + + + + +F
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD--DSAAQTLARFEA 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03356PREPILNPTASE1535e-48 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 153 bits (388), Expect = 5e-48
Identities = 74/167 (44%), Positives = 96/167 (57%), Gaps = 18/167 (10%)

Query: 71 PFTPIVTGALFLY-----------------FCFVLTLSVIDFRTQLLPDKLTLPLLWLGL 113
P ++T L + ++ L+ ID LLPD+LTLPLLW GL
Sbjct: 111 PLVELLTALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGL 170

Query: 114 VFNAQYGLIDLHDAVYGAVAGYGVLWCVYWGVWLVCHKEGLGYGDFKLLAAAGAWCGWQT 173
+FN G + L DAV GA+AGY VLW +YW L+ KEG+GYGDFKLLAA GAW GWQ
Sbjct: 171 LFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQA 230

Query: 174 LPMILLIASLGGIGYAIVSQLLQRRTITT-IAFGPWLALGSMINLGY 219
LP++LL++SL G I LL+ + I FGP+LA+ I L +
Sbjct: 231 LPIVLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03357HELNAPAPROT353e-05 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 35.2 bits (81), Expect = 3e-05
Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%)

Query: 5 TKVINYLNKLLGNE---LVAINQYFLHARMFKNWGLKRLNDVEYHESIDEM-----KHAD 56
T V N LN L N ++++ +W +K + HE +E+ + D
Sbjct: 11 TLVENSLNTQLSNWFLLYSKLHRF--------HWYVKGPHFFTLHEKFEELYDHAAETVD 62

Query: 57 RYIERILFLEGLPN--LQDLGKL------NIGEDVEEMLRSDLALELDGAKNLREAIGYA 108
ER+L + G P +++ + EM+++ + + + IG A
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 109 DSVHDYVSRDMMIEILRDEEGHIDWLETEL 138
+ D + D+ + ++ + E + L + L
Sbjct: 123 EENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03359GPOSANCHOR320.015 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.6 bits (71), Expect = 0.015
Identities = 14/60 (23%), Positives = 24/60 (40%)

Query: 181 ATEISETSNPQSCTSAPQPSPDVKPAPDVKPAPDVQPAPADKSNDNYAVVAWKGQEGSST 240
A + E + ++ ++ +PD KP P P K N N A + ++ ST
Sbjct: 449 AKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPST 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03361TCRTETOQM6130.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 613 bits (1583), Expect = 0.0
Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128
+ W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188
K +P I F+NK+D+ G + V IK +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248
N+ +++Q ++ E +++L+EKY+ G+ L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308
+ R N + V GSA N G+ +++ + + S
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368
FKI L + R+YSGV++ D+V S K + + +
Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299

Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424
+ +I + +G+I L V GDT P ER+E P P++ VEP
Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484
+E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544
+P V Y E +K E + + + + + PL GS G ++ + + G
Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468

Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604
+ + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++
Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527

Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664
KKA LLEP + ++ P+E D + + + + V + E+P +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587

Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702
Y + L T GR+ E Y + V + R
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622


98CMJKDNLE_03367CMJKDNLE_03377N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_033671140.074988putative DNA-binding transcriptional regulator
CMJKDNLE_033680151.507270peptidyl-prolyl cis-trans isomerase; in protein
CMJKDNLE_033690142.999835host factor for lysis of phX174 infection
CMJKDNLE_03370-1142.942461FKBP-type peptidyl prolyl cis-trans isomerase
CMJKDNLE_03371-1142.698527hypothetical protein
CMJKDNLE_03372-2132.708780K+ : H+ antiporter KefB
CMJKDNLE_03373-1162.412557protein required for KefB activity
CMJKDNLE_033740171.523872fused predicted transporter subunits of ABC
CMJKDNLE_03375-2110.558755putative hydrolase
CMJKDNLE_03376-1110.707820hypothetical protein
CMJKDNLE_03377-1110.905972putative phosphoribulokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03367ACRIFLAVINRP290.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.021
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218
A +V+D VTQ +E + ++ + S + + + L + D A QV ++L +
Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113

Query: 219 SK 220
+
Sbjct: 114 AT 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03368INFPOTNTIATR1341e-40 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 134 bits (339), Expect = 1e-40
Identities = 80/226 (35%), Positives = 125/226 (55%), Gaps = 9/226 (3%)

Query: 28 AAKPATAADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87
A A AA + D K +Y++GA LG K + GI ++ D L G+QD
Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66

Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146
+ + L++++++ L F+ + + A+ K A +N+AKG + + G+ +
Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126

Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206
GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L
Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186

Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251
+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_0337260KDINNERMP310.021 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 30.7 bits (69), Expect = 0.021
Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 261 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 316
A+ P L + L+FIS + L +++ + W +++ V+ ++ L
Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375

Query: 317 GVRSSERMQ 325
S +M+
Sbjct: 376 QYTSMAKMR 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03373ISCHRISMTASE320.001 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.9 bits (72), Expect = 0.001
Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%)

Query: 11 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 68
Y P + D N+V P + + +HD+ ++ D F L + +
Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68

Query: 69 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA------Y 118
P+ + P DR L F GPG N +G Y +IT PE +
Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124

Query: 119 RYDALNRYPMSDVLR 133
RY A R + +++R
Sbjct: 125 RYSAFKRTNLLEMMR 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03374GPOSANCHOR330.005 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%)

Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563
+ D + ++ E + + ++ R+ +R R + L E
Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602
+LE++ + +L A+ + EE+ SE QS + +L A
Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391

Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634
+ + + LEE L A E+L + L E
Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422



Score = 32.0 bits (72), Expect = 0.008
Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%)

Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572
+ + ++ + E A A + D ++ + +++
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179

Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632
L A+ A E + + E + TA + + ++ + ++ LE +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 633 EGQSN 637
++
Sbjct: 240 FSTAD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03377PF07299320.002 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 31.8 bits (72), Expect = 0.002
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116
P+ + + E ++ KG SRK++ ++ + + GTF
Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155


99CMJKDNLE_03466CMJKDNLE_03473N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03466-1203.464530Gamma-glutamyltranspeptidase
CMJKDNLE_03467-1233.752013hypothetical protein
CMJKDNLE_03468-1233.366757hypothetical protein
CMJKDNLE_03469-1253.786778glycerophosphodiester phosphodiesterase,
CMJKDNLE_03470-1253.601772glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03471-1263.413959glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03472-2263.688668glycerol-3-phosphate / glycerol-2-phosphate ABC
CMJKDNLE_03473-2243.387166glycerol-3-phosphate / glycerol-2-phosphate ABC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03466NAFLGMOTY320.007 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 31.6 bits (71), Expect = 0.007
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%)

Query: 275 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331
R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131

Query: 332 YAYADRSEYLGDPDFVKVPWQA 353
Y P F WQ+
Sbjct: 132 GRY---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03469PF04619300.008 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 29.5 bits (66), Expect = 0.008
Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 4/65 (6%)

Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84
+G ++ D + G+ FL+ D+N ++ W + D GSW
Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129

Query: 85 YSKMF 89
+
Sbjct: 130 GGIIG 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03470PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.003
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKD 75
+V+ G G GKSTL+ + GL+ + +D KD
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03473MALTOSEBP392e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 39.3 bits (91), Expect = 2e-05
Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193
G L++ P L YNKD PPKTW+++ +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251
+ +A G F +N +D D ++ K + +++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291
+ F G+ AMT + +NI + +K NYGV ++P
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274


100CMJKDNLE_03506CMJKDNLE_03512N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_03506117-3.534854putative transporter subunit: membrane component
CMJKDNLE_03507-117-3.424363ribosome-associated ATPase
CMJKDNLE_03508-121-5.195822putative HlyD family secretion protein
CMJKDNLE_03509124-6.862304hypothetical protein
CMJKDNLE_03510017-4.577184hypothetical protein
CMJKDNLE_03511011-0.169609inner membrane protein with a role in acid
CMJKDNLE_035120141.554590putative oxidoreductase with FAD/NAD(P)-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03506ABC2TRNSPORT505e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 49.9 bits (119), Expect = 5e-09
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%)

Query: 200 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 367
+P +H + L + I+ ++ + I FF + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03507PF05272300.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.045
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03508RTXTOXIND845e-20 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 84.1 bits (208), Expect = 5e-20
Identities = 73/409 (17%), Positives = 142/409 (34%), Gaps = 83/409 (20%)

Query: 6 RHLAWWVVGLLAVAAIVVWWLLRPAGVPEGFAVSNGRI--EATEVDIASKIAGRIDTILV 63
R +A++++G L +A + +L E A +NG++ +I + I+V
Sbjct: 58 RLVAYFIMGFLVIA--FILSVLGQV---EIVATANGKLTHSGRSKEIKPIENSIVKEIIV 112

Query: 64 KEGQFVREGEVLAKMDTRV----------------LQEQRLEAI---------------- 91
KEG+ VR+G+VL K+ L++ R + +
Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172

Query: 92 --------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDS 131
Q Q+ + L+++++E + +N+ +
Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRV 232

Query: 132 VAKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 233 EKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 191 ------------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAA 235
QT T + S ++AP +V Q +V G V+
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352

Query: 236 GGRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFT 294
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNIN 409

Query: 295 PKTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03512ALARACEMASE290.033 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.0 bits (65), Expect = 0.033
Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 18/98 (18%)

Query: 226 ENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRNAHPNQSLKNTL 283
E + RG GP +L + ++ + + + L T + N Q A N LK L
Sbjct: 63 EAITLRERGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLKALQNARLKAPL 118

Query: 284 AVHL------------PKRLVERLQQLGQIPDVSLKQL 309
++L P R++ QQL + +V L
Sbjct: 119 DIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156


101CMJKDNLE_03572CMJKDNLE_03577N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_035720111.126621YhjX MFS transporter
CMJKDNLE_035731121.340100putative lipase
CMJKDNLE_035741101.3684163-methyl-adenine DNA glycosylase I,
CMJKDNLE_035751111.229021putative acyltransferase with acyl-CoA
CMJKDNLE_035760111.116402biotin sulfoxide reductase
CMJKDNLE_03577015-0.863196putative outer membrane lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03572TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 2e-06
Identities = 48/275 (17%), Positives = 94/275 (34%), Gaps = 32/275 (11%)

Query: 44 PVSQVAFSFGLLSLGLAIS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSDNL 99
+ V +G+L A+ + V G L +RFG + V + S + + + A + L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFAIGSYGLGSLGFK 152
+L++ AG+ AG + + F + LG
Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152

Query: 153 FIDTQLLETVGLEKTFVIWGAIALLMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMR 212
L+ F A+ L + G L+ ++ K E + R
Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207

Query: 213 --KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQSLAHLDVVSAANAVTVISIAN-LS 265
++AV F+ + L+VI + H D + ++ I + L+
Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSLA 262

Query: 266 GRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300
++ G ++ ++ R + +G + G L FA
Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297



Score = 36.3 bits (84), Expect = 2e-04
Identities = 37/155 (23%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 241 AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300
AH ++ A A+ + A + G L SD+ R V+ + + V A + A
Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 301 PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSICGSIIA 360
P V + I VA G T V + +++ + A+++G + FG G + G ++
Sbjct: 94 PFLWVLYIGRI--VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151

Query: 361 SLFGGF--YVTFYVIFALLILSLALSTTIRQPEQK 393
L GGF + F+ AL L+ + K
Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03573ECOLNEIPORIN270.048 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 27.5 bits (61), Expect = 0.048
Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 13/90 (14%)

Query: 119 SMYNEFGDSTTTLTDPLWHASVSTLGWRVDSRLGDLRPWAQISYNQQFGENIWKAQSGLS 178
S+ + D+ + H S + + + R G++ P ++SY F +
Sbjct: 228 SVAVQQQDAKLV-EENYSHNSQTEVAATLAYRFGNVTP--RVSYAHGFKGSF-------- 276

Query: 179 RMTATNQNGNWLDVTVGADMLLNQNIAAYA 208
ATN N ++ V VGA+ ++ +A
Sbjct: 277 --DATNYNNDYDQVVVGAEYDFSKRTSALV 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03575SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.1 bits (83), Expect = 2e-05
Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 76 VAPKAVRRGIGKALM----QYVQQRHP-HLMLEVYQKNQPAINFYQAQGFHI 122
VA ++G+G AL+ ++ ++ H LMLE N A +FY F I
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03577OMPADOMAIN1132e-32 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 113 bits (285), Expect = 2e-32
Identities = 41/122 (33%), Positives = 62/122 (50%), Gaps = 11/122 (9%)

Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVIGYTDSTGGHDLNMRLS 165
+ ++V F+ + ATLKP G L + L +V V+GYTD G N LS
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274

Query: 166 QQRADSVASALITQGVDASRIRTQGLGPANPIASNSTAEGK---------AQNRRVEITL 216
++RA SV LI++G+ A +I +G+G +NP+ N+ K A +RRVEI +
Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334

Query: 217 SP 218

Sbjct: 335 KG 336


102CMJKDNLE_03694CMJKDNLE_03706N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_036940120.925583purine ribonucleoside efflux transporter
CMJKDNLE_03695-1121.742937hypothetical protein
CMJKDNLE_03696-1122.184970putative membrane protein with possible
CMJKDNLE_03697-1143.070370cryptic adenine deaminase monomer
CMJKDNLE_036980173.496046hexose-6-phosphate:phosphate antiporter
CMJKDNLE_036991173.923010glycerol-3-phosphate:phosphate antiporter
CMJKDNLE_037001184.416139Signal transduction histidine-protein
CMJKDNLE_037011183.842917FimZ transcriptional regulator
CMJKDNLE_037022172.922644acetohydroxybutanoate synthase / acetolactate
CMJKDNLE_037031153.132186acetohydroxybutanoate synthase / acetolactate
CMJKDNLE_037050161.325753toxic peptide TisB
CMJKDNLE_03706-1171.130730multidrug efflux transporter EmrD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03694TCRTETA384e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 4e-05
Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 13/208 (6%)

Query: 49 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 101
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 102 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 161
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 162 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGE 217
+ +V LG +G F AAA + + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 218 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 245
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03697UREASE389e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.2 bits (89), Expect = 9e-05
Identities = 28/105 (26%), Positives = 41/105 (39%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG----------AEYTDAPA 71
V+R D +I N ILD + G + I +K IA +G P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03698TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03699TCRTETB419e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 9e-06
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 87 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 143
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 202
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03700PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 478
+KH + L+G + + + L +E+ GS + + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 479 G---TLHISCLHG-TRVSVSLP 496
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03701HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03706TCRTETB606e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.9 bits (145), Expect = 6e-12
Identities = 41/184 (22%), Positives = 81/184 (44%), Gaps = 1/184 (0%)

Query: 5 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 64
R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 65 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 123
+SD++G + ++L G+ I +++ V S ++LI A +QG G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 124 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCAGVTFSMARWM 183
+ A L+ + + + P IGG++ +W L ++ V F M
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 184 PETR 187
E R
Sbjct: 191 KEVR 194


103CMJKDNLE_03893CMJKDNLE_03903N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_038930172.358953GAP-like protein that activates GTPase activity
CMJKDNLE_038942232.287602coproporphyrinogen III dehydrogenase
CMJKDNLE_038962221.955256small predicted membrane protein
CMJKDNLE_038972180.673826HyfR DNA-binding transcriptional activator
CMJKDNLE_03898118-1.497897NtrB
CMJKDNLE_03899318-3.226599glutamate-putrescine ligase
CMJKDNLE_03900214-4.841352protein possibly involved in ribosome structure
CMJKDNLE_03901013-5.506309YihL putative transcriptional regulator
CMJKDNLE_03902010-3.888457hypothetical protein
CMJKDNLE_03903110-3.344793YihN MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03893SECA300.004 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.004
Identities = 11/71 (15%), Positives = 30/71 (42%)

Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73
+K + + EE+++ + R+ + +R ++ + + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 74 LGVTEKVTKQH 84
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03897HTHFIS6020.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 602 bits (1553), Expect = 0.0
Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL + S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03898PF06580280.042 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.042
Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%)

Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223
I+E + R ++ L + L + + S+ V + L S++ D ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279
+P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+
Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293

Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336
++VE+ G + ++ TG GL R + G I+ +
Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 337 GHTEFSVYLP 346
G V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03900TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_03903TCRTETB290.028 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.028
Identities = 31/161 (19%), Positives = 64/161 (39%), Gaps = 15/161 (9%)

Query: 227 NVFFVYAVYCGLTFFIPFLKNIYLLP----------VALVGAYGIINQYCLKMIGGPIGG 276
N+ F+ V CG F + ++P A +G+ I +I G IGG
Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314

Query: 277 MISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMACTLGFGAIVFTQRAVFF 336
++ D+ + P L + + + L E+ ++ + G + FT+
Sbjct: 315 ILVDR--RGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTK--TVI 369

Query: 337 APIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDL 377
+ I + + + + GA M+L +F + ++ G +L +
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


104CMJKDNLE_04053CMJKDNLE_04062N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04053-1163.544943NtrB
CMJKDNLE_040540172.983896HyfR DNA-binding transcriptional activator
CMJKDNLE_040550151.885930phosphoribosylamine-glycine ligase
CMJKDNLE_040560141.293289AICAR transformylase / IMP cyclohydrolase
CMJKDNLE_04058-1110.399631stress response protein
CMJKDNLE_04059-211-3.222324putative acetyltransferase
CMJKDNLE_04060-211-2.995306homoserine O-succinyltransferase
CMJKDNLE_04061-19-0.845128malate synthase A
CMJKDNLE_04062-18-0.329034isocitrate lyase monomer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04053PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 9e-05
Identities = 49/262 (18%), Positives = 104/262 (39%), Gaps = 43/262 (16%)

Query: 204 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 260
I+F + V S+L F W + + + ++ Q +M + L L A + H + N L
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179

Query: 261 SSIKGLAKYFAERAPAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVDL 317
++I+ L +A L+++M + ++ +++ L +V ++L L ++
Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236

Query: 318 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 376
+ Q+ + ++Q+ P L Q L+ N + I + Q G
Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279

Query: 377 VISVTASESGAGVKISVTDSGKGIAADQLDAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 436
I + ++ V + V ++G + E TG GL V ++ G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327

Query: 437 ---TIQVASQEGKGSTFTLWLP 455
I+++ ++GK + +P
Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04054HTHFIS5240.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 524 bits (1351), Expect = 0.0
Identities = 183/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%)

Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67
ILV DDD + T+L L GY+V + ++ + DLV+ DV M + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEIKALNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSI 127
L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 128 DAETPAVTASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187
++ + +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247
R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 248 MMQVRLLRAIQEREVQRVGSNQIISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307
Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV +
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 308 EVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367
+P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 368 AVVLLTGEYISERELPLAIASTPIPLGQSQDIQP-------------------------- 401
L + I+ + + S +
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441
L E+E +ILAAL T GN+ +AA LG+ R TL K+
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04058SHAPEPROTEIN317e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.3 bits (71), Expect = 7e-04
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96
IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E
Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131

Query: 97 GA 98
GA
Sbjct: 132 GA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04059SACTRNSFRASE354e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.9 bits (80), Expect = 4e-05
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 5/54 (9%)

Query: 78 IDPDVRGCGVGRVLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126
+ D R GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04062BINARYTOXINB320.004 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 32.3 bits (73), Expect = 0.004
Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346
ET+ PD+ L A P L Y + N D +T + + QL+++
Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601


105CMJKDNLE_04080CMJKDNLE_04089N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04080-1141.287071hypothetical protein
CMJKDNLE_04081-1150.420004putative porin
CMJKDNLE_04082018-0.272218putative phosphate starvation-inducible protein
CMJKDNLE_04083-1180.455232xylose:H+ symporter
CMJKDNLE_04084-1210.968112maltose ABC transporter - membrane subunit
CMJKDNLE_040850200.809270maltose ABC transporter - membrane subunit
CMJKDNLE_040871210.329447maltose ABC transporter - periplasmic binding
CMJKDNLE_04088018-0.446946hypothetical protein
CMJKDNLE_04089017-5.078216maltose ABC transporter - ATP binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04080CHANLCOLICIN300.007 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.007
Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 3/95 (3%)

Query: 20 AAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANSWWPGAVISEELATAAALRQQQALL 79
A + + + LT + L D+V + N+ + A AA++ + L
Sbjct: 73 AKAAAEAQAKAKANRDALT--QRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERL 130

Query: 80 TRLAEQGADSSADDAAAINALRQQIQALKVTGRQK 114
RLA+ + + AA A ++ Q K R+K
Sbjct: 131 -RLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04083TCRTETA364e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.6 bits (82), Expect = 4e-04
Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%)

Query: 279 VIGVMLSIFQQFVGINVVLYYAPEVFKTLGASTDIALLQTIIVGVINLTFTVLAIMT--- 335
+I ++ ++ VGI +++ P + + L S D+ I++ + L A +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 336 VDKFGRKPLQIIGALGMAIGMFSLGTA 362
D+FGR+P+ ++ G A+ + TA
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04085FLGHOOKAP1310.011 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.1 bits (70), Expect = 0.011
Identities = 22/124 (17%), Positives = 43/124 (34%), Gaps = 21/124 (16%)

Query: 128 GDEWQLALSDGETGKNYLSDAFKFGGEQKLQLKETTAQPEGERANLRVITQNRQALSDIT 187
++WQ+ T DA L+L T + L+ + A+ ++
Sbjct: 367 NNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPV---SDAIVNMD 423

Query: 188 AILPDGNKVMMSSLRQFSGTQPLYTLDGDGTLTNNQSGVKYRPNNQ--------IGFYQS 239
++ D K+ M+S GD N Q+ + + N++ Y S
Sbjct: 424 VLITDEAKIAMAS----------EEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473

Query: 240 ITAD 243
+ +D
Sbjct: 474 LVSD 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04087MALTOSEBP7560.0 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 756 bits (1953), Expect = 0.0
Identities = 396/396 (100%), Positives = 396/396 (100%)

Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60
MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK
Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60

Query: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120
VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW
Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120

Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180
DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP
Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180

Query: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240
YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE
Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240

Query: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300
AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE
Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300

Query: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360
LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP
Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360

Query: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK
Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04089PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 6e-04
Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKR 66
VV G G GKSTL+ + GL+ + IG +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


106CMJKDNLE_04202CMJKDNLE_04213N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04202112-4.406260Transcriptional regulatory protein DcuR
CMJKDNLE_04203-113-3.696211NtrB
CMJKDNLE_04204-117-4.401598hypothetical protein
CMJKDNLE_04205021-4.081730putative acyltransferase with acyl-CoA
CMJKDNLE_04206120-4.944094hypothetical protein
CMJKDNLE_04207119-4.116486lysyl-tRNA synthetase
CMJKDNLE_04208114-2.768898dipeptide:H+ symporter YjdL
CMJKDNLE_04209117-3.084380lysine decarboxylase 1
CMJKDNLE_04210217-1.963701cadaverine:H+ symporter / lysine:cadaverine
CMJKDNLE_04211217-2.035496CadC DNA-binding transcriptional activator
CMJKDNLE_04213119-0.119387*putative transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04202HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 4e-16
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04203PF06580417e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 7e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04205SACTRNSFRASE270.011 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.011
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04208TCRTETA300.023 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.023
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04211SYCDCHAPRONE377e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.8 bits (85), Expect = 7e-05
Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445
++ A+ + + T+ LN +S +Y + A + GK +++++
Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64

Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481
L+ + L LG + G A +Y +
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04213HTHTETR455e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 5e-08
Identities = 28/188 (14%), Positives = 51/188 (27%), Gaps = 13/188 (6%)

Query: 3 REDVLGEALKLLELQGIANTTLEMVAERVDYPLDELRRFWPDKEAILYDALRYLSQQIDV 62
R+ +L AL+L QG+++T+L +A+ + + DK + + I
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 63 WRRQLMLDETQTAEQKLLARYQALSECVKNNRYPGCLFIAACTFYPDPGH----PIHQLA 118
+ L + E + F+ + Q
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLEST--VTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 119 DQQKSAAYDFTHELLTT-------LEVDDPAMVAKQMELVLEGCLSRMLVNRSQADVDTA 171
+YD + L A M + G + L D+
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190

Query: 172 HRLAEDIL 179
R IL
Sbjct: 191 ARDYVAIL 198


107CMJKDNLE_04465CMJKDNLE_04471N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
CMJKDNLE_04465-2160.963311putative phosphoglycerate mutase 2
CMJKDNLE_04466-2130.321530Rob DNA-binding transcriptional activator
CMJKDNLE_04467-115-0.153823hypothetical protein
CMJKDNLE_04468-2130.521391TorR transcriptional dual regulator
CMJKDNLE_04469-2120.753464EnvZ sensory histidine kinase
CMJKDNLE_04470-2110.534090putative inner membrane protein
CMJKDNLE_04471-2101.173693TorR transcriptional dual regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04465VACCYTOTOXIN290.016 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.8 bits (64), Expect = 0.016
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189
P +V GIA G V T+ GL W ++ N D + +W
Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04468HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 34/139 (24%), Positives = 60/139 (43%)

Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFAVEVFERGLPVLDKARQQAPDVMILDVGLPDI 60
M T+ + +D+ I L L + G+ V + + D+++ DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120
+ F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 RVKKFSSPSPVIRIGHFEL 139
K+ S L
Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04469PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 40/182 (21%), Positives = 73/182 (40%), Gaps = 40/182 (21%)

Query: 312 LRQARLENRQEVVLTVVDVAALFR---RVSEARTVQLAE--KNITLHVM--------PTE 358
+R LE+ + + ++ L R R S AR V LA+ + ++ +
Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ 241

Query: 359 VNVAAEPALLEQALGNLL-----DNA----IDFTPESGRITLSAEVDQEHVALKVLDTGS 409
PA+++ + +L +N I P+ G+I L D V L+V +TGS
Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 410 GIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASL 467
N ++S+G GL V E + L+ E ++ + ++G V A +
Sbjct: 302 LALK----------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 468 RL 469
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
CMJKDNLE_04471HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%)

Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60
M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119
N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RT 121

Sbjct: 121 EP 122



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.