PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome1026.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008253 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1ECP_0058ECP_0070Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0058-2203.430294Dna-J like membrane chaperone protein
ECP_0059-2204.15048523S rRNA/tRNA pseudouridine synthase A
ECP_0060-2194.025131ATP-dependent helicase HepA
ECP_0061-2111.786035DNA polymerase II
ECP_0062-211-0.653024L-ribulose-5-phosphate 4-epimerase
ECP_0063-111-0.736746L-arabinose isomerase
ECP_0064216-1.129061ribulokinase
ECP_0065119-1.237876DNA-binding transcriptional regulator AraC
ECP_0066215-1.338895hypothetical protein
ECP_00670142.262635hypothetical protein
ECP_00682173.600763DedA family integral membrane protein
ECP_00691173.765351thiamine transporter ATP-binding subunit
ECP_00701193.530521thiamine transporter membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_005856KDTSANTIGN290.022 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.2 bits (65), Expect = 0.022
Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 18/120 (15%)

Query: 157 IAEELGISRAQFD-----QFLRMMQGGAQFGGGYQQQSGGGNWQQAQRGPTLEDACNVLG 211
EEL R FD F+ + QQQ G G QQAQ T ++A
Sbjct: 310 TLEEL---RDSFDGYINNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQ--ATAQEAVAAAA 364

Query: 212 VKPTDDATTIKRAYRKLMS-EHHPDKLVAKGLPPEMMEMAKQKAQEIQ-QAYELIKQQKG 269
V+ + + I + Y+ L+ + H G+ M ++A Q+ ++ + Q KQQ+G
Sbjct: 365 VRLLNGSDQIAQLYKDLVKLQRH------AGIRKAMEKLAAQQEEDAKNQGKGDCKQQQG 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0067SECYTRNLCASE260.047 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 26.3 bits (58), Expect = 0.047
Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 2 SKYIYILLSF-LVLFFIFFYAYISLMSKE 29
IYI+ F L++FF FFY IS +E
Sbjct: 314 DHPIYIVTYFLLIVFFAFFYVAISFNPEE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0070PF06580310.014 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.014
Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 5/80 (6%)

Query: 4 RRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGDWVAVWQDS-YLWHVVRFSFWQ 62
R GWL + L V A +W+ A ++W+ ++
Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVAN----TSIWRLLAFINTKPVAFTLP 115

Query: 63 AFLSAQLSVVPAIFLARALY 82
LS +VV F+ LY
Sbjct: 116 LALSIIFNVVVVTFMWSLLY 135


2ECP_0111ECP_0121Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0111215-3.092554regulatory protein AmpE
ECP_0112417-4.319030aromatic amino acid transporter
ECP_0113529-8.183765colicin
ECP_0114626-8.100554colicin immunity protein
ECP_0115428-7.622198uropathogenic specific protein
ECP_0116327-1.551462colicin immunity protein
ECP_01174350.485583uropathogenic specific protein
ECP_01185380.982556colicin immunity protein
ECP_01194321.981981transcriptional regulator PdhR
ECP_01203352.308584hypothetical protein
ECP_01213352.505891pyruvate dehydrogenase subunit E1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0113PYOCINKILLER1811e-51 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 181 bits (459), Expect = 1e-51
Identities = 92/253 (36%), Positives = 125/253 (49%), Gaps = 21/253 (8%)

Query: 343 GEGTPYENVRVANMQWNEQTQRYEFT---PAHDVDGPLITWTPENPEHGNVPGHTGN--D 397
G P + V V +N T YE T + ++TWTP +P P T
Sbjct: 377 GVSVP-KAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVP 435

Query: 398 RPPLDQPTILVTPIPDGTDTYTTPPFPVPDPKEFNDYILVFPAGSGIKPIYVYLKEDPRK 457
+P +TP+ T +P D I+ FPA SGIKPIYV + DPR
Sbjct: 436 KPVPVYEGATLTPV-----KATPETYPGVITLP-EDLIIGFPADSGIKPIYVMFR-DPRD 488

Query: 458 LPGVVTGHGVPLSPGTRWLDMSVSNNGNGAPIPAHIADKLRGREFKTFDEFREALWLEVS 517
+PG TG G P+ WL ++ G GAPIP+ IADKLRG+ FK + +FRE W+ V+
Sbjct: 489 VPGAATGKGQPV--SGNWLG--AASQGEGAPIPSQIADKLRGKTFKNWRDFREQFWIAVA 544

Query: 518 QDPELIAQFSSGNQTRIKQGLTAKAPIDGWHYGPKDIVKKFQIHHRVAIEYGGSVYDIDN 577
DPEL QF+ G+ ++ G G + K +IHH+V + GG VY++ N
Sbjct: 545 NDPELSKQFNPGSLAVMRDGGAPYVRESE-QAGGR---IKIEIHHKVRVADGGGVYNMGN 600

Query: 578 LRIVTPRLHDEIH 590
L VTP+ H EIH
Sbjct: 601 LVAVTPKRHIEIH 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0115PYOCINKILLER534e-12 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 53.3 bits (127), Expect = 4e-12
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 1 MSQYPELIAQFSSGNQTRIKQGLIAKAPLEGWHYGTKEIVKKFHMYHRVAIEYSGGIYDI 60
++ PEL QF+ G+ ++ G E G + K ++H+V + GG+Y++
Sbjct: 543 VANDPELSKQFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNM 598

Query: 61 DNLR 64
NL
Sbjct: 599 GNLV 602


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0116PF04605260.019 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 26.0 bits (57), Expect = 0.019
Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 1 MYNFKDKIEDYTEREFIELLGEFTNPTGDNAQLK 34
Y+ K+ I+D ++F + L EFT T N +LK
Sbjct: 88 QYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLK 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0117PYOCINKILLER443e-09 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 44.0 bits (103), Expect = 3e-09
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 4/53 (7%)

Query: 4 QFSTGNQTRIKQGLIAKAPLEGWHYGSKEIVKEFHIYHSVAIECGGEIYDIDN 56
QF+ G+ ++ G E G + + I+H V + GG +Y++ N
Sbjct: 552 QFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNMGN 600


3ECP_0140ECP_0163Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0140018-3.016216hypothetical protein
ECP_0141220-4.063203transposase, YhgA-like
ECP_0142223-4.637455pantoate--beta-alanine ligase
ECP_0143530-6.6463283-methyl-2-oxobutanoate
ECP_0144331-8.052162hypothetical protein
ECP_0145330-7.786979hypothetical protein
ECP_0146330-7.523420protein YadK
ECP_0147229-6.959092hypothetical protein
ECP_0148020-4.311837hypothetical protein
ECP_0149-117-2.923976outer membrane usher protein
ECP_0150-117-0.426348chaperone protein EcpD
ECP_01510140.395831fimbrial-like protein YadN
ECP_01520152.0074742-amino-4-hydroxy-6-
ECP_01530143.454206poly(A) polymerase
ECP_0154-1153.262751glutamyl-Q tRNA(Asp) synthetase
ECP_01550132.500859DnaK transcriptional regulator DksA
ECP_01560132.763058sugar fermentation stimulation protein A
ECP_01570133.1343112'-5' RNA ligase
ECP_0158-1154.047648ATP-dependent RNA helicase HrpB
ECP_0159-2163.846650penicillin-binding protein 1b
ECP_0160-1143.698124ferrichrome outer membrane transporter
ECP_01610164.718021iron-hydroxamate transporter ATP-binding
ECP_01621154.410186iron-hydroxamate transporter substrate-binding
ECP_01630154.452932iron-hydroxamate transporter permease subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0143FLGMRINGFLIF290.017 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.017
Identities = 27/100 (27%), Positives = 40/100 (40%), Gaps = 22/100 (22%)

Query: 110 MVKIEGGEWL----VETVQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDRLL- 164
V +E G L + V L AV GL P +V + D++G LL
Sbjct: 176 TVTLEPGRALDEGQISAVVHLVSSAVA-----GLPPGNVTLV---------DQSG-HLLT 220

Query: 165 -SDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIG 203
S+ + AQL V + +RI L+ P++G G
Sbjct: 221 QSNTSGRDLNDAQLKFANDVESRIQRRIEAILS-PIVGNG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0149PF005778050.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 805 bits (2081), Expect = 0.0
Identities = 260/869 (29%), Positives = 423/869 (48%), Gaps = 40/869 (4%)

Query: 12 IATFCALLYSNSALCAELVEYDHTFLMGKDASNIDLSRYTEGNPTLPGIYDVSVYVNDQP 71
+ CA AE + ++ FL + DLSR+ G PG Y V +Y+N+
Sbjct: 30 LFVACAFAAQAPLSSAE-LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGY 88

Query: 72 IMSQSIAFAVIEGKKNAQACITQKNLLQFHISSPDKNSEKAILLKRDEDLGDCLNLAEMI 131
+ ++ + F + ++ C+T+ L +++ + C+ L MI
Sbjct: 89 MATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMN------LLADDACVPLTSMI 142

Query: 132 PQSSIRYDVNDQRLDIDVPQAWIMKNYQNYVDPSLWENGINAAMLSYNLNGYHSESP-GR 190
++ + DV QRL++ +PQA++ + Y+ P LW+ GINA +L+YN +G ++ G
Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202

Query: 191 TNDSIYAAFNGGINLGAWRLRASGNYNWMTNVHS-----DYDFQNRYLQRDLASLRSQLV 245
+ Y G+N+GAWRLR + +++ ++ S + N +L+RD+ LRS+L
Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 246 IGESYTTGETFDSVSIRGIRLYSDSRMLPPVLASFAPIIHGVANTNAKVTVMQNGYKIYE 305
+G+ YT G+ FD ++ RG +L SD MLP FAP+IHG+A A+VT+ QNGY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 306 TTVPPGAFAIDDLSPSGYGSDLIVTIEEADGTKRTFSQPFSSVVQMLRPGVGRWDISAGQ 365
+TVPPG F I+D+ +G DL VTI+EADG+ + F+ P+SSV + R G R+ I+AG+
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 366 VLKD-SIQDEPNLFQASYYYGLNNYLTGYTGIQLTDNNYTAGLLGLGMNT-PVGAFSVDV 423
+ Q++P FQ++ +GL T Y G QL D Y A G+G N +GA SVD+
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSVDM 441

Query: 424 THSNVSIPDDKTYQGQSYRISWNKLFENTSTSLNIAAYRYSTQHYLGLNDALTLIDEVEH 483
T +N ++PDD + GQS R +NK + T++ + YRYST Y D +
Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501

Query: 484 PEQELE--------PKSMRNYSRMKNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQ 535
E + + ++ +++ Q L + YLSGS YW +
Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTSTLYLSGSHQTYWGTSN 557

Query: 536 NSTNYSIGYSNSASWGSYSISAQRSLNE-DGQTDDSIYLSFTIPIENLLGTEHRSS-GFQ 593
+ G + + ++++S + N D + L+ IP + L ++ +S
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 594 SIDTQLNSDFKGNNQLNISSSGYSDT-NRISYSVNTGYMMNKSSDDLSYIGGYASYESPW 652
S ++ D G G N +SYSV TGY + S +Y +
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 653 GTLSGSASASSDNSRQFSLNTDGGFVLHSGGLTFSNDSFSDSDTLAVIQAPGAKGARINY 712
G + S S D +Q GG + H+ G+T +DT+ +++APGAK A++
Sbjct: 678 GNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQPL---NDTVVLVKAPGAKDAKVEN 733

Query: 713 GNST-VDRWGYGVTSALSPYHENRIALDINDLENDVELKSTSTVAVPRQGAVVFADFETV 771
D GY V + Y ENR+ALD N L ++V+L + VP +GA+V A+F+
Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793

Query: 772 QGQSAIMNIVRSDGKNIPFAADIYDEQNNIIGNVGQGGQAFVRGIGQEGNIRITWIEEGK 831
G +M + + K +PF A + E + G V GQ ++ G+ G +++ W EE
Sbjct: 794 VGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 832 PVSCFAHYQQNTTSEKIAQSIILNGLRCQ 860
C A+YQ S++ Q + C+
Sbjct: 853 A-HCVANYQLPPESQQ--QLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0162FERRIBNDNGPP5060.0 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 506 bits (1304), Expect = 0.0
Identities = 290/296 (97%), Positives = 292/296 (98%)

Query: 1 MSGLPLISRRRLLTAMALSPFLWQMNTAHAAVIDPNRIVALEWLPVELLLALGIVPYGVA 60
MSGLPLISRRRLLTAMALSP LWQMNTAHAA IDPNRIVALEWLPVELLLALGIVPYGVA
Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60

Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120
DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR
Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120

Query: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLTHYEDFIRSMKPRFVKRGARPLLLT 180
GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHL YEDFIRSMKPRFVKRGARPLLLT
Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180

Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240
TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH
Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240

Query: 241 DNSKDMNALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRILDNAIGGKA 296
DNSKDM+ALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVR+LDNAIGGKA
Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296


4ECP_0217ECP_0244Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0217-122-4.143235membrane-bound lytic murein transglycosylase D
ECP_0218125-5.463805hydroxyacylglutathione hydrolase
ECP_0219117-1.484354hypothetical protein
ECP_0220-1151.875832ribonuclease H
ECP_02210182.792019DNA polymerase III subunit epsilon
ECP_02230193.781347*lipoprotein
ECP_02240235.855067hypothetical protein
ECP_02250235.888226hypothetical protein
ECP_02260246.093638IcmF-like protein
ECP_02270256.133497hypothetical protein
ECP_02280266.467769hypothetical protein
ECP_02290265.849913Clp ATPase
ECP_02300244.466733hypothetical protein
ECP_02311224.342040hypothetical protein
ECP_02321213.344633lipoprotein
ECP_02331182.979546hypothetical protein
ECP_02340171.884787hypothetical protein
ECP_02352232.014226hypothetical protein
ECP_0236220-1.064382hypothetical protein
ECP_0237324-3.308442hypothetical protein
ECP_0238532-5.459007hypothetical protein
ECP_0239535-6.040908hemolysin co-regulated protein
ECP_0240640-7.269479Vgr-like protein
ECP_02411056-14.365149hypothetical protein
ECP_02421057-14.137702hypothetical protein
ECP_0243343-9.732507hypothetical protein
ECP_0244235-6.999313H repeat-containing Rhs element protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0218BINARYTOXINB344e-04 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 34.3 bits (78), Expect = 4e-04
Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236
+ ++ EL A N T+ +K ++N+ +R D + I V +E+++++
Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0240ICENUCLEATIN320.012 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 31.6 bits (71), Expect = 0.012
Identities = 30/107 (28%), Positives = 42/107 (39%), Gaps = 10/107 (9%)

Query: 519 TETIGNDQKITVGLG--QTVNVGSKKEGGHDQKVTVANDQHLTIKNDRHKVVNNNQTSKV 576
T+T G D +T G G QT GS G+ T D L + QT+
Sbjct: 359 TQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAG------YGSTQTAGE 412

Query: 577 TGTDTEEVVKKQSIKIGDNYELKVEHGTNIISGDSIELICGQGESGT 623
T T Q+ + G + L +G+ +GD LI G G + T
Sbjct: 413 ESTQTAGYGSTQTAQKGSD--LTAGYGSTGTAGDDSSLIAGYGSTQT 457


5ECP_0274ECP_0408Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0274429-0.571276*phage integrase
ECP_0275428-0.780820hypothetical protein
ECP_0276428-2.387168transposase for insertion sequence IS100
ECP_0277432-3.769350transposase/IS protein
ECP_0278430-2.834083hypothetical protein
ECP_0279232-4.057748transposase
ECP_0280234-5.821734hypothetical protein
ECP_0281334-5.600631phospho-2-dehydro-3-deoxyheptonate aldolase
ECP_0282337-6.808801hypothetical protein
ECP_0283337-7.486844transposase
ECP_0284544-9.418220transposase
ECP_0285643-8.736049ABC transporter ATP-binding protein
ECP_0286644-8.780179hypothetical protein
ECP_0287641-8.733215hypothetical protein
ECP_0288637-6.248655hypothetical protein
ECP_0289533-3.699190hypothetical protein
ECP_0290634-4.822883hypothetical protein
ECP_0291935-2.493936fimbrial transcription regulator protein FaeA
ECP_0292935-2.599103major pilu subunit operon regulatory protein
ECP_02931036-2.935146S-fimbrial protein subunit
ECP_02941035-2.416130minor F1C fimbrial subunit
ECP_02951033-2.745168F1C periplasmic chaperone
ECP_02961033-3.271935F1C fimbrial usher
ECP_0297524-4.579459F1C minor fimbrial subunit F
ECP_0298521-3.647682F1C minor fimbrial subunit protein G presursor
ECP_0299317-1.425607F1C fimbrial adhesin
ECP_03000161.856678hypothetical protein
ECP_03010162.592000MarR family transcriptional regulator
ECP_03020153.352111outer membrane receptor FepA
ECP_03031174.527572IroE protein
ECP_03042184.427655esterase
ECP_03052194.049121ABC transporter
ECP_03064211.912505glycosyl transferase family protein
ECP_03076290.110426hypothetical protein
ECP_0308725-1.321595transposase InsG for insertion sequence element
ECP_0309726-2.334958hypothetical protein
ECP_0311728-3.532349*transposase
ECP_0312729-3.898050hypothetical protein
ECP_0313730-4.862487hypothetical protein
ECP_0314629-4.225657TonB-dependent receptor
ECP_0315530-3.509463hypothetical protein
ECP_0316427-1.984138cobalamin synthesis protein
ECP_03175261.521188hypothetical protein
ECP_03186271.194474transposase for insertion sequence IS100
ECP_0319329-3.543488transposase/IS protein
ECP_0320334-8.286329IS orf
ECP_0321542-12.440943transposase
ECP_0322642-13.308246IS orf
ECP_0323646-13.088696hypothetical protein
ECP_0324645-12.654745chromosome replication initiation inhibitor
ECP_0325644-12.300464lysyl-tRNA synthetase
ECP_0326535-8.685557AraC family transcriptional regulator
ECP_0327523-1.517098lysine decarboxylase
ECP_03287232.570952lysine/cadaverine antiporter
ECP_032910245.069268IS orf
ECP_033010265.496477hypothetical protein
ECP_033110276.058660hypothetical protein
ECP_03329275.934738autotransporter
ECP_03337284.498896hypothetical protein
ECP_03347264.038458anti-restriction protein
ECP_03356273.579321RadC-like DNA repair protein
ECP_03367250.779022hypothetical protein
ECP_0337325-4.110917hypothetical protein
ECP_0338425-5.384655hypothetical protein
ECP_0339429-8.373553hypothetical protein
ECP_0340726-6.142589hypothetical protein
ECP_0341727-6.568841hypothetical protein
ECP_0342729-6.884581phage integrase
ECP_0343826-5.558586CP4-like integrase
ECP_0344725-5.460838MarR family transcriptional regulator
ECP_0345725-4.239109protein SepA
ECP_0346134-6.378209hypothetical protein
ECP_0347125-2.971237insertion element IS1 1/2/3/5/6 protein InsA
ECP_0348017-1.476148InsB protein
ECP_03490190.678511hypothetical protein
ECP_03501191.134591integral membrane protein
ECP_03512211.935941ferredoxin
ECP_03522211.031572hypothetical protein
ECP_03532210.725730hypothetical protein
ECP_03542220.626056hypothetical protein
ECP_0355422-2.832371hypothetical protein
ECP_0356425-6.209328hypothetical protein
ECP_0357333-7.385217hypothetical protein
ECP_0358234-6.937152hypothetical protein
ECP_0359231-5.79149850S ribosomal protein L36
ECP_0360127-4.76486250S ribosomal protein L31
ECP_0361125-4.197384NADH-dependent flavin oxidoreductase
ECP_0362018-1.471808hypothetical protein
ECP_0363019-1.648242LysR family transcriptional regulator
ECP_0364018-1.631882LysR family transcriptional regulator
ECP_0365-117-2.063936aldo/keto reductase
ECP_0366-119-2.685658aldo/keto reductase
ECP_0367-122-3.441357adhesin/invasin
ECP_0368129-6.538195AraC family transcriptional regulator
ECP_0369227-5.363252aldo/keto reductase
ECP_0370125-3.784540hypothetical protein
ECP_0371023-2.942412hypothetical protein
ECP_0372022-2.598852pyridine nucleotide-disulfide oxidoreductase
ECP_0373019-3.247278AraC family transcriptional regulator
ECP_0374119-4.287942hypothetical protein
ECP_0375121-4.648812electron transport protein YkgF
ECP_0376324-5.190040hypothetical protein
ECP_0377427-6.397915hypothetical protein
ECP_0378428-6.380176hypothetical protein
ECP_0379429-6.198516autotransporter
ECP_0380434-7.404682hypothetical protein
ECP_0381017-2.135378hypothetical protein
ECP_03820150.733925phage integrase
ECP_03831192.536956hypothetical protein
ECP_03841202.495040hypothetical protein
ECP_03850223.877892hypothetical protein
ECP_03860192.750076choline dehydrogenase
ECP_03870171.324974betaine aldehyde dehydrogenase
ECP_0388-1160.119807transcriptional regulator BetI
ECP_0389-115-0.988472hypothetical protein
ECP_0390015-1.137679choline transport protein BetT
ECP_0391220-3.985796hypothetical protein
ECP_0392118-1.133466transcriptional regulator YahB
ECP_03931171.263899hypothetical protein
ECP_03941181.759407hypothetical protein
ECP_03951203.053866ankyrin repeat-containing protein
ECP_03961203.604177hypothetical protein
ECP_03971234.330047YahF/FdrA-like protein
ECP_03980193.861247hypothetical protein
ECP_03990141.705124carbamate kinase
ECP_04000131.968104deaminase
ECP_04010141.303746zinc-type alcohol dehydrogenase-like protein
ECP_04021151.862415hypothetical protein
ECP_04040192.953237*LysE family translocator
ECP_04051234.231215hypothetical protein
ECP_04060234.539650propionate catabolism operon regulatory protein
ECP_04070224.0720262-methylisocitrate lyase
ECP_04080203.550823methylcitrate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0276HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0289PHPHTRNFRASE270.029 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.1 bits (60), Expect = 0.029
Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 110 GQCRVERCF--RVTWPDTSEQYVALKTAVQSL--IPLVIATI 147
G R E + R P EQ+ A K VQ + P+VI T+
Sbjct: 294 GLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0292FIMREGULATRY1462e-49 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 146 bits (370), Expect = 2e-49
Identities = 84/102 (82%), Positives = 88/102 (86%)

Query: 1 MAQHEVITRGGDAFLLKLRESALSSGSMSEEQFFLLIGISSIHSDRVILAMKDYLVSGHS 60
MA HEVI+R G+AFLL +RES L GSMSE FFLLIGISSIHSDRVILAMKDYLV GHS
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKDVCEKYQMNNGYFSTTLGRLTRLNVLVARLAPYYTDSVSA 102
RK+VCEKYQMNNGYFSTTLGRL RLN L ARLAPYYTD SA
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSA 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0294FIMBRIALPAPE290.009 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.8 bits (64), Expect = 0.009
Identities = 39/160 (24%), Positives = 64/160 (40%), Gaps = 23/160 (14%)

Query: 16 AALAGNHWHVMLPGGNMRFQGKIIAEACSLALSDRQMTVDMGQLSSNRFHAAGEYGDPVG 75
A L H H N+ F+GK+I AC++ + V+ G + +G G+
Sbjct: 15 AVLMSQHVHA---ADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSG--GNQKD 65

Query: 76 FDIHLQDCSTVVSQRVGISFYGVSDIHEPELLSVEEENDASDGIAIALFNES----GELV 131
F + + ++ + +V I+ G + +L + DG+ I L+N + G V
Sbjct: 66 FTVDMNCPYSLGTMKVTITSNGQTG---NSILVPNTSTASGDGLLIYLYNSNNSGIGNAV 122

Query: 132 KLNQPPENWVHLTRGDMKLHMQARYKATHYPVAGGKANGQ 171
L +T G + AR K T Y G K N Q
Sbjct: 123 TLGSQ------VTPGKITGTAPAR-KITLYAKLGYKGNMQ 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0296PF005779600.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 960 bits (2483), Expect = 0.0
Identities = 545/861 (63%), Positives = 690/861 (80%), Gaps = 9/861 (1%)

Query: 25 RMRFNILPLAFFIGIIVSPAR------AELYFNPRFLSDDPDAVADLSAFTQGQELPPGV 78
+ + + + + A AELYFNPRFL+DDP AVADLS F GQELPPG
Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77

Query: 79 YRVDIYLNDTYISTRDVQFQMSQDGKQLAPCLSPEHMSAMGVNRYAVPGMERLPADTCTS 138
YRVDIYLN+ Y++TRDV F + + PCL+ +++MG+N +V GM L D C
Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137

Query: 139 LNSMIQGATFRFDVGQQRLYLTVPQIYMSNQARGYIAPEYWDNGITAALLNYDFSGNRVR 198
L SMI AT + DVGQQRL LT+PQ +MSN+ARGYI PE WD GI A LLNY+FSGN V+
Sbjct: 138 LTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQ 197

Query: 199 DSYGGTSDYAYLNLKTGLNIGSWRLRDNTSWSYSAGKGYS--QNNWQHINTWLERDIVPL 256
+ GG S YAYLNL++GLNIG+WRLRDNT+WSY++ S +N WQHINTWLERDI+PL
Sbjct: 198 NRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 257 RSRLTMGDSYTRGDIFDGVNFRGIQLASDDNMVPDSQRGYAPTIHGISRGTSRISIRQNG 316
RSRLT+GD YT+GDIFDG+NFRG QLASDDNM+PDSQRG+AP IHGI+RGT++++I+QNG
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 317 YEIYQSTLPPGPFEINDIYPAGSGGDLQVTLQEADGSVQRFNVPWSSVPVLQREGHLKYA 376
Y+IY ST+PPGPF INDIY AG+ GDLQVT++EADGS Q F VP+SSVP+LQREGH +Y+
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 377 LSAGEFRSGGHQQDNPRFAEGTLKYGLPAGWTVYGGAWIAERYRAFNLGVGKNMGWLGAV 436
++AGE+RSG QQ+ PRF + TL +GLPAGWT+YGG +A+RYRAFN G+GKNMG LGA+
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 437 SLDATRANARLPDESRYDGQSYRFLYNKSLTETGTNIQLIGYRYSTRGYFSFADTAWKKM 496
S+D T+AN+ LPD+S++DGQS RFLYNKSL E+GTNIQL+GYRYST GYF+FADT + +M
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 497 SGYSVLTQDGVIQIQPKYTDYYNLAYNKRGRVQVSISQQTGESSTLYLSGSHQSYWGTDR 556
+GY++ TQDGVIQ++PK+TDYYNLAYNKRG++Q++++QQ G +STLYLSGSHQ+YWGT
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 557 TDRQLNAGFNSSVNDISWSLNYSLSRNAWQHETDRILSFDVSIPFSHWMRSDSTSAWRNA 616
D Q AG N++ DI+W+L+YSL++NAWQ D++L+ +V+IPFSHW+RSDS S WR+A
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 617 SARYSQTLEAHGQAASTAGLYGTLLEDNNLGYSIQSGYTRGGYEGSSKTGYASLNYRGGY 676
SA YS + + +G+ + AG+YGTLLEDNNL YS+Q+GY GG S TGYA+LNYRGGY
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 677 GNASAGYSHSGGYRQLYYGLSGGILAHANGLTLSQPLGDTLILVRAPGASDTRIENQTGV 736
GNA+ GYSHS +QLYYG+SGG+LAHANG+TL QPL DT++LV+APGA D ++ENQTGV
Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 737 STDWRGYAVLPYATDYRENRVALDTNTLADNVDIENTVVSVVPTHGAVVRADYKTRVGVK 796
TDWRGYAVLPYAT+YRENRVALDTNTLADNVD++N V +VVPT GA+VRA++K RVG+K
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 797 VLMTLMRNGKAVPFGSVVTARNGGS-SIAGENGQVYLSGMPLSGQVSVKWGSQTTDQCTA 855
+LMTL N K +PFG++VT+ + S I +NGQVYLSGMPL+G+V VKWG + C A
Sbjct: 798 LLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVA 857

Query: 856 DYKLPKESAGQILSHVTVSCR 876
+Y+LP ES Q+L+ ++ CR
Sbjct: 858 NYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0320CHANLCOLICIN300.004 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.004
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 5/91 (5%)

Query: 4 SLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVATYASEINRLKALVAKLQRMQFGKSS 63
+ A A AL Q +D + + +N + A N A+ A+ +R++ K+
Sbjct: 79 AQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANN--AAMQAEDERLRLAKAE 136

Query: 64 EKLR---AKTERQIQEAQERISALQEEMAET 91
EK R E+ QEA++R ++ E AET
Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAET 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0324HTHFIS290.023 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.023
Identities = 7/20 (35%), Positives = 17/20 (85%)

Query: 26 RAARILGISQSAISQKIKKL 45
+AA +LG++++ + +KI++L
Sbjct: 454 KAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0329cdtoxina280.008 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 28.1 bits (62), Expect = 0.008
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 5/60 (8%)

Query: 51 SGVELLPVEITPDEQKVPMTAIAPSLSTSTQTTVCASSCKVEFRHGKMTLENPSPELLTV 110
VE P +PDE +P+ P+L T+ + ++L N +LT+
Sbjct: 38 PQVEGGPTVPSPDEPGLPLPGPGPALPTNGAIPIPEPGTAPA-----VSLMNMDGSVLTM 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0332PRTACTNFAMLY438e-06 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 42.7 bits (100), Expect = 8e-06
Identities = 91/438 (20%), Positives = 140/438 (31%), Gaps = 41/438 (9%)

Query: 286 TAGNTTINQNGELKVHAGGEASDVTQNTGGALVTSTAATVTGTNRLGAFSVVEGKADNVV 345
T + I G +H G S ++ + V VT GA + V + +
Sbjct: 172 TVQRSAIVDGG---LHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASEL 228

Query: 346 LENGGRLDVLSGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRSDG 405
+GG + G A + R G A G AV G G
Sbjct: 229 TLDGGH--ITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPG 286

Query: 406 TAFRIGGGQA----DALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGSLAGTTTLNNGA 461
+ G +E S A G T GGSL+ G
Sbjct: 287 GFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPH----GN 342

Query: 462 ILTLSGKTV---NNDTLTIR-EGDALLQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNL 517
++ G L+I + A QG +L E LT++ Q +
Sbjct: 343 VIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPV---KLTLTGGADAQGDIVA 399

Query: 518 NEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASGATWNIPDNATVQSV 577
E S DV GA + ATW + DN+ V ++
Sbjct: 400 TELPSIPGTSIGPLDVALASQARWT--------GATRAVDSLSIDNATWVMTDNSNVGAL 451

Query: 578 VDDLSHAGQIHF-TSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGR 636
L+ G + F G+F L V L G +G + V D+ + D+LV+
Sbjct: 452 --RLASDGSVDFQQPAEAGRF--KVLTVNTLAG-SGLFRMNVFADLGLS--DKLVVMQD- 503

Query: 637 ATGKTILNLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRD 696
A+G+ L + N+G+ + T + V + A T A ++ G + Y L +
Sbjct: 504 ASGQHRLWVRNSGSEPASANTL---LLVQTPLGSAATFTLANK-DGKVDIGTYRYRLAAN 559

Query: 697 SDESWYLRSENAYRAEVP 714
+ W L A A P
Sbjct: 560 GNGQWSLVGAKAPPAPKP 577


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0345IGASERPTASE6160.0 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 616 bits (1590), Expect = 0.0
Identities = 270/887 (30%), Positives = 410/887 (46%), Gaps = 123/887 (13%)

Query: 42 SLALSALLPTVAGASTVGGNNPYQTYRDFAENKGQFQAGATNIPIFNNKGELVGHL--DK 99
+L ++ L A+ V + YQ +RDFAENKG+F GATN+ + + + +G +
Sbjct: 12 ALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNNKDLGTALPNG 71

Query: 100 APMVDFSSVNVSSNPGVATLINPQYIASVKH-NKGYQSVSFG------------------ 140
PM+DFS V + +ATLINPQY+ VKH + G + FG
Sbjct: 72 IPMIDFSVV--DVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNAKAHRDVS 129

Query: 141 DGQNSYHIVDRNEHSSS-----------------DLHTPRLDKLVTEVAPATVTSSST-- 181
+N Y V++NE+ + D + PRLDK VTEVAP +++S+
Sbjct: 130 SEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDA 189

Query: 182 ADILTPSKYSAFYRAGSGSQYIQDSQGKRHWVTGGYGYLTGGILPTSFFYH--------- 232
+KY AF R GSGSQ+I + + + Y
Sbjct: 190 GTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNLKLVGDAYTYGIAGTPYK 249

Query: 233 --GSDGIQLYMGGNIHDHSI---------LPSFGEAGDSGSPLFGWNTAKGQWELVGVYS 281
+ + G + +HS L ++ GDSGSPLF ++ KG+W +G Y
Sbjct: 250 VNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYD 309

Query: 282 ---GVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGS 338
G + +++ F + ++D+ + + + + S+ T ++ G
Sbjct: 310 FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSK-----TDYSWSSNGKTSTITGGE 364

Query: 339 DEYAMHGQKGSDL-NAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVT-TSNGSTWT 396
+ G D N GK++TF G +G + L N++ QGAG L F DY V TS+ +TW
Sbjct: 365 KSLNVDLADGKDKPNHGKSVTFEG-SGTLTLNNNIDQGAGGLFFEGDYEVKGTSDNTTWK 423

Query: 397 GAGIIVDKDASVNWQVNGVKGDNLHKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSS 456
GAG+ V + +V W+V+ + D L KIG+GTL+V+GTG N+G LKVGDGTV+L QQ + S
Sbjct: 424 GAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQTNGS 483

Query: 457 GHVQAFSSVNIASGRPTVVLADNQQVNPDNISWGYRGGVLDVNGNDLTFHKLNAADYGAT 516
G AF+SV I SGR T+VL D++QV+P++I +G+RGG LD+NGN LTF + D GA
Sbjct: 484 GQ-HAFASVGIVSGRSTLVLNDDKQVDPNSIYFGFRGGRLDLNGNSLTFDHIRNIDDGAR 542

Query: 517 LGNS-SDKTANITLD---YQTHPADVKV---------NEWSSSNRGTVGSLYIYNNPYTH 563
L N +NIT+ T P + N ++ G LY+ YT
Sbjct: 543 LVNHNMTNASNITITGESLITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNLENYT- 601

Query: 564 TVDYFILK--TSSYGWFP-TGQVSNEHWEYVGHDQNSAQALLANRINNK------GYLY- 613
Y+ L+ S+ P SNE+W Y+G + A+ + N INN+ GY
Sbjct: 602 ---YYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRNVMNHINNERMNGFNGYFGE 658

Query: 614 -HGKLLGNINFSNKATPGTTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIAN 672
GK GN+N + K ++ G N++G T E G L + G P HA + IA
Sbjct: 659 EEGKNNGNLNVTFKGKSE-QNRFLLTGGTNLNGDLTVEKGTLFLSGRPTPHA---RDIAG 714

Query: 673 TVSSLGDNSVLTQPTSFTQDDWENRTFSFGSLVLK-DTDFGLGRN-ATLNTTIQADNSS- 729
S+ D +DDW NR F ++ + + GRN A + + I A N +
Sbjct: 715 ISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTGNASLYSGRNVANITSNITASNKAQ 774

Query: 730 ----VTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQS--VLNIND 783
GD+ G T T ++ + A + + G VNL + VL +
Sbjct: 775 VHIGYKTGDTVCVRSDYTGYVTC-TTDKLSDKALNSFNPTNLRGNVNLTESANFVLGKAN 833

Query: 784 IFNGGIQANNSTVNISSDS--AILGNS-----TLTSTALNLNKGANA 823
+F NS V ++ +S + GNS L + ++LN N+
Sbjct: 834 LFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLANGHIHLNSADNS 880



Score = 52.0 bits (124), Expect = 2e-08
Identities = 64/329 (19%), Positives = 119/329 (36%), Gaps = 54/329 (16%)

Query: 760 KDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNISSDSAILGNSTLTSTALNLNK 819
K +D++ N +++N+ + N F NN +N++ N L + NLN
Sbjct: 631 KTSDEAKRNVMNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTNLNG 690

Query: 820 GANALASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSW----------NLKG 869
F+S P + ++S + W N+ G
Sbjct: 691 DLTVEKGTLFLSGRPTPHARDIAGISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTG 750

Query: 870 DDARLNVGPYSMLSGNINVQDKGTVTLG--------------GEGELSPDLTLQNQMLYS 915
+ + + + ++ NI +K V +G G + D L ++ L S
Sbjct: 751 NASLYSGRNVANITSNITASNKAQVHIGYKTGDTVCVRSDYTGYVTCTTD-KLSDKALNS 809

Query: 916 LFN-----------------GYRNTWSGSLNAPDATVSMT-DTQWSMNGNSTAGNMKLNR 957
G N + + ++ V +T ++ W + GNS + L
Sbjct: 810 FNPTNLRGNVNLTESANFVLGKANLFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLAN 869

Query: 958 TIVGFNGGTSS-----FTTLTTDNLDAVQSAFVMRTDL--NKADKLVINKSATGHDNSIW 1010
+ N +S + TLT ++L +F TDL + DK+V+ KSATG+
Sbjct: 870 GHIHLNSADNSNNVTKYNTLTVNSLSG-NGSFYYLTDLSNKQGDKVVVTKSATGNFTLQV 928

Query: 1011 VNFLKKPSDKDTLDIPLVSAPEATADNLF 1039
+ +P+ ++ L A +A D+L
Sbjct: 929 ADKTGEPNHN---ELTLFDASKAQRDHLN 954


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0354PF00577635e-12 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 62.6 bits (152), Expect = 5e-12
Identities = 29/247 (11%), Positives = 73/247 (29%), Gaps = 23/247 (9%)

Query: 487 TLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQSVYSGTFGSLGLRAGIQRYNNGDSS 546
L + + T +S + Y + +Q+ + F + N
Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588

Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTV 594
+ +AL++++P +W + Q + A+ S + +
Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647

Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYINTNLTANGSVGWQGK 654
++ +G + +G A + Y + + S +D +G V
Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706

Query: 655 NIAASGRTDGNAGVIFDTGLEN---DGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQ 711
+ + ++ G ++ + Q + + R G + Y V L
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761

Query: 712 NSKNSLD 718
+ + +
Sbjct: 762 TNTLADN 768


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0367INTIMIN547e-177 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 547 bits (1410), Expect = e-177
Identities = 232/828 (28%), Positives = 350/828 (42%), Gaps = 70/828 (8%)

Query: 41 PVMAARAQHAVQPRLSMENTTVTADNNVEKNVASLAANAGTFLSSQPDS-----DATRNF 95
P++AA +L+ + VT N + + AA L SQ S D ++
Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190

Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKNFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155
G+A +A+ ++Q WL YGTA V L NF SSL+ L P YD+ + F Q
Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248

Query: 156 HRTDDRTQSNIGFGWRHFSENDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215
D R +N+G G R F + M G N FID D S +TR+G+G EYWRDY K S NGY
Sbjct: 249 RYIDSRFTANLGAGQRFF-LPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307

Query: 216 IRASGWKTSPDVEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275
R SGW S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q
Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367

Query: 276 KDPHAITAEVNYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRER 335
+P A T VNYTP+PL+T+ ++ G END + ++ Y+ +P +Q++ + E
Sbjct: 368 SNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNEL 427

Query: 336 RMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTVSLGLVVSKATHGLKNVQ 395
R L+GSRYDLV+RNNNI+LEY+K +++ + +P I G T + L+V K+ +GL +
Sbjct: 428 RTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIV-KSKYGLDRIV 486

Query: 396 WEAPSLLAAGGKITGQG----NQWQVTLPAYQAGKDNYYAISAIAYDNKGNASKRVQTEV 451
W+ +L + GG+I G +Q LPAY G N Y ++A AYD GN+S V +
Sbjct: 487 WDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI 546

Query: 452 VISGAGMSADRTALTLDGQSRIQMLANGNEQKPLVLSLRDAEGQPVTGMKDQIKTELTFK 511
+ G D+ +T + A+G E +++ G
Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVA--------------- 590

Query: 512 PAGNIVTRSLKVTKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTA 571
A V S + A + +G + G+ ++ M+ + A
Sbjct: 591 QANVPV--SFNIVSGTAVLSANSANTNGSGKATVTLKSDK-PGQVVVSAKTAEMTSALNA 647

Query: 572 ELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRLVPQDTN 631
+ S VA+GQ A T T+ V PV+ + T
Sbjct: 648 NAVIFVDQTKASITEIKADKTTAVANGQDAITYTVK-VMKGDKPVSNQEVTF-----TTT 701

Query: 632 GVTVGAIS--EIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGP------- 682
+ + G T++ST G +V A + ++F
Sbjct: 702 LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761

Query: 683 ------LDAAHSSITLNPDK---PVVGGTVTAIWTAKDANDNPVTGLNPDAPSLSGAAAA 733
+ ++ L + GG W + + V + +L
Sbjct: 762 EIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDA-SSGQVTLKEKGTT 820

Query: 734 GSTASGWTDNGDGTWTAQISLGTTAGELDVMPKLNGQDAAANAAKVTVVADALSSNQSKV 793
+ +DN T+T T L V L S+Q+++
Sbjct: 821 TISVIS-SDNQTATYTIA-----TPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNEL 874

Query: 794 -------SVAEDHVKAGESTTVTLVAKDAHGNAISGLSLSASLTGTAS 834
A + S T+ + +A SG++ + L
Sbjct: 875 ENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNP 922



Score = 75.5 bits (185), Expect = 1e-15
Identities = 74/347 (21%), Positives = 115/347 (33%), Gaps = 46/347 (13%)

Query: 905 KTTTELTFTVK----DAYGNPVTGLKPDAPVFSGAASTGSERPSAGNWTEKGNGVYVSTL 960
T TVK PV+ + SG A SA + G+G TL
Sbjct: 575 TEAITYTATVKKNGVAQANVPVSFN-----IVSGTAV-----LSANSANTNGSGKATVTL 624

Query: 961 TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVNNQLANGQSANQITL 1020
+ +A+ V+ V D +KA I ++ +ANGQ A IT
Sbjct: 625 KSDKPGQVVVSAKTAEMTSALNANAVIFV--DQTKASITEIKADKTTAVANGQDA--ITY 680

Query: 1021 TV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIELMSTVAGEHNISASVNG 1079
TV V P+ QEVT T G S + T T+ G + L ST G+ +SA V+
Sbjct: 681 TVKVMKGDKPVSNQEVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSD 738

Query: 1080 AQ---KTVTVKFNADASTGQANLQVDTAVQKVANGKDAFTLTATVK-DQYGNLLPGAVVV 1135
K V+F + N+++ V G T ++ Q G
Sbjct: 739 VAVDVKAPEVEFFTTLTIDDGNIEI------VGTGVKGKLPTVWLQYGQVNLKASGGNGK 792

Query: 1136 FNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASAGNDQPSNAQSVTFVADK 1195
+ A+ I G+ LK GT I+ + ++Q T+
Sbjct: 793 YTW-----RSANPAIASVDASSGQVTLK----EKGTTTISVISSDNQT-----ATYTIAT 838

Query: 1196 TTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSEVTLTAS 1242
+ I + D ++ N L++ A+
Sbjct: 839 PNSLI-VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAA 884



Score = 52.0 bits (124), Expect = 1e-08
Identities = 59/368 (16%), Positives = 106/368 (28%), Gaps = 56/368 (15%)

Query: 779 VTVVADALSSNQSKV---SVAEDHVKAGESTTVTLVA------KDAHGNAISGLSLSASL 829
+TV+++ +Q V + + KA + +T A +S +S
Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG-- 603

Query: 830 TGTASEGATVSSWTEKGDGSYVAT--LTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGE 887
A +S+ + +GS AT L + + A A + +
Sbjct: 604 ------TAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN--ANAVIFVDQ 655

Query: 888 MSSANSTLVADNKTPTVKTTTELTFTVKDAY-GNPVTGLKPDAPVFSGAASTGSERPSAG 946
++ + + AD T +T+TVK PV+ + +T + S
Sbjct: 656 TKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV-------TFTTTLGKLSNS 708

Query: 947 NWTEKGNGVYVSTLTLGSAAGQLSVMPRVNGQN-AVAQPLVLNVAG---DASKAEIRDMT 1002
NG TLT + G+ V RV+ V P V D EI
Sbjct: 709 TEKTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEI---- 763

Query: 1003 VKVNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDI 1061
+ G T+ + G T ++ ++G+V +
Sbjct: 764 ------VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTW---RSANPAIASVDASSGQVTL 814

Query: 1062 ELMSTVAGEHNISASVNGAQKTVTVKFNADASTGQANLQVDTAVQKVANGKDAFTLTATV 1121
+ G IS + Q T + + V + +
Sbjct: 815 K----EKGTTTISVISSDNQ---TATYTIATPNSLIVPNMSKRVT-YNDAVNTCKNFGGK 866

Query: 1122 KDQYGNLL 1129
N L
Sbjct: 867 LPSSQNEL 874



Score = 50.5 bits (120), Expect = 5e-08
Identities = 45/248 (18%), Positives = 79/248 (31%), Gaps = 23/248 (9%)

Query: 1168 TAGTYEITASA----GNDQPSNAQSVTFVADKTTAT---ISSIEVIGNRAVADGKTKQTY 1220
+ Y++TA A GN + ++T +++ ++ A ADG TY
Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580

Query: 1221 KVTVTDANNNLLKDSEVTLTASPENLVLTPNGTATTNEQGQAIFTATTTVAATYTLTAKV 1280
TV S + +A TN G+A T + ++AK
Sbjct: 581 TATVKKNGVAQANVPVSFNIVS--GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK- 637

Query: 1281 EQADGQESTKTAESKFVADDKNAELAATSDVHSLVADGVTTATLTVTLFSANNPVGGTMW 1340
A+ + FV K + +D + VA+G T TV + + PV
Sbjct: 638 -TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV 696

Query: 1341 VDIEA--PEGVTEADYQFLPSKNDHFASGKITRTFSTNKPGTYTFTFNSLTYGGYEMKPV 1398
+ K D +G T ++ PG + ++ ++K
Sbjct: 697 TFTTTLGKLSNSTE-------KTD--TNGYAKVTLTSTTPGKSLVS-ARVSDVAVDVKAP 746

Query: 1399 TVTINAVP 1406
V
Sbjct: 747 EVEFFTTL 754


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0368HTHTETR280.025 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.025
Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 3 RQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRN 44
RQ IL L S+ +IA+ +G +R I F++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0379PRTACTNFAMLY1204e-30 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 120 bits (302), Expect = 4e-30
Identities = 149/738 (20%), Positives = 245/738 (33%), Gaps = 109/738 (14%)

Query: 104 INATGSTITAQGEGTYVRTAMVIDSTGDVVVNGGNFVTKNEKGSATGISLEGARGNNVTL 163
N T + V A + G + G +G+ + R +
Sbjct: 206 TNVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPA 265

Query: 164 NGTT--INAQGNKSSSNASTAIFAQKGSLLQGFDGDATDNITLA---------GSNIING 212
G G F G D + + LA G+ I G
Sbjct: 266 GGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSS-VELAQSIVEAPELGAAIRVG 324

Query: 213 RIEAIVIAGNNTGTHTVNLNIKDGSVI---GAANNKQTIYASASAQGAGSATQNLNLSVA 269
R + ++G + N+ G+ AA T+ A A AQG + L V
Sbjct: 325 RGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVK 384

Query: 270 DSTIYSDVLALSSSNSSVGTTTNVNMNVARSYWEGNAYTFNSGDKAGSDLDINLSDSSVW 329
L L+ + G + G G LD+ L+ + W
Sbjct: 385 --------LTLTGGADAQGDIVATELPSI------------PGTSIGP-LDVALASQARW 423

Query: 330 KGKVSGAGDASVSLQNGSVWNVTGSSTVDALAVKDSTVNITKATVNTGTFA-------SQ 382
G S+S+ N W +T +S V AL + + G F +
Sbjct: 424 TGATRAVD--SLSIDNA-TWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAG 480

Query: 383 NGTLI----VDASSENTLDISGKASGDLRVY---------SAGSLDLINEQ----TAFIS 425
+G D + L + ASG R++ SA +L L+ F
Sbjct: 481 SGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAATFTL 540

Query: 426 TGKDSTLKATGTTEGGLYQYDLTQGADGNFYFVKNTHK---------------------- 463
KD G + G Y+Y L +G + V
Sbjct: 541 ANKD------GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPE 594

Query: 464 ------------ASNASSVIQAMA-AAPANVANLQADTLSARQDAVRLSENDKGGVWIQY 510
++ A++ + + + +++ LS R +RL+ D GG W +
Sbjct: 595 APAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNP-DAGGAWGRG 653

Query: 511 FGGKQKHTTAGNASYDLDVNGVMLGGDTRFMTEDGSWLAGVAMSSAKGDMT-TMQSKGDT 569
F +Q+ +D V G LG D G W G +GD T G T
Sbjct: 654 FAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHT 713

Query: 570 EGYSFHAYLSRQYNNGIFIDTAAQFGHYSNTADVRLMNGGGTIKADFNTNGFGAMVKGGY 629
+ Y + ++G ++D + N V +G +K + T+G GA ++ G
Sbjct: 714 DSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGY-AVKGKYRTHGVGASLEAGR 772

Query: 630 TWKDGNGLFIQPYAKLSALTLEGVDYQL-NGVDVHSDSYNSVLGEAGTRVGYDFAVGNA- 687
+ +G F++P A+L+ G Y+ NG+ V + +SVLG G VG +
Sbjct: 773 RFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGR 832

Query: 688 TVKPYLNLAALNEFSDGNKVRLGDESVNASIDGAAFRVGAGVQADITKNMGAYASLDYTK 747
V+PY+ + L EF V + + G +G G+ A + + YAS +Y+K
Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892

Query: 748 GDDIENPLQGVVGINVTW 765
G + P G +W
Sbjct: 893 GPKLAMPWTFHAGYRYSW 910


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0388HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 2e-14
Identities = 31/172 (18%), Positives = 58/172 (33%), Gaps = 15/172 (8%)

Query: 16 RRRQLIDATLEAINEVGMHDATIAQIARRAGVSTGIISHYFRDKNGLLEATMRDITSQLR 75
R+ ++D L ++ G+ ++ +IA+ AGV+ G I +F+DK+ L S +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 76 DAVLNRLHALPQGSAELRLQAIVGGNFDETQVSSAAMKAWLAFWASSMHQP-------ML 128
+ L P L+ I+ + T V+ + +
Sbjct: 72 ELELEYQAKFPGDPLS-VLREILIHVLEST-VTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 129 YRLQQVSSRRLLSNLVSEFRRE---LPRQQAQEAGYGLAALIDGL---WLRA 174
R + S + + + A + I GL WL A
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0392HTHFIS310.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.009
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 6 TEENLLAFTTAARFGSFSKAAEELGLTTSAISYTIKRMETGLDVVLFTRS 55
E L+ A G+ KAA+ LGL + + I+ + G+ V +RS
Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL--GVSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0399CARBMTKINASE431e-155 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 431 bits (1110), Expect = e-155
Identities = 140/315 (44%), Positives = 201/315 (63%), Gaps = 3/315 (0%)

Query: 1 MKELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLD 60
M + VV+A+GGN++ + S E + V+ A + E++A Y++V+THGNGPQVG
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 61 LRRAEIAHEREGLPLTPLANCVADTQGGIGYLIQQALNNRLARHG-EKKAVTVVTQVEVD 119
L + G+P P+ A +QG IGY+IQQAL N L + G EKK VT++TQ VD
Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120

Query: 120 KNDPGFAHPTKPIGEFFSESQRDELQKANPDWRFVEDAGRGYRRVVASPEPKRIVEAPAI 179
KNDP F +PTKP+G F+ E L + W ED+GRG+RRVV SP+PK VEA I
Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLAR-EKGWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179

Query: 180 KALIQQGFVVIGAGGGGIPVVRTEAGDYQSVDAVIDKDLSTALLAREIHADILVITTGVE 239
K L+++G +VI +GGGG+PV+ E G+ + V+AVIDKDL+ LA E++ADI +I T V
Sbjct: 180 KKLVERGVIVIASGGGGVPVIL-EDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238

Query: 240 KVCIHFGKPQQQALDRVDIATMTRYMQEGHFPPGSMLPKIIASLTFLEQGGKEVIITTPE 299
+++G ++Q L V + + +Y +EGHF GSM PK++A++ F+E GG+ II E
Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLE 298

Query: 300 CLPAALRGETGTHII 314
AL G+TGT ++
Sbjct: 299 KAVEALEGKTGTQVL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0406HTHFIS338e-113 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 338 bits (869), Expect = e-113
Identities = 121/401 (30%), Positives = 200/401 (49%), Gaps = 54/401 (13%)

Query: 164 DLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHNTHDATRNALRTRYVLGDMLGQSP 223
A +A G + Y ++ + + +L ++ ++G+S
Sbjct: 88 MTAIKASEKGAYDYLPKPFDL--TELIGIIGRALAEPKRRPSK-LEDDSQDGMPLVGRSA 144

Query: 224 QMEQVRQTILLYARSSAAVLIEGETGTGKELAAQAIHREYFARHDARQGKKSHPFVAVNC 283
M+++ + + ++ ++I GE+GTGKEL A+A+H + R+ PFVA+N
Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-----YGKRRNG---PFVAINM 196

Query: 284 GAIAESLLEAELFGYEEGAFTGSRRGGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVL 343
AI L+E+ELFG+E+GAFTG++ G FE A GGTLFLDEIG+MP+ QTRLLRVL
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTR-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255

Query: 344 EEKEVTRVGGHQPVPVDVRVISATHCNLEEDMQQGQFRRDLFYRLSILRLQLPPLRERVA 403
++ E T VGG P+ DVR+++AT+ +L++ + QG FR DL+YRL+++ L+LPPLR+R
Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315

Query: 404 DILPLAESFLKVSLAALSAPFAAALRQGLQASETMLVHYDWPGNIRELRNMMERLALFLS 463
DI L F++ ++ L+ + + WPGN+REL N++ RL
Sbjct: 316 DIPDLVRHFVQ-QAEKEGLDVKRFDQEALEL----MKAHPWPGNVRELENLVRRLTALYP 370

Query: 464 VES-TPDLTPQFLQ-----------------LLLPELARESAKTPIPGLLTA-------- 497
+ T ++ L+ L + + E+ + A
Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430

Query: 498 -----------QQALEKFNGDKTAAANYLGISRTTFWRRLK 527
AL G++ AA+ LG++R T ++++
Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIR 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0408PHPHTRNFRASE300.023 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.8 bits (67), Expect = 0.023
Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 65 LIHGKLPTRDE-LAAYKTKLKALRGLPANVRTV 96
+ +LPT +E AYK ++ + G P +RT+
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335


6ECP_0510ECP_0532Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0510-1163.175258multidrug transporter membrane\ATP-binding
ECP_0511-1162.727763nitrogen regulatory protein P-II 2
ECP_0512-2152.516117ammonium transporter
ECP_0513-115-0.719518acyl-CoA thioesterase
ECP_0514-116-1.671134hypothetical protein
ECP_0515019-2.513232hypothetical protein
ECP_0516119-3.410709methyltransferase
ECP_0517119-5.009432hypothetical protein
ECP_0518-113-1.675376diguanylate phosphodiesterase
ECP_0519218-0.569315hypothetical protein
ECP_0520117-0.773441maltose O-acetyltransferase
ECP_0521116-0.121367hemolysin expression-modulating protein
ECP_05221150.130086hypothetical protein
ECP_05231161.069309acriflavine resistance protein B
ECP_05242140.534184acriflavin resistance protein A
ECP_05251160.428392DNA-binding transcriptional repressor AcrR
ECP_05263162.423611potassium efflux protein KefA
ECP_05274164.098564hypothetical protein
ECP_05283174.654885primosomal replication protein N''
ECP_05293233.233475hypothetical protein
ECP_05304273.247212adenine phosphoribosyltransferase
ECP_05313223.151458DNA polymerase III subunits gamma and tau
ECP_05322221.619502hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0510ACRIFLAVINRP330.003 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 33.3 bits (76), Expect = 0.003
Identities = 21/96 (21%), Positives = 42/96 (43%), Gaps = 4/96 (4%)

Query: 92 AAVGVVQQLRTDVMDAA--LRQPLSEFDTQ-PVGQVISRVTNDTEVIRDLYVTVVATVLR 148
A +G+ + +D A ++ L+E P G + + T ++ VV T+
Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFE 346

Query: 149 SAALVGAMLVAMFSLDWRMALVAIMIFPVVLVVMVI 184
+ LV +++ +F + R L+ + PVVL+
Sbjct: 347 AIMLV-FLVMYLFLQNMRATLIPTIAVPVVLLGTFA 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0518BCTERIALGSPF310.013 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.0 bits (70), Expect = 0.013
Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 245 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 304
W+ L L+ G +A +LR R+ + + P++ G+I
Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272

Query: 305 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHSQQHISINLE 363
AR+ +T + S +PL Q +S + ++ R D +R+ H + LE
Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328

Query: 364 STVLTSEKIPQLLREMI 380
T L P ++R MI
Sbjct: 329 QTAL----FPPMMRHMI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0523ACRIFLAVINRP13670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1367 bits (3539), Expect = 0.0
Identities = 802/1033 (77%), Positives = 916/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTNYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT+YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKNWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0524RTXTOXIND384e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.9 bits (88), Expect = 4e-05
Identities = 29/173 (16%), Positives = 58/173 (33%), Gaps = 22/173 (12%)

Query: 17 KQEYDQ-ALADAQQANAAVTAAKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQN 74
Q + L +Q + + + + +P+S ++ + V TEG +V
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352

Query: 75 GQATALATVQQLDPIYVDVTQSSNDFLRLKQELA----------NGTLKQENGKAKVSLI 124
+ T + V + D + V + D + KV I
Sbjct: 353 AE-TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNI 408

Query: 125 TSDGIKFPQDGTLEFSDVTVDQTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 177
D I+ + G + +++++ S + I L GM V A ++ G
Sbjct: 409 NLDAIEDQRLGLVFNVIISIEENCLSTGNKNIP------LSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0525HTHTETR2022e-68 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 202 bits (514), Expect = 2e-68
Identities = 196/196 (100%), Positives = 196/196 (100%)

Query: 2 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 61
ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA
Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 79

Query: 62 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 121
KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD
Sbjct: 80 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 139

Query: 122 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 181
RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL
Sbjct: 140 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199

Query: 182 EMYLLCPTLRNPATNE 197
EMYLLCPTLRNPATNE
Sbjct: 200 EMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0526RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0531IGASERPTASE399e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 9e-05
Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLLRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S+ ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


7ECP_0558ECP_0593Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_05581174.727137thioredoxin YbbN protein
ECP_05590174.767346short chain dehydrogenase
ECP_05601174.303694multifunctional acyl-CoA thioesterase I/protease
ECP_05610174.109797ABC transporter
ECP_0562-1163.737139ABC transporter
ECP_0563-2133.125094membrane protein YbbP
ECP_0564-3150.336643tRNA 2-selenouridine synthase
ECP_0565-216-0.457776DNA-binding transcriptional activator AllS
ECP_0566017-1.500948ureidoglycolate hydrolase
ECP_0567-116-1.198897DNA-binding transcriptional repressor AllR
ECP_0568116-1.285525glyoxylate carboligase
ECP_0569116-1.047513hydroxypyruvate isomerase
ECP_0570115-0.8677472-hydroxy-3-oxopropionate reductase
ECP_0571213-0.700575allantoin permease
ECP_05722120.284704allantoinase
ECP_05733161.181845purine permease YbbY
ECP_05742152.233252glycerate kinase
ECP_05752152.297456hypothetical protein
ECP_05761163.638881allantoate amidohydrolase
ECP_05770154.477573malate/L-lactate dehydrogenase
ECP_05781165.271834membrane protein FdrA
ECP_05791164.878966hypothetical protein
ECP_05801154.184353hypothetical protein
ECP_05811152.861011carbamate kinase
ECP_05822202.272796phosphoribosylaminoimidazole carboxylase ATPase
ECP_05832181.407763phosphoribosylaminoimidazole carboxylase
ECP_05842190.731471hypothetical protein
ECP_05853201.835256UDP-2,3-diacylglucosamine hydrolase
ECP_05863221.321804peptidyl-prolyl cis-trans isomerase B
ECP_05871171.530112cysteinyl-tRNA synthetase
ECP_0588118-0.043457membrane-bound metal-dependent hydrolase
ECP_0589119-0.550914hypothetical protein
ECP_0590020-3.307074bifunctional 5,10-methylene-tetrahydrofolate
ECP_0591531-7.187994hypothetical protein
ECP_0593124-5.251160*DNA integration/recombination/invertion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0559DHBDHDRGNASE784e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.8 bits (191), Expect = 4e-19
Identities = 49/212 (23%), Positives = 81/212 (38%), Gaps = 7/212 (3%)

Query: 3 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNS----MGFT--GVLIDLDS 56
K ITG + GIG A L QG H+ A P+ +E++ S D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 116
++D + + + N AG G + ++S + E FS N G + +
Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 117 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 176
M+ G IV S + AYA+SK A ++ L +EL I+ +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 208
G T ++ ++ G F G
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0561PF05272305e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 5e-04
Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 41 LVGESGSGKSTLLAILAGLDDGSSGEVRLG 70
L G G GKSTL+ L GLD S +G
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0567PF09025280.020 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 28.1 bits (62), Expect = 0.020
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 8/61 (13%)

Query: 126 EAVLIGQLECKSMVRMCAPLGSR--------LPLHASGAGKALLYPLAEEELMSIILQTG 177
+ + +LE K+M+R PLG + L G L LA EL +I G
Sbjct: 68 QGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNG 127

Query: 178 L 178
+
Sbjct: 128 M 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0572UREASE561e-10 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 56.3 bits (136), Expect = 1e-10
Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 32/163 (19%)

Query: 4 DLIIKNGTVILENEARVVDIAVKDGKIAAIG-------QD-----LGDAKDVMDASGLVV 51
D +I N ++ DI +KDG+IAAIG Q +G +V+ G +V
Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128

Query: 52 SPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRAS------- 104
+ G +D+H H P + A G+T M+ PA A+
Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177

Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFK 146
I +AA ++ A G + L E+ G K
Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0581CARBMTKINASE383e-136 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 383 bits (984), Expect = e-136
Identities = 124/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%)

Query: 2 KTLVVALGGNALLQRGEALTAKNQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60
K +V+ALGGNAL QRG+ + + N+ +A + AR Y + I HGNGPQVG L L
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 61 QNLAWKE---VEPYPLDILVAESQGMIGYMLAQSLSAQPQM----PPVTTVLTRIEVSPD 113
A + + P+D+ A SQG IGYM+ Q+L + + V T++T+ V +
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122

Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172
DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L
Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182

Query: 173 LKEGHVVICSGGGGVPVAEDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229
++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD +
Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242

Query: 230 NWGTPQQRAIRHATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIEE 285
+GT +++ +R +EL + + GSMGPKV A ++ G+ A I L + E
Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302

Query: 286 TLAGEAGTCI 295
L G+ GT +
Sbjct: 303 ALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0587RTXTOXIND290.029 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.029
Identities = 16/150 (10%), Positives = 44/150 (29%), Gaps = 8/150 (5%)

Query: 299 RSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTP----- 353
+ ++ +L QAR R R + P E F +++
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 354 EAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEV 413
E +S + + + A + + + + + + + + F +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249

Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRLN 443
A+ L Q+ + + +++
Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0593ANTHRAXTOXNA270.015 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 27.0 bits (59), Expect = 0.015
Identities = 16/65 (24%), Positives = 28/65 (43%), Gaps = 8/65 (12%)

Query: 41 IPSGVPLSVLQEMGG---WESREMVRRYAHLAPNHLTEHARKIDDIFGDNVPLWN-YRRN 96
IP V L + E+GG + ++V H L+E + + G+ VP + +
Sbjct: 89 IPKDV-LEIYSELGGEIYFTDIDLVE---HKELQDLSEEEKNSMNSRGEKVPFASRFVFE 144

Query: 97 KEGVT 101
K+ T
Sbjct: 145 KKRET 149


8ECP_0613ECP_0644Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0613-2113.098930hypothetical protein
ECP_0614-2102.078094phosphopantetheinyltransferase component of
ECP_0615-3112.502262outer membrane receptor FepA
ECP_0616-1132.987570enterobactin/ferric enterobactin esterase
ECP_06170133.612539MbtH-like protein
ECP_06180143.932149enterobactin synthase subunit F
ECP_06191143.476417ferric enterobactin transport protein FepE
ECP_06200145.407218iron-enterobactin transporter ATP-binding
ECP_06211165.599247iron-enterobactin transporter permease
ECP_06220165.309685iron-enterobactin transporter membrane protein
ECP_0623-1174.861690enterobactin exporter EntS
ECP_0624-2174.703620iron-enterobactin transporter periplasmic
ECP_0625-1225.137995isochorismate synthase
ECP_0626-1235.206866enterobactin synthase subunit E
ECP_06270224.986967isochorismatase
ECP_06280204.7208722,3-dihydroxybenzoate-2,3-dehydrogenase
ECP_06290203.448961hypothetical protein
ECP_0630-1161.597656carbon starvation protein A
ECP_0631-117-2.881958hypothetical protein
ECP_0632-116-3.247232hypothetical protein
ECP_0633-216-4.578713aminotransferase
ECP_0634-219-4.564263hypothetical protein
ECP_0635-220-4.374241hypothetical protein
ECP_0636-122-3.495593transcriptional regulator YbdO
ECP_0637023-0.773441disulfide isomerase/thiol-disulfide oxidase
ECP_0638022-0.089607alkyl hydroperoxide reductase
ECP_06390150.731296alkyl hydroperoxide reductase
ECP_06400151.421498universal stress protein
ECP_0641-1152.684879nucleoside diphosphate kinase regulator
ECP_0642-2173.059338ribonuclease I
ECP_0643-2183.647971citrate/succinate antiporter
ECP_0644-2203.961961triphosphoribosyl-dephospho-CoA synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0613HOKGEFTOXIC644e-18 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 63.7 bits (155), Expect = 4e-18
Identities = 18/52 (34%), Positives = 28/52 (53%)

Query: 11 INMLTKYALVAVIVLCLTVLGFTLLVGDSLCEFTVKERNIEFKAVLAYEPKK 62
+ + + V+++CLT+L FT L SLCE ++ E A +AYE K
Sbjct: 1 MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0614ENTSNTHTASED2663e-93 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 266 bits (681), Expect = 3e-93
Identities = 109/184 (59%), Positives = 132/184 (71%), Gaps = 1/184 (0%)

Query: 4 MKTTHTSLPFAGHTLHFVEFDPASFREQDLLWLPHYAQLQHAGRKRKTEHLAGRIAAIYA 63
M T+H LPFAGH LH V+FD +SFRE DLLWLPH+ +L+ AGRKRK EHLAGRIAA++A
Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60

Query: 64 LREYGYKCVPAIGELRQPVWPAGVYGSISHCGTTALAVVSRQPIGIDIEEIFSAQTAREL 123
LRE G + VP +G+ RQP+WP G++GSISHC TTALAV+SRQ IGIDIE+I S TA EL
Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120

Query: 124 TDNIITPAEHKRLADCGLAFPLALTLAFSAKESAFKA-SEIQAAQGFLDYQIISWNKQQI 182
+II E + L L FPLALTLAFSAKES +KA S+ GF ++ S I
Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180

Query: 183 IIRL 186
+ L
Sbjct: 181 SLHL 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0623TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.0 bits (83), Expect = 2e-04
Identities = 82/394 (20%), Positives = 146/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSVRPGLLMLLSTLG---AFLAIGLFGLMP 309
A IG AA L + A+ +G +A + ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0624FERRIBNDNGPP647e-14 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 64.2 bits (156), Expect = 7e-14
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVATQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0627ISCHRISMTASE439e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 439 bits (1130), Expect = e-159
Identities = 145/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPRLQAYALPESHDIPHNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+P NKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVYGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0628DHBDHDRGNASE364e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 364 bits (936), Expect = e-131
Identities = 110/258 (42%), Positives = 150/258 (58%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0637BCTLIPOCALIN290.015 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 28.8 bits (64), Expect = 0.015
Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 13/98 (13%)

Query: 30 QGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNTLIEK 87
+ + + F+ YLGK+ ++ + G ++ + N+ G ++ N
Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71

Query: 88 EIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFADPF 125
Y+ + W+ E + ++G D + V F PF
Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107


9ECP_0745ECP_0750Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0745716-0.326622hypothetical protein
ECP_07462180.233074hypothetical protein
ECP_07472210.276770acyl-CoA thioester hydrolase
ECP_07483200.187675colicin uptake protein TolQ
ECP_07493190.185109colicin uptake protein TolR
ECP_0750317-0.266433cell envelope integrity inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0750IGASERPTASE584e-11 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 58.2 bits (140), Expect = 4e-11
Identities = 31/194 (15%), Positives = 72/194 (37%), Gaps = 14/194 (7%)

Query: 114 QEQKNQAEEAAKQAELKQKQAEEAAAKAAADAKAKAE----------ADAKAAEEAAK-- 161
E++NQ + QA+ + + + A+ + ++ E A+
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 162 KAAADAKKKAEAEAAKAAVEAQKKAEAAAAALKKKAEAAEAA--AAEARKKAATEAAEKA 219
K + +K E +A + + ++ A+ A + +K + E A +E ++ TE E A
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 220 KAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAK 279
E E+KA E + + K+ + +A ++ + ++
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 280 AAAEKAAAAKAAAE 293
A + A + ++
Sbjct: 1165 TADTEQPAKETSSN 1178



Score = 57.4 bits (138), Expect = 8e-11
Identities = 31/236 (13%), Positives = 86/236 (36%), Gaps = 11/236 (4%)

Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKNQAEEAAKQA 127
Q+ S ++E+ ++ +E ++ + +KN E+ A +
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061

Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAV 180
+ ++ A+EA + A+ A++ +E E + A + ++KA+ E K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240
+ ++ + ++++E + A AR+ T ++ +++ A E+ A + +
Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 241 EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADD 296
E+ + + + A ++ K + ++ +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235



Score = 55.1 bits (132), Expect = 4e-10
Identities = 33/265 (12%), Positives = 83/265 (31%), Gaps = 14/265 (5%)

Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103
D V A + ++ ++K+ + + EQ A E + ++ A++ +
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 104 KQLEKERLAAQEQKNQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163
+ E + E K K+ +K+ KA + + E ++ + K+
Sbjct: 1081 QTNEVAQSG-SETKETQTTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 164 AADA-KKKAEAEAAKAAVEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAE 222
++ + +AE K+ ++ + A+ ++ +
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 223 AEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAA 282
+ A + +++ K + + + + A + A +
Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254

Query: 283 EKAAAAKAAAEADDIFGELSSGKNA 307
A + A A F L+ GK
Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAV 1279



Score = 54.7 bits (131), Expect = 4e-10
Identities = 29/229 (12%), Positives = 71/229 (31%), Gaps = 4/229 (1%)

Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKNQAEEAAK 125
R ++E+ + + + Q+ E +E Q E + +EKE A E + E
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAVEAQKK 185
+++ KQ + + A+ + + +E + A + A+ + VE
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 186 AEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA---EK 242
E E + + +++ + A ++
Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244

Query: 243 AAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 291
+ +D +A A+ A + A ++ ++ +
Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293


10ECP_0778ECP_0789Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0778211-1.847224phosphotransferase
ECP_0779111-1.5022706-phosphogluconolactonase
ECP_0780112-1.488657transcriptional regulator
ECP_0781214-0.998579transcriptional regulator
ECP_0782112-0.529007hypothetical protein
ECP_07830130.479397membrane transport protein
ECP_0784-1121.484482hypothetical protein
ECP_0785-1112.836423pectinesterase
ECP_07861123.272367kinase inhibitor protein
ECP_07870123.367680virulence protein GipA
ECP_0788-2133.594202adenosylmethionine-8-amino-7-oxononanoate
ECP_0789-1133.169688biotin synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0781HTHFIS270.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 26.7 bits (59), Expect = 0.009
Identities = 6/18 (33%), Positives = 12/18 (66%)

Query: 44 AAKLLNITQPALTRRIKK 61
AA LL + + L ++I++
Sbjct: 455 AADLLGLNRNTLRKKIRE 472


11ECP_0803ECP_0811Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0803-1223.386703cardiolipin synthase 2
ECP_0804-1223.761900hypothetical protein
ECP_0805-2223.516680hypothetical protein
ECP_0806-1203.796292membrane protein YbhR
ECP_0807-1183.877660inner membrane protein
ECP_0808-2163.637684ABC transporter ATP-binding protein
ECP_0809-1133.558730hypothetical protein
ECP_0810-1123.260069DNA-binding transcriptional regulator
ECP_08110133.271611ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0806ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0808PF05272310.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.012
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.3 bits (65), Expect = 0.047
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0809RTXTOXIND627e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 62.2 bits (151), Expect = 7e-13
Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 82 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 141
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 142 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 196
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 197 AELNLQDSTLVAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 254
E Q S + AP + V G V+ T+ + + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 255 QPGRKVLLYTDGRPNKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 308
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 309 ----DADDALRQGMPVTVQ 323
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0810HTHTETR729e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 9e-18
Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 29/220 (13%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSK---FISR 128
IGE E + P + +RE+++ + + + + +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 EQLSPTAAYHLVHEQVISPLHSHLTRLIAAW---TGCDASDTRMILHTHALIGEILAFRL 185
E A + + + L I A +I+ I ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM---- 175

Query: 186 GKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
W + + + ++ ++L+
Sbjct: 176 --------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0811SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


12ECP_0836ECP_0847Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0836-1133.491538formate acetyltransferase 3
ECP_0837-1113.408933pyruvate formate-lyase 3 activating enzyme
ECP_0838-1123.417628fructose-6-phosphate aldolase
ECP_0839-1123.127971molybdopterin biosynthesis protein MoeB
ECP_0840-1142.889210molybdopterin biosynthesis protein MoeA
ECP_0841-1162.291496L-asparaginase
ECP_0842016-1.975448glutathione transporter ATP-binding protein
ECP_0843116-4.330951hypothetical protein
ECP_0844012-3.620048hypothetical protein
ECP_0845011-4.390661ABC transporter substrate-binding protein
ECP_0846110-4.512050ABC transporter inner membrane component
ECP_084719-4.867467hypothetical protein
13ECP_1003ECP_1036Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_10030154.063539TrpR binding protein WrbA
ECP_1004-1174.241441hypothetical protein
ECP_1005-1174.348824transporter
ECP_1006-1194.950012flavin:NADH reductase YcdH
ECP_10070153.824325hypothetical protein
ECP_1008-1114.126466hypothetical protein
ECP_1009-1123.303128hypothetical protein
ECP_1010-1133.067624isochorismatase YcdL
ECP_1011-1132.954447monooxygenase YcdM
ECP_1012-1152.318833TetR family transcriptional regulator
ECP_1013-1142.556976trifunctional transcriptional regulator/proline
ECP_1014-1130.761678hypothetical protein
ECP_1015-1130.629549sodium/proline symporter
ECP_1016-214-0.886740hypothetical protein
ECP_1017-318-3.023333hypothetical protein
ECP_1018-223-3.764222hypothetical protein
ECP_1019-228-6.259741hypothetical protein
ECP_1020-129-6.449860hypothetical protein
ECP_1021-127-6.026093N-glycosyltransferase PgaC
ECP_1022-227-5.675994lipoprotein YcdR
ECP_1023-223-4.587675outer membrane protein PgaA
ECP_1024-121-4.272349hypothetical protein
ECP_1025-117-1.465258*2-hydroxyacid dehydrogenase
ECP_1026-117-2.589930hydrolase
ECP_1027-120-3.968991hypothetical protein
ECP_1028222-5.838086hypothetical protein
ECP_1029129-8.043414curli production assembly/transport component
ECP_1030130-8.076976curli assembly protein CsgF
ECP_1031034-8.136483curli assembly protein CsgE
ECP_1032132-8.061813DNA-binding transcriptional regulator CsgD
ECP_1033232-5.678430hypothetical protein
ECP_1034-124-3.530521curlin minor subunit
ECP_1035-119-4.018267cryptic curlin major subunit
ECP_1036-116-3.804680autoagglutination protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1010ISCHRISMTASE753e-18 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 75.4 bits (185), Expect = 3e-18
Identities = 44/176 (25%), Positives = 71/176 (40%), Gaps = 23/176 (13%)

Query: 12 TFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQ 71
DP ++ L++ DMQN + +D S + ANI+ G+ +++
Sbjct: 25 VPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQLGIPVVY-- 76

Query: 72 NGWDEQYVEAGGPGSPNFHKSNALKTMRNQPQLQGKLLAKGSWDYQLVDELMPQPGDIVL 131
PGS N L G L G ++ +++ EL P+ D+VL
Sbjct: 77 ---------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 132 PKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDA 187
K RYS F T L ++R G L+ TGI ++ T + F + + DA
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1012HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 2e-15
Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 8/165 (4%)

Query: 10 GKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAV 69
K + ++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 LRQILDIWLAPLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAG 122
++ F PL+ ++E + LE + + L + E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 APLLMDELTGDLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166
++ D + +++ L A + + ++
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1023ARGDEIMINASE300.047 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 29.8 bits (67), Expect = 0.047
Identities = 27/183 (14%), Positives = 61/183 (33%), Gaps = 23/183 (12%)

Query: 450 WPRAAENELKK-AEVIEPRNINLEVEQAWTALTLQEWQQA--AVLTHDVVEREPQDPGVV 506
+ A E + A +++ + +E + + L ++ ++E E + +
Sbjct: 47 YLEVARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTI 106

Query: 507 -RLK---RAVDVHNLAELRIAGSTGIDAEGPDSGKHDVDLTTIVYS---PPLKDNWRGFA 559
LK ++ + N+ I+G E + DL P+ + F
Sbjct: 107 NLLKDYFSSLTIDNMISKMISGVVT--EELKNYTSSLDDLVNGANLFIIDPMPNVL--FT 162

Query: 560 GFGYADGQFSEGKGIVRDWLAGVEWRSRNIWLEAEYAERVFNHEHKPGARLSGWYDFNDN 619
D S G G+ + + + R E +AE +F + + W + +
Sbjct: 163 ----RDPFASIGNGVT---INKMFTKVRQ--RETIFAEYIFKYHPVYKENVPIWLNRWEE 213

Query: 620 WRI 622
+
Sbjct: 214 ASL 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1024BINARYTOXINA300.027 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 29.6 bits (66), Expect = 0.027
Identities = 22/77 (28%), Positives = 36/77 (46%), Gaps = 6/77 (7%)

Query: 335 DQVIKTVVNIIGKSIRPDDLLA--RVGGEEFGVLLTEIDTECAKALAERIRENVERLTGD 392
D + + N + + P +L+ R G +EFG+ LT + + K E I E+ G
Sbjct: 313 DSKVNNIENALKLTPIPSNLIVYRRSGPQEFGLTLTSPEYDFNK--IENIDAFKEKWEGK 370

Query: 393 NPEYAIPQKVTISIGAV 409
Y P ++ SIG+V
Sbjct: 371 VITY--PNFISTSIGSV 385


14ECP_1059ECP_1076Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_10592161.217887hypothetical protein
ECP_10602161.307761virulence factor MviM
ECP_10611130.856192virulence factor mviN-like protein
ECP_10621170.853965flagellar synthesis protein FlgN
ECP_10632150.848318anti-sigma-28 factor FlgM
ECP_10641162.154400flagellar basal body P-ring biosynthesis protein
ECP_10653162.294772flagellar basal-body rod protein FlgB
ECP_10663142.215884flagellar basal body rod protein FlgC
ECP_10673122.380367flagellar basal body rod modification protein
ECP_10681122.369667flagellar hook protein FlgE
ECP_1069-1132.453565flagellar basal body rod protein FlgF
ECP_1070-1101.297390flagellar basal body rod protein FlgG
ECP_10710132.245209flagellar basal body L-ring protein
ECP_10720131.975842flagellar basal body P-ring biosynthesis protein
ECP_10731131.661371flagellar rod assembly protein/muramidase FlgJ
ECP_10741131.233585flagellar hook-associated protein FlgK
ECP_10753151.140128flagellar hook-associated protein FlgL
ECP_10764171.515561ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1068FLGHOOKAP1416e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 6e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 34.9 bits (80), Expect = 5e-04
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 6 AVSGLNVAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLN A L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1070FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1071FLGLRINGFLGH350e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 350 bits (898), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 4 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 63
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 123
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 124 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 183
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 184 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1072FLGPRINGFLGI426e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 426 bits (1097), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVITAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1073FLGFLGJ5090.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 509 bits (1311), Expect = 0.0
Identities = 309/313 (98%), Positives = 310/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKGMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLK MRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGNSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPG+SKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAVSAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTA SAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1074FLGHOOKAP16770.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 677 bits (1747), Expect = 0.0
Identities = 540/546 (98%), Positives = 544/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSTADPSRTTVAYIDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPS+ADPSRTTVAY+DGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAGAFNTQHKAGFDANGDAGKDFFAIGKPAVLQNTKNNGDVAIGATVTDASAVLATD 361
ALAFA AFNTQHKAGFDANGDAG+DFFAIGKPAVLQNTKN GDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNNKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSN+KTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1075FLAGELLIN468e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 46.2 bits (109), Expect = 8e-08
Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAGYKTETAPFSEADGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186
T NG + + DG E+I + +G G + +
Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232
+ T A + + TA DK
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1076IGASERPTASE652e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 65.1 bits (158), Expect = 2e-12
Identities = 42/226 (18%), Positives = 79/226 (34%), Gaps = 12/226 (5%)

Query: 590 PAEQSAPKAEAKPERQQDRR-----KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRR 644
P+ S + A+ + N ++++++ D E +NR
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 645 QAQQQTAETRESRQQAEV------TEKARTTDEQQAPRRERSRRRNDDKRQAQQEVKALN 698
A++ + + + Q EV T++ +TT+ ++ E+ + + + QEV +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK-TQEVPKVT 1126

Query: 699 VEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEA 758
+ QE + + + R +N K Q+ P E + E V E+
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 759 AAPRTELVKVPLPVVAQTAPEQQEENNADNRDNGGMPRRSRRSPRH 804
T V P A Q N+ + RRS RS H
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232



Score = 62.8 bits (152), Expect = 6e-12
Identities = 48/289 (16%), Positives = 92/289 (31%), Gaps = 38/289 (13%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAATATPASPAQPGLL 571
P E+ + DVP P+ E A AP P A ATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSGGEETKPAEQSAPKAEAKPERQQDRRKP-RQNNRRDRNERRDTRSER- 629
AE S +++ + +QD + QN + + + ++
Sbjct: 1039 -----------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 630 -TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKR 688
E + + E + + ++TA + + TEK + + + + + +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 689 QAQ---QEVKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAP 743
QA+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 744 VVEETVAAEPIVQEAAA------PRTELVKVPLPVVAQTAPEQQEENNA 786
+P V ++ R + VP V T A
Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248


15ECP_1131ECP_1222Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1131223-2.418307isocitrate dehydrogenase
ECP_1132328-3.296457hypothetical protein
ECP_1133328-2.451979transposase/IS protein
ECP_1134227-2.935538transposase for insertion sequence IS100
ECP_1135127-3.748428hypothetical protein
ECP_1136127-1.327943excisionase
ECP_1137326-1.045593hypothetical protein
ECP_1138224-0.279666hypothetical protein
ECP_1139-123-2.154621hypothetical protein
ECP_1140024-6.432163exonuclease
ECP_1141126-7.498225bacteriophage recombination protein
ECP_1142132-9.349610host-nuclease inhibitor protein Gam
ECP_1143131-8.378057prophage Kil protein
ECP_1144128-7.819619phage associated protein
ECP_1145022-4.930187hypothetical protein
ECP_1146-122-2.949605phage-related repressor protein
ECP_1147-125-4.277279hypothetical protein
ECP_1148024-3.789184bacteriophage regulatory protein CII
ECP_1149025-4.071843replication protein O
ECP_1150023-3.778596replication protein 14
ECP_1151329-7.405842hypothetical protein
ECP_1152325-6.136387hypothetical protein
ECP_1153225-4.799093hypothetical protein
ECP_1154225-5.437484hypothetical protein
ECP_1155327-6.045680endodeoxyribonuclease RUS
ECP_1156328-6.153689antitermination protein Q
ECP_1157528-6.308071outer membrane porin protein LC
ECP_1158128-5.086692hypothetical protein
ECP_1159329-4.446391lysis protein S
ECP_1160030-5.088527lysozyme
ECP_1161029-4.036478endopeptidase
ECP_1162119-0.836826Bor protein
ECP_11631210.212051TonB-like membrane protein
ECP_11640212.660171hypothetical protein
ECP_11650224.317517hypothetical protein
ECP_11660245.105753terminase small subunit
ECP_11671245.545577terminase large subunit
ECP_11684266.672910head-to-tail joining protein W
ECP_11694256.900712minor capsid protein B
ECP_11703266.741385minor capsid protein C
ECP_11712255.130113major capsid protein D
ECP_11723235.040749major coat protein
ECP_11733254.557668DNA packaging protein FI
ECP_11743264.493831tail attachment protein minor capsid protein
ECP_11753285.309040minor tail protein Z
ECP_11765295.503135minor tail protein U
ECP_11774295.546027major tail protein V
ECP_11785305.763148minor tail protein G
ECP_11795306.430773minor tail protein T
ECP_11804285.615670minor tail protein H
ECP_11812252.920253minor tail protein M
ECP_11822242.312582minor tail protein L
ECP_11832242.113594tail assembly protein K
ECP_11842221.052537tail assembly protein I
ECP_11852200.429793host specificity protein J
ECP_1186023-2.270285tail fiber protein
ECP_1187119-1.715511hypothetical protein
ECP_1188119-1.965683hypothetical protein
ECP_1189222-2.832320iron transport protein, inner membrane
ECP_1190325-3.484855iron transport protein, inner membrane
ECP_1191231-5.655420iron ABC transporter ATP-binding protein
ECP_1192433-7.072654iron transport protein, periplasmic-binding
ECP_1193439-9.903782acetyltransferase
ECP_1194239-8.469940transcriptional regulator
ECP_1195135-8.692130hypothetical protein
ECP_1196134-8.792640hypothetical protein
ECP_1197032-8.389608hypothetical protein
ECP_1198136-9.354590hypothetical protein
ECP_1199336-9.261206transcriptional regulator YcgE
ECP_1200235-10.222600hypothetical protein
ECP_1201336-11.151416hypothetical protein
ECP_1202436-12.018061hypothetical protein
ECP_1203439-12.223196hypothetical protein
ECP_1204230-9.502356hypothetical protein
ECP_1205028-8.512738hypothetical protein
ECP_1206333-7.549818hypothetical protein
ECP_1207233-8.370431hypothetical protein
ECP_1208134-8.189081hypothetical protein
ECP_1209028-5.287590hypothetical protein
ECP_1210233-6.759033hypothetical protein
ECP_1211-134-7.025870hypothetical protein
ECP_1212226-5.997385hypothetical protein
ECP_1213023-3.557502hypothetical protein
ECP_1214021-3.364535autotransporter
ECP_1215024-4.701936hypothetical protein
ECP_1216-122-5.030612cell division topological specificity factor
ECP_1217021-5.062072cell division inhibitor MinD
ECP_1218-123-4.574518septum formation inhibitor
ECP_1219-319-5.647801hypothetical protein
ECP_1220-219-6.040620hypothetical protein
ECP_1221-118-5.063197protein YcgK
ECP_1222-219-3.058948hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1134HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1157ECOLIPORIN5070.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 507 bits (1307), Expect = 0.0
Identities = 241/388 (62%), Positives = 280/388 (72%), Gaps = 33/388 (8%)

Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYAR 60
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTDGQVAYGK 223
D+ NGDGFG STTY+ GF GA Y SDRT+ QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 224 SKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEAV 277
+ A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFE
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 278 AQYQFDFGLRPSVAYLQSKGKDLGVH----GDRDLVKYVDVGATYYFNKNMSTFVDYKIN 333
AQYQFDFGLRP+V++L SKGKDL + D+DLVKY DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 334 LID-DSKFTKTAGIDTDDIVAVGLVYQF 360
L+D D F K AGI TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1162PF062911755e-61 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 175 bits (444), Expect = 5e-61
Identities = 97/97 (100%), Positives = 97/97 (100%)

Query: 1 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 60
MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65

Query: 61 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 97
AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
Sbjct: 66 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1163TONBPROTEIN682e-17 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 68.5 bits (167), Expect = 2e-17
Identities = 33/82 (40%), Positives = 46/82 (56%)

Query: 23 ADEPRQLVTVYPRYPEYAAANYIKGLVEVKFDIGADGTVTRIVFLRSEPHNLFRDEVVKA 82
A PR L P+YP A A I+G V+VKFD+ DG V + L ++P N+F EV A
Sbjct: 150 ASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA 209

Query: 83 MAKWRFEKNRPCQGVKRQFIFT 104
M +WR+E +P G+ +F
Sbjct: 210 MRRWRYEPGKPGSGIVVNILFK 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1180GPOSANCHOR366e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.2 bits (83), Expect = 6e-04
Identities = 43/240 (17%), Positives = 81/240 (33%), Gaps = 24/240 (10%)

Query: 377 TLQADLEKAREMAAKDWAESEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGK 436
L A + S A + + L+ + E
Sbjct: 222 ALAARKADLEKALEGAMNFSTADS-AKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280

Query: 437 ILQTDYNTLMAAAKKDYEATLKKPKQSGVKVSAGERQEDSAHAALLTLQAELRTLEKHAG 496
AA + + + + + R D++ A L+AE + LE+
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340

Query: 497 ANEKISQQ-RRDL-------WKAESQFAVLKEAAQRRQLSAQEKS--LLAHKDETLEYKR 546
+E Q RRDL + E++ L+E + + S Q L A ++ + ++
Sbjct: 341 ISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEK 400

Query: 547 QLAALGDKVTYQEHLNALAQQADKFAQQQRAKRAAIDAKNRGLTDRQAAREATEQRLKEQ 606
L K+ E LN +++ K ++++A + QA EA + LKE+
Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKA-------------ELQAKLEAEAKALKEK 447


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1184PF06291280.014 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.7 bits (61), Expect = 0.014
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 135 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 169
M +LFS +M++ G AQ P A TP+ T +
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1186IGASERPTASE412e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 2e-05
Identities = 27/132 (20%), Positives = 56/132 (42%), Gaps = 15/132 (11%)

Query: 121 SQSAAAAKKSETAAASSRNA--AKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARA 178
S + A+ E A ++T+ET A NS + + + + Q+A + N
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ---NREV 1068

Query: 179 SEEASADSEEASRRN--AESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAA 236
++EA ++ + ++ N A+S +E +E TK ++ A EK
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSE--------TKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 237 DRAEVAAEVTAE 248
+ +V ++V+ +
Sbjct: 1121 EVPKVTSQVSPK 1132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1192adhesinb329e-115 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 329 bits (846), Expect = e-115
Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%)

Query: 9 MLLGGLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68
+G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P
Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72

Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121
P D+K+ A LI NG+NLE WF + ++ VS GV + +
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181
+GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192

Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241
+P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++
Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252

Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297
F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+
Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1193SACTRNSFRASE499e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 49.2 bits (117), Expect = 9e-10
Identities = 17/61 (27%), Positives = 25/61 (40%)

Query: 126 NDYWWIKSFYIAPEHRGMGLADELIKHLIKEAKSEKALELRLYVHGDNGRAIRAYERCGF 185
N Y I+ +A ++R G+ L+ I+ AK L L N A Y + F
Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146

Query: 186 I 186
I
Sbjct: 147 I 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1194HTHTETR280.021 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.021
Identities = 9/41 (21%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 3 KRAKNQIVDSDIARLLLKLRKSRNLTVTELAQRSGVSQAMI 43
+ + I+D A L + + ++ E+A+ +GV++ I
Sbjct: 10 QETRQHILDV--ALRLFSQQGVSSTSLGEIAKAAGVTRGAI 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1195SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 5/59 (8%)

Query: 72 LEALFVDASARGLGVGKHLISHAL--ALHPD---LSVDVNEQNHQAVGFYQHMGFKLSG 125
+E + V R GVG L+ A+ A L ++ + N A FY F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1214PRTACTNFAMLY435e-08 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 42.7 bits (100), Expect = 5e-08
Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 1/101 (0%)

Query: 8 TRSIYRELGATLSYNMRLGNGMEIEPWLKAAVRKEFVDDNRVKVNSDGNFVNDLSGRRGI 67
S+ LG + + L G +++P++KA+V +EF V N + +L G R
Sbjct: 811 GSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELRGTRAE 869

Query: 68 YQAGIKASFSSTLSGHLGVGYSRGAGVESPWNAVAGVNWSF 108
G+ A+ S + YS+G + PW AG +S+
Sbjct: 870 LGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


16ECP_1297ECP_1318Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1297-221-4.302167hypothetical protein
ECP_1298-220-3.962887voltage-gated potassium channel
ECP_1299117-1.767172hypothetical protein
ECP_1300018-3.054920transporter
ECP_1301-122-5.069496acyl-CoA thioester hydrolase
ECP_1302-122-5.236629intracellular septation protein A
ECP_1303-218-3.151657hypothetical protein
ECP_1304014-1.517098outer membrane protein W
ECP_1305212-0.703554protein YciE
ECP_13061111.702700protein YciF
ECP_13071122.699949hypothetical protein
ECP_13080112.826910tryptophan synthase subunit alpha
ECP_13090122.287476tryptophan synthase subunit beta
ECP_1310-190.542543bifunctional indole-3-glycerol phosphate
ECP_1311-2100.656230bifunctional glutamine
ECP_1312-215-2.523222anthranilate synthase component I
ECP_1313-222-4.729089phosphotransferase domain-containing protein
ECP_1314-124-5.648474hypothetical protein
ECP_1315-223-5.103596membrane protein YciQ
ECP_1316-227-5.42523723S rRNA pseudouridylate synthase B
ECP_1317-125-5.969241hypothetical protein
ECP_1318021-3.020617hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1299adhesinmafb308e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 8e-04
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 41 GPMPAVDSNDPGTAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97
P+PA G GS E + EA W +P A V +V KV
Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1300TONBPROTEIN2531e-87 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 253 bits (648), Expect = 1e-87
Identities = 234/239 (97%), Positives = 236/239 (98%), Gaps = 1/239 (0%)

Query: 6 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 65
MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 66 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ-PKRDVKPVESR 124
VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV++ PKRDVKPVESR
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120

Query: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 184
PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF
Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180

Query: 185 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 243
DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239


17ECP_1378ECP_1397Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1378019-3.107613mandelate racemase/muconate lactonizing enzyme
ECP_1379023-4.609306murein peptide amidase A
ECP_1380120-4.995081hypothetical protein
ECP_1381017-3.977464hypothetical protein
ECP_1382-116-3.448814LysR family transcriptional regulator
ECP_1385-215-3.714900hypothetical protein
ECP_1386-115-3.040580hypothetical protein
ECP_1387020-4.718806universal stress protein UspE
ECP_1388022-4.659409fumarate/nitrate reduction transcriptional
ECP_1389025-4.839855O-6-alkylguanine-DNA:cysteine-protein
ECP_1390-124-4.570347AraC family transcriptional regulator
ECP_1391-123-4.126663multidrug resistance protein A
ECP_1392-121-2.954470multidrug resistance protein B
ECP_1393016-0.861827hypothetical protein
ECP_1394-116-1.197737hypothetical protein
ECP_1395017-1.589457zinc transporter
ECP_1396-118-2.253823hypothetical protein
ECP_1397-116-3.151945ATP-dependent RNA helicase DbpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1391RTXTOXIND1149e-31 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 114 bits (288), Expect = 9e-31
Identities = 65/410 (15%), Positives = 136/410 (33%), Gaps = 97/410 (23%)

Query: 11 VVAIGILLAGVVFFIW-WVSK--------GRFIQTTDDAYIGGNITTVASKVSGYISAIE 61
+VA I+ V+ FI + + G+ G + + + I
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLT-------HSGRSKEIKPIENSIVKEII 111

Query: 62 VRDNQSVKKGDIILRLDDRDYRANVARLEAKIKSSKANLESIQATI-------------- 107
V++ +SV+KGD++L+L A+ + ++ + ++ Q
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 108 -------------AMQQSIIQSASETWQAVKHEEQKRLRD--------TERYEKLAQSAA 146
S+I+ TWQ K++++ L R + +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR 231

Query: 147 ISQQIIDNAR-------FDYQQVAAKERK---AANDFLVEKQRLAVLSAQEENVRASI-- 194
+ + +D+ V +E K A N+ V K +L + ++ + +
Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 195 ------EEVLAALTQALL--------------DLEYTLVRAPIDGIVANRSAHT-GSWVE 233
E+L L Q + +++RAP+ V HT G V
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVT 351

Query: 234 GGTSLVSLVPVSE-LWVDANYKENQIAGMKPGMKAEIRADILKGEVFH---GHIESLSPA 289
+L+ +VP + L V A + I + G A I+ + + G +++++
Sbjct: 352 TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411

Query: 290 TGASFSLIPIENATGNFTKIVQRVPVRIAFDDAKELKQLLRPGLSVTVSV 339
+ G ++ + K + L G++VT +
Sbjct: 412 A-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSGMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1392TCRTETB1065e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 106 bits (267), Expect = 5e-27
Identities = 81/418 (19%), Positives = 170/418 (40%), Gaps = 21/418 (5%)

Query: 3 SMRKHIAFASMCIGLFIAQLDIQIVSSSLNEIGGGLSAGKDEMAWLQTSYLIAEIIVIPL 62
++R + +CI F + L+ +++ SL +I + W+ T++++ I +
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 63 SGWLSRVFSTRWLFTLSAGIFTLMSIACGLAWN-IQIMIFFRALQGVAGASMIPLVFTTA 121
G LS + L I S+ + + ++I R +QG A+ LV
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVV 128

Query: 122 FIYYQGKELGLAAAVVSALASLSPTLGPTLGGWITDNLDWRWLFYINILPGIYLVLSIPF 181
Y + G A ++ ++ ++ +GP +GG I + W +L I ++ ++++PF
Sbjct: 129 ARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI----TIITVPF 184

Query: 182 LVNFDKPDLSLLKVADYPSIILLAMTLGCLEYTLEEGARWGWLDDNTILLTSVLALVSFI 241
L+ K ++ + D IIL+++ + +L +++++SF+
Sbjct: 185 LMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTS-YSISFL---------IVSVLSFL 234

Query: 242 LFAARTLKISNPIMDLHAFKDKNFTLGCFFSFSGGVGIFSTVYLIPVFLGQVRGLNAEEI 301
+F K+++P +D K+ F +G + V ++P + V L+ EI
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 302 GFAVCTTG-IFQLFSVPFYFWLSKKINLRWLLMAGMGGFVFSMYL--FTPITHEWGWQEL 358
G + G + + L + ++L G+ S F T W + +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTI 353

Query: 359 LFPQAIRGISQQFAMAHIVTLTLGGIPKERLKLASGVFNLTRNLGGAIGIALCGSILN 416
+ + G+S F I T+ + ++ + N T L GIA+ G +L+
Sbjct: 354 IIVFVLGGLS--FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1396PRTACTNFAMLY311e-04 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.8 bits (69), Expect = 1e-04
Identities = 12/31 (38%), Positives = 12/31 (38%)

Query: 10 PVPEPIPGDPVPVPDPIPRPQPMPDPPPDEE 40
P P P PG P P P P PP E
Sbjct: 575 PKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605



Score = 26.6 bits (58), Expect = 0.005
Identities = 11/23 (47%), Positives = 11/23 (47%)

Query: 19 PVPVPDPIPRPQPMPDPPPDEEP 41
P P P P P PQP P P E
Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEA 595


18ECP_1455ECP_1461Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1455435-1.363106glutathione S-transferase
ECP_1456435-3.666921glutathione S-transferase
ECP_1457537-4.096603hypothetical protein
ECP_1458539-5.180891Rhs/Vgr-family protein
ECP_1459548-14.679491hypothetical protein
ECP_1460235-8.571553hypothetical protein
ECP_1461028-5.627997hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1458ICENUCLEATIN330.006 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 33.2 bits (75), Expect = 0.006
Identities = 25/133 (18%), Positives = 53/133 (39%), Gaps = 8/133 (6%)

Query: 545 GHDQSITVANDRCITVRNDQTLQVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNH 604
G +S + +R + + + Q R+ +S D + + +R + G +T G+
Sbjct: 1091 GP-ESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGDR 1149

Query: 605 VSLVEGKHSLVVKGDLARKVSGALGIKVDGDIVLESSSRISLKVGGSFVVIHSGGVDIVG 664
L+ G +S + GD ++ +G D +L + R L G + ++ ++G
Sbjct: 1150 SKLLAGNNSYLTAGDRSKLTAG-------NDCILMAGDRSKLTAGINSILTAGCRSKLIG 1202

Query: 665 PKISLNSGGSPGT 677
S + G
Sbjct: 1203 SNGSTLTAGENSV 1215



Score = 30.5 bits (68), Expect = 0.030
Identities = 15/69 (21%), Positives = 35/69 (50%)

Query: 567 QVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNHVSLVEGKHSLVVKGDLARKVSG 626
Q+ + R+ ++ + + +R + + GK +T G +L+ G S+ + G+ + ++G
Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139

Query: 627 ALGIKVDGD 635
A + GD
Sbjct: 1140 ADSTQTAGD 1148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1461PF07299280.010 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 28.3 bits (63), Expect = 0.010
Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 13/51 (25%)

Query: 61 LNDMYAFIPGDNYYFIKS------SGYKFVND-------KWFTLKSINNIF 98
+ M AFI D Y FIKS +G+ ND K ++ I ++F
Sbjct: 4 VIKMEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVF 54


19ECP_1487ECP_1498Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1487-211-3.724305lipoprotein YddW precursor
ECP_1488-213-5.265498amino acid antiporter
ECP_1489-214-5.860806glutamate decarboxylase
ECP_1490-219-7.337508zinc protease PqqL
ECP_1491-120-8.387327hypothetical protein
ECP_1492-123-7.368778ABC transporter ATP-binding protein
ECP_1493-124-7.161626hypothetical protein
ECP_1494-124-5.964849sulfatase YdeN
ECP_1495-124-4.961609transcriptional regulator YdeO
ECP_1496-123-3.809250hypothetical protein
ECP_1497-122-3.551482oxidoreductase
ECP_1498-121-3.186457fimbrial-like protein
20ECP_1511ECP_1528Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1511016-3.789301sugar efflux transporter
ECP_1512018-5.220802multiple drug resistance protein MarC
ECP_1513120-6.776221DNA-binding transcriptional repressor MarR
ECP_1514021-5.945439DNA-binding transcriptional activator MarA
ECP_1515-218-4.651756hypothetical protein
ECP_1516-119-5.2079866-phospho-beta-glucosidase
ECP_1517-116-3.256079hypothetical protein
ECP_1518-115-2.662381hypothetical protein
ECP_1519-114-1.942861O-acetylserine/cysteine export protein
ECP_1520-115-1.959576MFS-type transporter YdeE
ECP_1521-117-2.096952hypothetical protein
ECP_1522-117-2.062162dipeptidyl carboxypeptidase II
ECP_1523-219-3.2575753-hydroxy acid dehydrogenase
ECP_1524-218-3.070415transcriptional regulator YdfH
ECP_1525-219-3.063121hypothetical protein
ECP_1526-118-3.078235oxidoreductase YdfI
ECP_1527-114-3.513787metabolite transport protein YdfJ
ECP_1528-316-3.624378dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1511TCRTETB553e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.5 bits (131), Expect = 3e-10
Identities = 41/192 (21%), Positives = 84/192 (43%), Gaps = 8/192 (4%)

Query: 36 LSDIAHSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95
L DIA+ F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154
F+ S F++L+++R A F ++ + R P R +A LI + A+ +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214
+G ++ Y W I + +IT+ L+KLL + LMS+
Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209

Query: 215 YLLTVVVVTAHY 226
++ ++ T Y
Sbjct: 210 GIVFFMLFTTSY 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1520TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 3e-06
Identities = 42/239 (17%), Positives = 82/239 (34%), Gaps = 18/239 (7%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVVFSLGF 63
R +L++ L +G G +P + L R S D+ G + + + +
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 64 GILADKFDKKRYMLMAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123
G L+D+F ++ +L+++ A + + + ++ + + + A A+ AD
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122

Query: 124 NLSSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183
+ + F G GP LG L+ S + PF+ AA + L +
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233
S K + W + A L F+ V A+ +
Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237



Score = 32.5 bits (74), Expect = 0.002
Identities = 21/155 (13%), Positives = 60/155 (38%), Gaps = 2/155 (1%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLIGYAMTIALTIGVVF-SLGFGI 65
+AL+A ++ + I+ ++ IG ++ + + ++ G
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 66 LADKFDKKRYMLMAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFADNL 125
+A + ++R +++ + A +G+I + + L+ + L+A + +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQV 328

Query: 126 SSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSI 160
+ ++ + ++ +GP L T + SI
Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1523DHBDHDRGNASE1002e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (249), Expect = 2e-27
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%)

Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58
I +TGA G GE + R QG + A E+L+++ L A+ DVR+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118
AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178
M++R G I+ +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227
T+ + ++G + +T++ + L P D+++AV + VS H+
Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 228 INTL 231
++ L
Sbjct: 248 MHNL 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1527TCRTETB493e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.7 bits (116), Expect = 3e-08
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 16/118 (13%)

Query: 72 VGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLG 131
+G ++GK+ D++G K++L I + + + V ++ + + A R IQG G
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAG 116

Query: 132 AGAEISGAGTMLAEYAPKGKR----GIISSFVAMGTNCGTLSATAI-----WAFMFFI 180
A A + ++A Y PK R G+I S VAMG G I W+++ I
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


21ECP_1660ECP_1667Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_16604281.443005integration host factor subunit alpha
ECP_16613270.493711phenylalanyl-tRNA synthetase subunit beta
ECP_1662224-2.278400phenylalanyl-tRNA synthetase subunit alpha
ECP_1663220-4.89382750S ribosomal protein L20
ECP_1664016-4.63522050S ribosomal protein L35
ECP_1665-115-3.311332translation initiation factor IF-3
ECP_1666-115-3.537624threonyl-tRNA synthetase
ECP_1667-123-3.221171hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1660DNABINDINGHU1193e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 3e-39
Identities = 34/89 (38%), Positives = 55/89 (61%)

Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63
K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92
NP+TGE+I I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


22ECP_1677ECP_1741Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1677-115-3.273296cell division modulator
ECP_1678-113-3.336855hydroperoxidase II
ECP_1679018-4.878527hypothetical protein
ECP_1680017-5.2536756-phospho-beta-glucosidase
ECP_1681017-4.612690DNA-binding transcriptional regulator ChbR
ECP_1682017-2.618420PTS system N,N'-diacetylchitobiose-specific
ECP_1683115-2.386966PTS system N,N'-diacetylchitobiose-specific
ECP_1684015-1.837051PTS system N,N'-diacetylchitobiose-specific
ECP_1685-113-0.489198DNA-binding transcriptional activator OsmE
ECP_16860120.515901NAD synthetase
ECP_16871132.471059nucleotide excision repair endonuclease
ECP_16881133.195077hypothetical protein
ECP_1689-1123.671148hypothetical protein
ECP_16900123.776334succinylglutamate desuccinylase
ECP_16910133.226434succinylarginine dihydrolase
ECP_16920122.630639succinylglutamic semialdehyde dehydrogenase
ECP_1693-1140.957767arginine succinyltransferase
ECP_1694-1130.851660bifunctional succinylornithine
ECP_16951150.771330exonuclease III
ECP_16962151.782584hypothetical protein
ECP_16971152.413363hypothetical protein
ECP_16981143.001186hypothetical protein
ECP_16991153.437374hypothetical protein
ECP_17001153.244807ABC transporter substrate-binding protein
ECP_17011162.841202ABC transporter permease
ECP_1702-1131.820808ABC transporter ATP-binding protein
ECP_1703-115-1.393510thiosulfate sulfurtransferase
ECP_1704125-4.926884hypothetical protein
ECP_1705028-5.887177pyrimidine (deoxy)nucleoside triphosphate
ECP_1706-122-4.415886hypothetical protein
ECP_1707-218-3.611761glutamate dehydrogenase
ECP_1708-220-3.968400hypothetical protein
ECP_1709-313-2.229726hypothetical protein
ECP_1710-290.035008chaperone protein HscC
ECP_1711-291.734071DNA topoisomerase III
ECP_1712-29-0.149458selenophosphate synthetase
ECP_1713-212-1.937402hypothetical protein
ECP_1714-212-2.597975protease 4
ECP_1715-119-4.249768asparaginase
ECP_1716-121-5.113637nicotinamidase/pyrazinamidase
ECP_1717022-5.747216metabolite transport protein YdjE
ECP_1718-122-5.032525transcriptional regulator YdjF
ECP_1719020-3.925483oxidoreductase YdjG
ECP_1720021-4.215167sugar kinase
ECP_1721019-3.683163hypothetical protein
ECP_1722-117-2.770742zinc-type alcohol dehydrogenase-like protein
ECP_1723-118-2.323068metabolite transport protein YdjK
ECP_1724-121-2.139320zinc-type alcohol dehydrogenase-like protein
ECP_1725125-1.852829hypothetical protein
ECP_1726220-1.585685methionine sulfoxide reductase B
ECP_1727118-1.606624glyceraldehyde 3-phosphate dehydrogenase A
ECP_1728-111-3.953745aldose 1-epimerase
ECP_1729-112-4.547401hypothetical protein
ECP_1730012-4.616466MltA-interacting protein
ECP_1731-114-4.564597hypothetical protein
ECP_1732-318-5.086155hypothetical protein
ECP_1733-220-4.286433hypothetical protein
ECP_1734-219-1.329363hypothetical protein
ECP_17351200.815673hypothetical protein
ECP_1736-119-0.095529hypothetical protein
ECP_1737-119-0.976140transcriptional regulator YeaM
ECP_1738-118-1.039145transporter
ECP_1739-120-2.818213hypothetical protein
ECP_1740-120-3.950726hypothetical protein
ECP_1741-221-4.384482hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1692DNABINDINGHU320.002 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 31.6 bits (72), Expect = 0.002
Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 74 SNKAELTAIIARETGKPRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHR 133
+NK +L A +A T + ++A V A+ + ++ + GE+ + G +R R
Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56

Query: 134 P 134

Sbjct: 57 A 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1710SHAPEPROTEIN1041e-26 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 104 bits (261), Expect = 1e-26
Identities = 81/373 (21%), Positives = 138/373 (36%), Gaps = 89/373 (23%)

Query: 3 IGIDLGTTNSLAAVWRNGQSELIPNALGKFLTPSVVCVDEDG------MVLTGEAARDLQ 56
+ IDLGT N+L V G ++ N PSVV + +D + G A
Sbjct: 13 LSIDLGTANTLIYVKGQG---IVLN------EPSVVAIRQDRAGSPKSVAAVGHDA---- 59

Query: 57 LIKPQNCASNFKRMMGTS-------KTLKLG--GREFRAEELSSLILRQLKEDAENYLGE 107
K+M+G + + +K G F E++ ++Q+ ++
Sbjct: 60 -----------KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNS---FMR 105

Query: 108 EVTEAVISVPAYFGDMQRKATKAAATMAGLNVERLINEPTAAALAYGLHNKDDEHQFLVF 167
++ VP ++R+A + +A AG LI EP AAA+ GL + +V
Sbjct: 106 PSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS-MVV 164

Query: 168 DLGGGTFDVSILELFDNIMEVRAS-AGDNFLGGEDIVDILIDAYCSRRDLPENIEWREPT 226
D+GGGT +V+++ L + GD F E I++ + Y E T
Sbjct: 165 DIGGGTTEVAVISLNGVVYSSSVRIGGDRF--DEAIINYVRRNY--------GSLIGEAT 214

Query: 227 FQRHLRIEAERVKRVLS--VRDEATFSVEIEGRRYYWHL-------TTEKFEFL---LQT 274
AER+K + + +E+ GR + + E E L L
Sbjct: 215 --------AERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTG 266

Query: 275 FFERVHMPLER-------AIRDAKINISQLDQVVLVGGTTRMPLIRKLVTRLFGRIPAMH 327
V + LE+ I + + VL GG + + +L+ G +
Sbjct: 267 IVSAVMVALEQCPPELASDISERGM--------VLTGGGALLRNLDRLLMEETGIPVVVA 318

Query: 328 LNPDEVIAQGAAI 340
+P +A+G
Sbjct: 319 EDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1716ISCHRISMTASE373e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 36.9 bits (85), Expect = 3e-05
Identities = 35/192 (18%), Positives = 55/192 (28%), Gaps = 58/192 (30%)

Query: 2 PPRALLLV-DLQNDFCAGGALAVPEGDSTVDVANRLIDWCQSRGEAVI-----ASQD--- 52
P RA+LL+ D+QN F +L + C G V+ SQ+
Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87

Query: 53 -------WHPANHGSFASQHGVEPYTPGQLDGLPQTFWPDHCVQNSEGAQLHPLLKQKAI 105
W P + + + P D + T W
Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDDDLV-LTKW---------------------- 124

Query: 106 AAVFHKGENPLVDSYSAFFDNGRRQKTALDDWLRAHVINELIVMGLATDYCVKFTVLDAL 165
YSAF +T L + +R ++LI+ G+ T +A
Sbjct: 125 -------------RYSAFK------RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAF 165

Query: 166 QLGYKVNVITDG 177
K + D
Sbjct: 166 MEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1717TCRTETB392e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.5 bits (92), Expect = 2e-05
Identities = 30/129 (23%), Positives = 50/129 (38%), Gaps = 1/129 (0%)

Query: 65 ALMFGYFIGSLTGGFIGDYFGRRRAFRINLLIVGIAATGAAFVPDMY-WLIFFRFLMGTG 123
A M + IG+ G + D G +R ++I + + LI RF+ G G
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116

Query: 124 MGALIMVGYASFTEFIPATVRGKWSARLSFVGNWSPMLSAAIGVVVIAFFSWRIMFLLGG 183
A + +IP RGK + + + AIG ++ + W + L+
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176

Query: 184 IGILLAWFL 192
I I+ FL
Sbjct: 177 ITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1723TCRTETB310.011 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.011
Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 23/142 (16%)

Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMDF-LIACRFVMGVGLGALL 129
+G V G + D+ G + + I+ V+G + LI RF+ G G A
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVSFIGNWSYPLCSLIAMGLTPLISA----EWNWR 181
+ Y+P NR G S V+ + G+ P I +W
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA------------MGEGVGPAIGGMIAHYIHWS 169

Query: 182 VQLLIPAILSLIATALAWRYFP 203
LLIP I I T
Sbjct: 170 YLLLIPMI--TIITVPFLMKLL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1728INVEPROTEIN290.021 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 28.9 bits (64), Expect = 0.021
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 13/81 (16%)

Query: 158 ETTSALHTYFNVGDIAKVSVSGLGDRFIDKVNDAKED-----------VLTDGIQTFPDR 206
E ++AL + N D K S S L + F ++V + + V ++ F +
Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114

Query: 207 TDRVYLNPQDCSVINDEALNR 227
++ +P D ++ E L R
Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1736PRTACTNFAMLY280.022 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 27.7 bits (61), Expect = 0.022
Identities = 18/61 (29%), Positives = 26/61 (42%)

Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGGRGVTLMGS 108
Q +I L IG + + LPPS ++ N ++ A VS LG +TL G
Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233

Query: 109 Q 109

Sbjct: 234 H 234


23ECP_1863ECP_2046Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1863-120-4.240410inner membrane protein
ECP_1864335-7.542407hypothetical protein
ECP_1865437-7.812206hypothetical protein
ECP_1866538-8.440629hypothetical protein
ECP_1867025-4.274300outer membrane porin protein LC
ECP_1868-215-1.737085transcriptional regulator YbcM
ECP_18690131.446573kinase inhibitor
ECP_1870-1153.809362multidrug efflux protein
ECP_18711174.610756flagellar hook-basal body protein FliE
ECP_18721154.351223flagellar MS-ring protein
ECP_18731184.450707flagellar motor switch protein G
ECP_1874-1163.727657flagellar assembly protein H
ECP_1875-2173.436748flagellum-specific ATP synthase
ECP_1876-1162.297228flagellar biosynthesis chaperone
ECP_1877-1162.460311flagellar hook-length control protein
ECP_1878-2212.100825flagellar basal body protein FliL
ECP_18790170.745595flagellar motor switch protein FliM
ECP_1880116-2.017411flagellar motor switch protein FliN
ECP_1881016-2.845722flagellar biosynthesis protein FliO
ECP_1882018-3.772207flagellar biosynthesis protein FliP
ECP_1883019-4.123408flagellar biosynthesis protein FliQ
ECP_1884-315-2.565244flagellar biosynthesis protein FliR
ECP_1885-117-2.052143colanic acid capsular biosynthesis activation
ECP_1886-3150.392624hypothetical protein
ECP_1887-2150.791886hypothetical protein
ECP_1888-2161.380584mannosyl-3-phosphoglycerate phosphatase
ECP_1889-1151.291891hypothetical protein
ECP_18902161.738110hypothetical protein
ECP_18911161.803545hypothetical protein
ECP_18922170.960379hypothetical protein
ECP_1893-114-1.541288very short patch repair protein
ECP_1894-215-2.385976DNA cytosine methylase
ECP_1895-220-3.853836hypothetical protein
ECP_1896-228-7.757468hypothetical protein
ECP_1897-130-8.535845hypothetical protein
ECP_1898-130-8.231570outer membrane pore protein
ECP_1899034-7.884585hypothetical protein
ECP_1900-129-6.729814chaperone protein HchA
ECP_1901033-6.982386sensor-like histidine kinase YedV
ECP_1902128-6.108630transcriptional regulatory protein YedW
ECP_1903128-6.567081transthyretin-like protein
ECP_1904026-4.969094hypothetical protein
ECP_1905137-10.157325sulfite oxidase subunit YedY
ECP_1906241-10.671107sulfite oxidase subunit YedZ
ECP_1907344-11.070448hypothetical protein
ECP_1908346-11.148677hypothetical protein
ECP_1909343-10.283285phage-related integrase
ECP_1910447-12.873759hypothetical protein
ECP_1911443-10.535154PilV-like protein
ECP_1912447-13.813704type IV pilin protein
ECP_1913547-14.402670hypothetical protein
ECP_1914544-12.906968hypothetical protein
ECP_1915747-15.447446hypothetical protein
ECP_1916546-14.220229hypothetical protein
ECP_1917448-15.497456hypothetical protein
ECP_1918542-11.578413hypothetical protein
ECP_1919742-12.867894hypothetical protein
ECP_1920848-15.448722hypothetical protein
ECP_1921848-15.477108hypothetical protein
ECP_1922847-14.469808hypothetical protein
ECP_1923942-12.205968hypothetical protein
ECP_1924847-14.424366hypothetical protein
ECP_1925645-13.558565hypothetical protein
ECP_1926437-11.700939hypothetical protein
ECP_1927437-10.764053DNA-binding protein
ECP_1928532-10.632380hypothetical protein
ECP_1929432-10.176316hypothetical protein
ECP_1930225-6.393747hypothetical protein
ECP_1931020-2.852211hypothetical protein
ECP_1932-1190.646707regulatory protein
ECP_19330172.607763hypothetical protein
ECP_19340184.481412*hypothetical protein
ECP_19361215.544017*P4-like integrase
ECP_1937-1217.083595salicylate synthase Irp9
ECP_19380238.189401cytoplasmic transmembrane protein
ECP_19390248.236408ABC transporter permease
ECP_19400248.310309inner membrane ABC-transporter (YbtP)
ECP_19410248.394026yersiniabactin transcriptional regulator, YbtA
ECP_1942-1248.226856yersiniabactin biosynthetic protein
ECP_1943-1237.509091yersiniabactin biosynthetic protein
ECP_1944-1214.932754yersiniabactin biosynthetic protein YbtU
ECP_1945-1192.774937yersiniabactin biosynthetic protein YbtT
ECP_1946-1181.683273yersiniabactin siderophore biosynthetic protein
ECP_1947-121-2.383664pesticin receptor
ECP_1948131-6.247490hypothetical protein
ECP_1949-227-4.583593hypothetical protein
ECP_1950-225-4.088334hypothetical protein
ECP_1951-230-5.804468hypothetical protein
ECP_1952-229-6.337703hypothetical protein
ECP_1953-227-4.236505hypothetical protein
ECP_1954-226-3.925883shikimate transporter
ECP_1955-127-4.104342AMP nucleosidase
ECP_1956-125-3.771269hypothetical protein
ECP_1957026-3.395551hypothetical protein
ECP_1958025-3.162452*nitrogen assimilation transcriptional regulator
ECP_1959023-2.786055transcriptional regulator Cbl
ECP_1960020-1.699007*hypothetical protein
ECP_1962019-0.056403*bacteriophage integrase
ECP_19631210.021154hypothetical protein
ECP_19641211.097773thioesterase
ECP_19652202.728611hypothetical protein
ECP_19662183.531805polyketide synthase
ECP_19672184.084294non-ribosomal peptide synthetase
ECP_19682174.373197drug/sodium antiporter
ECP_19692174.622134amidase
ECP_19702164.402026non-ribosomal peptide synthase
ECP_19712164.001159non-ribosomal peptide synthase
ECP_19721152.760812polyketide synthase
ECP_19731162.233933peptide synthetase
ECP_19742191.484617malonyl-CoA transacylase
ECP_19752200.928165acyl-CoA dehydrogenase
ECP_19762200.698575D-alanyl carrier protein
ECP_1977220-0.0166413-hydroxybutyryl-CoA dehydrogenase
ECP_19782190.391019polyketide synthase
ECP_1979220-0.521879polyketide synthase
ECP_1980334-11.839068hypothetical protein
ECP_1981534-11.636910transcriptional regulator
ECP_1982834-10.5307034'-phosphopantetheinyl transferase
ECP_1983328-5.610322IS1400 transposase A
ECP_1984229-4.012642IS1400 transposase B
ECP_1985228-2.811864transposase
ECP_1986229-2.378294hypothetical protein
ECP_1987228-1.287306hypothetical protein
ECP_1988127-1.448960hypothetical protein
ECP_1989125-3.350739nicotinate-nucleotide--dimethylbenzimidazole
ECP_1990125-4.845537cobalamin synthase
ECP_1991528-5.467437adenosylcobinamide kinase
ECP_1992726-4.914740hypothetical protein
ECP_1993729-6.173675hypothetical protein
ECP_1994426-5.474517outer membrane receptor for iron compound or
ECP_1995326-5.135659hypothetical protein
ECP_1996323-4.380025hypothetical protein
ECP_1997324-4.822554phosphoadenosine phosphosulfate reductase
ECP_1998226-5.667800hypothetical protein
ECP_1999124-5.315132hypothetical protein
ECP_2000029-7.262517hypothetical protein
ECP_2001030-7.003064hypothetical protein
ECP_2002131-6.919548hypothetical protein
ECP_2003130-6.890374hypothetical protein
ECP_2004131-7.000814carbohydrate kinase
ECP_2005229-5.535670hypothetical protein
ECP_2006327-2.758563hypothetical protein
ECP_2007427-1.794876phosphotriesterase
ECP_2008629-0.096808hypothetical protein
ECP_2009832-0.101301hypothetical protein
ECP_20107301.187305transposase (part)
ECP_20116281.518838transposase
ECP_20127312.083725transposase
ECP_20137311.669945hypothetical protein
ECP_20145313.515992hypothetical protein
ECP_20153283.751960hypothetical protein
ECP_20164272.277228hypothetical protein
ECP_20175271.936038hypothetical protein
ECP_20183251.831116hypothetical protein
ECP_20194231.513205transposase
ECP_2020622-0.405987regulatory protein
ECP_2021623-0.289708hypothetical protein
ECP_2022527-0.469689hypothetical protein
ECP_2023528-0.493858hypothetical protein
ECP_2024429-1.122316transposase/IS protein
ECP_2025631-0.712028transposase for insertion sequence IS100
ECP_2026732-1.559507hypothetical protein
ECP_2027833-2.282404hypothetical protein
ECP_2028228-0.168550hypothetical protein
ECP_20290221.519566hypothetical protein
ECP_2030-1202.791845hypothetical protein
ECP_2031-1192.906825hypothetical protein
ECP_2032-2193.396299hypothetical protein
ECP_2033-2172.659177ABC transporter
ECP_2034-1162.385218iron ABC transporter
ECP_20350172.102812iron-chelating periplasmic-binding protein
ECP_20360181.355563TonB-dependent receptor
ECP_20374240.385722hypothetical protein
ECP_20384240.674836hypothetical protein
ECP_20394241.165158hypothetical protein
ECP_20404231.445017hypothetical protein
ECP_20414230.849958hypothetical protein
ECP_20427272.753172hypothetical protein
ECP_20437294.379215hypothetical protein
ECP_20448263.454493antirestriction protein
ECP_20456252.417739radC-like protein YeeS
ECP_20463220.376841hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1863RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1864PF01206936e-29 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 92.5 bits (230), Expect = 6e-29
Identities = 16/71 (22%), Positives = 37/71 (52%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66
D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1867ECOLIPORIN5100.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 510 bits (1314), Expect = 0.0
Identities = 240/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%)

Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223
D+ NGDGFG STTY+ GF GA Y SDRTN+QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277
A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333
AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360
L+D D F K +G++TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1871FLGHOOKFLIE1178e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (293), Expect = 8e-38
Identities = 102/103 (99%), Positives = 102/103 (99%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQT ARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1872FLGMRINGFLIF7510.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 751 bits (1941), Expect = 0.0
Identities = 476/555 (85%), Positives = 513/555 (92%), Gaps = 5/555 (0%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVSAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQV+AQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPTNQNNRQQ--QASTTSNS---GPRSTQRNETSN 357
+GYPGGVPGALSNQPAP N API+TPPTNQ N Q Q ST++NS GPRSTQRNETSN
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEALTREAMGFSEK 417
YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIE LTREAMGFS+K
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424

Query: 418 RGDSLNVVNSPFNSSDESGGALPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477
RGD+LNVVNSPF++ D +GG LPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT
Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484

Query: 478 RRAEAVKTVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537
RR E K Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544

Query: 538 VVALVIRQWINNDHE 552
VVALVIRQW++NDHE
Sbjct: 545 VVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1873FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1874FLGFLIH370e-134 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 370 bits (951), Expect = e-134
Identities = 224/228 (98%), Positives = 227/228 (99%)

Query: 1 MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTW PDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTMDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPT+DNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1876FLGFLIJ2022e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (515), Expect = 2e-70
Identities = 146/147 (99%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1877FLGHOOKFLIK469e-168 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 469 bits (1208), Expect = e-168
Identities = 367/375 (97%), Positives = 370/375 (98%)

Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSDIVSDAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120
GEPL+SDIVSDAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQGRVTGNSGVDIFA 375
LQGRVTGNSGVDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1879FLGMOTORFLIM385e-136 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 385 bits (989), Expect = e-136
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 20 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 77
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 78 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 137
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 138 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 197
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 198 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 255
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 256 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 312
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 313 GVPVLTSQYGTLNGQYALRIEHLI 336
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1880FLGMOTORFLIN2114e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 211 bits (538), Expect = 4e-74
Identities = 125/137 (91%), Positives = 133/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSGKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSG +QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1882FLGBIOSNFLIP333e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 333 bits (856), Expect = e-119
Identities = 244/245 (99%), Positives = 244/245 (99%)

Query: 1 MRRLFSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRL SVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1883TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1884TYPE3IMRPROT2026e-67 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 202 bits (516), Expect = 6e-67
Identities = 256/261 (98%), Positives = 259/261 (99%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEIFNLLADIISELPLI 261
EHLFSEIFNLLADIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1894PF05272290.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.045
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1895CARBMTKINASE342e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 2e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + +L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1898ECOLIPORIN410e-145 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 410 bits (1054), Expect = e-145
Identities = 199/388 (51%), Positives = 246/388 (63%), Gaps = 41/388 (10%)

Query: 1 MKRKVLAMLVPALLVAGAANAAEIYNKNGNKVELYGKMVGERILTDRESGEKGDNSQDTS 60
MKRKVLA+++PALL AGAA+AAEIYNK+GNK++LYGK+ G +D S D +
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSS-----KDGDQT 55

Query: 61 YARVGVKGETQINPELTGYGQFELDLEASNRHNPDQ---TRLAYAGLSYKDFGSFDYGRN 117
Y RVG KGETQIN +LTGYGQ+E +++A+ TRLA+AGL + D+GSFDYGRN
Sbjct: 56 YMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRN 115

Query: 118 VGVAYDAEAFTDMFVEWGGDSWAGTDLFMTNRTNGVATYRNTDFFGMVEGLNFALQYQGK 177
GV YD E +TDM E+GGDS+ D +MT R NGVATYRNTDFFG+V+GLNFALQYQGK
Sbjct: 116 YGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGK 175

Query: 178 NEGTGNY----------------KANGDGHGLSATYTID-GFSFAGAYANSDRTDWQSGD 220
NE NGDG G+S TY I GFS AY SDRT+ Q
Sbjct: 176 NESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNA 235

Query: 221 GK----GERAEVWALSTKYDANNVYAAVMYGESHNM-------NSDDGDVVNKTQNFEAV 269
G G++A+ W KYDANN+Y A MY E+ NM DG V NKTQNFE
Sbjct: 236 GGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 270 LQYQFDFGLRPSIGYSYSKALDVA----GYKDSDRLNYIEIGTWYYFNKNMNVYTAYQIN 325
QYQFDFGLRP++ + SK D+ D D + Y ++G YYFNKN + Y Y+IN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 326 LLDKSD-YVLAHGLNTDDQLAVGIVYQF 352
LLD D + G++TDD +A+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1901PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 35/181 (19%), Positives = 64/181 (35%), Gaps = 37/181 (20%)

Query: 290 ENILFLARADKNNVLVKLDALS----------------LNKEVENLLDYL--EYLSDEKE 331
NI L D L +LS L E+ + YL + E
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239

Query: 332 IRFKVECNQQIFADKI---LLQRMLSNLIVNAIRYSPEKSRIHITSFLDANGSLNIDIAS 388
++F+ + N I ++ L+Q ++ N I + I P+ +I + D NG++ +++ +
Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVEN 298

Query: 389 PGTKINEPEKLFRRFWRGDNSRHSVGQGLGLSLVKA-IAELHGGSATYHYLSKHNVFRIT 447
G+ + K G GL V+ + L+G A K
Sbjct: 299 TGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 448 L 448
+
Sbjct: 345 V 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1902HTHFIS849e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 9e-21
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1911BCTERIALGSPH330.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 32.6 bits (74), Expect = 0.002
Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 8/86 (9%)

Query: 2 IKKKGFTLLEVTIVL---GIGTLIAFMKFQDMRNDQEAVLADNVGTQIKQLGE--AVNRY 56
++++GFTLLE+ ++L G+ + + F R+D A Q++ + +
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 57 ---ISIRYDKISTLSSSNNQSSDPGP 79
+S+ D+ L +DP P
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAP 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1912PilS_PF08805738e-19 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 73.4 bits (180), Expect = 8e-19
Identities = 47/179 (26%), Positives = 84/179 (46%), Gaps = 17/179 (9%)

Query: 7 KRKSKKGFSLLELLLVLGIIAALVVAAFIVYPKVQASQRAQAESNNIATIQAGVKALYTS 66
K++ KG +L+E+LLV+G+I L +A+ +Y VQ++ ++ E NN+ T+ A +K+L
Sbjct: 21 KKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQ 80

Query: 67 AS-SFTGLTNTVAVQAKIFPDNMLSGSGTAAKPINAFKGNVTLAATATGPSSATGSSFTI 125
+ + T+ + P +M+ T A N + G+VT+ S+ SF +
Sbjct: 81 GRYTDSNYIKTL-YAQGLLPSDMI-ADTTGASAKNPWGGSVTITT------SSDKYSFNV 132

Query: 126 TYDNVPAAECVKIATAAAGNFYITTVGTKVVKAAGGTLDVAATAAACTNATSNTLVFTS 184
NVP C+ + A + + T +AA + SNTL F++
Sbjct: 133 VEANVPQKNCMAMVNA--------LRSSSAISKINNTSTSTVSAATVCASDSNTLTFST 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1914CHANLCOLICIN300.022 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.0 bits (67), Expect = 0.022
Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 184 ERIDHRSLRTQCADALAQAE-EAFSAEEKAFWLAKATETNRPAMQRVHRAKWNDTESQEQ 242
E + H + RT A LA A A AE++ LAKA E R + +A + +++
Sbjct: 100 EALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKE 159

Query: 243 RAAEQAQRDQQIEEAKKVYTTFSEL 267
E+A+ ++Q++ A+ + L
Sbjct: 160 IEREKAETERQLKLAEAEEKRLAAL 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1918ACRIFLAVINRP270.006 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.006
Identities = 9/48 (18%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query: 29 VLYGTYPGWYAAVVLLLTFGLSTLIGMSTGMAGATISLPIIAVVGFIA 76
L Y W V ++L L ++G+ + +VG +
Sbjct: 886 CLAALYESWSIPVSVMLVVPL-GIVGVLLAATLFNQKNDVYFMVGLLT 932


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1928TACYTOLYSIN300.001 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 30.3 bits (68), Expect = 0.001
Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 12/53 (22%)

Query: 67 WFFTWKD------TGIQ-PGTAFVSSVVAGICFGVLMAAYHWWRKVVN--NLP 110
W W T I + ++A C G+ A+ WWRKV++ ++
Sbjct: 502 WDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGL---AWEWWRKVIDERDVK 551


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1929SHAPEPROTEIN250.034 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 25.1 bits (55), Expect = 0.034
Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 35 SVVNERREEYYQEIGEKKAHKLKMK 59
+++N R Y IGE A ++K +
Sbjct: 197 AIINYVRRNYGSLIGEATAERIKHE 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1942ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 82 AWNQLMLSRS 91
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1943DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1948INTIMIN752e-17 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 74.7 bits (183), Expect = 2e-17
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 162 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFS 221
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1950INTIMIN563e-10 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 55.8 bits (134), Expect = 3e-10
Identities = 62/263 (23%), Positives = 91/263 (34%), Gaps = 20/263 (7%)

Query: 175 IAVKAHVNDQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVK 234
I A V G + P +F+ S ++S N+ +TN G A VT+ ++ G V
Sbjct: 578 ITYTATVKKN-GVAQANVPVSFNIV-SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635

Query: 235 ASLANGASLEKQLEAI---DEKLTLTSSPLIGVNAPKGATLTATLT---SANGTPVEGQV 288
A A S I K ++T A T T PV Q
Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695

Query: 289 INFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFHNGVTIQTQTTVKVTGN 348
+ F+ TL LS +T+++G A V LTS G V+A + V+
Sbjct: 696 VTFTTTL--GKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTT 753

Query: 349 PSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTVYFALKSGSTTLTSLTA 407
+ I T ++ G NL G + +S + + S
Sbjct: 754 LTID------DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS--- 804

Query: 408 VTDQNGIATTSVKGEITGSVTVS 430
V +G T KG T SV S
Sbjct: 805 VDASSGQVTLKEKGTTTISVISS 827



Score = 52.4 bits (125), Expect = 3e-09
Identities = 46/170 (27%), Positives = 65/170 (38%), Gaps = 7/170 (4%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGV-TIQTQTTVKVTGNPSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGL 389
+ + V + A + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQ--TKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQ 694

Query: 390 TVYFALKSGSTTLTSLTAVTDQNGIATTSVKGEITGSVTVSAVTSAGGMQ 439
V F G + + T TD NG A ++ G VSA S +
Sbjct: 695 EVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 51.2 bits (122), Expect = 7e-09
Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 16/233 (6%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN----AKVDVNSATI----TSQEIPEGQVMVAQDIAVKAHVN 182
VSA ++D +V + + +D + I ++P + Q + N
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790

Query: 183 DQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVKA 235
++ + A S Q+ + + +T + V + + +YT+
Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTIS-----VISSDNQTATYTIAT 838



Score = 40.1 bits (93), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1951INTIMIN280.022 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 27.7 bits (61), Expect = 0.022
Identities = 22/129 (17%), Positives = 46/129 (35%), Gaps = 6/129 (4%)

Query: 11 KISAIDYSQNINGDYKATVTGGGEGIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVS 70
K+S + NG K T+T G + + ++ V + +EF G +
Sbjct: 704 KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIE 762

Query: 71 VNSANLPTASFPSQGFTGAYYQLNNDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDG 130
+ + P+ L + G + ++ A ++G+VT K G
Sbjct: 763 IVGTGV-KGKLPTVWLQYGQVNL---KASGGNGKYTWRSANPAIASVDASSGQVTLKEKG 818

Query: 131 DSNTVIITA 139
+ T+ + +
Sbjct: 819 -TTTISVIS 826


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1954TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 39/259 (15%), Positives = 96/259 (37%), Gaps = 18/259 (6%)

Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138
+G ++G D+LG KR+L+ + + + + + SF ++ I+ ++ A
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAA 119

Query: 139 VGGEWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWG 197
+ + K S V +G GVG + + I
Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------H 167

Query: 198 WRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIPVIEALLRHPGAFLKIIA 257
W L ++ ++ ++ +++ + + + ++ +L + +
Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227

Query: 258 LRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRR 317
+ + L +++ + + GL + + IG+L GG+ T+ F + +
Sbjct: 228 VSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV 286

Query: 318 VYITGALIGTLSAFPFFMA 336
++ A IG++ FP M+
Sbjct: 287 HQLSTAEIGSVIIFPGTMS 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1965BICOMPNTOXIN330.002 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 33.3 bits (76), Expect = 0.002
Identities = 6/41 (14%), Positives = 16/41 (39%)

Query: 303 LAADNRILYASGWFIDQNQGPYISHGGQNPNFSSCIALRPD 343
+ +F+ ++ P + G NP+F + ++
Sbjct: 210 VGYKPHSKDPRDYFVPDSELPPLVQSGFNPSFIATVSHEKG 250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1967ISCHRISMTASE429e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 42.3 bits (99), Expect = 9e-06
Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 3/87 (3%)

Query: 927 DVRQMVATVRNTAPASGSER-LGDAAIRHSVRVCVEGALEQTEFDDNENLYVLGLDSIKS 985
++ A V+ T+ +G + IR + ++ E + D E+L GLDS++
Sbjct: 209 QLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPE--DITDQEDLLDRGLDSVRI 266

Query: 986 IQIAAQLRHHGWTMSAVQVMECGTVNA 1012
+ + Q R G ++ V++ E T+
Sbjct: 267 MTLVEQWRREGAEVTFVELAERPTIEE 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1979DHBDHDRGNASE512e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.2 bits (122), Expect = 2e-08
Identities = 32/167 (19%), Positives = 58/167 (34%), Gaps = 7/167 (4%)

Query: 2188 IPGNVLWIIGGEKGIGRMIGEALAQREGVRVVLSSRTGYHHEAVQQDAL------DVIHC 2241
I G + +I G +GIG + LA +G + E V +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLAS-QGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 2242 DVTQAEAVRACLATLLERYGRLDGVIFAADATTTLTLHQLSESALRDTLTVKERGTANVL 2301
DV + A+ A + G +D ++ A +H LS+ T +V G N
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 2302 HALAQRNLLDERLLLLFCNSLAAVNAEIGQTGYATASAYLDALAQQL 2348
++++ + ++ S A YA++ A + L
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2000FLGPRINGFLGI300.010 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 29.5 bits (66), Expect = 0.010
Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 48 EKNVQIADQVIIDESAGEVVIGANTRICHGAVIQGPVVI 86
+V+I+E G +VIGA+ RI AV G + +
Sbjct: 254 TVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTV 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2033PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.002
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 32 VTVLLGPNGCGKSTLLRALAGL 53
VL G G GKSTL+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


24ECP_2061ECP_2089Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_20610223.620747antitoxin YefM
ECP_2062-1234.026706ATP phosphoribosyltransferase
ECP_20630233.834253histidinol dehydrogenase
ECP_2064-1262.960975histidinol-phosphate aminotransferase
ECP_2065-2191.022784imidazole glycerol-phosphate
ECP_2066-217-1.301182imidazole glycerol phosphate synthase subunit
ECP_2067-216-1.8780771-(5-phosphoribosyl)-5-[(5-
ECP_2068-216-1.469569imidazole glycerol phosphate synthase subunit
ECP_2069-115-4.257902bifunctional phosphoribosyl-AMP
ECP_2070022-7.459586chain length determinant protein
ECP_2071226-8.802606UDP-glucose 6-dehydrogenase
ECP_2072332-11.2140686-phosphogluconate dehydrogenase
ECP_2073341-14.108921phosphomannomutase
ECP_2074857-18.467135mannose-1-phosphate guanylyltransferase
ECP_2075859-20.063407glycosyltransferase
ECP_2076759-19.918305UDP-glucose 4-epimerase
ECP_2077652-17.830787glycosyltransferase
ECP_2078441-14.207416glycosyltransferase
ECP_2079231-10.691674glycosyltransferase
ECP_2080123-7.253469antigen polymerase
ECP_2081-115-2.908761O antigen flippase
ECP_2082-1190.876901UTP-glucose-1-phosphate uridylyltransferase
ECP_2083-1221.528837colanic acid biosynthesis protein
ECP_20840242.967262colanic acid biosynthesis glycosyl transferase
ECP_20850243.240444pyruvyl transferase
ECP_20860243.397605colanic acid exporter
ECP_2087-1243.757164UDP-glucose lipid carrier transferase
ECP_2088-1243.816235phosphomannomutase
ECP_2089-1213.103878mannose-1-phosphate guanylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2076NUCEPIMERASE1743e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 174 bits (444), Expect = 3e-54
Identities = 74/351 (21%), Positives = 143/351 (40%), Gaps = 40/351 (11%)

Query: 1 MNILVTGGAGYIGSHTAIELLNAGHEIIVLDNFSNASYKCIEK---IKEITRRDFITITG 57
M LVTG AG+IG H + LL AGH+++ +DN N Y K ++ + + F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNL-NDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DAGCRKTLSAIFEKHAIDIVIHFAGFKSVSESKSEPLKYYQNNVGVTITLLQVMEEYRIK 117
D R+ ++ +F + V +V S P Y +N+ + +L+ +I+
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 KFIFSSSATVYGEPEIIPIPETAKIGGTTNPYGTSKYFVEKILEDVSSTGKLDIICLRYF 177
+++SS++VYG +P + + Y +K E + S L LR+F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 178 NPVGAHSSGKIGEAPSGIPNNLVPYLL--DVASGKRDKLFIYGNDYPTNDGTGVRDFIHV 235
G P G P ++ + + GK ++ N G RDF ++
Sbjct: 180 TVYG----------PWGRP-DMALFKFTKAMLEGKSIDVY--------NYGKMKRDFTYI 220

Query: 236 VDLAKGHLAAMNYL---------------SINSGYNIFNLGTGKGYSVLELITTFEKLTN 280
D+A+ + + + + + Y ++N+G +++ I E
Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280

Query: 281 IKVNKSFIERRAGDVASCWADADKANSLLDWQAEQTLEQMLLDSWRWKKNY 331
I+ K+ + + GDV AD ++ + E T++ + + W +++
Sbjct: 281 IEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


25ECP_2111ECP_2129Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2111-2173.947476hypothetical protein
ECP_2112-2184.310149hypothetical protein
ECP_2113-2173.976071hypothetical protein
ECP_2114-2174.014197multidrug efflux system subunit MdtA
ECP_2115-2183.886296multidrug efflux system subunit MdtB
ECP_2116-2142.558738multidrug efflux system subunit MdtC
ECP_2117-213-2.698916multidrug efflux system protein MdtE
ECP_2118022-5.406269signal transduction histidine-protein kinase
ECP_2119031-8.922560DNA-binding transcriptional regulator BaeR
ECP_2120025-7.649990hypothetical protein
ECP_2121027-8.145886hypothetical protein
ECP_2122026-7.520723hypothetical protein
ECP_2123-219-5.773625hypothetical protein
ECP_2124-116-4.408162hypothetical protein
ECP_2125213-1.393379protease YegQ
ECP_2126321-3.099722hypothetical protein
ECP_2127321-2.980202lipid kinase
ECP_2128321-3.220189galactitol utilization operon repressor
ECP_2129321-2.234152galactitol-1-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2114RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 2e-08
Identities = 48/369 (13%), Positives = 106/369 (28%), Gaps = 87/369 (23%)

Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGR------- 51
S + R V ++ IA G+ + + A G + +
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 52 --RGMRAG-PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 103
+R G L + A + L T ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 144
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 280 DNQIDATTG 288
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2115ACRIFLAVINRP9190.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 919 bits (2376), Expect = 0.0
Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSESVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2116ACRIFLAVINRP9160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 916 bits (2368), Expect = 0.0
Identities = 288/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDDTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 AVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
+ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRGERS---ETAQQIIDRLRKKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP ER+ +A+ +I R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QEDNGAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 80.7 bits (199), Expect = 2e-17
Identities = 76/448 (16%), Positives = 161/448 (35%), Gaps = 26/448 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L+ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQANASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQEDNGAE-- 703
V ++ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLIYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V+
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPKQA 1022
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2117TCRTETB1237e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 7e-33
Identities = 97/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHAQNNNRALFSLKL 257
G +L++VG+ L + + V V++ ++++ H + L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYT--WLSMASIIAL 445
+Y+ L + II +
Sbjct: 428 LYSNLLLLFSGIIVI 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2118BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 28/95 (29%), Positives = 36/95 (37%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALSTLLAALATF------PLARGLLAPVKRLVDGTHKLAAGDFTTRVAPTSE 217
RQ + L+ A L AL P L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GRLAEDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2119HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2129DHBDHDRGNASE320.002 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 31.9 bits (72), Expect = 0.002
Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 2/92 (2%)

Query: 125 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSRE 183
A+G E K I GA IG + + GA + A+D + EKL S + ++
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 184 MSAPQIQGVLRELRFNQLILETAGVPQTVELA 215
A + ++ E + V +A
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93


26ECP_2141ECP_2147Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2141222-2.378743phosphomethylpyrimidine kinase
ECP_2142224-4.860881hydroxyethylthiazole kinase
ECP_2143325-6.989735hypothetical protein
ECP_2144327-7.434358nickel/cobalt efflux protein RcnA
ECP_2145331-8.678629hypothetical protein
ECP_2146226-7.132508hypothetical protein
ECP_2147-212-3.984157outer membrane usher protein YehB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2145TYPE3OMGPROT280.007 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 27.9 bits (62), Expect = 0.007
Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 6 KMLLGALLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 46
++L G LLL++S +WA ++K E L + DF
Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2146BINARYTOXINB280.045 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 28.5 bits (63), Expect = 0.045
Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 93 NITLSNNQ---TSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSA-----SSS 144
NI LS N+ T T + T++ S ++ + S G + + + + S+S
Sbjct: 297 NIILSKNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNS 356

Query: 145 TRGSAAVQFLLCLLGGKSW 163
+ A+ L L G ++W
Sbjct: 357 NSSTVAIDHSLSLAGERTW 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2147PF005777250.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 725 bits (1873), Expect = 0.0
Identities = 243/843 (28%), Positives = 391/843 (46%), Gaps = 35/843 (4%)

Query: 2 LRMTPLASAI---VALLIGIEAYAAEETFDTHFMIGGMKDQQVSNIRL--EDNQPLPGQY 56
R+ + A +AE F+ F+ Q V+++ + PG Y
Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTY 78

Query: 57 DIDIYVNKQWRGKYEIIVKDNPQET----CLSREMIKRLGINTD-----NFASGKQCLTF 107
+DIY+N + ++ E CL+R + +G+NT N + C+
Sbjct: 79 RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138

Query: 108 KQLIQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDY 167
+I + D+G RL+ ++PQA++ GY+PPE W+ GINA +Y S
Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198

Query: 168 KASGNSKSTYVRFNSGLNLLGWQLHSDASFSKTNNNPGG-----WKSNTLYLERGFAQLL 222
+ GNS Y+ SGLN+ W+L + ++S +++ W+ +LER L
Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258

Query: 223 GTLRVGDMYTSSDIFDSVRFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGF 282
L +GD YT DIFD + F G +L D MLP+S++ F P + GIA+ A VTI+QNG+
Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318

Query: 283 VVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDF 342
+Y VPPGPF I D+ AG DL V++KEADGS + VPY++VP + + G ++Y
Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378

Query: 343 AAGRSHIEGASKQSD-FVQAGHQYGFNNLLTLYGGSMVANNYYAFTLGTGWNT-RIGAIS 400
AG A ++ F Q+ +G T+YGG+ +A+ Y AF G G N +GA+S
Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438

Query: 401 VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNK 460
VD T+++S + DGQS + YNK ++++ T L +RYS+ Y F D ++
Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498

Query: 461 DNYRRDENDIYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSG 516
++ + + DYY + ++ ++Q L ++ LS + YWG S
Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSN 557

Query: 517 SSKDYQLSYSNNLRRISYTLAASHAYDENHHE-EKRFNIFISIPFD--WGDDVTTPRRQI 573
+ +Q + I++TL+ S + ++ + ++IPF D + R
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 574 YMSKSTTFDDQGVASNNTGLSGTVGSRDQFNYGVNLSYQYQGN---ETTAGANLTWNAPV 630
S S + D G +N G+ GT+ + +Y V Y G+ +T A L +
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 631 ATVNGSYSQSSAYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYR 690
N YS S +Q VSGG++A + GV L L++T ++ APG KDA V Q
Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 691 TTNRNGVVVYDGMTPYRENYLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP 750
T+ G V T YREN + LD + +L P RGA+V F +
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGI 796

Query: 751 WFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVAIDKQQGLSCT 810
+ L + +PL FG V + G+V Q+++ + V V +++ C
Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856

Query: 811 ITF 813
+
Sbjct: 857 ANY 859


27ECP_2231ECP_2246Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_22310173.530770*hypothetical protein
ECP_2232-1183.375408hypothetical protein
ECP_22330203.850840transcriptional regulator NarP
ECP_22340194.384814cytochrome c-type biogenesis protein CcmH
ECP_22350184.544508thiol:disulfide interchange protein DsbE
ECP_22360184.104859cytochrome c-type biogenesis protein CcmF
ECP_2237-1153.277876cytochrome c-type biogenesis protein CcmE
ECP_2238-1153.529850cytochrome c-type biogenesis protein CcmD
ECP_2241-1184.209661cytochrome c-type biogenesis protein CcmB
ECP_2242-1204.271639cytochrome c biogenesis protein CcmA
ECP_22430234.262171cytochrome c-type protein NapC
ECP_2244-1214.628260citrate reductase cytochrome c-type subunit
ECP_2245-1214.145746quinol dehydrogenase membrane component
ECP_22460213.683076quinol dehydrogenase periplasmic component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2232PERTACTIN270.025 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.0 bits (59), Expect = 0.025
Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 40 VFAVIEKGGLLEV--KATGDFKIFVTDTGASPAAGDNLTLVTT 80
VFA + L V A+G +++V ++G+ PA+G+ + LV T
Sbjct: 484 VFADLGLSDKLVVMRDASGQHRLWVRNSGSEPASGNTMLLVQT 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2233HTHFIS642e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 2e-14
Identities = 22/113 (19%), Positives = 48/113 (42%), Gaps = 2/113 (1%)

Query: 9 VMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68
+++ DD +R + Q L +G++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_223560KDINNERMP280.033 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.0 bits (62), Expect = 0.033
Identities = 7/47 (14%), Positives = 19/47 (40%), Gaps = 2/47 (4%)

Query: 3 RKVLLIPLIIFLAIAAALLWQLARN--AEGDDPTNLESALIGKPVPK 47
++ LL+ ++F++ W+ +N + T + G +
Sbjct: 4 QRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQ 50


28ECP_2300ECP_2305Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_23002152.3724634-amino-4-deoxy-L-arabinose transferase
ECP_23011163.798477hypothetical protein
ECP_23020143.907739hypothetical protein
ECP_23030144.709193polymyxin B resistance protein pmrD
ECP_23040134.503771O-succinylbenzoic acid--CoA ligase
ECP_23050124.060525O-succinylbenzoate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2301BCTERIALGSPC280.007 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 28.0 bits (62), Expect = 0.007
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 34 KHIVLWLGLALACLGLAMVLWLLVL-QNVPV 63
+ I+ +L + L C LAM+ W + L N PV
Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPV 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2304ALARACEMASE300.023 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.7 bits (67), Expect = 0.023
Identities = 32/192 (16%), Positives = 58/192 (30%), Gaps = 37/192 (19%)

Query: 268 GYGLTEFASTVCAKEADGLADVGSPL----PGREVKIVNDEVWLRAASMAEGYWRNGQRV 323
G+G+ S + A + L ++ + G + I+ E + A + + R+
Sbjct: 40 GHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIY---DQHRL 96

Query: 324 PLVNDEGWYATRDRGEMHNGKLTI-------VGRLDNLFFSGGEGIQPEEVERVIAAHPA 376
W + L I + RL G QP+ V V A
Sbjct: 97 TTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRL---------GFQPDRVLTVWQQLRA 147

Query: 377 VLQVFIVPVADKEFGHRPVAVVEYDQQTVDLDEWVKDKLARFQQPVRWLTLPPELKNGGI 436
+ V + + H A + + + +AR +Q L L N
Sbjct: 148 MANVGEMTL----MSHFAEA---------EHPDGISGAMARIEQAAEGLECRRSLSNSAA 194

Query: 437 KISRQALK-EWV 447
+ +WV
Sbjct: 195 TLWHPEAHFDWV 206


29ECP_2314ECP_2322Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_23141273.537812hypothetical protein
ECP_23151304.204295NADH dehydrogenase subunit N
ECP_23160313.742857NADH dehydrogenase subunit M
ECP_23170314.206227NADH dehydrogenase subunit L
ECP_2318-1304.017786NADH dehydrogenase subunit K
ECP_2319-1304.033444NADH dehydrogenase subunit J
ECP_23200294.177981NADH dehydrogenase subunit I
ECP_23210284.017635NADH dehydrogenase subunit H
ECP_23220274.011684NADH dehydrogenase subunit G
30ECP_2388ECP_2399Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2388024-4.002612sucrose-6-phosphate hydrolase
ECP_2389-128-5.707724sucrose operon repressor
ECP_2390-131-8.587987hypothetical protein
ECP_2391-132-8.457944D-serine dehydratase
ECP_2392035-9.456287multidrug resistance protein Y
ECP_2393-134-8.456009multidrug resistance protein K
ECP_2394032-7.800896DNA-binding transcriptional activator EvgA
ECP_2395032-7.446321hybrid sensory histidine kinase in two-component
ECP_2396230-5.566860hypothetical protein
ECP_2397131-5.842718transporter YfdV
ECP_2398129-5.631872oxalyl-CoA decarboxylase
ECP_2399-220-4.693852formyl-coenzyme A transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2392TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2393RTXTOXIND741e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.1 bits (182), Expect = 1e-16
Identities = 47/277 (16%), Positives = 94/277 (33%), Gaps = 46/277 (16%)

Query: 56 AKNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDYNRRV----PLAKQGVIS 108
K + Q + L + AE + + Y+ R+ L + I+
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 109 KEALEHTKDTLI----------SSKAALNAAIQAYKANKALVMNTPLNRQPQVIEAADAT 158
K A+ ++ + S + + I + K + T L + + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKNEILDKLRQTT 308

Query: 159 KE----------AWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPGQSLMAVVPARQ-MWV 206
+ + I++PV+ + Q V G V+ ++LM +VP + V
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 207 NANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIK 266
A + + + +GQ+ I + F G +G + +
Sbjct: 369 TALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK---VKNINLDAIEDQRLG 419

Query: 267 IVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 299
+V V +S++ L PL G+++TA I T
Sbjct: 420 LVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2394HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2395HTHFIS794e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 4e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LARKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


31ECP_2441ECP_2447Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2441-2213.058499glucose-specific PTS system component
ECP_2442-1254.114090pyridoxal kinase
ECP_2443-1274.277835hypothetical protein
ECP_2444-2254.514734cysteine synthase B
ECP_2445-1244.123091sulfate/thiosulfate transporter subunit
ECP_2446-1223.901495sulfate/thiosulfate transporter permease
ECP_2447-1203.674718sulfate/thiosulfate transporter subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2445PF05272347e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.3 bits (78), Expect = 7e-04
Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 30 MVALLGPSGSGKTTLLRIIAGLEHQTSGHIRFH 62
V L G G GK+TL+ + GL+ + H
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


32ECP_2457ECP_2470Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2457-1163.665601coproporphyrinogen III oxidase
ECP_2458-1184.720121transcriptional regulator EutR
ECP_2459-2225.202531ethanolamine utilization protein EutK
ECP_2460-1225.475026ethanolamine utilization protein EutL
ECP_24610225.795384ethanolamine ammonia-lyase small subunit
ECP_24621225.812463ethanolamine ammonia-lyase heavy chain
ECP_24632216.107351reactivating factor for ethanolamine ammonia
ECP_24642205.680306ethanolamine utilization protein EutH
ECP_24654196.179799ethanolamine utilization protein EutG
ECP_24663185.997975ethanolamine utilization protein EutJ
ECP_24673195.280418ethanolamine utilization protein EutE
ECP_24681184.270908ethanolamine utilization protein EutN
ECP_24692194.001757ethanolamine utilization protein EutM
ECP_24702183.385910phosphotransacetylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2466SHAPEPROTEIN511e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 50.9 bits (122), Expect = 1e-09
Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 9/116 (7%)

Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHVATSFPPGTDP---RISINVLESAGL 118
++DG++ DFF +++ + F R V P G R + AG
Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135

Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169
+++EP A A L + G VVDIGGGTT +A++ V YS+ GG
Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191


33ECP_2527ECP_2533Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_25270173.026529enhanced serine sensitivity protein SseB
ECP_25280204.112335aminopeptidase
ECP_25292233.227024hypothetical protein
ECP_25302272.879100(2Fe-2S) ferredoxin
ECP_25312262.937105chaperone protein HscA
ECP_25322241.206214co-chaperone HscB
ECP_25332281.588963iron-sulfur cluster assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2527STREPKINASE290.015 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 29.3 bits (65), Expect = 0.015
Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 21/120 (17%)

Query: 130 GNPLSSQEVLEGGESLILSE-----VAEPPAQMIDSLTTLFKTIKPVKRAFICSIKENEE 184
G+ ++SQE+L +S++ + E + ++ +F+TI P+ + F +K E+
Sbjct: 217 GDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQ 276

Query: 185 A-QPNLLIGIEADGDIEEIIQAAGSVATDTLPGDEPIDICQVKKGEKGISHFITEHIAPF 243
A + N G+ + + ++I V +KKGEK F H+ F
Sbjct: 277 AYRINKKSGLNEEINNTDLISEKYYV---------------LKKGEKPYDPFDRSHLKLF 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2531SHAPEPROTEIN1145e-30 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 114 bits (288), Expect = 5e-30
Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%)

Query: 23 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 75
IDLGT N+L+ G + +E PSVV +Q VG+DA+
Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63

Query: 76 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 135
T I++++ + +AD V+ +L+
Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92

Query: 136 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 194
+ V++ VP +R+ +++A+ AG + L+ EP AAAI G
Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 195 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 254
L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 255 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 304
I + + R++ E+ A + + V G +G ++ + E +
Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260

Query: 305 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 361
+ + A AL+ E A ++ E +V+ GG + + + E G P + + D
Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320

Query: 362 PDKVVAIGAAI 372
P VA G
Sbjct: 321 PLTCVARGGGK 331


34ECP_2612ECP_2629Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2612213-0.491594cytochrome C-type biogenesis protein
ECP_2613214-0.632582hypothetical protein
ECP_2614216-1.057734heat shock protein GrpE
ECP_2615215-0.984926inorganic polyphosphate/ATP-NAD kinase
ECP_2616219-1.022624recombination and repair protein
ECP_2617116-0.282530hypothetical protein
ECP_2618-1181.756469hypothetical protein
ECP_26190202.977632hypothetical protein
ECP_26202254.658695SsrA-binding protein
ECP_26212213.931767hypothetical protein
ECP_26222213.901166hypothetical protein
ECP_26232203.434537hydroxyglutarate oxidase
ECP_26242192.915625succinate-semialdehyde dehydrogenase I
ECP_26251171.9854884-aminobutyrate aminotransferase
ECP_2626216-1.116869gamma-aminobutyrate transporter
ECP_2627016-1.876810DNA-binding transcriptional regulator CsiR
ECP_2628-118-2.980619LysM domain/BON superfamily protein
ECP_2629123-3.257025hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2617BLACTAMASEA260.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 26.3 bits (58), Expect = 0.032
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%)

Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61
K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A
Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122

Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88
MSD N + G G+T
Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146


35ECP_2664ECP_2705Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_26640143.045520glucitol/sorbitol-specific PTS system component
ECP_26650152.707409sorbitol-6-phosphate dehydrogenase
ECP_26660152.788929DNA-binding transcriptional activator GutM
ECP_26670153.828114DNA-binding transcriptional repressor SrlR
ECP_26681164.327721D-arabinose 5-phosphate isomerase
ECP_26691163.782703anaerobic nitric oxide reductase transcriptional
ECP_26700173.003324anaerobic nitric oxide reductase
ECP_26711163.494344nitric oxide reductase
ECP_26720153.069522hydrogenase maturation protein HypF
ECP_2673-1140.766963electron transport protein HydN
ECP_2674-1140.599816hypothetical protein
ECP_26750161.327938cryptic asc operon repressor
ECP_2676-1161.388944cellobiose/arbutin/salicin-specific PTS system
ECP_2677-1161.144655cryptic 6-phospho-beta-glucosidase
ECP_26781201.916919HTH-type transcriptional regulator YgjM
ECP_26790274.071210hypothetical protein
ECP_2680-1294.729018hydrogenase 3 maturation protease
ECP_2681-1275.397021formate hydrogenlyase maturation protein HYch
ECP_2682-1265.516110formate hydrogenlyase-3 component G
ECP_26830264.948845formate hydrogenlyase complex iron-sulfur
ECP_26840244.819973formate hydrogenase-3 component E
ECP_26852224.652441formate hydrogenlyase subunit 4
ECP_26862214.167304formate hydrogenlyase subunit 3
ECP_26872192.720649formate hydrogenase-3 component B
ECP_26882193.272736formate hydrogenlyase regulatory protein HycA
ECP_26890162.572935hydrogenase nickel incorporation protein
ECP_26900142.414127hydrogenase nickel incorporation protein HypB
ECP_2691-1111.858068hydrogenase assembly chaperone
ECP_2692-191.594532hydrogenase isoenzymes formation protein HYpd
ECP_2693-110-0.919488hydrogenase isoenzymes formation protein HypE
ECP_2694116-4.727481formate hydrogenlyase transcriptional activator
ECP_2695535-10.387424molybdenum-pterin-binding protein
ECP_2696633-10.787031hypothetical protein
ECP_2697940-12.616158hypothetical protein
ECP_2698738-10.976557hypothetical protein
ECP_2699533-9.224352hypothetical protein
ECP_2700429-8.381296hypothetical protein
ECP_2701426-5.963675transposase
ECP_2702428-4.490819hypothetical protein
ECP_2703427-1.033780transposase/IS protein
ECP_2704336-3.818420transposase
ECP_2705437-4.274892transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2665DHBDHDRGNASE829e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.4 bits (203), Expect = 9e-21
Identities = 67/257 (26%), Positives = 120/257 (46%), Gaps = 7/257 (2%)

Query: 3 QVAVVIGGGQTLGAFLCHGLAAEGYRVAVVDIQSDKAANVAQEINAEYGEGTAYGFGADA 62
++A + G Q +G + LA++G +A VD +K V + AE A F AD
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66

Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122
++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182
S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + +
Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPEQVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241
G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S +
Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242

Query: 242 ASYCTGQSINVTGGQVM 258
A + T ++ V GG +
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2667ARGREPRESSOR280.024 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 27.9 bits (62), Expect = 0.024
Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%)

Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40
M QR I E + + +EL ++ T T+ +D+
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2669HTHFIS372e-127 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 372 bits (956), Expect = e-127
Identities = 125/388 (32%), Positives = 193/388 (49%), Gaps = 33/388 (8%)

Query: 149 IAALAAGALS----------NALLIEQLESQNMLPGDAAPFEAVKQTQMIGLSPGMTQLK 198
I A GA +I + ++ ++ ++G S M ++
Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150

Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258
+ + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG
Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210

Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318
H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378
DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329

Query: 379 RLGLSRVVLSAGARNLLQHYNFPGNVRELEHAIHRAVVLARATRSGDEVIL-----EAQH 433
+ GL A L++ + +PGNVRELE+ + R L E+I E
Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389

Query: 434 FAFPEVTLPPPEAAAVPVVKQNLR-----------------EATEAFQRETIRQALAQNH 476
+ + V++N+R + I AL
Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449

Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504
N A +L + L + + LG+
Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2675HTHTETR280.036 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.036
Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 7/93 (7%)

Query: 3 TTMLEVAKRAGVSKATVSRVLSG-----NGYVSQETKDRVFQAVEESGYRPNLLARNLSA 57
T++ E+AK AGV++ + + + +E P L
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 58 KSTQTLGLVVTNTLYHGIYFSELLFHAARMAEE 90
L VT + E++FH E
Sbjct: 92 ILIHVLESTVTEERRRLLM--EIIFHKCEFVGE 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2691TYPE4SSCAGA270.012 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 27.0 bits (59), Expect = 0.012
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%)

Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRVGQWVLVHVGFAMSVINEAEARDTLD 69
I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D +
Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226

Query: 70 ALQN--MFDVEPDVG 82
A+ + V+PD+
Sbjct: 227 AINQEPVPHVQPDIA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2694HTHFIS389e-131 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 389 bits (1001), Expect = e-131
Identities = 140/373 (37%), Positives = 204/373 (54%), Gaps = 39/373 (10%)

Query: 350 YQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYSVLKQVEMVAQSDSTVLILG 409
E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G
Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167

Query: 410 ETGTGKELIARAIHNLSGRNNRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469
E+GTGKEL+ARA+H+ R N V +N AA+P L+ES+LFGHE+GAFTGA + GRF
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227

Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKIIQTDVRLIAATNRDLKKMV 529
E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DLK+ +
Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287

Query: 530 ADREFRSDLYYRLNVFPIHLPPLRERPEDIPLLAKAFTFKIARRLGRNIDSIPAETLRIL 589
FR DLYYRLNV P+ LPPLR+R EDIP L + F + A + G ++ E L ++
Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346

Query: 590 SNMEWPGNVRELENVIERAVLLTRGNVLQLSL---------------------PDIALPE 628
WPGNVRELEN++ R L +V+ + +++ +
Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406

Query: 629 PETPPAATVVAQEG--------------EDEYQLIVRVLKETNGVVAGPKGAAQRLGLKR 674
A G E EY LI+ L T G AA LGL R
Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQI---KAADLLGLNR 463

Query: 675 TTLLSRMKRLGID 687
TL +++ LG+
Sbjct: 464 NTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2695ALARACEMASE270.027 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 27.0 bits (60), Expect = 0.027
Identities = 16/138 (11%), Positives = 39/138 (28%), Gaps = 28/138 (20%)

Query: 22 NDEVELTLAGGAKLVAIV--------------THSSQQALGLAKGKEAIAL----IKAPW 63
N + A A++ ++V + L +EAI L K P
Sbjct: 17 NLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPI 76

Query: 64 VTL--ATEDCGLKFSARNQFAGSVSTI--------TEGAVNATVHIKTDAGFEIVAVVTN 113
+ L L+ +++ V + +++K ++G + +
Sbjct: 77 LMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRLGFQPD 136

Query: 114 ESQDEMKLTTGSRVIALI 131
+ + +
Sbjct: 137 RVLTVWQQLRAMANVGEM 154


36ECP_2736ECP_2756Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_27360163.501610phosphoadenosine phosphosulfate reductase
ECP_27370153.175621sulfite reductase subunit beta
ECP_27380142.941625sulfite reductase subunit alpha
ECP_27390182.3923676-pyruvoyl tetrahydrobiopterin synthase
ECP_27402172.745101electron transfer flavoprotein-quinone
ECP_27411131.453012ferredoxin-like protein YgcO
ECP_27421110.298424anti-terminator regulatory protein YgcP
ECP_27431110.194863electron transfer flavoprotein subunit YgcQ
ECP_2744010-0.780876electron transfer flavoprotein subunit YgcR
ECP_2745-111-1.730685metabolite transport protein YgcS
ECP_2746-212-3.029805flavoprotein YgcU
ECP_2747019-3.903879oxidoreductase YgcW
ECP_2748020-3.529133hypothetical protein
ECP_2749-124-3.820199sugar kinase
ECP_2750026-3.821446aminoimidazole riboside kinase
ECP_2751026-4.267022sucrose porin
ECP_2752025-4.086996PTS system, sucrose-specific IIBC component
ECP_2753024-4.255752sucrose-6-phosphate hydrolase
ECP_2754-125-5.199492sucrose operon repressor
ECP_2755021-4.323638hypothetical protein
ECP_2756-121-3.487427hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2737PF07675300.021 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.021
Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%)

Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260
++ +P+ T +P PQN + A+ ++VAI+++G L G + G++
Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297

Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287
+ K Y + YLP+ + E
Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2745TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 45/314 (14%), Positives = 112/314 (35%), Gaps = 36/314 (11%)

Query: 69 LGSLVLGWISDHIGRQKIFTFSFMLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 127
+G+ V G +SD +G +++ F ++ S + F + LI R + G G ++
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 128 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISESPEAWRWLLASAAL 183
++A + P+ +RG G + VG + + H+ W +LL +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177

Query: 184 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLF-- 241
+ + L + R +G F I+ +L + + + L
Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 242 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 288
++ R+ ++ F+ V+ +I+ ++ + + L+ + +
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 289 LNALLIVGALLGLV-------LTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLF 341
+ ++ G + ++ L L L+ + + + L +S + +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354

Query: 342 VLFSTTISAVSNLV 355
++F + + V
Sbjct: 355 IVFVLGGLSFTKTV 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2747DHBDHDRGNASE1052e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (264), Expect = 2e-29
Identities = 73/257 (28%), Positives = 117/257 (45%), Gaps = 11/257 (4%)

Query: 11 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANVFIPSFVKDNGETKEMIEK-QGVEVD 69
M+ ++GK A +TG G+G+A A LA GA++ + + E K + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 70 FMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 129
D+ A +I A G +DILVN AG+ + + +W+ VN T
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 130 FELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNI 189
F S +K M+ ++SG I+ + S + + AY+++K A FTK EL +YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 190 QVNGIAPGYYATDI--TLATRSNPETNQRVLDH-------IPANRWGDTQDLMGAAVFLA 240
+ N ++PG TD+ +L N Q + IP + D+ A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 241 SPASNYVNGHLLVVDGG 257
S + ++ H L VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2748TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 1/76 (1%)

Query: 41 GFSNTEIGLIMSTFGIAAIIFYA-PSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLW 99
+ T IG+ ++ FGI + A +G +A + R+ + MI G +L+A W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 100 VMLCIQVAFAITTILM 115
+ I V A I M
Sbjct: 302 MAFPIMVLLASGGIGM 317



Score = 30.9 bits (70), Expect = 0.008
Identities = 42/268 (15%), Positives = 94/268 (35%), Gaps = 30/268 (11%)

Query: 48 GLIMSTFGIAAIIFYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVA 107
G++++ + + G ++D+F R ++ ++ + +MAT P LWV+ ++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAP 164
IT + A + + D E+ + G+M G G+++ V
Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP-----VLGGLMG 155

Query: 165 DDSASLKTVIIIYSVVYILLGILCWFFV-----SDNNNLRNTNNEEKQSFQLSDILAVLR 219
S + + L + F + + LR SF+ + + V+
Sbjct: 156 GFSPHAP--FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVA 213

Query: 220 ISTTWYCSMVIFGVF--TIYAILSYST-NYLTEMYGMSLVAASYMGIVINKIFRALCGPL 276
+ M + G ++ I ++ G+SL A + + +A+ +
Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS----LAQAM---I 266

Query: 277 GGIITTYSKVKSPTRVVQILSIIGLLAL 304
G + + + I G + L
Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILL 294


37ECP_2796ECP_2826Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2796-2184.244491murein transglycosylase A
ECP_2800-1224.469330***hypothetical protein
ECP_28010266.657076hypothetical protein
ECP_2802-1307.430724hypothetical protein
ECP_2803-2265.637179hypothetical protein
ECP_2804-2224.026943OmpA family membrane protein
ECP_2805-2243.120674hemolysin-coregulated protein Hcp
ECP_2806-2211.584740CLPA/B-type chaperone protein
ECP_2807018-3.404470VGR-related protein
ECP_2808021-5.118731transmembrane protein
ECP_2809-2150.578774hypothetical protein
ECP_2810-2182.919421VGR-related protein
ECP_28110180.509212hypothetical protein
ECP_28120181.170361hypothetical protein
ECP_2813-1173.726414hypothetical protein
ECP_2814-2173.923673hypothetical protein
ECP_2815-1191.674726hypothetical protein
ECP_2816018-1.013200hypothetical protein
ECP_2817-2192.464579hypothetical protein
ECP_2818-2214.612181hypothetical protein
ECP_2819-2192.638901hypothetical protein
ECP_2820-2190.213981hypothetical protein
ECP_2821223-4.656908hypothetical protein
ECP_2822225-5.685828hypothetical protein
ECP_2823127-7.739010hypothetical protein
ECP_2824025-8.2785512-hydroxyacid dehydrogenase
ECP_2825-119-6.625826posphosugar isomerase
ECP_2826-117-4.034721aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2804OMPADOMAIN811e-18 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 80.7 bits (199), Expect = 1e-18
Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 14/142 (9%)

Query: 415 PEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVL---VDALVNIRAKPGWLILVA 471
+Q + L S LF+ +A LK L L N+ K G ++V
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVL 258

Query: 472 GYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGR-- 529
GYTD G + NQ LS RRA++V ++ L + IPA + +G+GES P N +
Sbjct: 259 GYTDRIGSDAYNQGLSERRAQSVVDY-LISKGIPADKISARGMGESNPVTGNTCDNVKQR 317

Query: 530 -------AVNRRVEISLVPRSD 544
A +RRVEI + D
Sbjct: 318 AALIDCLAPDRRVEIEVKGIKD 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2806HTHFIS350.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 0.001
Identities = 36/189 (19%), Positives = 66/189 (34%), Gaps = 34/189 (17%)

Query: 512 IMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVATVVADWTGI--------PLSSL 563
+ + ++ +L +++ + P+L + + + A G L+ L
Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 564 LKDEQSDLLSMEKSLENR---------VVGQSPALCAIAQRL-RAAKTGLTPENGPQGVF 613
+ L ++ +VG+S A+ I + L R +T LT
Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT--------L 163

Query: 614 LLTGPSGTGKTETALTLADTLFGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGV 673
++TG SGTGK A L D + IN++ S+L G + G
Sbjct: 164 MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGA 215

Query: 674 LTEAVRKRP 682
T A +
Sbjct: 216 FTGAQTRST 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2808PF04183310.004 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 31.4 bits (71), Expect = 0.004
Identities = 9/36 (25%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 114 AEIITALEEYKKQYPHLAKRVEKISGYVDDIDKEVL 149
A ++ +Y K++P +++R S + I + VL
Sbjct: 511 AAVL---SDYMKKHPQMSERFALFSLFRPQIIRVVL 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2814PF00577310.034 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 31.0 bits (70), Expect = 0.034
Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 6/71 (8%)

Query: 302 LRLAHTLAERGIAHWQSVL---KPLLAGGAFSSLRLRGLMFSPPLAAVPEAAPHAWLPSP 358
+ +T ER I +S L G F + RG + +P +P
Sbjct: 243 WQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLP---DSQRGFAP 299

Query: 359 VWAGITGDNAR 369
V GI A+
Sbjct: 300 VIHGIARGTAQ 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2821ANTHRAXTOXNA290.010 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.010
Identities = 13/83 (15%), Positives = 35/83 (42%), Gaps = 9/83 (10%)

Query: 33 ESKSVASAVFYKQIKILHLDFFSR---------SALNTDAEDTPLSTMVHVWQLKTREDF 83
+ + V+Y+ K + LD S+ + + + ++D+ S ++ + K + +
Sbjct: 161 INSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLEL 220

Query: 84 DKADYDTLFMQEEKTLEKDVLAK 106
+ D F++E T + +
Sbjct: 221 NNKSIDINFIKENLTEFQHAFSL 243


38ECP_2959ECP_3043Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2959-217-3.882395ornithine decarboxylase
ECP_2960333-7.703617membrane protein YqgA
ECP_2962741-9.735780*P4-type integrase
ECP_2963848-11.790274hypothetical protein
ECP_2964741-10.181726transposase-like protein
ECP_2965525-4.967836hypothetical protein
ECP_2966524-2.744996PixG protein
ECP_2967524-2.198635PixF protein
ECP_2968522-1.078256PixJ protein
ECP_2969522-0.268517PixD protein
ECP_2970520-0.529444fimbrial usher protein PixC
ECP_2971424-2.916589PixH protein
ECP_2972323-2.906706PixA protein
ECP_2973221-2.770117hypothetical protein
ECP_2974222-3.756566hypothetical protein
ECP_2975124-4.195807transport activator
ECP_2976125-4.773434regulatory protein
ECP_2977225-5.965753regulatory protein
ECP_2978228-7.971819transporter protein
ECP_2979230-8.287931hypothetical protein
ECP_2980128-7.217947hypothetical protein
ECP_2981126-6.950133hypothetical protein
ECP_2982126-6.810555L-fucose permease
ECP_2983332-6.988660ribokinase sugar kinase
ECP_2984626-2.907532DeoR family transcriptional regulator
ECP_2985727-2.486877hypothetical protein
ECP_2986727-2.517968hypothetical protein
ECP_2987726-1.893634hypothetical protein
ECP_2988627-1.684696hypothetical protein
ECP_2989527-0.611072transposase (orfB)
ECP_2990427-0.460760transposase
ECP_29914250.016038hypothetical protein
ECP_2992424-0.579781hypothetical protein
ECP_2993525-1.620802hypothetical protein
ECP_2994425-3.750775hypothetical protein
ECP_2995424-3.321769transposase for insertion sequence IS100
ECP_2996523-3.816979transposase/IS protein
ECP_2997523-3.372804hypothetical protein
ECP_2998524-2.530209hemolysin expression modulating protein
ECP_2999624-2.195273hypothetical protein
ECP_30007280.964024regulatory protein
ECP_3001429-4.111645hypothetical protein
ECP_3002542-8.592972hypothetical protein
ECP_3003432-5.683765hypothetical protein
ECP_3004623-0.180876hypothetical protein
ECP_30056230.283963hypothetical protein
ECP_30066221.091955hypothetical protein
ECP_30077233.083509hypothetical protein
ECP_30089265.438969hypothetical protein
ECP_30099265.139736autotransporter
ECP_30106304.575890hypothetical protein
ECP_30117304.726232hypothetical protein
ECP_30127273.702571hypothetical protein
ECP_30137273.174591radC-like protein YeeS
ECP_30147270.748647hypothetical protein
ECP_3015420-0.269326hypothetical protein
ECP_3016316-1.394729hypothetical protein
ECP_3017112-1.303542hypothetical protein
ECP_3018011-0.851851hypothetical protein
ECP_3019-19-0.457621hypothetical protein
ECP_302008-0.592043polysialic acid capsule expression protein KpsF
ECP_3021-19-1.029375capsule polysaccharide export inner-membrane
ECP_3022-316-3.815378polysialic acid transport protein KpsD
ECP_3023032-7.5546313-deoxy-manno-octulosonate cytidylyltransferase
ECP_3024339-10.257208capsule polysaccharide export protein KpsC
ECP_3025749-13.994527hypothetical protein
ECP_3026752-15.028048hypothetical protein
ECP_3027857-16.347129capsule polysaccharide export protein KpsS
ECP_3028958-17.494225capsule export protein KpsC
ECP_30291061-20.053321hypothetical protein
ECP_3030862-20.212162hypothetical protein
ECP_3031863-19.422092hypothetical protein
ECP_3032657-17.085091hypothetical protein
ECP_3033345-12.342495hypothetical protein
ECP_3034233-6.832184glycosyltransferase
ECP_3035025-2.939862polysialic acid transport ATP-binding protein
ECP_30360210.310547polysialic acid transport protein KpsM
ECP_30370194.378767general secretion pathway protein YghD
ECP_30380184.255641GspL-like protein
ECP_30390204.697188type II secretion protein GspK
ECP_3040-1195.225905type II secretion protein GspJ
ECP_3041-1194.632635type II secretion protein GspI
ECP_3042-2163.816531type II secretion protein GspH
ECP_3043-3153.115055type II secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2967FIMBRIALPAPF1073e-32 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 107 bits (268), Expect = 3e-32
Identities = 67/172 (38%), Positives = 101/172 (58%), Gaps = 8/172 (4%)

Query: 1 MRITVFLLTFLSFLSDLWAVDIPINITGTIIIPPCQINNSNPVDVDFGNIRVSELDTKEH 60
+R+++F+ L+ ++ L D+ INI G + IPPC INN + VDFGNI +D
Sbjct: 2 IRLSLFISLLLTSVAVL--ADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRG 59

Query: 61 IKVVSFPVYCPYHQGEAYVKMTGQSM-TGKDNVLATNIDGLGIELYQGGEGTGNHLILGS 119
+ + CPY G ++K+TG +M G++NVLATNI GI LYQ G+G L LG+
Sbjct: 60 EVTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQ-GKGMSTPLTLGN 118

Query: 120 GSSGYGYEVINALSEKNVERTTFTFTAKIYKAEGVTINSGEFSASALINIVY 171
G SG GY V L + R+TFTFT+ ++ +N G+F +A ++++Y
Sbjct: 119 G-SGNGYRVTAGL---DTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2969cloacin290.040 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.5 bits (63), Expect = 0.040
Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 5/75 (6%)

Query: 3 GGHPGTSGPGTTVAAALSSGEVTLYTPAI----VCISRQKNVKKQRAENMQKMKPALKKT 58
GG GT G + VAA ++ G L TP V IS + A+ M +K K
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA-LSAAIADIMAALKGPFKFG 130

Query: 59 LMAVACLSAVPAAQA 73
L VA +P+ A
Sbjct: 131 LWGVALYGVLPSQIA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2970PF005777590.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 759 bits (1961), Expect = 0.0
Identities = 248/885 (28%), Positives = 386/885 (43%), Gaps = 65/885 (7%)

Query: 15 LNRLHIMKKNKSTFTINFITYSLMLSLAGVPVYAVDFNTDVLDAADRQNIDFSRFSRAGY 74
LHI K + F + + A + + FN L + D SRF
Sbjct: 13 TQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQE 72

Query: 75 IMPGQYQMEIRVNGQDISPSAFQIAFLEPPFSDSDNEKPLPEPCLTPEIVSRMGLTEASQ 134
+ PG Y+++I +N +A + F+ D+E+ + PCLT ++ MGL AS
Sbjct: 73 LPPGTYRVDIYLNNG-------YMATRDVTFNTGDSEQGI-VPCLTRAQLASMGLNTASV 124

Query: 135 EKVTYWNNGQCADFRQL-SGVEIRPNPAEGMLYINMPQAWLEYSDASWLPPSRWDNGIPG 193
+ + C + + + + L + +PQA++ ++PP WD GI
Sbjct: 125 SGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA 184

Query: 194 LLFDYNINGTVNKPHQGKQSQSLNYNGTAGANFGAWRLRADYQGNLNHTTGSAQGTDSQF 253
L +YN +G + G S N +G N GAWRLR + + N + S+ G+ +++
Sbjct: 185 GLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSS-GSKNKW 243

Query: 254 TWSRFYMYRAIPRWRANLTLGENYINSEIFSSWRYTGASLESDDRMLPPKLRGYAPQVSG 313
++ R I R+ LTLG+ Y +IF + GA L SDD MLP RG+AP + G
Sbjct: 244 QHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHG 303

Query: 314 IADTNARVVISQQGRILYDSTVPAGPFTIQDLD-SSVRGRLDVEVIEQDGRKKTFQVDTA 372
IA A+V I Q G +Y+STVP GPFTI D+ + G L V + E DG + F V +
Sbjct: 304 IARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYS 363

Query: 373 YVPYLTRPGQVRYKLVSGRSRTYEHTMEGPVFAAGEASWGISNTWSLYGGSIVAGDYNAL 432
VP L R G RY + +G R+ E P F G+ W++YGG+ +A Y A
Sbjct: 364 SVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAF 423

Query: 433 AVGLGRDLSKFGTVSADVTQSVARIPGYDTKQGKSWRLSYSKRFDEVNTDITFAGYRFSE 492
G+G+++ G +S D+TQ+ + +P G+S R Y+K +E T+I GYR+S
Sbjct: 424 NFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYST 483

Query: 493 RNYMTMDQYLNARYR--------------------NDFTGREKELYTVTLNKNFEDWKAS 532
Y +R + ++ +T+ + ++
Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-ST 542

Query: 533 VNLQYSHQTYWDRRTSD-YYTLSVNRYFDAFSFKNIALGISASRSKYLNRD--NDSAFVR 589
+ L SHQTYW D + +N +F++I +S S +K + + +
Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNT-----AFEDINWTLSYSLTKNAWQKGRDQMLALN 597

Query: 590 LSVPWGT------------GTASYSGSMSND-RYTNTVGYSDTL-NNGLSSYSLNAGVNS 635
+++P+ +ASYS S + R TN G TL + SYS+ G
Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657

Query: 636 GGGQPSQRQMSAYYNHNGSLTNLSASFSAVENGYSSFGMSASGGATVTMKGAALHAGGMN 695
GG S A N+ G N + +S + SGG G L G
Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSH-SDDIKQLYYGVSGGVLAHANGVTL--GQPL 714

Query: 696 GGTRLLVDTDGVGGVPVDGGR-VYTNRWGIGVVTDVSSYYRNTTSVDLNKLPEDMEATRS 754
T +LV G V+ V T+ G V+ + Y N ++D N L ++++ +
Sbjct: 715 NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774

Query: 755 VVESVLTEGAIGYREFEVLKGSRLFAVLRMSDNSYPPFGASVTNAKGRELGMVADSGLAW 814
V V T GAI EF+ G +L L +N PFGA VT+ + G+VAD+G +
Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833

Query: 815 LSGVNPGETLNVGW--DGRTQCVVDIPAHPDPAQQLL----LPCR 853
LSG+ + V W + CV + P+ QQLL CR
Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2971FIMBRIALPAPE333e-04 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 33.5 bits (76), Expect = 3e-04
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 9/86 (10%)

Query: 29 GMTLPEYWG----EEHVWWDGRASFKGQVIAPACTLSMEDAWQEIDMGTTPLRDLQNSPA 84
G+ LP G +HV +FKG++I PACT+ E++ G +++L S
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLVQS-G 60

Query: 85 GPEKKFRLRLRNCELTGAGKQVYTAT 110
G +K F + + G K T+
Sbjct: 61 GNQKDFTVDMNCPYSLGTMKVTITSN 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2975HTHFIS2401e-76 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 240 bits (614), Expect = 1e-76
Identities = 112/479 (23%), Positives = 188/479 (39%), Gaps = 83/479 (17%)

Query: 10 SILLIDDDADVLDAYTQLLEQSGYRVFACNNPFEAQAWIQPDWPGIVLSDVCMPGCSGID 69
+IL+ DDDA + Q L ++GY V +N WI +V++DV MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 70 LMMLFHQDDQQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLSLVEEALRQRQS 129
L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ ++ AL + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 130 IIARRQYCQQTLQVELIGRSEWINQYRRRLQQLSETDIAVWLYGAPGTGRMTGARYLHQF 189
++ + Q L+GRS + + R L +L +TD+ + + G GTG+ AR LH +
Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 190 GRNAQGEFVYRELTPDNAPQLND------------------------FIALAQGGTLVLS 225
G+ G FV N + A+GGTL L
Sbjct: 184 GKRRNGPFV-----AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 226 HPEHLTREQQYHLVQ-LQSQEHRP----------FRLIGIGDTSLVELAASNHIIAELYY 274
+ + Q L++ LQ E+ R++ + L + +LYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 275 CFAMTQIACLPLTQRPDDIEPLFRHYLCKACQRLNHPVPEVGKEMLKEMMRRMWPNNVRE 334
+ + PL R +DI L RH++ +A + V +E L+ M WP NVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 335 LANAAE-----------------LFTVGILPLAETANPLMHVGT---------------- 361
L N +P + G+
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 362 --------PAPLDRRVEDAERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 412
DR + + E +I AL +G + A+ L + R L ++++ G+S
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2978TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 2e-05
Identities = 66/387 (17%), Positives = 131/387 (33%), Gaps = 39/387 (10%)

Query: 52 TPYLKEQLDLSATQI---GVLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108
P L L S G+L + + V+ +L+D+ + + L A+
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 109 VGLGFSTAFWIFAVLVILNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168
+ + W+ + I+ G+ G IA+ ER R F +S G G+
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144

Query: 169 VA-PIVGAAFALLGSEHWQSASYIVPACVAIVFAVIVLILGKGSPRQEGLPSLEEMMPEE 227
VA P++G A + A + + + L +PE
Sbjct: 145 VAGPVLGGLMGGFSPH----APFFAAAALNGLNFLTGCFL----------------LPE- 183

Query: 228 KVVLNTRQTVKAPENMSAFQIFCTYVLRNKNAWYVSLVDVFVYMVRFGMISWLPIYLLTV 287
+ + + P A ++ +L+ VF M G + +
Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAMICMALIFICLIGYW 344
F + ++ + ++ ++ G ++ +L + R + L MI +I L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 345 KSESLFMVTIFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404
+ F + + A G + Q + S Q E GS L ++ I G L T+
Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357

Query: 405 LFGIMVDHIGWHGGFYLLGCGIICCII 431
++ + W+G ++ G + +
Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2995HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3006INVEPROTEIN290.018 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 28.5 bits (63), Expect = 0.018
Identities = 28/99 (28%), Positives = 42/99 (42%), Gaps = 7/99 (7%)

Query: 57 ITQSDLEQLEATSLESITKTISEL---KSLKTNKNSTQEEILDLEKKRKEMELLVKKASM 113
+ + DLE++ LES+ K + E K+LK N + L + + LL +AS
Sbjct: 133 LRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLSLKPGLL--RASY 190

Query: 114 ALFFREQLNYHERRILSEIKGSETLNHSLSEIKEIKGKL 152
F Q HE I S+ S L + I+G L
Sbjct: 191 RQFI--QSESHEVEIYSDWIASYGYQRRLVVLDFIEGSL 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3040BCTERIALGSPG280.026 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 27.5 bits (61), Expect = 0.026
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 1 MRRAS--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45
MR GFTLLE++V I I LA L+ + + + A D
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3041BCTERIALGSPH348e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 8e-05
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3042BCTERIALGSPH775e-20 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 76.5 bits (188), Expect = 5e-20
Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATDSESPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDPPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + P +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3043BCTERIALGSPG2173e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (554), Expect = 3e-76
Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


39ECP_3184ECP_3192Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3184-115-3.778721hexuronate transporter
ECP_3185-217-4.891647minor pilin protein
ECP_3186-216-4.635736usher protein
ECP_3187-115-4.280179hypothetical protein
ECP_3188014-3.583941CS1 type fimbrial major subunit
ECP_3189-115-2.854163fimbrial protein
ECP_31900130.204665DNA-binding transcriptional repressor ExuR
ECP_31912171.099301DedA family membrane protein
ECP_31922210.824117hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3184TCRTETA416e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.0 bits (96), Expect = 6e-06
Identities = 59/329 (17%), Positives = 107/329 (32%), Gaps = 37/329 (11%)

Query: 34 PTLMEELNISTQQ---YSYIIAAYSAAYTVMQPVAGYVLDVLGTK----IGYAMFAVLWA 86
P L+ +L S Y ++A Y+ PV G + D G + + A AV +A
Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88

Query: 87 VFCGATALAGSWGGLAVA--RGAVGAAEAAMIPAGLKASSEWFPAKERSIAVGYFNVGSS 144
+ A L + G VA GA GA A I ++ ER+ G+ +
Sbjct: 89 IMATAPFLWVLYIGRIVAGITGATGAVAGAYI-------ADITDGDERARHFGFMSACFG 141

Query: 145 IGAMIAPPLVVWAIVMHSWQMAFIISGALSFIWAMAWLIFYKHPRDQKHLTDEERDYIIN 204
G M+A P++ + S F + AL+ + + K R +N
Sbjct: 142 FG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALN 198

Query: 205 GQEAQHQVDTAKKMSVGQILRNRQFWGIALPRFLAEPAWGTFNAWIPLFMFKVYGFNLKE 264
+ ++ + F+ + L + W +F + ++
Sbjct: 199 PLASFRWARGMTVVAALMAV----FFIMQLVGQVPAALWV-------IFGEDRFHWDATT 247

Query: 265 IAMFAWMPMLFADLGCILGGYLPPLFQRWFGVNLIVSRKMVV-TLGAVLMIGPGMIGLFT 323
I + F L + + G + M+ G +L+ +
Sbjct: 248 IGI---SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMA- 303

Query: 324 NPYVAIMLLCIGGFAHQALSGALITLSSD 352
+ ++LL GG AL L +
Sbjct: 304 --FPIMVLLASGGIGMPALQAMLSRQVDE 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3186PF00577741e-15 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 74.1 bits (182), Expect = 1e-15
Identities = 72/433 (16%), Positives = 139/433 (32%), Gaps = 30/433 (6%)

Query: 185 NSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGG 244
++V +N + + + G I+ S + + + E + T+ VP++
Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367

Query: 245 LT-DGNAQWFLQAGKTTSQVS-DDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGN 302
L +G+ ++ + AG+ S + ++ +Q + L + +Y G AD AF G
Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427

Query: 303 NWTADLGGAGNLAISASVFRNDDGGKGDMQQANWSH-PGWPTLGF------YRTNSDGDA 355
GA ++ ++ + D + D Q + + G YR ++ G
Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487

Query: 356 CTTDNRESYNALSCYES--ISATVSQNFVGWNMMLGYTRTQNNTDDSLRWDKQQSFENNY 413
D S E+ V F + + R + + + + + +
Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547

Query: 414 LRQT--SAQSISETVQLSASRAFVMRDWILSTSLGVFHRNDNGGDNDDNGLYLSFS--LS 469
QT ++ E Q + AF ++ +L + D L L+ + S
Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFED----INWTLSYSLTKNAWQKGRDQMLALNVNIPFS 603

Query: 470 DTPTMDSNNNSHSTNVSTDYRYSDQDGDQTSWQLSHTFYNDSFSHKEL--GVTVGGLNTD 527
DS + + S + + T D+ + G GG
Sbjct: 604 HWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNS 663

Query: 528 TINSAVNGRWDGQYGNVYATVSDSYDRQNHDHLSAFTGTYSSTLAVSRYGINVGASGSDD 587
+ G YGN S S D + S + G+ +G +D
Sbjct: 664 GSTGYATLNYRGGYGNANIGYSHSDDIKQ------LYYGVSGGVLAHANGVTLGQPLND- 716

Query: 588 LLGAVLVDVKGFS 600
VLV G
Sbjct: 717 --TVVLVKAPGAK 727



Score = 31.0 bits (70), Expect = 0.025
Identities = 40/222 (18%), Positives = 68/222 (30%), Gaps = 35/222 (15%)

Query: 199 SFYLNSGSQFIDTSSF------PPGSYSVALKVYENNQLTRTELVPFTKTGGLTDGNAQW 252
F + D S F PPG+Y V + + NN T V F
Sbjct: 52 RFLADDPQAVADLSRFENGQELPPGTYRVDIYL--NNGYMATRDVTFNTGDSEQG----- 104

Query: 253 FLQAGKTTSQVSDDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGNNWTADLG-GA 311
+ T +Q++ +G+ L A A S D+G
Sbjct: 105 -IVPCLTRAQLA-------SMGLNTASVSGMNLLADDACVPLTSMIH-DATAQLDVGQQR 155

Query: 312 GNLAISASVFRNDDGGKGDMQQANWSHPGWPTLGFYRTNSDGDACTTDNRESYNALSCYE 371
NL I + N +G + W L Y + + NR N+ Y
Sbjct: 156 LNLTIPQAFMSNRA--RGYIPPELWDPGINAGLLNYNFS----GNSVQNRIGGNSHYAYL 209

Query: 372 SISATVSQNFVGW----NMMLGYTRTQNNTDDSLRWDKQQSF 409
++ + + N W N Y + +++ +W ++
Sbjct: 210 NLQSGL--NIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249


40ECP_3235ECP_3256Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_32350143.078316hypothetical protein
ECP_3236-1172.566435hypothetical protein
ECP_32370182.488681DnaA initiator-associating protein DiaA
ECP_32380193.185943hypothetical protein
ECP_3239-1203.371211hypothetical protein
ECP_3240-1191.816235hypothetical protein
ECP_32410202.600549hypothetical protein
ECP_32421193.530521hypothetical protein
ECP_3243-1204.156410GIY-YIG nuclease superfamily protein
ECP_3244-2203.742504acetyltransferase YhbS
ECP_3245-2193.911560hypothetical protein
ECP_3246-2163.343108protease YhbU
ECP_32470232.587932peptidase YhbV
ECP_32481272.314703hypothetical protein
ECP_32492291.735590tryptophan permease
ECP_32504331.658588ATP-dependent RNA helicase DeaD
ECP_32514330.949027lipoprotein NlpI
ECP_32525371.420394polynucleotide phosphorylase
ECP_32535321.04268430S ribosomal protein S15
ECP_32544291.022259tRNA pseudouridine synthase B
ECP_32553250.442857ribosome-binding factor A
ECP_3256221-0.791871translation initiation factor IF-2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3237RTXTOXINA280.031 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.0 bits (62), Expect = 0.031
Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%)

Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94
K+L GN + A T + IA + V AI+ D+
Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330

Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141
E Y+++ + LG+ GD LLA + A++A++T T++A
Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3240NUCEPIMERASE290.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.014
Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 4 VLITGATGLVGGHLLRMLINEP 25
L+TGA G +G H+ + L+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3256TCRTETOQM732e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 73.4 bits (180), Expect = 2e-15
Identities = 70/313 (22%), Positives = 110/313 (35%), Gaps = 77/313 (24%)

Query: 396 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETEN 437
++ HVD GKT+L + + T++ S + G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 438 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAAGVPVVVAV 497
+ +DTPGH F + R D +L+++A DGV QT + G+P + +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 498 NKIDKPEADPDRV----KNELSQYGI-----------------LPEEWG----------- 525
NKID+ D V K +LS + E+W
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 526 ---------------GESQFV---------HVSAKAGTGIDELLDAILLQAEVLELKAVR 561
ES H SAK GID L++ I +
Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245

Query: 562 KGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVL-CGFEYGRVRAMRNELGQEVLEA 620
+ G V + + R +A + + G LH D V E ++ M + E+ +
Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305

Query: 621 GPSIPVEILGLSG 633
+ EI+ L
Sbjct: 306 DKAYSGEIVILQN 318


41ECP_3299ECP_3310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3299-2153.112532hypothetical protein
ECP_3302-2113.038842glutamate synthase subunit beta
ECP_3303-2112.415081hypothetical protein
ECP_3304-2133.360014hypothetical protein
ECP_3305-1143.968496N-acetylmannosamine kinase
ECP_3306-1173.299049N-acetylmannosamine-6-phosphate 2-epimerase
ECP_33070182.284847sialic acid transporter
ECP_33083221.460135N-acetylneuraminate lyase
ECP_33094281.100261transcriptional regulator NanR
ECP_33102190.560548ClpXP protease specificity-enhancing factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3307TCRTETB608e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.9 bits (145), Expect = 8e-12
Identities = 64/406 (15%), Positives = 135/406 (33%), Gaps = 32/406 (7%)

Query: 30 LLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMV 89
+ +++ + L ++ +F + +A ++ G + G + D+ G + ++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 90 TSIVLFSAGTLACGFAPGYITMFI-ARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGF 148
I++ G++ + ++ I AR + G G A V PK R KA G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 149 LISGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHGGKA 208
+ S ++G V + ++ W L I ++ II +L K + +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--------- 194

Query: 209 PVRTMVDILYRGEHRIANIVMTLAAATALWFCFAGNLQNAAIVAVLGLLCAAIFISFMVQ 268
+G I I++ + IV+VL L IF+ + +
Sbjct: 195 ---------IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL---IFVKHIRK 242

Query: 269 STGK----RWPTGVMLMVVVLFAFLYSWPIQA---LLPTYLKTDLAYDPHTVANVLFFSG 321
T + M+ VL + + ++P +K + +V+ F G
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 322 -FGAAVGCCVGGFLGDWLGTRK-AYVCSLLASQLLIIPVFAIGGANVWVLGLLLFFQQML 379
+ +GG L D G + S + F + W + +++ F
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET-TSWFMTIIIVFVLGG 361

Query: 380 GQGIAGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAPIIGALI 425
++ ++ + AG+ L I +
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


42ECP_3340ECP_3354Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3340013-3.296880acetyl-CoA carboxylase biotin carboxyl carrier
ECP_3341014-4.120569acetyl-CoA carboxylase biotin carboxylase
ECP_3342022-5.231475hypothetical protein
ECP_3343023-5.712786ribokinase sugar kinase
ECP_3344022-5.633964sugar uptake ABC transporter permease
ECP_3345-120-4.425948sugar uptake ABC transporter ATP-binding
ECP_3346-114-1.890018sugar uptake ABC transporter periplasmic
ECP_3347-210-0.061159tagatose-bisphosphate aldolase
ECP_3348-2110.555684DeoR family regulatory protein
ECP_3349-2120.932350ribokinase sugar kinase
ECP_3350-3130.821526hypothetical protein
ECP_3351-3130.220354sodium/panthothenate symporter
ECP_3352-215-2.20132250S ribosomal protein L11 methyltransferase
ECP_3353-218-4.656742tRNA-dihydrouridine synthase B
ECP_3354-218-4.331716DNA-binding protein Fis
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3340RTXTOXIND270.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.026
Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 127 IEADKSGTVKAILVESGQPVEFDEPLV 153
I+ ++ VK I+V+ G+ V + L+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3354DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


43ECP_3504ECP_3519Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_35040173.084204transcriptional regulator MalT
ECP_35051193.291930RNA 3'-terminal-phosphate cyclase
ECP_35060182.491543protein RtcB
ECP_35070141.841111transcriptional regulatory protein RtcR
ECP_3508-211-0.018971DNA-binding transcriptional repressor GlpR
ECP_3509-113-1.201839intramembrane serine protease GlpG
ECP_3510018-4.359245thiosulfate sulfurtransferase
ECP_3511024-5.897323glycerol-3-phosphate dehydrogenase
ECP_3512340-10.063826hypothetical protein
ECP_3513343-10.474465hypothetical protein
ECP_3514345-10.650274hypothetical protein
ECP_3515347-11.576594fimbrial adhesin
ECP_3516344-10.562324fimbrial chaperone
ECP_3517124-7.015920minor fimbrial subunit
ECP_3518019-4.711591minor fimbrial subunit
ECP_3519014-3.353302outer membrane usher protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3507HTHFIS2207e-68 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 220 bits (563), Expect = 7e-68
Identities = 91/294 (30%), Positives = 145/294 (49%), Gaps = 15/294 (5%)

Query: 148 PDSPGAVTIIDLDLSRYNAIASRFAEERQQALDFLKSGIATRNSHFNRMIEQIEKVAIKS 207
D + II L+ S+ ++ Q + + R++ + + ++ ++
Sbjct: 106 FDLTELIGIIGRALAEPKRRPSKLEDDSQDGM-----PLVGRSAAMQEIYRVLARLM-QT 159

Query: 208 RAPILLNGPTGAGKSFLARRIFELKQARHQFSGAFVEVNCATLRGDTAMSTLFGHVKGAF 267
+++ G +G GK +AR + + + R+ G FV +N A + D S LFGH KGAF
Sbjct: 160 DLTLMITGESGTGKELVARALHDYGKRRN---GPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 268 TGARESREGLLRSANGGMLFLDEIGELGADEQAMLLKAIEEKTFYPFGSDRQVSSDFQLI 327
TGA+ G A GG LFLDEIG++ D Q LL+ +++ + G + SD +++
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276

Query: 328 AGTVRDLRQLVAEGKFREDLYARINLWTFTLPGLRQRQEDIEPNLDYEVERHASLTGDSV 387
A T +DL+Q + +G FREDLY R+N+ LP LR R EDI + + V++ D
Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVK 336

Query: 388 RFNTEARRAWLAFATSPQATWRGNFRELSASVTRMATFATSGRITLDTVEDEIN 441
RF+ EA A W GN REL V R+ IT + +E+E+
Sbjct: 337 RFDQEALELMKAHP------WPGNVRELENLVRRLTALYPQDVITREIIENELR 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3508ARGREPRESSOR342e-04 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 33.7 bits (77), Expect = 2e-04
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 5/48 (10%)

Query: 1 MKQTQRHNGIIELVKQQGYVSTEELV-----EHFSVSPQTIRRDLNEL 43
M + QRH I E++ + +ELV + ++V+ T+ RD+ EL
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3519PF005778840.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 884 bits (2287), Expect = 0.0
Identities = 398/866 (45%), Positives = 568/866 (65%), Gaps = 28/866 (3%)

Query: 19 KRVVPLLLVIMPACSIA--------GMRFNPAFLSGDTEAVADLSRFEKGMTYLPGSYEV 70
R+ + + AC+ A + FNP FL+ D +AVADLSRFE G PG+Y V
Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80

Query: 71 EVWVNDSPLLSRTVTFKADD-ENQLIPCLSLADLLSLGINKNALPEQALASSENSCLDLR 129
++++N+ + +R VTF D E ++PCL+ A L S+G+N ++ L + +++C+ L
Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA-DDACVPLT 139

Query: 130 IWFPDVHYMPELDAQRLKLTFPQAIIKRDARGYIPPEQWDNGITAFLLNYDFSGN--NDR 187
D ++ QRL LT PQA + ARGYIPPE WD GI A LLNY+FSGN +R
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 188 GDYSSNNYYLNLRAGINIGAWRFRDYSTWSR-----GSNSAGKLEHISSTLQRVIIPFRS 242
+S+ YLNL++G+NIGAWR RD +TWS S S K +HI++ L+R IIP RS
Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRS 259

Query: 243 ELTLGDTWSSSDVFDSVSIRGIKLESDENMLPDSQSGFAPTVRGIAKSRAQVTIKQNGYV 302
LTLGD ++ D+FD ++ RG +L SD+NMLPDSQ GFAP + GIA+ AQVTIKQNGY
Sbjct: 260 RLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYD 319

Query: 303 IYQTYMPPGPFEISDLNPTSSAGDLEVTIKESDNSETVYTVPYAAVPILQREGHLKYSTT 362
IY + +PPGPF I+D+ ++GDL+VTIKE+D S ++TVPY++VP+LQREGH +YS T
Sbjct: 320 IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSIT 379

Query: 363 VGQYRSNSYNQKSPYVFQGELIWGLPWDITAYGGAQFSEDYRALALGLGLNLGVFGATSF 422
G+YRS + Q+ P FQ L+ GLP T YGG Q ++ YRA G+G N+G GA S
Sbjct: 380 AGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSV 439

Query: 423 DVTQANSSLVDGSKHQGQSYRFLYSKSLVQTGTAFHIIGYRYSTQGFYTLSDTTYQQMSG 482
D+TQANS+L D S+H GQS RFLY+KSL ++GT ++GYRYST G++ +DTTY +M+G
Sbjct: 440 DMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNG 499

Query: 483 TVVDPKTLDDKDYVYNWNDFYNLRYSKRGKFQASVSQPFGNYGSMYLSASQQTYWNTDKK 542
++ + + + D+YNL Y+KRGK Q +V+Q G ++YLS S QTYW T
Sbjct: 500 YNIETQDGVIQVKPK-FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNV 558

Query: 543 DSLYQVGYNTSIKGIYLNVAWNYSKSPGTN-ADKIVSLNVSLPISNWLSSTNDGRSSSNA 601
D +Q G NT+ + I ++++ +K+ D++++LNV++P S+WL S D +S
Sbjct: 559 DEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS--DSKSQWRH 616

Query: 602 MTATYGYSQDNHGQVNQYTGVSGSLLEQHNLSYNIQHGFANQDNSSSGSVG---VNYRGA 658
+A+Y S D +G++ GV G+LLE +NLSY++Q G+A + +SGS G +NYRG
Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676

Query: 659 YGSLNSAYSYDNEGNQQINYGISGALVVHENGLTLSQPLGETNVLIKAPGANNVDVQRGT 718
YG+ N YS+ + +Q+ YG+SG ++ H NG+TL QPL +T VL+KAPGA + V+ T
Sbjct: 677 YGNANIGYSHSD-DIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQT 735

Query: 719 GISTDWRGYAVVPYATEYRRNNISLDPMSMNMHTELDITSTEVIPGKGALVRAEFAAHIG 778
G+ TDWRGYAV+PYATEYR N ++LD ++ + +LD V+P +GA+VRAEF A +G
Sbjct: 736 GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVG 795

Query: 779 IRGLFTVRYRNKSVPFGATASAQIKNSSQITGIVGDNGQLYLSGLPLEGVINIQWGDGVQ 838
I+ L T+ + NK +PFGA +++ SSQ +GIV DNGQ+YLSG+PL G + ++WG+
Sbjct: 796 IKLLMTLTHNNKPLPFGAMVTSE---SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 839 QKCQANYKLPETELDNPVSYATLECR 864
C ANY+LP ++ + ECR
Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878


44ECP_3532ECP_3558Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3532015-3.673239gluconate kinase
ECP_3533118-5.652900GalR/LacI family gluconate utilization operon
ECP_3534223-7.795719pirin-related protein
ECP_3535219-5.613878dehydrogenase
ECP_3536122-6.201194acetyltransferase YhhY
ECP_3537119-5.160029hypothetical protein
ECP_35380130.353542hypothetical protein
ECP_3539-1162.515318hypothetical protein
ECP_3540-2193.199883gamma-glutamyltranspeptidase
ECP_3541-2223.127711hypothetical protein
ECP_3542-1243.626271glycerophosphodiester phosphodiesterase
ECP_3543-2253.390111glycerol-3-phosphate transporter ATP-binding
ECP_3544-1263.361580glycerol-3-phosphate transporter membrane
ECP_3545-2263.846106glycerol-3-phosphate transporter permease
ECP_3546-2243.504165glycerol-3-phosphate transporter periplasmic
ECP_3547-3234.032024leucine/isoleucine/valine transporter
ECP_3548-3233.679456leucine/isoleucine/valine transporter
ECP_3549-3243.590714leucine/isoleucine/valine transporter permease
ECP_3550-3222.794356branched-chain amino acid transporter permease
ECP_3551-1212.623029leucine-specific binding protein
ECP_35521202.559770hypothetical protein
ECP_35531182.521996Leu/Ile/Val-binding protein precursor
ECP_35541161.868792RNA polymerase factor sigma-32
ECP_35552141.650018cell division protein FtsX
ECP_35563132.139618cell division protein FtsE
ECP_35572133.509929cell division protein FtsY
ECP_35580153.04739116S rRNA m(2)G966-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3536SACTRNSFRASE363e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 3e-05
Identities = 21/92 (22%), Positives = 32/92 (34%), Gaps = 16/92 (17%)

Query: 55 VACIDGIVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108
+ ++ +G + I + + D + D R K GV +AL+ + IE C
Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125

Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKFGFEIEG 140
L I N A Y K F I
Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3540NAFLGMOTY320.005 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 32.0 bits (72), Expect = 0.005
Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%)

Query: 272 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQIMAEAEKYA 330
R P+ G+ R + SMPPP G H +I N+ F Q G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133

Query: 331 YADRSEYLGDPDFVKVPWQA 350
Y P F WQ+
Sbjct: 134 Y---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3543PF05272290.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.041
Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTTGDI 61
+V+ G G GKSTL+ + GL+ +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3546MALTOSEBP392e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 39.3 bits (91), Expect = 2e-05
Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193
G L++ P L YNKD PPKTW+++ +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251
+ +A G F +N +D D ++ K + +++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291
+ F G+ AMT + +NI + +K NYGV ++P
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3557IGASERPTASE502e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.7 bits (118), Expect = 2e-08
Identities = 42/208 (20%), Positives = 72/208 (34%), Gaps = 21/208 (10%)

Query: 20 QTPEK-ETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAADVVEVTEQV 78
TP + +V + EEI + E T AE + VE EQ
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 79 AESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSP----EEWQAEAET 134
A AQ EV + + V+ + + + E E +E +A+ ET
Sbjct: 1058 ATETTAQ-NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 135 VEIVEAAE---EEAAKDEITDE---------EPEAQALAAEAAEEAVMVVSPAEEEQPVE 182
+ E + + + K E ++ E + E + + A+ EQP +
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT---NTTADTEQPAK 1173

Query: 183 EIAQEQEKPTKEGFFARLKRSLLKTKEN 210
E + E+P E S+++ EN
Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 49.3 bits (117), Expect = 3e-08
Identities = 35/178 (19%), Positives = 56/178 (31%), Gaps = 7/178 (3%)

Query: 19 EQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAADVVEVTEQV 78
TP + TE E E + A+E + + A EV +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 79 AESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETVEIV 138
+E+++ Q E E E +E E+ V ++ VSP++ Q+E +
Sbjct: 1090 SETKETQTT----ETKETATVEKEEKAKVETEKTQEVPKVTSQ-VSPKQEQSETVQPQAE 1144

Query: 139 EAAEEEAAK--DEITDEEPEAQALAAEAAEEAVMVVSPAEEEQPVEEIAQEQEKPTKE 194
A E + E + A E + V P E V E P
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202



Score = 43.1 bits (101), Expect = 3e-06
Identities = 28/156 (17%), Positives = 45/156 (28%), Gaps = 14/156 (8%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASE------HAVEEQPQAHTEAEAETFAAD 70
Q +T E T + E+ VE + P S+ + QPQA E +
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 71 VVEVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQA 130
++ ++ QP E + E V E+ V + PE+ P
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 131 EAETVEI-------VEAAEEEAAKDEITDEEPEAQA 159
+ + E A D A
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250



Score = 37.4 bits (86), Expect = 1e-04
Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 11/182 (6%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAE-AETFAADVVEVT 75
+E E ++ V+ E+ Q+ K ++ ++ + E A+ EV
Sbjct: 1065 NREVAKEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 76 EQVAE-------SEKAQPEAEVVAQPEPVV--EETPEPVAIEREELPLPEDVNAEAVSPE 126
+ ++ SE QP+AE + +P V +E + ++ ++ P
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 127 EWQAEAETVEIVEAAEEEAAKDEITDEEPEAQALAAEAAEEAVMVVSPAEEEQPVEEIAQ 186
T V E + + + P E
Sbjct: 1184 TESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSND 1243

Query: 187 EQ 188

Sbjct: 1244 RS 1245


45ECP_3568ECP_3575Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_35680173.796912major facilitator superfamily transporter
ECP_3569-1204.327960hypothetical protein
ECP_3570-1245.519536holo-(acyl carrier protein) synthase 2
ECP_35710245.389809nickel-binding periplasmic protein
ECP_35722265.903976nickel transporter permease NikB
ECP_35730245.567922nickel transporter permease NikC
ECP_35740205.385698nickel transporter ATP-binding protein NikD
ECP_35750194.416708nickel transporter ATP-binding protein NikE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3568TCRTETA538e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 53.3 bits (128), Expect = 8e-10
Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%)

Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70
++ N ++ I+ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 71 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 129
P G +D G + +++ L G + + Y L V L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112

Query: 130 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 187
A G+ + + H G + + G +G +G H A AL +
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 188 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 240
LL + + P + + +A +A V A
Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 241 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 296
+F + + WD ++L + + + ++ R+G M+ + G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 297 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 355
L+ AT WMA VLLA G + PAL + + V ++ QG + L+ +
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349

Query: 356 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 389
GPL + A + ++A A L + L R
Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3572BORPETOXINB280.046 Bordetella pertussis toxin B subunit signature.
		>BORPETOXINB#Bordetella pertussis toxin B subunit signature.

Length = 226

Score = 28.1 bits (62), Expect = 0.046
Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 204 GQRHVTWARLRGLSDKQTERRHILRNASLPMITAVGMHIGELIGGTMIIENIFAWPGVG- 262
R +T A LRG D Q RH+ R S+ + G ++G GG +I++ PG
Sbjct: 53 KTRALTVAELRGSGDLQEYLRHVTRGWSIFALYD-GTYLGGEYGG--VIKD--GTPGGAF 107

Query: 263 ----RYAVSAIFNRDYP 275
+ + N P
Sbjct: 108 DLKTTFCIMTTRNTGQP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3575HTHFIS300.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.008
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLALKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


46ECP_3685ECP_3706Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_36850123.993775cryptic L-xylulose kinase
ECP_36860113.7862763-keto-L-gulonate-6-phosphate decarboxylase
ECP_36870113.158722L-xylulose 5-phosphate 3-epimerase
ECP_36880113.028357L-ribulose-5-phosphate 4-epimerase
ECP_36910112.668805alcohol dehydrogenase
ECP_3692-3133.221194selenocysteinyl-tRNA-specific translation
ECP_3693-3142.275272selenocysteine synthase
ECP_3694-2161.348703glutathione S-transferase
ECP_3695-2171.180075hypothetical protein
ECP_3696-1200.829708hypothetical protein
ECP_3697-119-1.019434PTS system mannitol-specific transporter subunit
ECP_3698220-1.517098mannitol-1-phosphate 5-dehydrogenase
ECP_3699319-0.800766mannitol repressor protein
ECP_3700219-0.437121hypothetical protein
ECP_37011170.169472hypothetical protein
ECP_37021160.372723hypothetical protein
ECP_37031161.397654hemagglutinin/invasin
ECP_3704-1143.634417L-lactate permease
ECP_37050163.259838DNA-binding transcriptional repressor LldR
ECP_3706-1173.084274L-lactate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3692TCRTETOQM593e-11 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 59.1 bits (143), Expect = 3e-11
Identities = 44/147 (29%), Positives = 69/147 (46%), Gaps = 18/147 (12%)

Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47
I HVD GKTTL +++ +G D E++RG+TI G +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 48 PDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTREHLAILQLTGNPML 107
+ +V ID PGH FL+ + + +D A+L+++ DGV AQTR L+ G P +
Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI 124

Query: 108 TVALTKADRVDEARVDEVERQVKEMLR 134
+ K D+ + V + +KE L
Sbjct: 125 -FFINKIDQNG-IDLSTVYQDIKEKLS 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3695RTXTOXIND642e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 64.1 bits (156), Expect = 2e-13
Identities = 56/314 (17%), Positives = 103/314 (32%), Gaps = 82/314 (26%)

Query: 66 ITPQVTGIVTEVTDKNNQLIQKGEVLFKLDPVR------------YQARVD--RLQA--- 108
I P IV E+ K + ++KG+VL KL + QAR++ R Q
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 109 ------------------------DLMTATHNIK----TLRAQLTEAQANTTQVSAERDR 140
+++ T IK T + Q + + N + AER
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 141 LFKNYQRY----------LKGSQAAVNPFS---------ERDIDDARQNF---LAQDALV 178
+ RY L + ++ + E +A +Q +
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278

Query: 179 KGSVAE----QAQIQSQLDSMVNGE----QSQIVSLRAQLTEAKYNLEQTVIRAPSNGYV 230
+ + + + + + I L +L + + + +VIRAP + V
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338

Query: 231 TQVLIR-PGTYAAALPLRPVMVFIPEQKRQIV-AQFRQNSLLRLKPGDDAEVVFNALPGQ 288
Q+ + G +MV +PE V A + + + G +A + A P
Sbjct: 339 QQLKVHTEGGVVT--TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396

Query: 289 VFH---GKLTSILP 299
+ GK+ +I
Sbjct: 397 RYGYLVGKVKNINL 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3703PF03895655e-15 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 65.2 bits (159), Expect = 5e-15
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 2/79 (2%)

Query: 1547 ESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGV-SMVSANGRWVYKLQ 1605
+L G+A+ A++ L Q G + S G Y ++A+A+GV S ++ +
Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61

Query: 1606 GSTNSQGEYSAALGAGIQW 1624
+T + G S G ++
Sbjct: 62 FNTYN-GGMSYGASVGYEF 79


47ECP_3718ECP_3727Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3718019-5.7541212-amino-3-ketobutyrate CoA ligase
ECP_3719126-8.806758ADP-L-glycero-D-manno-heptose-6-epimerase
ECP_3720332-10.768935ADP-heptose--LPS heptosyltransferase
ECP_3721340-13.989727ADP-heptose--LPS heptosyltransferase
ECP_3722341-16.124498lipid A-core:surface polymer ligase WaaL
ECP_3723234-14.032958beta1,3-glucosyltransferase WaaV
ECP_3724228-11.078781UDP-galactose:(galactosyl) LPS
ECP_3725227-9.266567lipopolysaccharide core biosynthesis protein
ECP_3726222-6.142719UDP-galactose:(glucosyl) LPS
ECP_3727118-3.763209UDP-glucose:(glucosyl) LPS
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3719NUCEPIMERASE1047e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 104 bits (260), Expect = 7e-28
Identities = 77/348 (22%), Positives = 127/348 (36%), Gaps = 67/348 (19%)

Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLDI 47
+VTG AGFIG ++ K L + G ++ +DNL D +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100
AD + + + A F E +F + +Y ++N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158
L C +I LYASS++ YG F + P+++Y +K +
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218
G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224

Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258
+ + W +E G ++N+G A + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305
+P G T AD L G+ P TV +GV ++ W
Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327


48ECP_3747ECP_3864Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3747-2153.505468DNA-directed RNA polymerase subunit omega
ECP_3748-2133.296752bifunctional (p)ppGpp synthetase II/
ECP_3749-1143.087912tRNA guanosine-2'-O-methyltransferase
ECP_3750-1132.855099ATP-dependent DNA helicase RecG
ECP_3751-1121.736905sodium/glutamate symport carrier protein
ECP_3752-2111.930911purine permease YicE
ECP_3753-2121.383926hypothetical protein
ECP_3754-1130.405166hypothetical protein
ECP_3755-114-0.058212hypothetical protein
ECP_3756019-3.373269hypothetical protein
ECP_3757-215-2.811964fructose-bisphosphate aldolase class-II
ECP_3758-114-3.502054PTS enzyme-II fructose
ECP_3759-215-4.523014PTS system, fructose-like-2 IIB component 1
ECP_3760-218-5.452810phosphotransferase system (PTS),
ECP_3761-120-6.086508transcriptional antiterminator
ECP_3762123-7.043675alpha-xylosidase
ECP_3763433-9.374002transporter
ECP_3765743-11.282806*phage integrase
ECP_3766949-12.899537type II 5-cytosoine methyltransferase
ECP_3767738-9.727350type II 5-cytosoine methyltransferase
ECP_3768534-6.972191hypothetical protein
ECP_37696280.039322hypothetical protein
ECP_37706270.655521hypothetical protein
ECP_3771527-1.450803hypothetical protein
ECP_3772527-1.236874transposase
ECP_3773427-1.970819transposase/IS protein
ECP_3774427-2.191255transposase
ECP_3775631-3.054796hypothetical protein
ECP_3776636-4.771201MarR family transcriptional regulator
ECP_3777938-4.894004hypothetical protein
ECP_3778839-7.123159hypothetical protein
ECP_3779841-7.741528hypothetical protein
ECP_3780843-8.324610hypothetical protein
ECP_3781845-9.703264hypothetical protein
ECP_3782845-9.890732F17-like fimbril adhesin subunit
ECP_3783843-8.066897F17-like fimbrial usher
ECP_3784637-7.134926F17-like fimbrial chaperone
ECP_3785535-5.699952F17 fimbrial protein
ECP_3786633-4.391614hypothetical protein
ECP_3787734-2.721580hypothetical protein
ECP_3788834-2.323795FMN-dependent dehydrogenase
ECP_3789834-3.044322hypothetical protein
ECP_3790939-1.841936transcription regulator protein
ECP_37911038-5.490375hypothetical protein
ECP_3792941-7.250147hypothetical protein
ECP_3793735-7.472864hypothetical protein
ECP_3794530-5.643572hypothetical protein
ECP_3795528-5.371431transposase
ECP_3796628-5.456492hypothetical protein
ECP_3797326-2.831652hypothetical protein
ECP_3798323-1.047614hypothetical protein
ECP_3799425-0.284356hypothetical protein
ECP_3800430-2.709877regulatory protein
ECP_3801429-1.586639hypothetical protein
ECP_3802328-2.408015hypothetical protein
ECP_3803228-2.607848hypothetical protein
ECP_3804429-2.820636transposase
ECP_3805437-8.597448hypothetical protein
ECP_3806436-8.557914transposase
ECP_3807636-10.831873transposase
ECP_3808940-11.735454hypothetical protein
ECP_38091043-9.807696hypothetical protein
ECP_3810944-10.137170CS12 fimbrial-like upstream regulatory protein
ECP_3811944-8.972742hypothetical protein
ECP_3812947-9.977335CS12 fimbria chaperone FasB-like protein
ECP_3813842-7.426544CS12 fimbria chaperone protein
ECP_3814740-6.525228CS12 fimbria outer membrane usher protein
ECP_3815437-5.549557CS12 fimbria chaperone protein
ECP_3816436-4.437784CS12 fimbria minor subunit protein
ECP_3817229-2.437496FasG-like protein
ECP_38184202.788797transposase
ECP_38196190.490038hypothetical protein
ECP_3820618-0.531514hypothetical protein
ECP_3821521-2.489203hypothetical protein
ECP_3822628-6.110518ABC transporter ATP-binding protein
ECP_3823632-8.081579ABC-transporter membrane protein
ECP_3824637-10.150759periplasmic binding protein
ECP_3825641-10.819283hypothetical protein
ECP_3826638-9.423150hemolysin-activating lysine-acyltransferase
ECP_3827636-8.517628hemolysin A
ECP_3828634-7.646229alpha-hemolysin translocation ATP-binding
ECP_3829530-4.733825hemolysin D
ECP_3830732-4.308276urea transporter
ECP_3831830-3.058831phosphoadenosine phosphosulfate reductase
ECP_3832631-4.465577ParB-like nuclease
ECP_3833431-4.824185transposase
ECP_3834432-4.540232transposase
ECP_3835429-3.243482AraC family transcriptional regulator
ECP_3836427-0.145527hypothetical protein
ECP_3837426-0.292027transposase
ECP_38384281.844619reverse transcriptase
ECP_38394273.489320hypothetical protein
ECP_38404273.365473transposase
ECP_38414292.592674transposase
ECP_38423262.410498transposase/IS protein
ECP_38432273.399888transposase
ECP_3844231-1.267410IS orf
ECP_3845327-1.133697transposase
ECP_3846427-1.378044hypothetical protein
ECP_3847325-1.176962hypothetical protein
ECP_3848325-0.695842hypothetical protein
ECP_38493260.445268transposase
ECP_38506261.106359hypothetical protein
ECP_38516271.816235hypothetical protein
ECP_38525292.537851hypothetical protein
ECP_38536324.409254hypothetical protein
ECP_38548304.414489hypothetical protein
ECP_38557263.817066antirestriction protein
ECP_38566273.376715radC-like protein
ECP_38576240.593356hypothetical protein
ECP_3858524-0.747867hypothetical protein
ECP_3859221-4.219268hypothetical protein
ECP_3860020-4.467811hypothetical protein
ECP_3861019-5.387972hypothetical protein
ECP_3862-119-5.892458hypothetical protein
ECP_3863020-4.174862hypothetical protein
ECP_3864020-3.050088hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3750SECA381e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 38.3 bits (89), Expect = 1e-04
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 291 MRLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFAPL 344
M L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 345 GIEVGWL 351
G+ VG
Sbjct: 151 GLTVGIN 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3761PF08280340.001 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 34.1 bits (78), Expect = 0.001
Identities = 79/491 (16%), Positives = 168/491 (34%), Gaps = 73/491 (14%)

Query: 7 RQNRLLRFLLPRREYTTIVTIAGYLNVSEKTIQRDLRLLEQWL-GQWRINVEKRAGAGVM 65
+ +L+ + I +A ++ + L + + ++KR M
Sbjct: 45 SKCQLVVLFF-KTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKR-----M 98

Query: 66 LSAENIADLLHLDHLLVAECEEIDGVMNNARRVKIASQLLSETPNETSISKLSERYFISG 125
+ H ++ + + ++ +++ + L+ + ++ + +F+S
Sbjct: 99 I-------SCQFTHP--SKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSN 149

Query: 126 ASIVNDLRVIESWLAPLGLSLIRSPSGTHIEGSEGQVRQAMALLINGIINHNEPQGVVYS 185
+S + L L L S I G E ++R +ALL G+
Sbjct: 150 SSAYRMREALIPLLRNFELKL----SKNKIVGEEYRIRYLIALL-------YSKFGIKVY 198

Query: 186 RLDPGSYKALVHYFGEEEVLFVQSLLLDMENELSWSLGEPYYVNIFTHILIMMYRNTHGN 245
L K ++H F L S L LS E + F IL+ + H
Sbjct: 199 DLTQQD-KNIIHSF-----LSHSSTHLKTSPWLS----ESFS---FYDILLALSWKRHQF 245

Query: 246 ALSREEDQTRQYDENIF---NVASQMIHKIEQRIAHTLPDDEVWFIYQ-YIISSGVAIDG 301
+++ + + Q + +F ++ IE ++ ++Y YI ++
Sbjct: 246 SVTIPQTRIFQQLKKLFVYDSLKKSSRDIIETYCQLNFSAGDLDYLYLIYITANNSFASL 305

Query: 302 Q---KDVSIISHMQASNEA-RLITWRLITVFSDIVD---------CDFSEDSALYDGLLV 348
Q + + + N+ RL+ +IT+ ++ + FS+ S L++ L
Sbjct: 306 QWTPEHIRQCCQLFEENDTFRLLLNPIITLLPNLKEQKASLVKALMFFSK-SFLFN--LQ 362

Query: 349 HIKPLINRLNYRIHIRNPLLEDIKAELADVWRLTQYVVNQVFKTWGENAVSEDEVGYLTV 408
H P N + N L + + W + K G+ ++
Sbjct: 363 HFIPETNLFVSPYYKGNQKLYTSLKLIVEEW---------MAKLPGKRYLNHKHFHLFCH 413

Query: 409 HFQAAMERQIARKRVLLVCSTGIGTSHLLKSRILRAFPEWTI---VDVISAANLSQVLPD 465
+ + + V+ V S I +HLL R F + +I + N+ Q+
Sbjct: 414 YVEQILRNIQPPLVVVFVASNFI-NAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDL 472

Query: 466 NIELIISTINL 476
+L+I+ L
Sbjct: 473 KPDLVITHSQL 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3774HTHTETR280.042 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.042
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3780OMADHESIN280.011 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 27.9 bits (61), Expect = 0.011
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 21 GMALSSWSASDATGAVTVGVVAK--GTHQNSMAQGEFSCTTRENEVYIGYDS 70
G+A+ S +DA +V +G + H S+A G+ S T REN V IG++S
Sbjct: 140 GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3783PF005777400.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 740 bits (1912), Expect = 0.0
Identities = 259/872 (29%), Positives = 420/872 (48%), Gaps = 47/872 (5%)

Query: 5 NLSCLIYCRCSLLLFAALGLTVTNHSF----AAEEAEFDSEFLHLDKGINAIDIRRFSHG 60
N CL + L F + ++ E F+ FL D D+ RF +G
Sbjct: 12 NTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD-PQAVADLSRFENG 70

Query: 61 NPVPEGRYYSDIYVNNVWKGKADLQYLRTANTGAPTLCLTPELLS-----LIDLVKDTMS 115
+P G Y DIY+NN + D+ + + CLT L+ + +
Sbjct: 71 QELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL 130

Query: 116 GNTSCFPASTGLSSASINFDLSTLRLNIEIPQALLNTRPRGYISPSQWQSGVPAAFINYD 175
+ +C P ++ + A+ D+ RLN+ IPQA ++ R RGYI P W G+ A +NY+
Sbjct: 131 ADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYN 190

Query: 176 ANYYQY-SSSGTSNEQTYLGLKAGFNLWGWALRHRGSESWNNSYPAG-----YQNIETSI 229
+ + G ++ YL L++G N+ W LR + S+N+S + +Q+I T +
Sbjct: 191 FSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWL 250

Query: 230 MHDLAPLRAQFTLGDFYTNGELMDSLSLRGVRLASDERMLPGSLRGYAPAVRGIANSNAK 289
D+ PLR++ TLGD YT G++ D ++ RG +LASD+ MLP S RG+AP + GIA A+
Sbjct: 251 ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310

Query: 290 VTIYQNAHILYETTVPAGPFVINDLYPSGYAGDLIVKITESNGQTRMFTVPFAAVAQLIR 349
VTI QN + +Y +TVP GPF IND+Y +G +GDL V I E++G T++FTVP+++V L R
Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370

Query: 350 PGFSRWQMSVGKYR-YANKTYNDLIAQGTYQYGLTNDITLNSGLTTASGYTAGLAGLAFN 408
G +R+ ++ G+YR + Q T +GL T+ G A Y A G+ N
Sbjct: 371 EGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 409 T-PLGAIASDITLSRTAFRYSGVTRKGYSLHSSYSINIPASNTNITLAAYRYSSKDFYHL 467
LGA++ D+T + + G S+ Y+ ++ S TNI L YRYS+ +++
Sbjct: 431 MGALGALSVDMTQANSTLP-DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489

Query: 468 KDALSANHNAF-------IDDVSVKSTAFY----RPRNQFQISINQELGEKWGGMYLTGT 516
D + N + + V K T +Y R + Q+++ Q+LG +YL+G+
Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGS 548

Query: 517 TYNYWGHKGSRNEYQMGYSNFWKQLGYQIGLSQSRDNEQQRRDDRFYINFTLPLGE---- 572
YWG ++Q G + ++ + + + S +++ Q+ RD +N +P
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608

Query: 573 ----SVQSPVFSTVLNYSKEEKNSIQTSISGTGGEDNQFSYGLS-----GNSQENGPSGY 623
+ S +++ + + + GT EDN SY + G +G +GY
Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668

Query: 624 AMNGGYRSPYVNITTTVGHDTQNNNQRSFGASGAVVAHPYGVTLSNDLSDTFAIIHAEGA 683
A YR Y N H + Q +G SG V+AH GVTL L+DT ++ A GA
Sbjct: 669 A-TLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGA 726

Query: 684 QGAAINNASGSRLDFWGNGIVPYVTPYEKNQISIDPSNLDLNVELSATEQEIIPRANSAT 743
+ A + N +G R D+ G ++PY T Y +N++++D + L NV+L ++P +
Sbjct: 727 KDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIV 786

Query: 744 LVKFDTKTGRSLLFDIRMSTGNPPPMASEVLDEHGQLAGYVAQAGKVFTRGLPEKGHLSV 803
+F + G LL + + P P + V E Q +G VA G+V+ G+P G + V
Sbjct: 787 RAEFKARVGIKLLMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQV 845

Query: 804 VWGPDNKDRCSFVYHVAHNKDDMQSQLVPVLC 835
WG + C Y + + C
Sbjct: 846 KWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3788PHAGEIV300.021 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 29.9 bits (67), Expect = 0.021
Identities = 18/88 (20%), Positives = 35/88 (39%), Gaps = 10/88 (11%)

Query: 253 FNQKVELTPADI-EFVK---KITGLPVIVKGILRGEDAVVAIDAGADAI------QVSNH 302
F Q +E+ + + +FV K TG VIV ++G V + D + + + +
Sbjct: 20 FAQVIEMNNSSLRDFVTWYSKQTGESVIVSPDVKGTVTVYSSDVKPENLRDFFISVLRAN 79

Query: 303 GGRQIDGVPSAISQLQEVAARVGHKVPV 330
+ +PS I + ++P
Sbjct: 80 NFDMVGSIPSIIQKYNPNNQDYIDELPS 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3790HTHTETR791e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.9 bits (194), Expect = 1e-19
Identities = 38/162 (23%), Positives = 59/162 (36%), Gaps = 11/162 (6%)

Query: 77 ARKTRSCSPEKTARTRQQIARAALEEFSAQGFARASISNISKRAGVAKGTVYNYFPTKEL 136
ARKT+ ++ TRQ I AL FS QG + S+ I+K AGV +G +Y +F K
Sbjct: 2 ARKTK----QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 137 LFEAVLKE----FIATVRTELESSPRRNGETVKAYLLRVMLPAVRKIDDASTGRARIAHL 192
LF + + P ++ L+ +L + + I H
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIH-VLESTVTEERRRLLMEIIFHK 116

Query: 193 VMTEGSRFPVIAQAYLREIHQPLQQAMTQLIQEAASAGELKA 234
G V R + + Q ++ A L A
Sbjct: 117 CEFVGEMAVVQQAQ--RNLCLESYDRIEQTLKHCIEAKMLPA 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3814PF005776580.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 658 bits (1698), Expect = 0.0
Identities = 242/854 (28%), Positives = 419/854 (49%), Gaps = 59/854 (6%)

Query: 20 AEDYFDPSLLATDIIGEGNIDLSAFSRPGGGMEGEQEVAIYVNDEFY-SRNTLFFKNTLD 78
AE YF+P LA D + DLS F G V IY+N+ + +R+ F +
Sbjct: 45 AELYFNPRFLADD--PQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSE 102

Query: 79 KGLLPEFTP------GFFDELLSGDFLVSEEDKTISSSDFLKKVPYSDINFNQGMSRVNV 132
+G++P T G +SG L++++ + + + G R+N+
Sbjct: 103 QGIVPCLTRAQLASMGLNTASVSGMNLLADDACV----PLTSMIHDATAQLDVGQQRLNL 158

Query: 133 SIPQAYLGDGAKLISSPDTWEYGGPAFLLDYNISGNRNDS-GNYDSRSLYISSQMGVNLM 191
+IPQA++ + A+ P+ W+ G A LL+YN SGN + +S Y++ Q G+N+
Sbjct: 159 TIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIG 218

Query: 192 KWRLRTSSSYSNYKTNSVWGGARSEQNSFYNTYAERDISSLRAILRLGEVSTAGLILDSV 251
WRLR ++++S ++S G Q+ NT+ ERDI LR+ L LG+ T G I D +
Sbjct: 219 AWRLRDNTTWSYNSSDSSSGSKNKWQHI--NTWLERDIIPLRSRLTLGDGYTQGDIFDGI 276

Query: 252 PFRGMKLSSSDDMLGMRLRNYTPTVRGMASSQAVVTITQNGRQVYQTNVPAGPFELNDFY 311
FRG +L+S D+ML R + P + G+A A VTI QNG +Y + VP GPF +ND Y
Sbjct: 277 NFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIY 336

Query: 312 LSGYSGDMLVTVREADGSEHSFLQPYSTLPEMKREGVSGFEVSVGRYDNNGAEHYYDAES 371
+G SGD+ VT++EADGS F PYS++P ++REG + + ++ G Y + +
Sbjct: 337 AAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGN--AQQEKPR 394

Query: 372 FVYGNWSRGFARGVTFFAETLQAEKYQSLGGGSTLSLGRLGAASADISLSRADKYGDIR- 430
F G G T + T A++Y++ G ++G LGA S D++ + + D +
Sbjct: 395 FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQH 454

Query: 431 IGQSYGFKYSKSQIETGTTVTLATYRYSTENFYTFRDFV------------------SKT 472
GQS F Y+KS E+GT + L YRYST ++ F D
Sbjct: 455 DGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPK 514

Query: 473 DTARYIWENKLKSRMTFSLSQSLGEYGYLSANASQQDYWNSREVSRNYSLTHSFSWNDIY 532
T Y + ++ +++Q LG L + S Q YW + V + + ++ DI
Sbjct: 515 FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDIN 574

Query: 533 FSTTLSMDDQRGRETGHLSNKQAGIYASVPLSKLLPRTDPTS---SSLTWSTSHADH-KV 588
++ + S+ ++ ++ + ++P S L + +S ++S SH + ++
Sbjct: 575 WTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRM 631

Query: 589 RNSVTLDGKVPESD-VRYRVGGSW---GNGTTEGSRMASVSWTGDHASTSLGYTRVGKYR 644
N + G + E + + Y V + G+G + + A++++ G + + ++GY+ +
Sbjct: 632 TNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIK 691

Query: 645 TLDYSMSGAAVMYPWGIAVGNSSVTGDGAIVVETPGAKGVR--TSTGYKTSWLGTALISS 702
L Y +SG + + G+ +G D ++V+ PGAK + TG +T W G A++
Sbjct: 692 QLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749

Query: 703 PQKYTENRINLYPDGLPSDTVLGETSKTAVPAKGAVVVLDYTVFRGSQVVFTLRQTDGNP 762
+Y ENR+ L + L + L VP +GA+V ++ G +++ TL + P
Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKP 808

Query: 763 LPFGTVITLDGVSRGKENSGIVGEEGRVYMAGIPEKGTLTASWGL--NKTCSIPFRINQH 820
LPFG ++T + ++SGIV + G+VY++G+P G + WG N C +++
Sbjct: 809 LPFGAMVTSE----SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPE 864

Query: 821 KAEAVIREVQGVCR 834
+ ++ ++ CR
Sbjct: 865 SQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3824adhesinb2373e-79 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 237 bits (605), Expect = 3e-79
Identities = 87/294 (29%), Positives = 157/294 (53%), Gaps = 7/294 (2%)

Query: 5 ILVVALSSLLVSPLVIAKELNVVASFSVLGDMVSQIGGPYVHVTDLVQPDGDPHEFEPSP 64
+ + A SS S + +LNVVA+ S++ D+ I G +++ +V DPHE+EP P
Sbjct: 15 VGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEPLP 74

Query: 65 KDSKTLAQADVVFVNGLGLE----GWLDRLMKASGYRGE--VITASNGIDTLKMKEDGTT 118
+D K +QAD++F NG+ LE W +L++ + + S G+D + ++
Sbjct: 75 EDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQSEK 134

Query: 119 IT-DPHAWNSMKNGIVYAHNIVNGLSKADPEHASDYRKQGDSYIQQLQQLDNYATQTFAA 177
DPHAW +++NGI+YA NI LS+ DP + Y K +Y+++L LD A + F
Sbjct: 135 GKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNN 194

Query: 178 IPREKRKVLTSHDAFGYFAAAYGVRFLSPVGYSTESEASSKNVAKLINQIKREHVKLYFI 237
IP EK+ ++TS F YF+ AY V +TE E + + L+ ++++ V F+
Sbjct: 195 IPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSLFV 254

Query: 238 ENQTDPRLVKQIANASGAQAGGELYPEALTDSSGLAATYTAAFKHNVDTLAAGM 291
E+ D R +K ++ + +++ +++ + +Y + K+N++ +A G+
Sbjct: 255 ESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3826RTXTOXINC316e-114 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 316 bits (811), Expect = e-114
Identities = 163/170 (95%), Positives = 166/170 (97%)

Query: 1 MNRNNPLEVLGHVSWLWASSPLHRNWPVSLFAINVLPAIRANQYALLTRDNYPVAYCSWA 60
MN N PLE+LGHVSWLWASSPLHRNWPVSLFAINVLPAI+ANQY LLTRD+YPVAYCSWA
Sbjct: 1 MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWA 60

Query: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120
NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR
Sbjct: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120

Query: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKNKSDFNFSLTG 170
VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVK KSDFNFSLTG
Sbjct: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRKSDFNFSLTG 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3827RTXTOXINA14770.0 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 1477 bits (3824), Expect = 0.0
Identities = 978/1024 (95%), Positives = 992/1024 (96%)

Query: 1 MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60
M TITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ
Sbjct: 1 MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60

Query: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120
GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 121 YQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSMKIDELIKKQKSGSNVSSSEL 180
YQKAGN LGG AENIGDNLGKAG +LSTFQNFLGTALSSMKIDELIKKQKSG NVSSSEL
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 181 AKASIELINQLVDTAASINNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240
AKASIELINQLVDT AS+NNNVNSFSQQLN LGSVLSNTKHLNGVGNKLQNLPNLDNIGA
Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240

Query: 241 GLDTVSGILSVISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300
GLDTVSGILS ISASFILSNADADT TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL
Sbjct: 241 GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300

Query: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360
STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE
Sbjct: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360

Query: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420
TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH
Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420

Query: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480
VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW
Sbjct: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480

Query: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKPDEFQKQVFDPLKGNIDLSDSKSS 540
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKK DEFQKQVFDPLKGNIDLSDSKSS
Sbjct: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSS 540

Query: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHA 600
TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKG+VYDYSNLIQHA
Sbjct: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHA 600

Query: 601 SVGNNQYREIRIESHLGDGDDKVFLAAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660
SVGNNQYREIRIESHLGDGDDKVFL+AGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE
Sbjct: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660

Query: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE 720
AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHING +LTETDNLYSVE
Sbjct: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE 720

Query: 721 ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGD 780
ELIGT RADKFFGSKFTDIFHGADGDD IEGNDGNDRLYGDKGNDTL GGNGDDQLYGGD
Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGD 780

Query: 781 GNDKLTGGVGNNYLNGGDGDDELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGND 840
GNDKL G GNNYLNGGDGDDE QVQGNSLAKNVL GGKGNDKLYGSEGADLLDGGEG+D
Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840

Query: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRDVAFKREGNDLIMYKAEGNV 900
LLKGGYGNDIYRYLSGYGHHIIDDDGGK+DKLSLADIDFRDVAFKREGNDLIMYK EGNV
Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900

Query: 901 LSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY 960
LSIGHKNGITFRNWFEKESGDISNH+IEQIFDK GR+ITPDSLKKA EYQQ NN+A+YVY
Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960

Query: 961 GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITL 1020
G A Y +LNPLINEISKIISAAG+FDVKEER+AASLLQLSGNASDFSYGRNSITL
Sbjct: 961 GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITL 1020

Query: 1021 TASA 1024
T SA
Sbjct: 1021 TTSA 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3829RTXTOXIND6010.0 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 601 bits (1551), Expect = 0.0
Identities = 462/478 (96%), Positives = 468/478 (97%)

Query: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60
MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV
Sbjct: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60

Query: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRK 120
AYFIMGFLVIAFILSVLGQVEIVATANGKLT SGRSKEIKPIENSIVKEIIVKEGESVRK
Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRK 120

Query: 121 GDVLLKLTALGAEADTLKTQSSLLQTRLEQIRYQILSRSIELNKLPELKLPDEPYFQNVS 180
GDVLLKLTALGAEADTLKTQSSLLQ RLEQ RYQILSRSIELNKLPELKLPDEPYFQNVS
Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180

Query: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENLSRVEKSRLDDF 240
EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT+LARINRYENLSRVEKSRLDDF
Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240

Query: 241 RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300
SLLHKQAIAKHAVLEQENKYVEA NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 301 LDKLRQTTDSIELLTLELEKNEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIV 360
LDKLRQTTD+I LLTLEL KNEERQQASVIRAPVS KVQQLKVHTEGGVVTTAETLMVIV
Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360

Query: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQKLGL 420
PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ+LGL
Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420

Query: 421 VFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER 478
VFNVI+S+EEN LSTGNK+IPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL ER
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3841HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


49ECP_3915ECP_3934Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3915112-4.1556426-phosphogluconate phosphatase
ECP_3916114-3.722778inner membrane protein
ECP_3917014-3.9524406-phosphogluconolactonase
ECP_3918012-3.682452hypothetical protein
ECP_3919-212-3.482768outer membrane protein YieC
ECP_3920-211-1.3476596-phospho-beta-glucosidase
ECP_3921-214-0.122018beta-glucoside-specific PTS system components
ECP_3922-3210.407084transcriptional antiterminator BglG
ECP_3923-1301.957805transcriptional regulator PhoU
ECP_3924-2282.031947phosphate transporter subunit
ECP_3925-2272.317234phosphate transporter permease subunit PtsA
ECP_39261312.130297phosphate transporter permease subunit PstC
ECP_39272321.963869phosphate ABC transporter substrate-binding
ECP_39283362.180608glucosamine--fructose-6-phosphate
ECP_39294352.130269bifunctional N-acetylglucosamine-1-phosphate
ECP_39305402.203654ATP synthase F0F1 subunit epsilon
ECP_39314422.098317ATP synthase F0F1 subunit beta
ECP_39323351.068530ATP synthase F0F1 subunit gamma
ECP_39334350.785204ATP synthase F0F1 subunit alpha
ECP_3934221-0.477462ATP synthase F0F1 subunit delta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3929RTXTOXINA290.048 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.048
Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%)

Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426
LGD + D V + AG+ N G DV T G AT A T
Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665

Query: 427 VTRNVGENALAISRVPQTQK 446
VTR +G + + V + Q+
Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685


50ECP_3960ECP_3965Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3960-2163.064321hypothetical protein
ECP_3961-2153.416611ATP-dependent protease
ECP_3962-2233.837939acetolactate synthase 2 catalytic subunit
ECP_39630263.958791branched-chain amino acid aminotransferase
ECP_39641223.766266dihydroxy-acid dehydratase
ECP_39651203.128571threonine dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3961HTHFIS359e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 9e-04
Identities = 40/196 (20%), Positives = 62/196 (31%), Gaps = 51/196 (26%)

Query: 170 KHALEHPKPTNAVSRALQHDLSDVVGQEQG----KRGLEITAAGGHNLLLIGPPGTGKTM 225
AL PK + D +VG+ R L L++ G GTGK +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 226 LASRINGLLPDLSNEEALESAAILSLVNAESVQKQWRQRPFRSPHHSA--------SLTA 277
+A ++ R PF + + +A L
Sbjct: 176 VARALHDYGK-------------------------RRNGPFVAINMAAIPRDLIESELFG 210

Query: 278 MVGG---GAIP-GPGEISLAHNGVLFLDEL----PEFERRTLDALREPIESGQIHLSRTR 329
G GA G A G LFLDE+ + + R L L++ G+
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GEYT--TVG 264

Query: 330 AKITYPARFQLVAAMN 345
+ + ++VAA N
Sbjct: 265 GRTPIRSDVRIVAATN 280


51ECP_4001ECP_4011Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_40010193.210935hypothetical protein
ECP_4002-1214.757839hypothetical protein
ECP_4003-2173.641441diaminopimelate epimerase
ECP_4004-2172.570339hypothetical protein
ECP_4005-2172.297786site-specific tyrosine recombinase XerC
ECP_4006-2140.475565flavin mononucleotide phosphatase
ECP_4007-111-2.683683DNA-dependent helicase II
ECP_4008-113-6.304003hypothetical protein
ECP_4009-113-5.994710hypothetical protein
ECP_4010-112-5.018313magnesium/nickel/cobalt transporter CorA
ECP_4011-113-3.607368hypothetical protein
52ECP_4056ECP_4088Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4056-1173.0210873-octaprenyl-4-hydroxybenzoate carboxy-lyase
ECP_4057-2182.947906FMN reductase
ECP_4058-2192.9748223-ketoacyl-CoA thiolase
ECP_4059-2182.023256multifunctional fatty acid oxidation complex
ECP_4060-3131.036346proline dipeptidase
ECP_4061-1130.312476hypothetical protein
ECP_4062014-0.955042potassium transporter
ECP_4063-114-2.479181protoporphyrinogen oxidase
ECP_4066-120-5.171771**molybdopterin-guanine dinucleotide biosynthesis
ECP_4067-219-5.683765molybdopterin-guanine dinucleotide biosynthesis
ECP_4068-120-6.907265hypothetical protein
ECP_4069-214-3.955596serine/threonine protein kinase
ECP_4070-112-3.529907protein disulfide isomerase I
ECP_4071-112-3.139517hypothetical protein
ECP_4072013-0.671233acyltransferase
ECP_40731140.262837hypothetical protein
ECP_40740151.813288DNA polymerase I
ECP_4075-1172.909521GTP-binding protein EngB
ECP_4076-1182.668750hypothetical protein
ECP_40772252.585145coproporphyrinogen III oxidase
ECP_40782242.196837hypothetical protein
ECP_40791190.811632nitrogen regulation protein NR(I)
ECP_4080118-1.171889nitrogen regulation protein NR(II)
ECP_4081118-1.953998glutamine synthetase
ECP_4082014-2.552967GTP-binding protein
ECP_4083-121-3.590247transcriptional regulator YihL
ECP_4084-121-2.973238hypothetical protein
ECP_4085019-1.185510membrane protein YihN
ECP_4086122-0.330255transcriptional regulator YihW
ECP_4087122-1.168747sugar kinase
ECP_4088222-1.331051oxidoreductase YihU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4076SECA290.007 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.5 bits (66), Expect = 0.007
Identities = 11/71 (15%), Positives = 29/71 (40%)

Query: 14 AKARRKTREELDQEARDRKRLKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73
+K + + EE+++ + R+ +R ++ + + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 74 LGVAEKVTKQH 84
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4079HTHFIS6010.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 601 bits (1552), Expect = 0.0
Identities = 205/478 (42%), Positives = 299/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARDLGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A ++ G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESNVPESTSHMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4082TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4085TCRTETB290.025 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.025
Identities = 31/161 (19%), Positives = 64/161 (39%), Gaps = 15/161 (9%)

Query: 227 NVFFVYAVYCGLTFFIPFLKNIYLLP----------VALVGAYGIINQYCLKMIGGPIGG 276
N+ F+ V CG F + ++P A +G+ I +I G IGG
Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314

Query: 277 MISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMACTLGFGAIVFTQRAVFF 336
++ D+ + P L + + + L E+ ++ + G + FT+
Sbjct: 315 ILVDR--RGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTK--TVI 369

Query: 337 APIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDL 377
+ I + + + + GA M+L +F + ++ G +L +
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


53ECP_4139ECP_4157Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_41390123.2212531,4-dihydroxy-2-naphthoate
ECP_41401153.218567ATP-dependent protease ATP-binding subunit HslU
ECP_41413143.520822ATP-dependent protease peptidase subunit
ECP_41422143.562714cell division protein FtsN
ECP_41431143.097451DNA-binding transcriptional regulator CytR
ECP_41441184.822149primosome assembly protein PriA
ECP_4145-2152.16941250S ribosomal protein L31
ECP_4146-3100.106090peptidoglycan peptidase
ECP_4147-210-2.014420transcriptional repressor protein MetJ
ECP_4148-212-2.413155cystathionine gamma-synthase
ECP_4149-111-3.397213bifunctional aspartate kinase II/homoserine
ECP_4150-118-6.157098nucleoside-specific channel-forming protein tsx
ECP_4151-211-3.1310865'-nucleotidase
ECP_4152-211-1.6983895'-nucleotidase
ECP_4153-2120.424054hypothetical protein
ECP_4154-2131.6249795'-nucleotidase
ECP_4155-2162.9548265,10-methylenetetrahydrofolate reductase
ECP_4156-1183.346086peroxidase/catalase HPI
ECP_41570153.015255transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4140HTHFIS300.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.018
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4142IGASERPTASE424e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 4e-06
Identities = 32/155 (20%), Positives = 63/155 (40%), Gaps = 5/155 (3%)

Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLVQQSR 173
+ +QAD+ P+ E+ ++ P + +AE + Q+S+
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049

Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPAYTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232
T E++ Q T T+Q V + + + A TQ + T ++ ++ A V +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267
A T + ++ + Q + EQ+ETV+ Q
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4150CHANNELTSX341e-121 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 341 bits (875), Expect = e-121
Identities = 167/257 (64%), Positives = 200/257 (77%), Gaps = 6/257 (2%)

Query: 1 MNVIGRTDSRFGPRLTNDLYPEYTVAGRKDWFDFYGYVDLPKFFGVGSHYDVGIWDEGSP 60
+NV+G +RFGP++ ND Y EY +KDWFDFYGY+D P FFG G+ GIW++GSP
Sbjct: 39 VNVVGSYHTRFGPQIRNDTYLEYEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIWNKGSP 97

Query: 61 LFTEIEPRFSIDKLTGLNLAFGPFKEWFIANNYVYDMGDNQSSRQSTWYMGLGTDIDTGL 120
LF EIEPRFSIDKLT +L+FGPFKEW+ ANNY+YDMG N S QSTWYMGLGTDIDTGL
Sbjct: 98 LFMEIEPRFSIDKLTNTDLSFGPFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGL 157

Query: 121 PIKLSANIYAKYQWQNYGAANENEWDGYRFKIKYSIPLTNLFGGRLVYNSFTNFDFGSDL 180
P+ LS N+YAKYQWQNYGA+NENEWDGYRFK+KY +PLT+L+GG L Y FTNFD+GSDL
Sbjct: 158 PMSLSLNVYAKYQWQNYGASNENEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDL 217

Query: 181 ADKSHNN-----KRTSNAIASSHILSLLYEHWKFAFTLRYFHNGGQWNAGEKVNFGDGPF 235
D + + RTSN+IASSHIL+L Y HW ++ RYFHNGGQW K+NFGDGPF
Sbjct: 218 GDDNFYDLNGKHARTSNSIASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPF 277

Query: 236 ELKNTGWGTYTTIGYQF 252
+++TGWG Y +GY F
Sbjct: 278 SVRSTGWGGYFVVGYNF 294


54ECP_4296ECP_4305Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4296-2193.484748hypothetical protein
ECP_4297-2203.144974murein hydrolase exporter
ECP_4298-2202.827597LrgB family murein hydrolase regulator
ECP_4299-2222.939317acetate permease
ECP_4300-1213.183757hypothetical protein
ECP_4301-2223.923317acetyl-CoA synthetase
ECP_4302-1174.002028hypothetical protein
ECP_4303-1174.226383cytochrome c552
ECP_4304-1194.645019cytochrome c nitrite reductase pentaheme
ECP_4305-1204.223543NrfC protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4300RTXTOXIND270.019 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 26.7 bits (59), Expect = 0.019
Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%)

Query: 17 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 48
EL+E R +++ ++ + +L
Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4305VACJLIPOPROT300.007 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 29.9 bits (67), Expect = 0.007
Identities = 6/21 (28%), Positives = 12/21 (57%)

Query: 179 FGNLDDPSSEISQLLRQKPTY 199
GNL++P+ ++ L+ P
Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95


55ECP_4333ECP_4346Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_43333286.291569ribose-5-phosphate isomerase B
ECP_43342327.380338hypothetical protein
ECP_43351348.057862carbon-phosphorus lyase complex accessory
ECP_43360387.993772aminoalkylphosphonic acid N-acetyltransferase
ECP_43371378.550282ribose 1,5-bisphosphokinase
ECP_43381398.861376HisM-like integral membrane protein PhnM
ECP_43390378.952257phosphonates transport ATP-binding protein PhnL
ECP_43400389.398554phosphonate C-P lyase system protein PhnK
ECP_43410409.290983PhnJ protein
ECP_43423398.340230PhnI protein
ECP_43431407.816235carbon-phosphorus lyase complex subunit
ECP_43441376.739020PhnG protein
ECP_43452375.423273phosphonate metabolism transcriptional regulator
ECP_43462323.302684phosphonates transport system permease PhnE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4336SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 20/83 (24%), Positives = 32/83 (38%), Gaps = 5/83 (6%)

Query: 51 LALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAGA 110
L L+ +G I + + N I+++ V R VG+ LL A E A++
Sbjct: 69 LYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 111 EMTELSTNVKRHDAHRFYLREGY 133
L T A FY + +
Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4339PF05272290.021 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.021
Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 8/70 (11%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGEEWVDLVTAPARKVVEI------RK 89
VVL G G GKSTL+ +L + I G++ + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 90 TTIGWVSQFL 99
V F
Sbjct: 656 ADAEAVKAFF 665


56ECP_4366ECP_4378Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4366116-3.932557hypothetical protein
ECP_4367117-4.417609DNA-binding transcriptional activator DcuR
ECP_4368213-4.753462sensory histidine kinase DcuS
ECP_4369-115-4.339168hypothetical protein
ECP_4370-117-4.260793hypothetical protein
ECP_4371022-3.609450hypothetical protein
ECP_4372018-4.233691hypothetical protein
ECP_4373018-3.969331lysyl-tRNA synthetase
ECP_4374018-3.462115POT family di-/tripeptide transport protein
ECP_4375017-2.867730lysine decarboxylase, inducible
ECP_4376116-2.340792lysine/cadaverine antiporter
ECP_4377015-1.684558DNA-binding transcriptional activator CadC
ECP_43782210.197136hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4367HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 4e-16
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4368PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4370SACTRNSFRASE260.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.012
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4374TCRTETA300.028 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.028
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4377SYCDCHAPRONE378e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.8 bits (85), Expect = 8e-05
Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445
++ A+ + + T+ LN +S +Y + A + GK +++++
Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64

Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481
L+ + L LG + G A +Y +
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


57ECP_4407ECP_4425Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4407-1133.306650hypothetical protein
ECP_4411-1123.347730***electron transport protein YjeS
ECP_4412-1123.408553hypothetical protein
ECP_4413-1142.611194ATPase
ECP_44140133.155866N-acetylmuramoyl-L-alanine amidase
ECP_44151142.811126DNA mismatch repair protein
ECP_44162191.920152tRNA delta(2)-isopentenylpyrophosphate
ECP_44174252.070346RNA-binding protein Hfq
ECP_44184231.966171GTPase HflX
ECP_44194222.374305FtsH protease regulator HflK
ECP_44204222.088277FtsH protease regulator HflC
ECP_44212181.201578membrane protein YjeT
ECP_44223171.099569adenylosuccinate synthetase
ECP_44234130.171020transcriptional repressor NsrR
ECP_4424413-0.054781exoribonuclease R
ECP_4425217-2.72072323S rRNA (guanosine-2'-O-)-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4418SECA320.005 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.8 bits (72), Expect = 0.005
Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%)

Query: 282 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLDDFEPRIDRDEENK-PIRV 340
++D +DV N + IDA+ P L ++ + + R+ D + PI
Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 341 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400
WL + L + + + + + + R + LQ ++ W E ++
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782

Query: 401 SLQVRMPIVDWRRLCKQEPALIDY 424
+R I R +++P +Y
Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4419cloacin320.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.006
Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%)

Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72
S G +SE N GG G G GGG GTG G S+ P
Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91

Query: 73 -----RPQLGGRVVTIAAAAI 88
P GG V+I+A A+
Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4424RTXTOXIND310.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.028
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218
+VP+D L L+ I +G ++++ P R + GK+ + D +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413


58ECP_4440ECP_4449Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4440-2233.019939L-ascorbate-specific enzyme IIA component of
ECP_4441-2222.8518343-keto-L-gulonate-6-phosphate decarboxylase
ECP_4442-1232.363667L-xylulose 5-phosphate 3-epimerase
ECP_44432260.472292L-ribulose-5-phosphate 4-epimerase
ECP_4444328-2.391322hypothetical protein
ECP_4445329-3.20136130S ribosomal protein S6
ECP_4446230-4.401290primosomal replication protein N
ECP_4447337-5.83952830S ribosomal protein S18
ECP_4448231-5.99471050S ribosomal protein L9
ECP_4449-126-3.911679hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4441ECOLNEIPORIN270.037 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 27.5 bits (61), Expect = 0.037
Identities = 6/19 (31%), Positives = 7/19 (36%), Gaps = 2/19 (10%)

Query: 105 FNGDVQI--ELTGYWTWEQ 121
F G + L W EQ
Sbjct: 62 FKGQEDLGNGLKAIWQVEQ 80


59ECP_4515ECP_4591Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4515022-3.718116gluconate 5-dehydrogenase
ECP_4516022-3.721868L-idonate 5-dehydrogenase
ECP_4519122-3.684728zinc-type alcohol dehydrogenase-like protein
ECP_4521226-5.208461*prophage P4 integrase
ECP_4522123-5.258906hypothetical protein
ECP_4523124-5.079144superfamily I DNA helicases
ECP_4524529-5.350913hypothetical protein
ECP_4525437-7.381801OmpA family protein
ECP_4526543-10.958507hypothetical protein
ECP_4527843-8.726093hypothetical protein
ECP_4528841-9.626932hypothetical protein
ECP_4529940-9.337180hypothetical protein
ECP_4530842-10.000732hypothetical protein
ECP_4531738-8.229014PapX protein
ECP_4532739-5.317218hypothetical protein
ECP_4533639-5.459331protein PapG
ECP_4534634-1.367789PapF protein
ECP_4535533-0.965830PapE protein
ECP_4536531-0.684212PapK fimbrial adapter
ECP_4537533-1.377199protein PapJ
ECP_4538531-2.584714chaperone protein PapD
ECP_4539530-2.330371outer membrane usher protein PapC
ECP_4540440-6.502853protein PapH
ECP_4541342-8.130855protein PapA
ECP_4542241-8.065825major pilu subunit operon regulatory protein
ECP_4543241-8.493043hypothetical protein
ECP_4544239-6.080590hypothetical protein
ECP_4545328-3.207919hypothetical protein
ECP_4546126-2.363122hypothetical protein
ECP_4547125-0.828738hypothetical protein
ECP_4548230-5.507326hypothetical protein
ECP_4549333-7.527011transposase
ECP_4550436-8.622487transposase
ECP_4551539-9.634940transposase
ECP_4552640-11.012764transposase
ECP_4553642-11.464293hemolysin D
ECP_4554642-11.435873alpha-hemolysin translocation ATP-binding
ECP_4555740-11.132863hemolysin A
ECP_4556737-12.006608hemolysin-activating lysine-acyltransferase
ECP_4557634-9.697609hypothetical protein
ECP_4558634-7.453773hypothetical protein
ECP_4559733-6.826716hypothetical protein
ECP_4560631-6.260688hypothetical protein
ECP_4561531-5.492633hypothetical protein
ECP_4562533-4.643143DNA-binding response regulator
ECP_4563536-4.078170hypothetical protein
ECP_4564436-5.160509histidine kinase-like ATPases
ECP_4565446-12.843798transposase-like protein
ECP_4566854-16.150592hypothetical protein
ECP_45671055-16.080279transposase
ECP_45681058-16.488849hypothetical protein
ECP_45691056-16.141053hypothetical protein
ECP_4570853-14.724078hypothetical protein
ECP_4571746-12.502802hypothetical protein
ECP_4572332-4.736146modification methylase
ECP_45733242.163884hypothetical protein
ECP_4574123-0.384412hypothetical protein
ECP_45755193.185980hypothetical protein
ECP_45766203.359719hypothetical protein
ECP_45776203.256059hypothetical protein
ECP_45786213.301227hypothetical protein
ECP_45796203.088063hypothetical protein
ECP_45806213.481376hemagglutinin-related protein
ECP_45818261.924636hemolysin activator HlyB
ECP_45825260.126633hypothetical protein
ECP_4583423-0.026044hypothetical protein
ECP_4584118-0.167718hypothetical protein
ECP_4585118-0.943105hypothetical protein
ECP_4587116-1.334159hypothetical protein
ECP_4588117-1.373502hypothetical protein
ECP_4589219-2.792818hypothetical protein
ECP_4590218-2.997396D-serine dehydratase
ECP_4591118-3.352415permease DsdX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4515DHBDHDRGNASE1441e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 144 bits (365), Expect = 1e-44
Identities = 86/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%)

Query: 7 LAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVKKLHQEGIQAVAAPFN 66
+ GK ITG+AQGIG +A L GA I D E+ E V L E A A P +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 67 VTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126
V ID IE+++GPID+LVN AG+ R ++EW +VN T VF S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 AVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186
+V+++M++R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 APGYFKTEMTKALVEDE--------AFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238
+PG +T+M +L DE P + P ++ A +FL S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 239 VNGHLLFVDGGMLVAV 254
+ H L VDGG + V
Sbjct: 246 ITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4525OMPADOMAIN412e-06 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 41.1 bits (96), Expect = 2e-06
Identities = 50/246 (20%), Positives = 79/246 (32%), Gaps = 55/246 (22%)

Query: 1 MNKVFVVSVVAAACVFAANAGAKEGKSGFYLTGKAGASVVSLSDQRFLSGDEEETSKYKG 60
M K + VA A FA A A + +Y K G S D F++
Sbjct: 1 MKKTAIAIAVALA-GFATVAQAAPKDNTWYTGAKLGWS--QYHDTGFIN---------NN 48

Query: 61 GDDHDTVFSGGIAAGYDFYPQFSIPVRTELEFYARGKADSKYNVDKDSWSGGYWRDDLKN 120
G H+ G GY P E+ + G+ K +V+ +
Sbjct: 49 GPTHENQLGAGAFGGYQVNPYVGF----EMGYDWLGRMPYKGSVE-----------NGAY 93

Query: 121 EVSVNTLMLNAYYDFRNDSAFTPWVSAGIGYARIHQKTTGISTWDYEYGSSGRESLSRSG 180
+ L Y +D Y R+ G W + S+ +G
Sbjct: 94 KAQGVQLTAKLGYPITDDLDI---------YTRL-----GGMVWRADTKSNVYGKNHDTG 139

Query: 181 SADNFAWSLGAGVRYDVTPDIALDLSYRYLDAGDSSVSYKDEWGDKYKSEVDVKSHDIML 240
+ F GV Y +TP+IA L Y++ + GD + + + L
Sbjct: 140 VSPVF----AGGVEYAITPEIATRLEYQWT----------NNIGDAHTIGTRPDNGMLSL 185

Query: 241 GMTYNF 246
G++Y F
Sbjct: 186 GVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4533PF036275490.0 PapG
		>PF03627#PapG

Length = 336

Score = 549 bits (1416), Expect = 0.0
Identities = 192/339 (56%), Positives = 232/339 (68%), Gaps = 7/339 (2%)

Query: 1 MKKWFPAFLF-LSLSGCNDALAANQSTIFYSFNDNIYHPQLSVKVTDIVQFIVDINSASS 59
MKKWFPA LF L +SG + A + +FYS + + +V +T QFI +
Sbjct: 1 MKKWFPALLFSLCVSGESSAW---NNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIA 57

Query: 60 TATLSYVACNGFTWTHGLYWSEYFAWLVVPKHV-SYNGYNIYLELQSKGGFSLD-AEDND 117
T T + GF Y+ EY AW+V PK V + NGY +++E+ +KG +S + DND
Sbjct: 58 TVTWNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDND 117

Query: 118 NYYLTKGFAWDE-VNSSGRVCFDIGEKRSLAWSFGGVTLNARLPVDLPKGDYTFPVKFLR 176
+Y+ KG+ WDE +G +C GE L F + LP DLP GDY+ + +
Sbjct: 118 SYFFLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTS 177

Query: 177 GIQRNNYDYIGGRYKIPSSLMKTFPFNGTLNFSIKNTGGCRPSAQSLEINHGDLSINSAN 236
G+QR+ Y+G R+KIP ++ KT P + F KN GGCRPSAQSLEI HGDLSINSAN
Sbjct: 178 GMQRHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSAN 237

Query: 237 NHYAAQTLSVSCDVPTNIRFFLLSNTTPAYSHGQQFSVGLGHGWDSIVSINGVDTGETTM 296
NHYAAQTLSVSCDVP NIRF LL NTTP YSHG++FSVGLGHGWDSIVS+NGVDTGETTM
Sbjct: 238 NHYAAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTM 297

Query: 297 RWYRAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 335
RWY+AGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP
Sbjct: 298 RWYKAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4534FIMBRIALPAPF292e-105 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 292 bits (749), Expect = e-105
Identities = 165/167 (98%), Positives = 165/167 (98%)

Query: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE
Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60

Query: 61 VTKTISISCPYKSGSLWIKVTGNTMGGGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120
VTK ISISCPYKSGSLWIKVTGNTMG GQNNVLATNITHFGIALYQGKGMSTPLTLGNGS
Sbjct: 61 VTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120

Query: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167
GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN
Sbjct: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4535FIMBRIALPAPE2769e-99 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 276 bits (706), Expect = 9e-99
Identities = 153/158 (96%), Positives = 156/158 (98%)

Query: 1 MLMSQHAHAADNLTFKGKLIIPACTVQNAEVDWGDIEIQNLVQNGGNQKDFTVDMNCPYS 60
+LMSQH HAADNLTFKGKLIIPACTVQNAEV+WGDIEIQNLVQ+GGNQKDFTVDMNCPYS
Sbjct: 16 VLMSQHVHAADNLTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSGGNQKDFTVDMNCPYS 75

Query: 61 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQFTPGKITG 120
LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQ TPGKITG
Sbjct: 76 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTPGKITG 135

Query: 121 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 158
TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS
Sbjct: 136 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4539PF005777440.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 744 bits (1922), Expect = 0.0
Identities = 244/882 (27%), Positives = 364/882 (41%), Gaps = 67/882 (7%)

Query: 2 MRVMKDRI-PFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLP 60
+ + K R+ F V + +++ + FN L + D +RF + P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 61 GQYLLDVIVNGQSISPASLQISFVEPQSSGDKAEKKLPQACLTSDMVRLMGLTAESLDKV 120
G Y +D+ +N + A+ ++F S CLT + MGL S+ +
Sbjct: 76 GTYRVDIYLNNGYM--ATRDVTFNTGDSEQGI------VPCLTRAQLASMGLNTASVSGM 127

Query: 121 VYWHDGQCADF-HGLPGVDIRPDTGAGVLRINMPQAWLEYSDATWLPPSRWDDGIPGLML 179
D C + + D G L + +PQA++ ++PP WD GI +L
Sbjct: 128 NLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187

Query: 180 DYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTW 239
+YN +G +N GG+SH N G N+G WRLR + S S + + +
Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSS--DSSSGSKNKWQH 245

Query: 240 SRFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIA 299
+L R I R+ LTLG+ DIF ++ GA L SDD MLP RG+AP I GIA
Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305

Query: 300 ETNARVVVSQQGRVLYDSMVPAGPFSIQDLD-SSVRGRLDVEVIEQNGRKKTFQVDTASV 358
A+V + Q G +Y+S VP GPF+I D+ + G L V + E +G + F V +SV
Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365

Query: 359 PYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLYGGAVLAGDYNALAA 418
P L R G RY + +G R + E P F GL W++YGG LA Y A
Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNF 425

Query: 419 GAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERN 478
G G ++G G LS D+TQ+ + + + G+S R Y+K + + +I GYR+S
Sbjct: 426 GIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485

Query: 479 YMTMEQYLNARYR--------------------NDYSSREKEMYTVTLNKNVADWNTSFN 518
Y +R + + ++ +T+ + + + +
Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLY 544

Query: 519 LQYSRQTYWDIRKTD-YYTVSVNRYFNVFGLQGVAVGLSASRSKYLGRD--NDSAYLRIS 575
L S QTYW D + +N F + LS S +K + + L ++
Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFE-----DINWTLSYSLTKNAWQKGRDQMLALNVN 599

Query: 576 VPLGT------------GTASYSGSMSND-RYVNMAGYTDT-FNDGLDSYSLNAGLNSGG 621
+P +ASYS S + R N+AG T D SYS+ G GG
Sbjct: 600 IPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGG 659

Query: 622 GLTSQRQINAYYSHRSPLANLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGG 681
S A ++R N + S SGG G L G
Sbjct: 660 DGNSGSTGYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTL--GQPLND 716

Query: 682 TRLLVDTDGVGGVPVDGGQVV-TNRWGTGVVTDISSYYRNTTSVDLKRLPDDVEATRSVV 740
T +LV G V+ V T+ G V+ + Y N ++D L D+V+ +V
Sbjct: 717 TVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776

Query: 741 ESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS 800
T GAI +F G +L L + PFGA VTSE + G+VAD G +LS
Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLS 835

Query: 801 GVTPGETLSVNW--DGKIQCQVNVPETAISDQQLL----LPC 836
G+ + V W + C N S QQLL C
Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4540FIMBRIALPAPE320.001 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 29/173 (16%)

Query: 29 GMSLPEYWG----EEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDL-QNGF 83
G+ LP G +HV F G+++ PACT+ + ++ G+ +++L Q+G
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAE----VNWGDIEIQNLVQSG- 60

Query: 84 SGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGET---PDKFNLSGQAKGINLQI 140
G ++ F++ + NC ++ ++ +T +G G + P+ SG I L
Sbjct: 61 -GNQKDFTVDM-NCPYS------LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYN 112

Query: 141 ADARGNIARAGKV-MPAIP--LTGNEEALDYTLRIVR----NGKKLEAGNYFA 186
++ I A + P +TG A TL N + L+AG + A
Sbjct: 113 SNN-SGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4542FIMREGULATRY1683e-58 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 168 bits (428), Expect = 3e-58
Identities = 100/104 (96%), Positives = 104/104 (100%)

Query: 1 MAHHEIISRAGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHT 60
MAHHE+ISR+GNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGH+
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKEVCEKHQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104
RKEVCEK+QMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4553RTXTOXIND5990.0 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 599 bits (1546), Expect = 0.0
Identities = 462/478 (96%), Positives = 468/478 (97%)

Query: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60
MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV
Sbjct: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60

Query: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRK 120
AYFIMGFLVIAFILSVLGQVEIVATANGKLT SGRSKEIKPIENSIVKEIIVKEGESVRK
Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRK 120

Query: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQIRYQILSRSIELNKLPELKLPDESYFQNVS 180
GDVLLKLTALGAEADTLKTQSSLLQARLEQ RYQILSRSIELNKLPELKLPDE YFQNVS
Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180

Query: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENLSRVEKSRLDDF 240
EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT+LARINRYENLSRVEKSRLDDF
Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240

Query: 241 RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300
SLLHKQAIAKHAVLEQENKYVEA NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 301 LDKLRQTTDSIELLTLELEKNEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIV 360
LDKLRQTTD+I LLTLEL KNEERQQASVIRAPVS KVQQLKVHTEGGVVTTAETLMVIV
Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360

Query: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQKLGL 420
PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ+LGL
Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420

Query: 421 VFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER 478
VFNVI+S+EEN LSTGNK+IPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL ER
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4555RTXTOXINA14740.0 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 1474 bits (3816), Expect = 0.0
Identities = 975/1024 (95%), Positives = 991/1024 (96%)

Query: 1 MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60
M TITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ
Sbjct: 1 MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60

Query: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120
GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 121 YQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180
YQKAGN LGG AENIGDNLGKAG +LSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 181 AKASIELINQLVDTAASLNNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240
AKASIELINQLVDT ASLNNNVNSFSQQLN LGSVLSNTKHLNGVGNKLQNLPNLDNIGA
Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240

Query: 241 GLDTVSGILSAISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300
GLDTVSGILSAISASFILSNADADT TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL
Sbjct: 241 GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300

Query: 301 STSAAAAGLIASVVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360
STSAAAAGLIAS VTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE
Sbjct: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360

Query: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420
TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH
Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420

Query: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFEILSQYNKEYSVERSVLITQQHW 480
VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNF+ILSQYNKEYSVERSVLITQQHW
Sbjct: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480

Query: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKEPDEFQKQVFDPLKGNIDLSVIKSS 540
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEK+ DEFQKQVFDPLKGNIDLS KSS
Sbjct: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSS 540

Query: 541 TLLKFITPLLTPGKEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHA 600
TLLKF+TPLLTPG+EIRERRQSGKYEYITELLVKGVDKWTVKGVQDKG+VYDYSNLIQHA
Sbjct: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHA 600

Query: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660
SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE
Sbjct: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660

Query: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE 720
AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHING +LTETDNLYSVE
Sbjct: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE 720

Query: 721 ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGD 780
ELIGT RADKFFGSKFTDIFHGADGDD IEGNDGNDRLYGDKGNDTL GGNGDDQLYGGD
Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGD 780

Query: 781 GNDKLTGGVGNNYLNGGDGDDELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGND 840
GNDKL G GNNYLNGGDGDDE QVQGNSLAKNVL GGKGNDKLYGSEGADLLDGGEG+D
Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840

Query: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRDVAFKREGNDLIMYKAEGNV 900
LLKGGYGNDIYRYLSGYGHHIIDDDGGK+DKLSLADIDFRDVAFKREGNDLIMYK EGNV
Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900

Query: 901 LSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY 960
LSIGHKNGITFRNWFEKESGDISNH+IEQIFDK GR+ITPDSLKKA EYQQ NN+A+YVY
Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960

Query: 961 GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITL 1020
G A Y +LNPLINEISKIISAAG+FDVKEER+AASLLQLSGNASDFSYGRNSITL
Sbjct: 961 GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITL 1020

Query: 1021 TASA 1024
T SA
Sbjct: 1021 TTSA 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4556RTXTOXINC316e-114 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 316 bits (810), Expect = e-114
Identities = 163/170 (95%), Positives = 167/170 (98%)

Query: 1 MNMNNPLEVLGHVSWLWASSPLHRNWPVSLFAINVLPAIRANQYALLTRDNYPVAYCSWA 60
MN+N PLE+LGHVSWLWASSPLHRNWPVSLFAINVLPAI+ANQY LLTRD+YPVAYCSWA
Sbjct: 1 MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWA 60

Query: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120
NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR
Sbjct: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120

Query: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKNKSDFNFSLTG 170
VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVK KSDFNFSLTG
Sbjct: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRKSDFNFSLTG 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4562HTHFIS914e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 4e-23
Identities = 35/128 (27%), Positives = 60/128 (46%)

Query: 6 KILLMEDDYDIAALLRLNLQDEGYQIVHEADGARARLLLDKQTWDAVILDLMLPNVNGLE 65
IL+ +DD I +L L GY + ++ A + D V+ D+++P+ N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 ICRYIRQMTSYLPVIIISARTSETHRVLGLEMGADDYLPKPFSIPELIARIKALFRRQEA 125
+ I++ LPV+++SA+ + + E GA DYLPKPF + ELI I +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 MGQNILLA 133
+
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4564PF06580441e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.7 bits (103), Expect = 1e-07
Identities = 24/137 (17%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 65 LSIETRRLQLRIMMSHSLPLIRADISMIERVITNLLDNAVRH----TPPEGSIRLKVWQE 120
L + + + + R+ + + D+ + ++ L++N ++H P G I LK ++
Sbjct: 229 LQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288

Query: 121 DNRLHVEVADSGPGLTEDMRTHLFRRASVLCHEPSEEPRGGLGLLIVRRMLVLHGGD--- 177
+ + +EV ++G + + + G GL VR L + G
Sbjct: 289 NGTVTLEVENTGSLALK-----------------NTKESTGTGLQNVRERLQMLYGTEAQ 331

Query: 178 IRLTDSTTGACFRFFLP 194
I+L++ +P
Sbjct: 332 IKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4580PF05860765e-18 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 75.6 bits (186), Expect = 5e-18
Identities = 25/139 (17%), Positives = 45/139 (32%), Gaps = 26/139 (18%)

Query: 32 AVITPQNGA---GMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILNNATGKLNP 88
A ITP ++ T G+ + H+ F +++V G N
Sbjct: 1 AQITPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNNPT---- 55

Query: 89 TQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGC 148
+ II+ VTGG+ S + G A N+ + NP GI
Sbjct: 56 -----------------NIQNIISRVTGGSVSNIDGLIRANATA-NLFLINPNGIIFGQN 97

Query: 149 GFINTPHATLTTGRPVMNA 167
++ + + + +
Sbjct: 98 ARLDIGGSFVGSTANRLKF 116


60ECP_4604ECP_4652Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4604-118-3.088645hypothetical protein
ECP_4605-118-3.209519hypothetical protein
ECP_4606117-1.848025hypothetical protein
ECP_4607017-1.562265hypothetical protein
ECP_4608017-1.636359hypothetical protein
ECP_4609022-4.424075hypothetical protein
ECP_4610227-5.035128transcription regulator protein
ECP_4611127-4.926189transposase OrfB protein of insertion sequence
ECP_4612228-4.956901transposase OrfA protein of insertion sequence
ECP_4613328-5.322552hypothetical protein
ECP_4614328-5.076072aminotransferase
ECP_4615327-2.132558Na+/H+ antiporter
ECP_4616426-0.976849transposase/IS protein
ECP_4617425-0.955608transposase
ECP_4618424-2.420392Na+/H+ antiporter
ECP_4619426-3.052429transposase
ECP_4620333-7.962254hypothetical protein
ECP_4621329-6.960258hypothetical protein
ECP_4622430-6.651139hypothetical protein
ECP_4623330-6.822966hypothetical protein
ECP_4624325-4.573914hypothetical protein
ECP_4625425-3.460853hypothetical protein
ECP_46266251.967862hypothetical protein
ECP_46276271.927152hypothetical protein
ECP_46286293.235632hypothetical protein
ECP_46297324.518339hypothetical protein
ECP_46308283.708708hypothetical protein
ECP_46318273.191000DNA repair protein
ECP_46329251.805161hypothetical protein
ECP_46338280.263317hypothetical protein
ECP_4634440-10.917422hypothetical protein
ECP_4635442-11.962303hypothetical protein
ECP_4636240-11.283714hypothetical protein
ECP_4637137-10.128209hypothetical protein
ECP_4638135-9.761590hypothetical protein
ECP_4639134-9.990096hypothetical protein
ECP_4640126-5.517581hypothetical protein
ECP_4641126-5.946923hypothetical protein
ECP_4642129-6.465744hypothetical protein
ECP_4643130-8.266174hypothetical protein
ECP_4644032-7.342833hypothetical protein
ECP_4645031-5.523341hypothetical protein
ECP_4646031-5.518938tyrosine recombinase
ECP_4647127-4.000573tyrosine recombinase
ECP_4648126-3.821554hypothetical protein
ECP_4649124-2.668086type-1 fimbrial major subunit
ECP_4650324-2.656554fimbrin-like protein fimI
ECP_4651224-2.949390chaperone protein FimC
ECP_4652323-2.429919outer membrane usher protein FimD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4610HTHTETR557e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.4 bits (133), Expect = 7e-12
Identities = 25/108 (23%), Positives = 50/108 (46%)

Query: 20 YQQLLESAAMIAGRDGIAALSLNAVAREAGVSKGGLLHHFPNKQALIYALFARLLAIMEE 79
Q +L+ A + + G+++ SL +A+ AGV++G + HF +K L ++ + + E
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 80 AIAALMQKDNISYGRFTRAYVNYLSALTDTQESRQLMVLSLAMPDEPV 127
K R + ++ T T+E R+L++ + E V
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4617HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4652PF0057710970.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 1097 bits (2838), Expect = 0.0
Identities = 877/878 (99%), Positives = 878/878 (100%)

Query: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60
MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA
Sbjct: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60

Query: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120
VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN
Sbjct: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120

Query: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180
TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP
Sbjct: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180

Query: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTSWSYNSSDSSSGSK 240
GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNT+WSYNSSDSSSGSK
Sbjct: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK 240

Query: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300
NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV
Sbjct: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300

Query: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360
IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV
Sbjct: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360

Query: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420
PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY
Sbjct: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420

Query: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480
RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR
Sbjct: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480

Query: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540
YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT
Sbjct: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540

Query: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600
STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI
Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600

Query: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660
PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD
Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660

Query: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720
GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL
Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720

Query: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780
VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP
Sbjct: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780

Query: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840
TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA
Sbjct: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840

Query: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR
Sbjct: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


61ECP_4669ECP_4684Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4669114-3.045961hypothetical protein
ECP_4670114-1.862133R-phenyllactate dehydratase activator
ECP_4671017-4.100924hypothetical protein
ECP_4672016-4.450884hypothetical protein
ECP_4673-112-4.551891hypothetical protein
ECP_4674018-6.916874hypothetical protein
ECP_4675014-5.657596endoribonuclease SymE
ECP_4676014-5.949722type I restriction enzyme EcoEI specificity
ECP_4677-213-2.819883type I restriction enzyme EcoEI M protein
ECP_4678-212-1.975004type I restriction enzyme EcoAI R protein
ECP_4679-312-1.782854hypothetical protein
ECP_4680-3202.268414GTP-binding protein YjiA
ECP_4681-2160.798129hypothetical protein
ECP_4682-211-0.128209carbon starvation protein
ECP_4683-114-2.141829methyl-accepting chemotaxis protein I
ECP_4684117-3.344341C4-dicarboxylate transporter large subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4669ADHESNFAMILY290.026 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 29.1 bits (65), Expect = 0.026
Identities = 10/45 (22%), Positives = 17/45 (37%)

Query: 53 LFVIVAVCTFFVQSCARKSNHAASFQNYHATIDGKEIAGITNNIS 97
+++ + + +CA S Q IA IT NI+
Sbjct: 6 TLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIA 50


62ECP_0113ECP_0122N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0113529-8.183765colicin
ECP_0114626-8.100554colicin immunity protein
ECP_0115428-7.622198uropathogenic specific protein
ECP_0116327-1.551462colicin immunity protein
ECP_01174350.485583uropathogenic specific protein
ECP_01185380.982556colicin immunity protein
ECP_01194321.981981transcriptional regulator PdhR
ECP_01203352.308584hypothetical protein
ECP_01213352.505891pyruvate dehydrogenase subunit E1
ECP_01221272.060801dihydrolipoamide acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0113PYOCINKILLER1811e-51 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 181 bits (459), Expect = 1e-51
Identities = 92/253 (36%), Positives = 125/253 (49%), Gaps = 21/253 (8%)

Query: 343 GEGTPYENVRVANMQWNEQTQRYEFT---PAHDVDGPLITWTPENPEHGNVPGHTGN--D 397
G P + V V +N T YE T + ++TWTP +P P T
Sbjct: 377 GVSVP-KAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVP 435

Query: 398 RPPLDQPTILVTPIPDGTDTYTTPPFPVPDPKEFNDYILVFPAGSGIKPIYVYLKEDPRK 457
+P +TP+ T +P D I+ FPA SGIKPIYV + DPR
Sbjct: 436 KPVPVYEGATLTPV-----KATPETYPGVITLP-EDLIIGFPADSGIKPIYVMFR-DPRD 488

Query: 458 LPGVVTGHGVPLSPGTRWLDMSVSNNGNGAPIPAHIADKLRGREFKTFDEFREALWLEVS 517
+PG TG G P+ WL ++ G GAPIP+ IADKLRG+ FK + +FRE W+ V+
Sbjct: 489 VPGAATGKGQPV--SGNWLG--AASQGEGAPIPSQIADKLRGKTFKNWRDFREQFWIAVA 544

Query: 518 QDPELIAQFSSGNQTRIKQGLTAKAPIDGWHYGPKDIVKKFQIHHRVAIEYGGSVYDIDN 577
DPEL QF+ G+ ++ G G + K +IHH+V + GG VY++ N
Sbjct: 545 NDPELSKQFNPGSLAVMRDGGAPYVRESE-QAGGR---IKIEIHHKVRVADGGGVYNMGN 600

Query: 578 LRIVTPRLHDEIH 590
L VTP+ H EIH
Sbjct: 601 LVAVTPKRHIEIH 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0115PYOCINKILLER534e-12 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 53.3 bits (127), Expect = 4e-12
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 1 MSQYPELIAQFSSGNQTRIKQGLIAKAPLEGWHYGTKEIVKKFHMYHRVAIEYSGGIYDI 60
++ PEL QF+ G+ ++ G E G + K ++H+V + GG+Y++
Sbjct: 543 VANDPELSKQFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNM 598

Query: 61 DNLR 64
NL
Sbjct: 599 GNLV 602


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0116PF04605260.019 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 26.0 bits (57), Expect = 0.019
Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 1 MYNFKDKIEDYTEREFIELLGEFTNPTGDNAQLK 34
Y+ K+ I+D ++F + L EFT T N +LK
Sbjct: 88 QYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLK 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0117PYOCINKILLER443e-09 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 44.0 bits (103), Expect = 3e-09
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 4/53 (7%)

Query: 4 QFSTGNQTRIKQGLIAKAPLEGWHYGSKEIVKEFHIYHSVAIECGGEIYDIDN 56
QF+ G+ ++ G E G + + I+H V + GG +Y++ N
Sbjct: 552 QFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNMGN 600


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0122RTXTOXIND320.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.008
Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGAADAAP 85
+ V +T G S E+ + IVKEI V G+ + G +++ + AD
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 86 AQA 88
Q+
Sbjct: 139 TQS 141



Score = 31.7 bits (72), Expect = 0.009
Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 119 EVTEILVKVGDKV-EAEQSLITVEGDKASMEVPAPFAGTVKEIKVN-VGDKVSTGSLIMI 176
E+ + L + D + L E + + + AP + V+++KV+ G V+T +M+
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 29.8 bits (67), Expect = 0.035
Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 3/95 (3%)

Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAP 289
+ VA +T G S E+ +VKE+ V G+ V+ G +++ GA A
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE-ADTL 137

Query: 290 AKQEAAAPAPAAKAEAPAAAPAAKAEGKSEFAEND 324
Q + A + + + + E D
Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172


63ECP_0289ECP_0296N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0289533-3.699190hypothetical protein
ECP_0290634-4.822883hypothetical protein
ECP_0291935-2.493936fimbrial transcription regulator protein FaeA
ECP_0292935-2.599103major pilu subunit operon regulatory protein
ECP_02931036-2.935146S-fimbrial protein subunit
ECP_02941035-2.416130minor F1C fimbrial subunit
ECP_02951033-2.745168F1C periplasmic chaperone
ECP_02961033-3.271935F1C fimbrial usher
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0289PHPHTRNFRASE270.029 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.1 bits (60), Expect = 0.029
Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 110 GQCRVERCF--RVTWPDTSEQYVALKTAVQSL--IPLVIATI 147
G R E + R P EQ+ A K VQ + P+VI T+
Sbjct: 294 GLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0292FIMREGULATRY1462e-49 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 146 bits (370), Expect = 2e-49
Identities = 84/102 (82%), Positives = 88/102 (86%)

Query: 1 MAQHEVITRGGDAFLLKLRESALSSGSMSEEQFFLLIGISSIHSDRVILAMKDYLVSGHS 60
MA HEVI+R G+AFLL +RES L GSMSE FFLLIGISSIHSDRVILAMKDYLV GHS
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKDVCEKYQMNNGYFSTTLGRLTRLNVLVARLAPYYTDSVSA 102
RK+VCEKYQMNNGYFSTTLGRL RLN L ARLAPYYTD SA
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSA 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0294FIMBRIALPAPE290.009 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 28.8 bits (64), Expect = 0.009
Identities = 39/160 (24%), Positives = 64/160 (40%), Gaps = 23/160 (14%)

Query: 16 AALAGNHWHVMLPGGNMRFQGKIIAEACSLALSDRQMTVDMGQLSSNRFHAAGEYGDPVG 75
A L H H N+ F+GK+I AC++ + V+ G + +G G+
Sbjct: 15 AVLMSQHVHA---ADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSG--GNQKD 65

Query: 76 FDIHLQDCSTVVSQRVGISFYGVSDIHEPELLSVEEENDASDGIAIALFNES----GELV 131
F + + ++ + +V I+ G + +L + DG+ I L+N + G V
Sbjct: 66 FTVDMNCPYSLGTMKVTITSNGQTG---NSILVPNTSTASGDGLLIYLYNSNNSGIGNAV 122

Query: 132 KLNQPPENWVHLTRGDMKLHMQARYKATHYPVAGGKANGQ 171
L +T G + AR K T Y G K N Q
Sbjct: 123 TLGSQ------VTPGKITGTAPAR-KITLYAKLGYKGNMQ 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0296PF005779600.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 960 bits (2483), Expect = 0.0
Identities = 545/861 (63%), Positives = 690/861 (80%), Gaps = 9/861 (1%)

Query: 25 RMRFNILPLAFFIGIIVSPAR------AELYFNPRFLSDDPDAVADLSAFTQGQELPPGV 78
+ + + + + A AELYFNPRFL+DDP AVADLS F GQELPPG
Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77

Query: 79 YRVDIYLNDTYISTRDVQFQMSQDGKQLAPCLSPEHMSAMGVNRYAVPGMERLPADTCTS 138
YRVDIYLN+ Y++TRDV F + + PCL+ +++MG+N +V GM L D C
Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137

Query: 139 LNSMIQGATFRFDVGQQRLYLTVPQIYMSNQARGYIAPEYWDNGITAALLNYDFSGNRVR 198
L SMI AT + DVGQQRL LT+PQ +MSN+ARGYI PE WD GI A LLNY+FSGN V+
Sbjct: 138 LTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQ 197

Query: 199 DSYGGTSDYAYLNLKTGLNIGSWRLRDNTSWSYSAGKGYS--QNNWQHINTWLERDIVPL 256
+ GG S YAYLNL++GLNIG+WRLRDNT+WSY++ S +N WQHINTWLERDI+PL
Sbjct: 198 NRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 257 RSRLTMGDSYTRGDIFDGVNFRGIQLASDDNMVPDSQRGYAPTIHGISRGTSRISIRQNG 316
RSRLT+GD YT+GDIFDG+NFRG QLASDDNM+PDSQRG+AP IHGI+RGT++++I+QNG
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 317 YEIYQSTLPPGPFEINDIYPAGSGGDLQVTLQEADGSVQRFNVPWSSVPVLQREGHLKYA 376
Y+IY ST+PPGPF INDIY AG+ GDLQVT++EADGS Q F VP+SSVP+LQREGH +Y+
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 377 LSAGEFRSGGHQQDNPRFAEGTLKYGLPAGWTVYGGAWIAERYRAFNLGVGKNMGWLGAV 436
++AGE+RSG QQ+ PRF + TL +GLPAGWT+YGG +A+RYRAFN G+GKNMG LGA+
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 437 SLDATRANARLPDESRYDGQSYRFLYNKSLTETGTNIQLIGYRYSTRGYFSFADTAWKKM 496
S+D T+AN+ LPD+S++DGQS RFLYNKSL E+GTNIQL+GYRYST GYF+FADT + +M
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 497 SGYSVLTQDGVIQIQPKYTDYYNLAYNKRGRVQVSISQQTGESSTLYLSGSHQSYWGTDR 556
+GY++ TQDGVIQ++PK+TDYYNLAYNKRG++Q++++QQ G +STLYLSGSHQ+YWGT
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 557 TDRQLNAGFNSSVNDISWSLNYSLSRNAWQHETDRILSFDVSIPFSHWMRSDSTSAWRNA 616
D Q AG N++ DI+W+L+YSL++NAWQ D++L+ +V+IPFSHW+RSDS S WR+A
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 617 SARYSQTLEAHGQAASTAGLYGTLLEDNNLGYSIQSGYTRGGYEGSSKTGYASLNYRGGY 676
SA YS + + +G+ + AG+YGTLLEDNNL YS+Q+GY GG S TGYA+LNYRGGY
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 677 GNASAGYSHSGGYRQLYYGLSGGILAHANGLTLSQPLGDTLILVRAPGASDTRIENQTGV 736
GNA+ GYSHS +QLYYG+SGG+LAHANG+TL QPL DT++LV+APGA D ++ENQTGV
Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737

Query: 737 STDWRGYAVLPYATDYRENRVALDTNTLADNVDIENTVVSVVPTHGAVVRADYKTRVGVK 796
TDWRGYAVLPYAT+YRENRVALDTNTLADNVD++N V +VVPT GA+VRA++K RVG+K
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 797 VLMTLMRNGKAVPFGSVVTARNGGS-SIAGENGQVYLSGMPLSGQVSVKWGSQTTDQCTA 855
+LMTL N K +PFG++VT+ + S I +NGQVYLSGMPL+G+V VKWG + C A
Sbjct: 798 LLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVA 857

Query: 856 DYKLPKESAGQILSHVTVSCR 876
+Y+LP ES Q+L+ ++ CR
Sbjct: 858 NYQLPPESQQQLLTQLSAECR 878


64ECP_0454ECP_0459N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_04541141.750299fructokinase
ECP_04551111.874636MFS transport protein AraJ
ECP_04560122.183551exonuclease SbcC
ECP_0457-1122.287460exonuclease SbcD
ECP_0458-2132.330136phosphate regulon transcriptional regulatory
ECP_0459-1142.255855phosphate regulon sensor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0454ACETATEKNASE300.015 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 29.8 bits (67), Expect = 0.015
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%)

Query: 187 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245
+G ++D+R L A + D A+LAL + R+ K++ +
Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323

Query: 246 DVIVLGGGM 254
DVIV G+
Sbjct: 324 DVIVFTAGI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0455TCRTETA514e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.0 bits (122), Expect = 4e-09
Identities = 74/356 (20%), Positives = 126/356 (35%), Gaps = 35/356 (9%)

Query: 5 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 61
+ ++AL G+G+ IM VL L ++ S H +++ YAL AP++
Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 62 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 121
S R+ + +LL +A + A+ + +L IGR+V+G GA + I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124

Query: 122 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 181
G A G +S ++ P+ L FS F A N + F +P+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 182 RDEAKGKLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 229
+ LR + + A + F + G W +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238

Query: 230 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFFG 286
F A T + L G+ + M++G ++ R R + ++L F
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 287 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 340
I + G+ LQ +L + E G G +A +L S VG
Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0456IGASERPTASE407e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 7e-05
Identities = 40/264 (15%), Positives = 81/264 (30%), Gaps = 11/264 (4%)

Query: 162 LNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSL 221
A P E E + E + Q S V + + A + + A Q+
Sbjct: 1029 APATPSETTETVAENSK-----QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN 1083

Query: 222 TASLQVLTDEEKQLITAQQQEQQSLNWLTRLD-ELQQEASRRQQALQQALAEEEKAQPQL 280
+ +E Q ++ +++ E QE + + + E QPQ
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 281 AALSLAQPARNLRPHWE---RIAEHSTALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQ 337
P N++ A+ T +E+ T + + + +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 338 SAELQQQQQSLNAWLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLN 397
A Q S ++ ++ R + + ++DR + T+ L+
Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA-TTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 398 ALAAITLTLTADEVATALAQHAEQ 421
A + + V A++QH Q
Sbjct: 1263 DARAKAQFVALN-VGKAVSQHISQ 1285



Score = 33.9 bits (77), Expect = 0.005
Identities = 27/139 (19%), Positives = 54/139 (38%), Gaps = 13/139 (9%)

Query: 738 QQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQ 797
Q DV + S + A+ D A A E T T E KQ + + Q
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPP-------APATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 798 TLVTQTAETLTQHQQHRPGGLSLTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDAD 857
TA+ ++ + V E+AQ+ + +E T++ + ++++
Sbjct: 1057 DATETTAQNREVAKEAKS-----NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 858 NRQQQQTLMQQIAQMTQQV 876
+ + + Q++ ++T QV
Sbjct: 1112 AKVETEK-TQEVPKVTSQV 1129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0457FRAGILYSIN300.021 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 29.7 bits (66), Expect = 0.021
Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%)

Query: 157 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 216
K+ ++ I ++Y + + + I T D + + I A
Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190

Query: 217 AQNFPPADYI 226
N P DYI
Sbjct: 191 ILNLPECDYI 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0458HTHFIS972e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.2 bits (242), Expect = 2e-25
Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%)

Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63
ILV +D+A IR ++ L + G+ + + + DL++ D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123
+ +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I +
Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152
E L D + G
Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0459PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 0.001
Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%)

Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPHLTERFYRVDKAR 380
LV N + H P+G I ++ + VE+ G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422
+G GL V+ + E+++ + GK +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


65ECP_0493ECP_0507N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0493020-0.219954muropeptide transporter
ECP_0494328-0.855903hypothetical protein
ECP_0495426-0.130959transcriptional regulator BolA
ECP_0496428-0.062401hypothetical protein
ECP_04973280.236604trigger factor
ECP_04981210.498852ATP-dependent Clp protease proteolytic subunit
ECP_04991220.331359ATP-dependent protease ATP-binding subunit ClpX
ECP_05000190.317995DNA-binding ATP-dependent protease La
ECP_0501-1130.393310transcriptional regulator HU subunit beta
ECP_0502-2120.317686peptidyl-prolyl cis-trans isomerase
ECP_0503-217-0.148445hypothetical protein
ECP_0504-2150.329134hypothetical protein
ECP_05050141.627556queuosine biosynthesis protein QueC
ECP_0506-1141.652372extracellular solute-binding protein
ECP_05070132.219720haloacid dehalogenase-like hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0493TCRTETA387e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.9 bits (88), Expect = 7e-05
Identities = 70/347 (20%), Positives = 134/347 (38%), Gaps = 20/347 (5%)

Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121
+F +P++ + F RR LL + V A M W+ + ++A +
Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109

Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181
+ V A+ D+ +ER + GM+ L + ++ A
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167

Query: 182 VL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238
L + + L PE + P+ + + + A L+ + ++ +G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298
A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++
Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285

Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358
A GY LL+ + + V GG+G A A+L ++ L+
Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403
AL+++ + VGP+ + A +T+ + + AA+ L L + R
Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0494PF06291290.006 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 28.9 bits (64), Expect = 0.006
Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 34 NMFKKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 70
N KK+LF ++ GCA+ T+ PT P++
Sbjct: 4 NKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0499HTHFIS290.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.043
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%)

Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119
E P+ E + ++G+ A + +Y RL D +++ G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167

Query: 120 PTGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 168 ESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0500GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%)

Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249
LE A +E + +L R +++ ++ S+ +Q++A ++L E + +
Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344

Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308
++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ +
Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397

Query: 309 VKKDLRQAQEILD 321
V+K L +A L
Sbjct: 398 VEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0501DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 3e-38
Identities = 49/88 (55%), Positives = 67/88 (76%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89
NPQTG+EI I A+KVP+F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0504PF08280270.021 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 27.1 bits (60), Expect = 0.021
Identities = 24/138 (17%), Positives = 41/138 (29%), Gaps = 20/138 (14%)

Query: 1 MQTQIKVRGYHLDVYQHVNNARYL-------EFLEEARWHGLENSDSFHWMTAH------ 47
+Q I + Y N Y E++ + N FH +
Sbjct: 361 LQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEWMAKLPGKRYLNHKHFHLFCHYVEQILR 420

Query: 48 ------NIAFVVVN-ININYRRPAVLSDLLTITSQLQQLNGKSGILSQVITLEPEGQVVA 100
+ FV N IN + + + + Q+ L+P+ +
Sbjct: 421 NIQPPLVVVFVASNFINAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDLKPDLVITH 480

Query: 101 DALITFVCIDLKTQKALA 118
LI FV +L A+A
Sbjct: 481 SQLIPFVHHELTKGIAVA 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0507HTHFIS290.019 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.019
Identities = 12/64 (18%), Positives = 24/64 (37%), Gaps = 10/64 (15%)

Query: 193 LTVLTQHLGLSLRDCMAFGDAMNDREMLGSVGSGFIMGN----------AMPQLRAELPH 242
TVL Q L + D +A + + ++ + +P+++ P
Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75

Query: 243 LPVI 246
LPV+
Sbjct: 76 LPVL 79


66ECP_0518ECP_0531N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0518-113-1.675376diguanylate phosphodiesterase
ECP_0519218-0.569315hypothetical protein
ECP_0520117-0.773441maltose O-acetyltransferase
ECP_0521116-0.121367hemolysin expression-modulating protein
ECP_05221150.130086hypothetical protein
ECP_05231161.069309acriflavine resistance protein B
ECP_05242140.534184acriflavin resistance protein A
ECP_05251160.428392DNA-binding transcriptional repressor AcrR
ECP_05263162.423611potassium efflux protein KefA
ECP_05274164.098564hypothetical protein
ECP_05283174.654885primosomal replication protein N''
ECP_05293233.233475hypothetical protein
ECP_05304273.247212adenine phosphoribosyltransferase
ECP_05313223.151458DNA polymerase III subunits gamma and tau
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0518BCTERIALGSPF310.013 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.0 bits (70), Expect = 0.013
Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 245 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 304
W+ L L+ G +A +LR R+ + + P++ G+I
Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272

Query: 305 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHSQQHISINLE 363
AR+ +T + S +PL Q +S + ++ R D +R+ H + LE
Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328

Query: 364 STVLTSEKIPQLLREMI 380
T L P ++R MI
Sbjct: 329 QTAL----FPPMMRHMI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0523ACRIFLAVINRP13670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1367 bits (3539), Expect = 0.0
Identities = 802/1033 (77%), Positives = 916/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTNYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT+YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKNWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0524RTXTOXIND384e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.9 bits (88), Expect = 4e-05
Identities = 29/173 (16%), Positives = 58/173 (33%), Gaps = 22/173 (12%)

Query: 17 KQEYDQ-ALADAQQANAAVTAAKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQN 74
Q + L +Q + + + + +P+S ++ + V TEG +V
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352

Query: 75 GQATALATVQQLDPIYVDVTQSSNDFLRLKQELA----------NGTLKQENGKAKVSLI 124
+ T + V + D + V + D + KV I
Sbjct: 353 AE-TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNI 408

Query: 125 TSDGIKFPQDGTLEFSDVTVDQTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 177
D I+ + G + +++++ S + I L GM V A ++ G
Sbjct: 409 NLDAIEDQRLGLVFNVIISIEENCLSTGNKNIP------LSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0525HTHTETR2022e-68 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 202 bits (514), Expect = 2e-68
Identities = 196/196 (100%), Positives = 196/196 (100%)

Query: 2 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 61
ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA
Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 79

Query: 62 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 121
KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD
Sbjct: 80 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 139

Query: 122 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 181
RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL
Sbjct: 140 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199

Query: 182 EMYLLCPTLRNPATNE 197
EMYLLCPTLRNPATNE
Sbjct: 200 EMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0526RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0531IGASERPTASE399e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 9e-05
Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLLRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S+ ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


67ECP_0596ECP_0606N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0596-211-0.108186hypothetical protein
ECP_0597-1100.818740outer membrane protease
ECP_05980101.583193hypothetical protein
ECP_0599-1111.365606bacteriophage N4 receptor, outer membrane
ECP_06000130.940123bacteriophage N4 adsorption protein B
ECP_06010202.203521sensor kinase CusS
ECP_0602-1202.329056DNA-binding transcriptional activator CusR
ECP_0603-1191.426103copper/silver efflux system outer membrane
ECP_0604-2191.550771copper-binding protein
ECP_0605-1181.686834copper/silver efflux system membrane fusion
ECP_0606-2181.116717cation efflux system protein CusA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0596LUXSPROTEIN310.002 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.002
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 90 AGESKI 95
++KI
Sbjct: 114 ENQNKI 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0597OMPTIN5280.0 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 528 bits (1360), Expect = 0.0
Identities = 314/317 (99%), Positives = 317/317 (100%)

Query: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60
MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS
Sbjct: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60

Query: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDRDWMDSSNPGTWTDESR 120
QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVD+DWMDSSNPGTWTDESR
Sbjct: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR 120

Query: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180
HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI
Sbjct: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180

Query: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVEASDNDEHYDPGKRIT 240
GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVE+SDNDEHYDPGKRIT
Sbjct: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRIT 240

Query: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNDNTSDYSKNGA 300
YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHN+NTSDYSKNGA
Sbjct: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGA 300

Query: 301 GIENYNFITTAGLKYTF 317
GIENYNFITTAGLKYTF
Sbjct: 301 GIENYNFITTAGLKYTF 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0601PF06580310.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.012
Identities = 29/184 (15%), Positives = 67/184 (36%), Gaps = 34/184 (18%)

Query: 306 EELTRMAKMVSDML-FLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDR-GVELRFV 363
+ M +S+++ + + N + + LADE+ V + + LA + L+F
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVS------LADELTVVDSYLQ-LASIQFEDRLQFE 243

Query: 364 GDECQVAGDPLMLRRALSNLLSNALRY----TPTGETIVVRCQTVDHLVQVTVENPGTPI 419
D + + L+ N +++ P G I+++ + V + VEN G+
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303

Query: 420 APEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVK---SIVVAHKGTVAVTSDVRGTKFVI 476
E +G GL V+ ++ + + ++ ++
Sbjct: 304 LKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 477 ILPA 480
++P
Sbjct: 346 LIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0602HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 35/117 (29%), Positives = 62/117 (52%)

Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61
+L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118
++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0603RTXTOXIND393e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.0 bits (91), Expect = 3e-05
Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 13/189 (6%)

Query: 254 QAQTVNSDSLQSVKLPA-GLPSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307
+ +S + +K + +I+++ + + L+ A A+ ++
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141

Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367
SL + + + + P F + L + + ++Q
Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200

Query: 368 QKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARTLYQHGAVSYLEVLDAER 427
QK Q + A R ++ +I+ + + L +L A++ VL+ E
Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 428 SLFATRQTL 436
L
Sbjct: 260 KYVEAVNEL 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0606ACRIFLAVINRP6970.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 697 bits (1801), Expect = 0.0
Identities = 214/1059 (20%), Positives = 441/1059 (41%), Gaps = 54/1059 (5%)

Query: 1 MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQI 60
M + IRR + A+ L + G I+ PV P ++ V + +YPG Q
Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56

Query: 61 VENQVTYPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119
V++ VT + M + + S G + + F+ GTDP A+ +V L
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 120 KLPAGVSAELGP-DATGVGWIYEYALVDRSGKHDLADLRSLQDWFLKYELKTIPDVAEVA 178
LP V + + + ++ V + D+ +K L + V +V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 179 SVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELA------EAEYMVR 232
G ++ +D L +Y ++ +V + L N + + + +
Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 233 ASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQVGPEMRRGIAELNGEGEVAGGVVILR 292
A + ++F + L+ + +G V L+DVA+V++G E IA +NG+ AG + L
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294

Query: 293 SGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVV 352
+G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 CALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412
LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 NAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472
N + + E D + + ++ AL ++++ FIP+ G G
Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 473 RLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRF----------LI 522
++ + T AMA + L+A+++ P L ++ + E F +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523

Query: 523 RVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISA 582
Y + K+L LL+ AL V ++ ++ FLP+ ++G L M G +
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583

Query: 583 AEAASMLQKTDKLIM--SVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639
+L + + V VF G + + + LKP ++
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640

Query: 640 MTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADI-DTMAE 698
+ + +I + + + +++ + I +G + +
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 699 QIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGE 758
+ A+ + S LE +E+++EKA G++++D+ +++A+GG V +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 759 TVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADVKVSTGPSMLKTENA 818
++ + ++ +R P+ + +L + + + + + G L+ N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 819 RPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPM 878
P+ I +A L + +A K L G ++G + ++ +V +
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 879 TLMIIFVLLYLAFRRVSEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAA 938
+ +++F+ L + S + ++ VP +VG + V G + G++A
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 939 EFGVVMLMYLRHAIEAEPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPI 998
+ ++++ + + +E E + + EA +R+RP MT I G+LP+
Sbjct: 939 KNAILIVEFAKDLMEKE----------GKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037
GAGS + + ++GGM++A LL++F +P + +
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


68ECP_0623ECP_0628N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0623-1174.861690enterobactin exporter EntS
ECP_0624-2174.703620iron-enterobactin transporter periplasmic
ECP_0625-1225.137995isochorismate synthase
ECP_0626-1235.206866enterobactin synthase subunit E
ECP_06270224.986967isochorismatase
ECP_06280204.7208722,3-dihydroxybenzoate-2,3-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0623TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.0 bits (83), Expect = 2e-04
Identities = 82/394 (20%), Positives = 146/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSVRPGLLMLLSTLG---AFLAIGLFGLMP 309
A IG AA L + A+ +G +A + ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0624FERRIBNDNGPP647e-14 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 64.2 bits (156), Expect = 7e-14
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVATQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0627ISCHRISMTASE439e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 439 bits (1130), Expect = e-159
Identities = 145/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPRLQAYALPESHDIPHNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+P NKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVYGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0628DHBDHDRGNASE364e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 364 bits (936), Expect = e-131
Identities = 110/258 (42%), Positives = 150/258 (58%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


69ECP_0806ECP_0811N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0806-1203.796292membrane protein YbhR
ECP_0807-1183.877660inner membrane protein
ECP_0808-2163.637684ABC transporter ATP-binding protein
ECP_0809-1133.558730hypothetical protein
ECP_0810-1123.260069DNA-binding transcriptional regulator
ECP_08110133.271611ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0806ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0808PF05272310.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.012
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.3 bits (65), Expect = 0.047
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0809RTXTOXIND627e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 62.2 bits (151), Expect = 7e-13
Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 82 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 141
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 142 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 196
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 197 AELNLQDSTLVAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 254
E Q S + AP + V G V+ T+ + + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 255 QPGRKVLLYTDGRPNKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 308
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 309 ----DADDALRQGMPVTVQ 323
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0810HTHTETR729e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 9e-18
Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 29/220 (13%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSK---FISR 128
IGE E + P + +RE+++ + + + + +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 EQLSPTAAYHLVHEQVISPLHSHLTRLIAAW---TGCDASDTRMILHTHALIGEILAFRL 185
E A + + + L I A +I+ I ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM---- 175

Query: 186 GKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
W + + + ++ ++L+
Sbjct: 176 --------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0811SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


70ECP_0853ECP_0861N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0853014-0.913456D-alanyl-D-alanine carboxypeptidase
ECP_0854115-0.158014DNA-binding transcriptional repressor DeoR
ECP_08550140.120667undecaprenyl pyrophosphate phosphatase
ECP_08560120.062763multidrug translocase MmdfA
ECP_0857-114-0.365212hypothetical protein
ECP_0858015-0.779997HAD family hydrolase
ECP_0859-114-0.001946membrane protein YbjJ
ECP_0860-112-0.716064transcriptional regulator YbjK
ECP_08610110.277047hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0853BLACTAMASEA438e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.2 bits (102), Expect = 8e-07
Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%)

Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65
+ L A A P + I MD ASG+ L ADE+
Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65

Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 125
S K++ V + A +L + + +P V D ++V +L
Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119

Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181
I S N A L V G + + +++G T ++T PG
Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174

Query: 182 --STARDMA------LLGKAL 194
+T MA L + L
Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0856TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 1e-05
Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 23/269 (8%)

Query: 71 LLGPLSDRIGRRPVMLAGVVWFIVTCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAI 130
+LG LSDR GRRPV+L + V + A + + R + GI+ GAV A I
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120

Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVG---AAWIHVLPWEGMFVLFAALAAISFFG 187
+ + + M+ + GP++G + P F AAL ++F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFLT 176

Query: 188 LQRAMPETATRIGEKLSLKELGRDYKLVLKNG-RFVAGALALGFVSLPLLAWIAQSP--I 244
+PE+ L + L G VA +A+ F ++ + Q P +
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAAL 232

Query: 245 IIITGEQLSSYEYGLLQVPIFGALIAGNL----LLARLTSRRTVRSLIIMGGWPIMIGLL 300
+I GE ++ + + + I +L + + +R R +++G G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 301 VAAAATVISSHAYLWMTAGLSIYAFGIGL 329
+ A AT ++ + + + GIG+
Sbjct: 293 LLAFAT----RGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0859TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.7 bits (77), Expect = 0.001
Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%)

Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275
+IGV+ + F + + P +M D H S GS+I T+ + + + GG +
Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317

Query: 276 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 331
DR + V+ + L ++ S ++ + VL GL + TI ++S
Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377

Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361
+A +S++ T +L+ G ++G L
Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0860HTHTETR529e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 9e-11
Identities = 14/84 (16%), Positives = 32/84 (38%)

Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIAALAGVPLGSMTYYFSGIDELLLEAFS 61
+ + R+ I+ L G+ + + +IA AGV G++ ++F +L E +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 62 RFTEIMSRQYQAFFSDVSDAPGAR 85
+ + + P +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0861TCRTETA320.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.006
Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%)

Query: 394 LMIGMITFQFSTFSFGMGNAAGLLFAGIML-GFMRANHPTFG-YIPQ--GALSMVKEFGL 449
L++ + +L+ G ++ G A G YI + FG
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135

Query: 450 MVFMAGVGLSAGSGINNGLGAIGGQM--LIAGLIVSLVPVVICFLF 493
M G G+ AG + +G A + L + CFL
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181


71ECP_0879ECP_0884N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_0879-1132.836518arginine transporter ATP-binding subunit
ECP_0880-1133.255894lipoprotein
ECP_0881-1132.974014hypothetical protein
ECP_0882-1133.095870N-acetylmuramoyl-L-alanine amidase
ECP_0883-3152.892411nucleotide di-P-sugar epimerase or dehydratase
ECP_0884-2132.261896hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0879PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.010
Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 31 LVLLGPSGAGKSSLLRVL 48
+VL G G GKS+L+ L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0882ECOLIPORIN290.026 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 28.7 bits (64), Expect = 0.026
Identities = 20/54 (37%), Positives = 26/54 (48%), Gaps = 9/54 (16%)

Query: 2 RRVFWLVAAALLLAGCTGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55
R+V LV ALL AG I K+G +LD Y ++ L HY +DD
Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0883NUCEPIMERASE752e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 75.2 bits (185), Expect = 2e-17
Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%)

Query: 13 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 63
MK LVTGA +G + + L + G V +A +LL + G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 64 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 116
L + + ++ S +P A+ +N+ + E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116

Query: 117 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 176
+++ ++ SS S+Y + D + +A +K A+E + + S T
Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174

Query: 177 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 234
LR +++GP + + + + M SI + + G D TY ++ A+
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 235 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 280
RVYNI N L +Q L D L I+ + +P D
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294

Query: 281 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVLTLDEGIEKTAAW 340
+ T D E +G+ P T+ +G++ W
Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327

Query: 341 LRD 343
RD
Sbjct: 328 YRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_0884NUCEPIMERASE561e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.6 bits (134), Expect = 1e-10
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%)

Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54
+ LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 NWPDNLPALLQN--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106
+ + L + + V+ H S+ + LN+ + R ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 107 IFLSS 111
++ SS
Sbjct: 122 LYASS 126


72ECP_1068ECP_1076N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_10681122.369667flagellar hook protein FlgE
ECP_1069-1132.453565flagellar basal body rod protein FlgF
ECP_1070-1101.297390flagellar basal body rod protein FlgG
ECP_10710132.245209flagellar basal body L-ring protein
ECP_10720131.975842flagellar basal body P-ring biosynthesis protein
ECP_10731131.661371flagellar rod assembly protein/muramidase FlgJ
ECP_10741131.233585flagellar hook-associated protein FlgK
ECP_10753151.140128flagellar hook-associated protein FlgL
ECP_10764171.515561ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1068FLGHOOKAP1416e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 6e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 34.9 bits (80), Expect = 5e-04
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 6 AVSGLNVAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLN A L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1070FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1071FLGLRINGFLGH350e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 350 bits (898), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 4 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 63
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 123
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 124 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 183
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 184 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1072FLGPRINGFLGI426e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 426 bits (1097), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVITAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1073FLGFLGJ5090.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 509 bits (1311), Expect = 0.0
Identities = 309/313 (98%), Positives = 310/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKGMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLK MRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGNSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPG+SKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAVSAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTA SAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1074FLGHOOKAP16770.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 677 bits (1747), Expect = 0.0
Identities = 540/546 (98%), Positives = 544/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSTADPSRTTVAYIDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPS+ADPSRTTVAY+DGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAGAFNTQHKAGFDANGDAGKDFFAIGKPAVLQNTKNNGDVAIGATVTDASAVLATD 361
ALAFA AFNTQHKAGFDANGDAG+DFFAIGKPAVLQNTKN GDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNNKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSN+KTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1075FLAGELLIN468e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 46.2 bits (109), Expect = 8e-08
Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAGYKTETAPFSEADGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186
T NG + + DG E+I + +G G + +
Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232
+ T A + + TA DK
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1076IGASERPTASE652e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 65.1 bits (158), Expect = 2e-12
Identities = 42/226 (18%), Positives = 79/226 (34%), Gaps = 12/226 (5%)

Query: 590 PAEQSAPKAEAKPERQQDRR-----KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRR 644
P+ S + A+ + N ++++++ D E +NR
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 645 QAQQQTAETRESRQQAEV------TEKARTTDEQQAPRRERSRRRNDDKRQAQQEVKALN 698
A++ + + + Q EV T++ +TT+ ++ E+ + + + QEV +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK-TQEVPKVT 1126

Query: 699 VEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEA 758
+ QE + + + R +N K Q+ P E + E V E+
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 759 AAPRTELVKVPLPVVAQTAPEQQEENNADNRDNGGMPRRSRRSPRH 804
T V P A Q N+ + RRS RS H
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232



Score = 62.8 bits (152), Expect = 6e-12
Identities = 48/289 (16%), Positives = 92/289 (31%), Gaps = 38/289 (13%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAATATPASPAQPGLL 571
P E+ + DVP P+ E A AP P A ATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSGGEETKPAEQSAPKAEAKPERQQDRRKP-RQNNRRDRNERRDTRSER- 629
AE S +++ + +QD + QN + + + ++
Sbjct: 1039 -----------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 630 -TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKR 688
E + + E + + ++TA + + TEK + + + + + +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 689 QAQ---QEVKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAP 743
QA+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 744 VVEETVAAEPIVQEAAA------PRTELVKVPLPVVAQTAPEQQEENNA 786
+P V ++ R + VP V T A
Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248


73ECP_1192ECP_1195N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1192433-7.072654iron transport protein, periplasmic-binding
ECP_1193439-9.903782acetyltransferase
ECP_1194239-8.469940transcriptional regulator
ECP_1195135-8.692130hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1192adhesinb329e-115 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 329 bits (846), Expect = e-115
Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%)

Query: 9 MLLGGLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68
+G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P
Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72

Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121
P D+K+ A LI NG+NLE WF + ++ VS GV + +
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181
+GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192

Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241
+P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++
Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252

Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297
F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+
Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1193SACTRNSFRASE499e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 49.2 bits (117), Expect = 9e-10
Identities = 17/61 (27%), Positives = 25/61 (40%)

Query: 126 NDYWWIKSFYIAPEHRGMGLADELIKHLIKEAKSEKALELRLYVHGDNGRAIRAYERCGF 185
N Y I+ +A ++R G+ L+ I+ AK L L N A Y + F
Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146

Query: 186 I 186
I
Sbjct: 147 I 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1194HTHTETR280.021 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.021
Identities = 9/41 (21%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 3 KRAKNQIVDSDIARLLLKLRKSRNLTVTELAQRSGVSQAMI 43
+ + I+D A L + + ++ E+A+ +GV++ I
Sbjct: 10 QETRQHILDV--ALRLFSQQGVSSTSLGEIAKAAGVTRGAI 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1195SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 5/59 (8%)

Query: 72 LEALFVDASARGLGVGKHLISHAL--ALHPD---LSVDVNEQNHQAVGFYQHMGFKLSG 125
+E + V R GVG L+ A+ A L ++ + N A FY F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


74ECP_1269ECP_1273N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1269-2141.690121hypothetical protein
ECP_1270-1202.143788transcriptional regulator NarL
ECP_1271-2202.439507nitrate/nitrite sensor protein NarX
ECP_1272-1262.629950hypothetical protein
ECP_1273-1222.099051nitrite extrusion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1269INTIMIN2561e-78 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 256 bits (656), Expect = 1e-78
Identities = 119/378 (31%), Positives = 195/378 (51%), Gaps = 21/378 (5%)

Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138
G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+
Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237

Query: 139 DRYLTWSQLGLTQQDDGLVSNVGVGQRWARGSWLVGYNTFYDNLLDENLQRAGFGAEAWG 198
++ L + Q+G D +N+G GQR+ ++GYN F D + R G G E W
Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297

Query: 199 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSVEQYFGDR 256
+Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD
Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357

Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316
V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P +
Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417

Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376
Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L +
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476

Query: 377 RSRYGIRQLIWQGDTQILS-----LTPGAQANSVEGWTLIMPDWQNGEGASNHWRLSVVV 431
+S+YG+ +++W D+ + S G+Q S + + I+P + +G SN ++++
Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531

Query: 432 EDNQGQRVSSNEITLTLV 449
D G SSN + LT+
Sbjct: 532 YDRNGN--SSNNVLLTIT 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1270HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%)

Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66
ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123
+ L ++++ ++V S N + A ++GA YL K + +L+ + +A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1271PF06580531e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 53.3 bits (128), Expect = 1e-09
Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%)

Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483
P +RE+L+ + + S + +LT +++ + S +F ++ +
Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243

Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534
Q+ P + VP L+Q E N +KH Q ++++ +++ V L V
Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585
++ G +N S G+ +R+R Q L G + +++ E G + IP
Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1273ACRIFLAVINRP310.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.011
Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%)

Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314
I+S + L+ + I A A L K + + FFG F S ++
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370
LG T L+ + L+ +LFL LP+ G F+ L +G+T
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583

Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411
+ + ++T +K E + E + A + F+S
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629


75ECP_1339ECP_1349N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1339-113-0.646897RNase II stability modulator
ECP_1340-2120.292360exoribonuclease II
ECP_1341-1110.337501hypothetical protein
ECP_1342-1120.083099enoyl-ACP reductase
ECP_1343-214-0.080105TetR family transcriptional regulator
ECP_1344-1120.163574multidrug-efflux transport protein
ECP_1345-1120.839865multidrug efflux protein
ECP_1346-2130.815428outer membrane efflux lipoprotein
ECP_1347-1140.857225drug transport transmembrane protein
ECP_1348-2131.250164peptide transport system ATP-binding protein
ECP_1349-1131.418104peptide transport system ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1339PF08280300.043 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 29.8 bits (67), Expect = 0.043
Identities = 21/105 (20%), Positives = 36/105 (34%), Gaps = 2/105 (1%)

Query: 526 PIDVELTESCLIENDELALSVIQQFSRLGAQVHLDDFGTGYSSLSQLARFPIDAIKLDQV 585
P+ V S I L S + FS + + ++ Q+ D +
Sbjct: 425 PLVVVFVASNFINAHLLTDSFPRYFS--DKSIDFHSYYLLQDNVYQIPDLKPDLVITHSQ 482

Query: 586 FVRDIHKQPVSQSLVRAIVAVAQALNLQVIAEGVESAKEDAFLTK 630
+ +H + V I L++Q + V+ K A LTK
Sbjct: 483 LIPFVHHELTKGIAVAEISFDESILSIQELMYQVKEEKFQADLTK 527


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1342DHBDHDRGNASE494e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 49.3 bits (117), Expect = 4e-09
Identities = 50/260 (19%), Positives = 97/260 (37%), Gaps = 22/260 (8%)

Query: 4 LSGKRILVTGVASKLSIAYGIAQAMHREGAEL-AFTYQNDKLKGRVEEFAAQLGSDIVLQ 62
+ GK +TG A I +A+ + +GA + A Y +KL+ V A+
Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 63 CDVAEDTSIDTMFAELGKVWPKFDGFVHSIGF---APGDQLDGDYVNAVTREGFKIAHDI 119
DV + +ID + A + + D V+ G L + A F +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVN--- 116

Query: 120 SSYSFVAMAKACRSMLNP-GSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMG 178
S+ F A + M++ +++T+ A + +KA+ + + +
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 179 PEGVRVNAISAGPIRTLAASGI--------KDFRKMLAHCEAVTPIRRTVTIEDVGNSAA 230
+R N +S G T + + + L + P+++ D+ ++
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 231 FLCSDLSAGISGEVVHVDGG 250
FL S + I+ + VDGG
Sbjct: 237 FLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1343HTHTETR551e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 1e-11
Identities = 17/65 (26%), Positives = 32/65 (49%)

Query: 1 MTSKLEIRHKQRQDEIINAARRCFRLCGFHAASMSQIASEAQLSVGQIYRYFANKDAIIE 60
M K + ++ + I++ A R F G + S+ +IA A ++ G IY +F +K +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EMVRR 65
E+
Sbjct: 61 EIWEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1344RTXTOXIND482e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.3 bits (115), Expect = 2e-08
Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 10/133 (7%)

Query: 41 PVSVVSELTGR-TSAALSAEVRPQVGGIIQKRLFKEGDLVKAGQPLYQIDAASYQAAWNE 99
V +V+ G+ T + S E++P I+++ + KEG+ V+ G L ++ A +A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 100 ARAALQQAQALVKADCQKAQRYARLVKENGVSQQDADDAQSTCAQDKASV--------EA 151
+++L QA+ Q R L K + D Q+ ++ +
Sbjct: 139 TQSSLLQARLEQ-TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197

Query: 152 KKAALETARINLD 164
+ +NLD
Sbjct: 198 WQNQKYQKELNLD 210



Score = 31.7 bits (72), Expect = 0.005
Identities = 15/116 (12%), Positives = 32/116 (27%), Gaps = 9/116 (7%)

Query: 83 QPLYQIDAASYQAAWN--EARAALQQAQALVKADCQKAQRYARLVKENGVSQQDADDAQS 140
L A + A + K+ ++ + KE Q ++
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKN 298

Query: 141 TCAQDKASVEAKKAALET----ARINLDWTTVTAPISGRI-GISSVTPGALVTASQ 191
L + + AP+S ++ + T G +VT ++
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1345ACRIFLAVINRP11610.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1161 bits (3005), Expect = 0.0
Identities = 583/1033 (56%), Positives = 760/1033 (73%), Gaps = 6/1033 (0%)

Query: 3 SRFFVRRPVFAWVIAILIMLAGILAIRTLPVAQYPDVAPPTIKISATYTGASAETLENSV 62
+ FF+RRP+FAWV+AI++M+AG LAI LPVAQYP +APP + +SA Y GA A+T++++V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 63 TQVIEQQLTGLDNLLYFSSTSSSDGSVSINVTFEQGTDPDTAQVQVQNKIQQAESRLPSE 122
TQVIEQ + G+DNL+Y SSTS S GSV+I +TF+ GTDPD AQVQVQNK+Q A LP E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 VQQTGVTVEKSQSNFLLIAAVYDTTDKASSSDIADWLVSNVQDPLARVEGVGSLQVFGAE 182
VQQ G++VEKS S++L++A + DI+D++ SNV+D L+R+ GVG +Q+FGA+
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181

Query: 183 YAMRIWLDPAKLASYSLMPSDVQSAIEAQNVQVTAGKIGALPSPNTQQLTATVRAQSRLQ 242
YAMRIWLD L Y L P DV + ++ QN Q+ AG++G P+ QQL A++ AQ+R +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 243 TVDQFKNIIVKSQSDGAVVRIKDVARVEMGSEDYTAIGKLNGHPSAGVAVMLSPGANALN 302
++F + ++ SDG+VVR+KDVARVE+G E+Y I ++NG P+AG+ + L+ GANAL+
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 303 TATLVKDKIAEFQRNMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIVLVVCVMYLFLQN 362
TA +K K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 363 LRATLIPALAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422
+RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSITIISAMLLS 482
++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFSITI+SAM LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 483 VVVALTLTPALCGSVL----QHVPPHKKGFFGAFNRFYRRTEDKYQRGVIYVLRRAARTM 538
V+VAL LTPALC ++L +K GFFG FN + + + Y V +L R +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 539 GLYVVLGGGMALMMWKLPGSFLPTEDQGEIMVQYTLPAGATAARTAEVNRQIVDWFLINE 598
+Y ++ GM ++ +LP SFLP EDQG + LPAGAT RT +V Q+ D++L NE
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 599 KANTDVIFTVDGFSFSGSGQNTGMAFVSLKNWSQRKGAENTAQAIALRATKELGTIRDAT 658
KAN + +FTV+GFSFSG QN GMAFVSLK W +R G EN+A+A+ RA ELG IRD
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 659 VFAMTPPAVDGLGQSNGFTFELLANGGTDRETLLQMRNQLIEKANQSP-ELHSVRANDLP 717
V PA+ LG + GF FEL+ G + L Q RNQL+ A Q P L SVR N L
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 718 QMPQLQVDIDSNKAVSLGLSLNDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGDSEFRSA 777
Q ++++D KA +LG+SL+D+ T+S+A GGTYVNDFIDRGRVKK+Y+Q D++FR
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 778 PSDLGKWFVRGSDNAMTPFSAFATTRWLYGPERLVRYNGSAAYEIQGENATGFSSGDAMT 837
P D+ K +VR ++ M PFSAF T+ W+YG RL RYNG + EIQGE A G SSGDAM
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 838 KMEELANSLPAGTTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAALYESWSVPFSVI 897
ME LA+ LPAG + W+G+S QE+L+ QA +L A+S +VVFLCLAALYESWS+P SV+
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 898 LVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFA-EAAVAEGYSLSR 956
LV+PLG++G LAA + + NDVYF V LLTTIGLS+KNAILIVEFA + EG +
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 957 AALRAAQTRLRPIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGTLTATLLAIFFV 1016
A L A + RLRPI+MTSLAFI GV+PLAI+ GAG+ ++ A+G G++GG ++ATLLAIFFV
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 1017 PLFFVLVKRLFAG 1029
P+FFV+++R F G
Sbjct: 1022 PVFFVVIRRCFKG 1034



Score = 75.3 bits (185), Expect = 1e-15
Identities = 53/330 (16%), Positives = 117/330 (35%), Gaps = 19/330 (5%)

Query: 721 QLQVDIDSNKAVSLGLSLNDVTDTLSSA----WGGTYVNDFIDRGRVKKVYIQGDSEFRS 776
+++ +D++ L+ DV + L G G+ I + F++
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 777 APSDLGKWFVRGSDN-AMTPFSAFATTRWLYGPER--LVRYNGSAA-----YEIQGENAT 828
P + GK +R + + ++ A L G + R NG A G NA
Sbjct: 243 -PEEFGKVTLRVNSDGSVVRLKDVARVE-LGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 829 GFSSGDAMTKMEELANSLPAG--TTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAAL 886
+ K+ EL P G + + + +L+ +LV + +
Sbjct: 301 DTAKA-IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV-MYLF 358

Query: 887 YESWSVPFSVILVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFAEA 946
++ + +P+ LLG + + ++ IGL +AI++VE E
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 947 AVAEGYSLSRAALRAAQTRLR-PIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGT 1005
+ E + A + ++++ ++ ++ A +P+A G+ I+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 1006 LTATLLAIFFVPLFFVLVKRLFAGKPRRQE 1035
+ L+A+ P + + + + +
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1346RTXTOXIND290.048 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.048
Identities = 24/166 (14%), Positives = 49/166 (29%), Gaps = 11/166 (6%)

Query: 70 DVQKAIADIDSARALYGQTNASLFPTVNAALSSTRSRSLANGTGTTAEADGTVSSYTLDL 129
A AD ++ Q +RS L + + + +
Sbjct: 128 TALGAEADTLKTQSSLLQARL----EQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 130 FGRNQSLSRAARETWLASEFTAQNTRLTLIAEISTAWLTLAADNSNLALAKETMASAENS 189
R SL + TW ++ + AE T + + + ++
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV-------LARINRYENLSRVEKSR 236

Query: 190 LKIIQRQQQVGTAAATDVSEAMSVYQQARASVASYQTQVMQDKNAL 235
L A V E + Y +A + Y++Q+ Q ++ +
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1347TCRTETA681e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 67.9 bits (166), Expect = 1e-14
Identities = 68/312 (21%), Positives = 114/312 (36%), Gaps = 18/312 (5%)

Query: 5 SLSWALILGLLAGIGPMCTDLYLPALPEMSEQLAATTTITQLTLTASLIGLGVGQLLFGP 64
L L L +G L +P LP + L + +T L + Q P
Sbjct: 6 PLIVILSTVALDAVG---IGLIMPVLPGLLRDLVHSNDVTA-HYGILLALYALMQFACAP 61

Query: 65 ----LSDKIGRKRPLILSLLLFIVSSILCATTNNIYWLVVWRFIQGIAGAGGSVLSRSIA 120
LSD+ GR+ L++SL V + AT ++ L + R + GI GA G+V IA
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 121 RDKYQGVTLTQFFALLMTVNGLAPVLSPVLGGYIVSTFDWRTLFWVMAEISTVLLLGCLL 180
D G + F + G V PVLGG + F F+ A ++ + L
Sbjct: 122 -DITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 181 FINETLPENKRGSSL----LLTGRSVVQNRRFMRFCLIQSFMLAGLFAYIGSSSFVL--Q 234
+ E+ +R L + + + F + L + ++ +V+ +
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF-IMQLVGQVPAALWVIFGE 238

Query: 235 KEFGFSPMQFSLVFGLNGI-GLIIASWIFSRLARRINAMTLLRGGLIAAILCALLTVLCA 293
F + + GI + + I +A R+ L G+IA +L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 294 WVQLPIPALVAL 305
+ P +V L
Sbjct: 299 RGWMAFPIMVLL 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1349HTHFIS310.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.007
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGKSLIAKAI 53
+ GESG+GK L+A+A+
Sbjct: 165 ITGESGTGKELVARAL 180


76ECP_1825ECP_1834N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_18250121.757177flagellar biosynthesis protein FlhB
ECP_18260121.182433chemotaxis regulator CheZ
ECP_1827091.175090chemotaxis regulatory protein CheY
ECP_1828091.376770chemotaxis-specific methylesterase
ECP_1829-191.063405chemotaxis methyltransferase CheR
ECP_1830-1110.561442methyl-accepting chemotaxis protein II
ECP_18310100.191811purine-binding chemotaxis protein
ECP_1832012-0.357932chemotaxis protein CheA
ECP_1833014-1.549714flagellar motor protein MotB
ECP_1834013-1.804074flagellar motor protein MotA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1825TYPE3IMSPROT424e-151 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 424 bits (1093), Expect = e-151
Identities = 95/346 (27%), Positives = 178/346 (51%), Gaps = 2/346 (0%)

Query: 5 SDDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVSVIWFGGVSLARRLSGMLSAGL 64
S +KTE PTP ++ AR++GQ+ +S+E+ S +++ +++ S ++ +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--I 59

Query: 65 HFDHSIINDPNLILGQIILLIREAMLALLPLISGVVLVAIISPVMLGGLVFSGKSLQPKF 124
+ S + + + ++ E PL++ L+AI S V+ G + SG++++P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 125 SKLNPLPGIKRMFSAQTGAELLKAILKTILVGSVTGFFLWHHWPQMMRLMAESPITAMGN 184
K+NP+ G KR+FS ++ E LK+ILK +L+ + + + +++L
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 185 AMDLVGLCALLVVLGVIPMVGFDVFFQIFSHLKKLRMSRQDIRDEFKQSEGDPHVKGRIR 244
++ ++ +G + + D F+ + ++K+L+MS+ +I+ E+K+ EG P +K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 245 QMQRAAARRRMMADVPKADVIVNNPTHYSVALQYDENKMSAPKVVAKGAGLVALRIREIG 304
Q + R M +V ++ V+V NPTH ++ + Y + P V K +R+I
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 305 AENNVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 350
E VP L+ PLARALY A + IP + A AEVL W+ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1827HTHFIS889e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 9e-24
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGLDALNKLQAGGYGFVISDWNMPNMDGL 66
LV DD + +R ++ L G++ V + + AG V++D MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111
+LL I+ LPVL+++A+ I A++ GA Y+ KPF
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1828HTHFIS663e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 3e-14
Identities = 35/188 (18%), Positives = 73/188 (38%), Gaps = 23/188 (12%)

Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60
M+ +L DD A +R ++ + ++ V + I + D++ DV MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119
+ D L ++ + RP V+V ++ + + ++A E GA D++ KP + E +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115

Query: 120 SEMIAEKVRTAAKASLAAHKPLSVPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179
+AE R +K + + + +G S E R + + +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPL-----------------VGRSAAMQEIYRVLARLMQ 158

Query: 180 LSSPALLI 187
++
Sbjct: 159 TDLTLMIT 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1832PF06580433e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.9 bits (101), Expect = 3e-06
Identities = 22/151 (14%), Positives = 49/151 (32%), Gaps = 52/151 (34%)

Query: 359 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 416
+++ ++++ + P+ LV N + HGI G ++L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 417 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVV 476
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 477 KRNIQEM---DGHVEIQSKQGTGTTIRILLP 504
+ +Q + + +++ KQG +L+P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1833PF05272310.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.009
Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%)

Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105
L +SSP A P + G + ++ PGGGDD GE +++
Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435

Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135
R+ + L+ R L + + S P L
Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1834PF05844330.001 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 33.1 bits (75), Expect = 0.001
Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%)

Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103
++LL +L+R+ K+R++G+ L+RD EN
Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99


77ECP_1857ECP_1864N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1857-113-1.256410flagellin
ECP_1858-2160.448133flagellar capping protein
ECP_1859-2140.267316flagellar protein FliS
ECP_1860-1130.619068flagellar biosynthesis protein FliT
ECP_18610130.067367alpha-amylase
ECP_1862018-1.253385hypothetical protein
ECP_1863-120-4.240410inner membrane protein
ECP_1864335-7.542407hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1857FLAGELLIN2241e-68 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 224 bits (571), Expect = 1e-68
Identities = 244/553 (44%), Positives = 298/553 (53%), Gaps = 46/553 (8%)

Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61
AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRIRELTVQASTGTNSDSDLDSIQDEIKSRLD 121
TQA+RNANDGIS+AQTTEGAL+EINNNLQR+REL+VQA+ GTNSDSDL SIQDEI+ RL+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVSGQTQFNGVNVLAKDGSMKIQVGANDGQTITIDLKKIDSDTLGLSGFNVNGKGAV 181
EIDRVS QTQFNGV VL++D MKIQVGANDG+TITIDL+KID +LGL GFNVNG
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 182 ANTAATKDDLVAASVSAAVGNEYTVSAGLSKSTAADVIASLTDGATVTAAGVSNGFAAGA 241
++ + T V +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 242 TGNAYKFNQANNTFTYNTTSTAAELQSYLTPKAGDTATFSVEIGSTKQDVVLASDGKITA 301
N + T + T+ A A G + D T
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEA-------------KAIAGAIKGGKEGDTFDYKGVTFTI 287

Query: 302 KDGSKLYIDTTGNLTQNGGGTLEEATLNGLAFNHSGPAAAVQSTITTADGTSIVLAGSGD 361
+K D G ++ G T T A+ + L S +
Sbjct: 288 --DTKTGNDGNGKVSTTINGEKVTLT-------------VADITAGAANVDAATLQSSKN 332

Query: 362 FGTTKTAGAINVTGAVISADALLSASKATGFTSGAYTVGTDGVVKSGGNDVYNKADGTGL 421
T+ G + A LS +A G + +G +
Sbjct: 333 VYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKT 392

Query: 422 TTDNTTKYYLQDDGSVTNGSGKAVYVDATGKLTTDAETKAATTADPLKALDEAISSIDKF 481
+ + DA +TA+PL ++D A+S +D
Sbjct: 393 MF------------------IDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAV 434

Query: 482 RSSLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNSVLAK 541
RSSLGA+QNR DSA+TNL NT TNL+ A+SRI+DADYATEVSNMSKAQI+QQAG SVLA+
Sbjct: 435 RSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQ 494

Query: 542 ANQVPQQVLSLLQ 554
ANQVPQ VLSLL+
Sbjct: 495 ANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1858TYPE3OMBPROT330.003 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 32.7 bits (74), Expect = 0.003
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273
N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++
Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293

Query: 274 AIKDWVNAYNSL 285
+KD VNA L
Sbjct: 294 MLKDQVNALKGL 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1863RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1864PF01206936e-29 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 92.5 bits (230), Expect = 6e-29
Identities = 16/71 (22%), Positives = 37/71 (52%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66
D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


78ECP_1867ECP_1884N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1867025-4.274300outer membrane porin protein LC
ECP_1868-215-1.737085transcriptional regulator YbcM
ECP_18690131.446573kinase inhibitor
ECP_1870-1153.809362multidrug efflux protein
ECP_18711174.610756flagellar hook-basal body protein FliE
ECP_18721154.351223flagellar MS-ring protein
ECP_18731184.450707flagellar motor switch protein G
ECP_1874-1163.727657flagellar assembly protein H
ECP_1875-2173.436748flagellum-specific ATP synthase
ECP_1876-1162.297228flagellar biosynthesis chaperone
ECP_1877-1162.460311flagellar hook-length control protein
ECP_1878-2212.100825flagellar basal body protein FliL
ECP_18790170.745595flagellar motor switch protein FliM
ECP_1880116-2.017411flagellar motor switch protein FliN
ECP_1881016-2.845722flagellar biosynthesis protein FliO
ECP_1882018-3.772207flagellar biosynthesis protein FliP
ECP_1883019-4.123408flagellar biosynthesis protein FliQ
ECP_1884-315-2.565244flagellar biosynthesis protein FliR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1867ECOLIPORIN5100.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 510 bits (1314), Expect = 0.0
Identities = 240/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%)

Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223
D+ NGDGFG STTY+ GF GA Y SDRTN+QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277
A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333
AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360
L+D D F K +G++TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1871FLGHOOKFLIE1178e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (293), Expect = 8e-38
Identities = 102/103 (99%), Positives = 102/103 (99%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQT ARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1872FLGMRINGFLIF7510.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 751 bits (1941), Expect = 0.0
Identities = 476/555 (85%), Positives = 513/555 (92%), Gaps = 5/555 (0%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVSAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQV+AQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPTNQNNRQQ--QASTTSNS---GPRSTQRNETSN 357
+GYPGGVPGALSNQPAP N API+TPPTNQ N Q Q ST++NS GPRSTQRNETSN
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEALTREAMGFSEK 417
YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIE LTREAMGFS+K
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424

Query: 418 RGDSLNVVNSPFNSSDESGGALPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477
RGD+LNVVNSPF++ D +GG LPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT
Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484

Query: 478 RRAEAVKTVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537
RR E K Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544

Query: 538 VVALVIRQWINNDHE 552
VVALVIRQW++NDHE
Sbjct: 545 VVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1873FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1874FLGFLIH370e-134 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 370 bits (951), Expect = e-134
Identities = 224/228 (98%), Positives = 227/228 (99%)

Query: 1 MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTW PDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTMDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPT+DNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1876FLGFLIJ2022e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (515), Expect = 2e-70
Identities = 146/147 (99%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1877FLGHOOKFLIK469e-168 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 469 bits (1208), Expect = e-168
Identities = 367/375 (97%), Positives = 370/375 (98%)

Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSDIVSDAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120
GEPL+SDIVSDAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQGRVTGNSGVDIFA 375
LQGRVTGNSGVDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1879FLGMOTORFLIM385e-136 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 385 bits (989), Expect = e-136
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 20 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 77
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 78 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 137
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 138 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 197
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 198 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 255
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 256 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 312
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 313 GVPVLTSQYGTLNGQYALRIEHLI 336
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1880FLGMOTORFLIN2114e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 211 bits (538), Expect = 4e-74
Identities = 125/137 (91%), Positives = 133/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSGKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSG +QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1882FLGBIOSNFLIP333e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 333 bits (856), Expect = e-119
Identities = 244/245 (99%), Positives = 244/245 (99%)

Query: 1 MRRLFSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRL SVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1883TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1884TYPE3IMRPROT2026e-67 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 202 bits (516), Expect = 6e-67
Identities = 256/261 (98%), Positives = 259/261 (99%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEIFNLLADIISELPLI 261
EHLFSEIFNLLADIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


79ECP_1894ECP_1902N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1894-215-2.385976DNA cytosine methylase
ECP_1895-220-3.853836hypothetical protein
ECP_1896-228-7.757468hypothetical protein
ECP_1897-130-8.535845hypothetical protein
ECP_1898-130-8.231570outer membrane pore protein
ECP_1899034-7.884585hypothetical protein
ECP_1900-129-6.729814chaperone protein HchA
ECP_1901033-6.982386sensor-like histidine kinase YedV
ECP_1902128-6.108630transcriptional regulatory protein YedW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1894PF05272290.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.045
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1895CARBMTKINASE342e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 2e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + +L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1898ECOLIPORIN410e-145 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 410 bits (1054), Expect = e-145
Identities = 199/388 (51%), Positives = 246/388 (63%), Gaps = 41/388 (10%)

Query: 1 MKRKVLAMLVPALLVAGAANAAEIYNKNGNKVELYGKMVGERILTDRESGEKGDNSQDTS 60
MKRKVLA+++PALL AGAA+AAEIYNK+GNK++LYGK+ G +D S D +
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSS-----KDGDQT 55

Query: 61 YARVGVKGETQINPELTGYGQFELDLEASNRHNPDQ---TRLAYAGLSYKDFGSFDYGRN 117
Y RVG KGETQIN +LTGYGQ+E +++A+ TRLA+AGL + D+GSFDYGRN
Sbjct: 56 YMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRN 115

Query: 118 VGVAYDAEAFTDMFVEWGGDSWAGTDLFMTNRTNGVATYRNTDFFGMVEGLNFALQYQGK 177
GV YD E +TDM E+GGDS+ D +MT R NGVATYRNTDFFG+V+GLNFALQYQGK
Sbjct: 116 YGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGK 175

Query: 178 NEGTGNY----------------KANGDGHGLSATYTID-GFSFAGAYANSDRTDWQSGD 220
NE NGDG G+S TY I GFS AY SDRT+ Q
Sbjct: 176 NESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNA 235

Query: 221 GK----GERAEVWALSTKYDANNVYAAVMYGESHNM-------NSDDGDVVNKTQNFEAV 269
G G++A+ W KYDANN+Y A MY E+ NM DG V NKTQNFE
Sbjct: 236 GGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 270 LQYQFDFGLRPSIGYSYSKALDVA----GYKDSDRLNYIEIGTWYYFNKNMNVYTAYQIN 325
QYQFDFGLRP++ + SK D+ D D + Y ++G YYFNKN + Y Y+IN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 326 LLDKSD-YVLAHGLNTDDQLAVGIVYQF 352
LLD D + G++TDD +A+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1901PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 35/181 (19%), Positives = 64/181 (35%), Gaps = 37/181 (20%)

Query: 290 ENILFLARADKNNVLVKLDALS----------------LNKEVENLLDYL--EYLSDEKE 331
NI L D L +LS L E+ + YL + E
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239

Query: 332 IRFKVECNQQIFADKI---LLQRMLSNLIVNAIRYSPEKSRIHITSFLDANGSLNIDIAS 388
++F+ + N I ++ L+Q ++ N I + I P+ +I + D NG++ +++ +
Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVEN 298

Query: 389 PGTKINEPEKLFRRFWRGDNSRHSVGQGLGLSLVKA-IAELHGGSATYHYLSKHNVFRIT 447
G+ + K G GL V+ + L+G A K
Sbjct: 299 TGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 448 L 448
+
Sbjct: 345 V 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1902HTHFIS849e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 9e-21
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


80ECP_1911ECP_1918N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1911443-10.535154PilV-like protein
ECP_1912447-13.813704type IV pilin protein
ECP_1913547-14.402670hypothetical protein
ECP_1914544-12.906968hypothetical protein
ECP_1915747-15.447446hypothetical protein
ECP_1916546-14.220229hypothetical protein
ECP_1917448-15.497456hypothetical protein
ECP_1918542-11.578413hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1911BCTERIALGSPH330.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 32.6 bits (74), Expect = 0.002
Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 8/86 (9%)

Query: 2 IKKKGFTLLEVTIVL---GIGTLIAFMKFQDMRNDQEAVLADNVGTQIKQLGE--AVNRY 56
++++GFTLLE+ ++L G+ + + F R+D A Q++ + +
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 57 ---ISIRYDKISTLSSSNNQSSDPGP 79
+S+ D+ L +DP P
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAP 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1912PilS_PF08805738e-19 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 73.4 bits (180), Expect = 8e-19
Identities = 47/179 (26%), Positives = 84/179 (46%), Gaps = 17/179 (9%)

Query: 7 KRKSKKGFSLLELLLVLGIIAALVVAAFIVYPKVQASQRAQAESNNIATIQAGVKALYTS 66
K++ KG +L+E+LLV+G+I L +A+ +Y VQ++ ++ E NN+ T+ A +K+L
Sbjct: 21 KKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQ 80

Query: 67 AS-SFTGLTNTVAVQAKIFPDNMLSGSGTAAKPINAFKGNVTLAATATGPSSATGSSFTI 125
+ + T+ + P +M+ T A N + G+VT+ S+ SF +
Sbjct: 81 GRYTDSNYIKTL-YAQGLLPSDMI-ADTTGASAKNPWGGSVTITT------SSDKYSFNV 132

Query: 126 TYDNVPAAECVKIATAAAGNFYITTVGTKVVKAAGGTLDVAATAAACTNATSNTLVFTS 184
NVP C+ + A + + T +AA + SNTL F++
Sbjct: 133 VEANVPQKNCMAMVNA--------LRSSSAISKINNTSTSTVSAATVCASDSNTLTFST 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1914CHANLCOLICIN300.022 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.0 bits (67), Expect = 0.022
Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 184 ERIDHRSLRTQCADALAQAE-EAFSAEEKAFWLAKATETNRPAMQRVHRAKWNDTESQEQ 242
E + H + RT A LA A A AE++ LAKA E R + +A + +++
Sbjct: 100 EALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKE 159

Query: 243 RAAEQAQRDQQIEEAKKVYTTFSEL 267
E+A+ ++Q++ A+ + L
Sbjct: 160 IEREKAETERQLKLAEAEEKRLAAL 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1918ACRIFLAVINRP270.006 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.006
Identities = 9/48 (18%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query: 29 VLYGTYPGWYAAVVLLLTFGLSTLIGMSTGMAGATISLPIIAVVGFIA 76
L Y W V ++L L ++G+ + +VG +
Sbjct: 886 CLAALYESWSIPVSVMLVVPL-GIVGVLLAATLFNQKNDVYFMVGLLT 932


81ECP_1948ECP_1954N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_1948131-6.247490hypothetical protein
ECP_1949-227-4.583593hypothetical protein
ECP_1950-225-4.088334hypothetical protein
ECP_1951-230-5.804468hypothetical protein
ECP_1952-229-6.337703hypothetical protein
ECP_1953-227-4.236505hypothetical protein
ECP_1954-226-3.925883shikimate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1948INTIMIN752e-17 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 74.7 bits (183), Expect = 2e-17
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 162 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFS 221
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1950INTIMIN563e-10 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 55.8 bits (134), Expect = 3e-10
Identities = 62/263 (23%), Positives = 91/263 (34%), Gaps = 20/263 (7%)

Query: 175 IAVKAHVNDQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVK 234
I A V G + P +F+ S ++S N+ +TN G A VT+ ++ G V
Sbjct: 578 ITYTATVKKN-GVAQANVPVSFNIV-SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635

Query: 235 ASLANGASLEKQLEAI---DEKLTLTSSPLIGVNAPKGATLTATLT---SANGTPVEGQV 288
A A S I K ++T A T T PV Q
Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695

Query: 289 INFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFHNGVTIQTQTTVKVTGN 348
+ F+ TL LS +T+++G A V LTS G V+A + V+
Sbjct: 696 VTFTTTL--GKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTT 753

Query: 349 PSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTVYFALKSGSTTLTSLTA 407
+ I T ++ G NL G + +S + + S
Sbjct: 754 LTID------DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS--- 804

Query: 408 VTDQNGIATTSVKGEITGSVTVS 430
V +G T KG T SV S
Sbjct: 805 VDASSGQVTLKEKGTTTISVISS 827



Score = 52.4 bits (125), Expect = 3e-09
Identities = 46/170 (27%), Positives = 65/170 (38%), Gaps = 7/170 (4%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGV-TIQTQTTVKVTGNPSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGL 389
+ + V + A + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQ--TKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQ 694

Query: 390 TVYFALKSGSTTLTSLTAVTDQNGIATTSVKGEITGSVTVSAVTSAGGMQ 439
V F G + + T TD NG A ++ G VSA S +
Sbjct: 695 EVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 51.2 bits (122), Expect = 7e-09
Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 16/233 (6%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN----AKVDVNSATI----TSQEIPEGQVMVAQDIAVKAHVN 182
VSA ++D +V + + +D + I ++P + Q + N
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790

Query: 183 DQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVKA 235
++ + A S Q+ + + +T + V + + +YT+
Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTIS-----VISSDNQTATYTIAT 838



Score = 40.1 bits (93), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1951INTIMIN280.022 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 27.7 bits (61), Expect = 0.022
Identities = 22/129 (17%), Positives = 46/129 (35%), Gaps = 6/129 (4%)

Query: 11 KISAIDYSQNINGDYKATVTGGGEGIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVS 70
K+S + NG K T+T G + + ++ V + +EF G +
Sbjct: 704 KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIE 762

Query: 71 VNSANLPTASFPSQGFTGAYYQLNNDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDG 130
+ + P+ L + G + ++ A ++G+VT K G
Sbjct: 763 IVGTGV-KGKLPTVWLQYGQVNL---KASGGNGKYTWRSANPAIASVDASSGQVTLKEKG 818

Query: 131 DSNTVIITA 139
+ T+ + +
Sbjct: 819 -TTTISVIS 826


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_1954TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 39/259 (15%), Positives = 96/259 (37%), Gaps = 18/259 (6%)

Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138
+G ++G D+LG KR+L+ + + + + + SF ++ I+ ++ A
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAA 119

Query: 139 VGGEWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWG 197
+ + K S V +G GVG + + I
Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------H 167

Query: 198 WRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIPVIEALLRHPGAFLKIIA 257
W L ++ ++ ++ +++ + + + ++ +L + +
Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227

Query: 258 LRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRR 317
+ + L +++ + + GL + + IG+L GG+ T+ F + +
Sbjct: 228 VSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV 286

Query: 318 VYITGALIGTLSAFPFFMA 336
++ A IG++ FP M+
Sbjct: 287 HQLSTAEIGSVIIFPGTMS 305


82ECP_2109ECP_2119N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2109-1121.914774chaperone
ECP_2110-3122.727939hypothetical protein
ECP_2111-2173.947476hypothetical protein
ECP_2112-2184.310149hypothetical protein
ECP_2113-2173.976071hypothetical protein
ECP_2114-2174.014197multidrug efflux system subunit MdtA
ECP_2115-2183.886296multidrug efflux system subunit MdtB
ECP_2116-2142.558738multidrug efflux system subunit MdtC
ECP_2117-213-2.698916multidrug efflux system protein MdtE
ECP_2118022-5.406269signal transduction histidine-protein kinase
ECP_2119031-8.922560DNA-binding transcriptional regulator BaeR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2109SHAPEPROTEIN492e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.0 bits (117), Expect = 2e-08
Identities = 32/129 (24%), Positives = 58/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANTQAQGILERAAKRAGFKDVVF 190
M+ H I+Q + ++ P+ + + + I E +A+ AG ++V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV-------ERRAIRE-SAQGAGAREVFL 140

Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRARLDREASLLGHSGCRI 250
EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198



Score = 34.7 bits (80), Expect = 6e-04
Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 23/137 (16%)

Query: 332 RLSYRLV---RSAEESKIALSSV--AETRASLPFISDELAT------LISQQGLESALSQ 380
R +Y + +AE K + S + + LA ++ + AL +
Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262

Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGD 432
PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ +
Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319

Query: 433 D-FGSVTAGLARWAEVV 448
D V G + E++
Sbjct: 320 DPLTCVARGGGKALEMI 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2114RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 2e-08
Identities = 48/369 (13%), Positives = 106/369 (28%), Gaps = 87/369 (23%)

Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGR------- 51
S + R V ++ IA G+ + + A G + +
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 52 --RGMRAG-PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 103
+R G L + A + L T ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 144
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 280 DNQIDATTG 288
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2115ACRIFLAVINRP9190.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 919 bits (2376), Expect = 0.0
Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSESVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2116ACRIFLAVINRP9160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 916 bits (2368), Expect = 0.0
Identities = 288/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDDTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 AVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
+ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRGERS---ETAQQIIDRLRKKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP ER+ +A+ +I R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QEDNGAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 80.7 bits (199), Expect = 2e-17
Identities = 76/448 (16%), Positives = 161/448 (35%), Gaps = 26/448 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L+ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQANASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQEDNGAE-- 703
V ++ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLIYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V+
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPKQA 1022
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2117TCRTETB1237e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 7e-33
Identities = 97/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHAQNNNRALFSLKL 257
G +L++VG+ L + + V V++ ++++ H + L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYT--WLSMASIIAL 445
+Y+ L + II +
Sbjct: 428 LYSNLLLLFSGIIVI 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2118BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 28/95 (29%), Positives = 36/95 (37%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALSTLLAALATF------PLARGLLAPVKRLVDGTHKLAAGDFTTRVAPTSE 217
RQ + L+ A L AL P L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GRLAEDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2119HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


83ECP_2173ECP_2179N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_21731172.364181D-alanyl-D-alanine endopeptidase
ECP_21740192.888284hypothetical protein
ECP_21751182.431759DedA family membrane protein
ECP_21761182.340927acetoin dehydrogenase
ECP_21770152.378588multidrug resistance outer membrane protein
ECP_21780131.163555hypothetical protein
ECP_21790120.191974tRNA-dihydrouridine synthase C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2173BLACTAMASEA444e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.6 bits (103), Expect = 4e-07
Identities = 42/195 (21%), Positives = 77/195 (39%), Gaps = 18/195 (9%)

Query: 4 MPKFRVSLFSLALMLAVPFAPQAVAKTVAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 62
M R+ + SL + +P A A + + S+ +++ MI +DL + + + +
Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58

Query: 63 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 119
D P+ S K++ VL DE+L+ I + YS V L ++ ++
Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118

Query: 120 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 172
A+ S+N +AA+L GG + A + +G N TR E
Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173

Query: 173 HNVSTARDLTKLLIA 187
+ +T + L
Sbjct: 174 RDTTTPASMAATLRK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2175BCTERIALGSPF280.018 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.018
Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 152 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 182
L + ++ + W++L ++ + R L++
Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2176DHBDHDRGNASE1152e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (290), Expect = 2e-33
Identities = 72/253 (28%), Positives = 119/253 (47%), Gaps = 12/253 (4%)

Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLDLG 62
++A IT + GIG+ A LA QG I ++ E+ K + AE D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKA-EARHAEAFPADVR 67

Query: 63 NLPEGAQALEKLIQRLGRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122
+ + ++ + +G ID+LVN AG + ++ +EW F+V+ G F S+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKAMALELVRHKILVNAVA 182
++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+
Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PGAIATPMN-----GMDGGD--VKPDAEP---SIPLRRFGTTHEIASLVVWLCSEGANYT 232
PG+ T M +G + +K E IPL++ +IA V++L S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 233 TGQSLIVDGGFML 245
T +L VDGG L
Sbjct: 247 TMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2179SHAPEPROTEIN290.018 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.3 bits (66), Expect = 0.018
Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%)

Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179
G EA+ ++ + +G + E+ K EI A E+ V GR +G R
Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249

Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVK 238
++ I E +++ L V A + + IS V+ G GAL + NL R++
Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307

Query: 239 YNEPRMP 245
E +P
Sbjct: 308 MEETGIP 314


84ECP_2229ECP_2235N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_22290131.576036sulfatase
ECP_22310173.530770*hypothetical protein
ECP_2232-1183.375408hypothetical protein
ECP_22330203.850840transcriptional regulator NarP
ECP_22340194.384814cytochrome c-type biogenesis protein CcmH
ECP_22350184.544508thiol:disulfide interchange protein DsbE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2229IGASERPTASE300.027 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.027
Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 6/70 (8%)

Query: 503 LHVSTPASEYSQGQ-DLF---NPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNL 558
L V+ E + + LF QR H V+ +T+ + K L N NG+Y YN
Sbjct: 926 LQVADKTGEPNHNELTLFDASKAQRDHLNVSLV-GNTVDLGAWKYKLR-NVNGRYDLYNP 983

Query: 559 RGERVKDEKP 568
E+
Sbjct: 984 EVEKRNQTVD 993


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2232PERTACTIN270.025 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.0 bits (59), Expect = 0.025
Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 40 VFAVIEKGGLLEV--KATGDFKIFVTDTGASPAAGDNLTLVTT 80
VFA + L V A+G +++V ++G+ PA+G+ + LV T
Sbjct: 484 VFADLGLSDKLVVMRDASGQHRLWVRNSGSEPASGNTMLLVQT 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2233HTHFIS642e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 2e-14
Identities = 22/113 (19%), Positives = 48/113 (42%), Gaps = 2/113 (1%)

Query: 9 VMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68
+++ DD +R + Q L +G++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_223560KDINNERMP280.033 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.0 bits (62), Expect = 0.033
Identities = 7/47 (14%), Positives = 19/47 (40%), Gaps = 2/47 (4%)

Query: 3 RKVLLIPLIIFLAIAAALLWQLARN--AEGDDPTNLESALIGKPVPK 47
++ LL+ ++F++ W+ +N + T + G +
Sbjct: 4 QRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQ 50


85ECP_2258ECP_2263N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2258-111-1.835690outer membrane porin protein C
ECP_2259-111-1.766152phosphotransfer intermediate protein in
ECP_2260-113-1.263875transcriptional regulator RcsB
ECP_2261-213-0.703825hybrid sensory kinase in two-component
ECP_2262-116-0.474994sensory histidine kinase AtoS
ECP_2263-2130.749110acetoacetate metabolism regulatory protein AtoC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2258ECOLIPORIN5350.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 535 bits (1380), Expect = 0.0
Identities = 256/386 (66%), Positives = 294/386 (76%), Gaps = 14/386 (3%)

Query: 1 MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLG 60
MK KVL+L++PALL AGAA+AAE+YNKDGNKLDLYGKVDGLHYFSDD S DGDQTYMR+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQVTDQLTGYGQWEYQIQGNAPESE-NNSWTRVAFAGLKFQDIGSFDYGRNYGVVY 119
FKGETQ+ DQLTGYGQWEY +Q N E E NSWTR+AFAGLKF D GSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 120 DVTSWTDVLPEFGGDTYG-SDNFMQQRGNGFATYRNTDFFGLVDGLNFAVQYQGQNGSVS 178
DV WTD+LPEFGGD+Y +DN+M R NG ATYRNTDFFGLVDGLNFA+QYQG+N S S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 179 GENDPDFTGHGITNNGRKALRQNGDGVGGSITYDY-EGFGVGAAVSSSKRTDAQN-TAAY 236
++ G NNG NGDG G S TYD GF GAA ++S RT+ Q
Sbjct: 181 ADDVN--IGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGT 238

Query: 237 IGNGDRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSL------GWANKAQNFEAVAQY 290
I GD+A+ +T GLKYDANNIYLA Y++T N T G G ANK QNFE AQY
Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQY 298

Query: 291 QFDFGLRPSVAYLQSKGKNLGTIGTRNYDDEDILKYVDVGATYYFNKNMSTYVDYKINLL 350
QFDFGLRP+V++L SKGK+L N DD+D++KY DVGATYYFNKN STYVDYKINLL
Sbjct: 299 QFDFGLRPAVSFLMSKGKDLTY-NNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLL 357

Query: 351 D-DNQFTRDAGINTDNIVALGLVYQF 375
D D+ F +DAGI+TD+IVALG+VYQF
Sbjct: 358 DDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2260HTHFIS489e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 9e-09
Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%)

Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60
M +++ADD + + ++L + + + ++ L + D +++TD+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114
+ L+ IK+ P L ++V++ N +A+ ++GA P
Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106

Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139
DL + + + + S+L +
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2261HTHFIS816e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 6e-18
Identities = 29/106 (27%), Positives = 47/106 (44%)

Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLNKNHIDIVLSDVNMPNMDGYRL 886
ILV DD R +L L GY + ++ + D+V++DV MP+ + + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 887 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVI 932
RI++ LPV+ ++A + E G L KP L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2263HTHFIS5620.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 562 bits (1449), Expect = 0.0
Identities = 181/484 (37%), Positives = 269/484 (55%), Gaps = 35/484 (7%)

Query: 1 MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPE 60
MT IL+ DD+ +R +L+ A + G++ +N T A D+V+ D+ MP+
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 MDGIKALKEMRSHETRTPVILMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRAL 120
+ L ++ PV++M+A TA++A GA+DY+ KPFDL EL I+ RAL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 QLQSMKKEIRHLHQALSTSWQWGH-ILTNSPAMMDICKDTAKIALSQASVLISGESGTGK 179
+ L Q G ++ S AM +I + A++ + +++I+GESGTGK
Sbjct: 120 AEP------KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173

Query: 180 ELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEG 239
EL+ARA+H +R GPF+ +N AA+P L+ESELFGHEKGAFTGAQT G FE+A G
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233

Query: 240 TLLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFR 299
TL LDEIG+MP+ Q +LLR+LQ+ E+ +GG I+ D+RI+AATN+DL+ + +G FR
Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293

Query: 300 EDLFYRLNVIHLILPPLRDRREDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWP 359
EDL+YRLNV+ L LPPLRDR EDI L HF+Q+ E + D A+ L+ A WP
Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWP 352

Query: 360 GNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPV---------CNAGEAKTAPVGERN- 409
GN+REL N++ R + +I E + ++R + +G + E N
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412

Query: 410 ----------------LKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQE 453
+ +E +I+ L GN+ + A +LG++R L K++E
Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472

Query: 454 YGID 457
G+
Sbjct: 473 LGVS 476


86ECP_2392ECP_2395N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2392035-9.456287multidrug resistance protein Y
ECP_2393-134-8.456009multidrug resistance protein K
ECP_2394032-7.800896DNA-binding transcriptional activator EvgA
ECP_2395032-7.446321hybrid sensory histidine kinase in two-component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2392TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2393RTXTOXIND741e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.1 bits (182), Expect = 1e-16
Identities = 47/277 (16%), Positives = 94/277 (33%), Gaps = 46/277 (16%)

Query: 56 AKNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDYNRRV----PLAKQGVIS 108
K + Q + L + AE + + Y+ R+ L + I+
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 109 KEALEHTKDTLI----------SSKAALNAAIQAYKANKALVMNTPLNRQPQVIEAADAT 158
K A+ ++ + S + + I + K + T L + + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKNEILDKLRQTT 308

Query: 159 KE----------AWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPGQSLMAVVPARQ-MWV 206
+ + I++PV+ + Q V G V+ ++LM +VP + V
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 207 NANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIK 266
A + + + +GQ+ I + F G +G + +
Sbjct: 369 TALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK---VKNINLDAIEDQRLG 419

Query: 267 IVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 299
+V V +S++ L PL G+++TA I T
Sbjct: 420 LVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2394HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2395HTHFIS794e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 4e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LARKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


87ECP_2646ECP_2652N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2646-1122.499623transmembrane transport protein
ECP_2647-2132.196951hypothetical protein
ECP_2648-2151.703001hypothetical protein
ECP_2649-3121.311449transcriptional repressor MprA
ECP_2650-1131.743584multidrug resistance protein A
ECP_2651-2141.590954multidrug resistance protein B
ECP_26520161.082290S-ribosylhomocysteinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2646TCRTETB448e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 43.7 bits (103), Expect = 8e-07
Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%)

Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93
L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151
S ++I+ + G + LV + A + RGK G I S + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 152 TVAGLLANLGGWRTVFWVASMLMALMALALWRGLPQMKSETHLNY 196
+ G++A+ W + + + + + + +++ + H +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2649PF05272280.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.018
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%)

Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82
P QE+ L + + L R A+G + + T + ++L ALG
Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808

Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111
SS ++ D L + GW RE+ RR
Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2650RTXTOXIND795e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 78.7 bits (194), Expect = 5e-18
Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%)

Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80
L FI+ + I VL E A +G +I + V ++
Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104
+ V++GDVL+ L A K
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141
K ++ Q +Q +N + +A + + +S
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201
+ L + I + + + A +L V Q ++ IL K E Q Q
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 202 AATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242
E+ + + + I +P++ V + V G ++
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298
LM +VP + V A + I + +GQ I + + +Y GKV + +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409

Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350
++ G V+ + + PL G++ + T R
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2651TCRTETB1329e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 132 bits (333), Expect = 9e-36
Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%)

Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76
I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135
G +L L+ I S V S ++LI R IQG A L ++ P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195
R A L V + GP +GG I+ HW + + +P+ + + L L +E R
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194

Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255
+ D G+ L+ +GI + ML F++ I +V+V++ +
Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315
+P VD L K+ F IG LC + + G + ++P ++++V+ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373
+ VI+ I G + ++ +V F ++ S + I F
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358

Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416
++ ++TI S L + A SL NFT L+ G +I
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2652LUXSPROTEIN293e-105 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 293 bits (751), Expect = e-105
Identities = 132/170 (77%), Positives = 148/170 (87%)

Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61
PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121
GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNVYQCGTYQMHSLQEAQDIARNILERDVRINSNEELALPKEKLQELHI 171
ELN YQCGT MHSL EA+ IA+NILE V +N N+ELALP+ L+EL I
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170


88ECP_2835ECP_2842N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_28351141.528035hypothetical protein
ECP_2836-2101.705541hypothetical protein
ECP_2837-3101.330561hypothetical protein
ECP_2838-391.245825hypothetical protein
ECP_2840-290.917551thymidylate synthase
ECP_2841-2100.728496prolipoprotein diacylglyceryl transferase
ECP_2842-2111.092237fused phosphoenolpyruvate-protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2835BCTERIALGSPH290.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.002
Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 29/114 (25%)

Query: 8 QQGFSLPEVMLAMVLMVMIVTA----------------LSGFQRTLMNSLASRNQYQQLW 51
Q+GF+L E+ML ++LM + L+ F+ L Q Q +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFF 62

Query: 52 -----RHGWQ--QTQLRAISPPA----NWQVNRMQTSQAGCVSISVTLVSPGGR 94
WQ + R + PA W R +AG V+ S ++ GG+
Sbjct: 63 GVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSI--AGGK 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2837PilS_PF08805290.015 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 28.7 bits (64), Expect = 0.015
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 72 ALSARRNRRMPVKEQGFSLLEVLIAMAISSVLLLGAARFLPALQRESLTNT 122
+LSARR + +++G +L+EVL+ + + VL A + +Q ++
Sbjct: 15 SLSARRKKE---QDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSN 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2838BCTERIALGSPG290.003 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.5 bits (66), Expect = 0.003
Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 1 MKTQRGYTLIETLVAMLILVMLSA 24
QRG+TL+E +V ++I+ +L++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2842PHPHTRNFRASE6110.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 611 bits (1578), Expect = 0.0
Identities = 189/571 (33%), Positives = 314/571 (54%), Gaps = 7/571 (1%)

Query: 168 QTRIRALPAAPGVAIAEGWQDATLPLMEQVYQASTLDPALERERLTGALEEAANEFRRYS 227
+I + A+ GVAIA+ + + + + S D + E E+LT ALE++ E R
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNV--DIEKTSITDVSTEIEKLTAALEKSKEELRAIK 59

Query: 228 KRFAAGAQKETAAIFDLYSHLLSDTRLRRELFAEVDKGSV-AEWAVKTVIEKFAEQFAAL 286
+ A + A IF + +L D L + +++ + AE+A+K V + F F ++
Sbjct: 60 DQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM 119

Query: 287 SDNYLKERAGDLRALGQRLLFHLDDANQGPNAW-PERFILVADELSATTLAELPQDRLVG 345
+ Y+KERA D+R + +R+L HL G A E +++A++L+ + A+L + + G
Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKG 179

Query: 346 VVVRDGAANSHAAIMVRALGIPTVMGA-DIQPSVLHRRTLIVDGYRGELLVDPEPVLLQE 404
G SH+AIM R+L IP V+G ++ + H +IVDG G ++V+P ++
Sbjct: 180 FATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKA 239

Query: 405 YQRLISEEIELSRLAEDDVNLPAQLKSGERIKVMLNAGLSPEHEEKLGSRIDGIGLYRTE 464
Y+ + + + V P+ K G +++ N G + + L + +GIGLYRTE
Sbjct: 240 YEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTE 299

Query: 465 IPFMLQSGFPSEEEQVAQYQGMLQMFNDKPVTLRTLDVGADKQLPYMPISEE-NPCLGWR 523
+M + P+EEEQ Y+ ++Q + KPV +RTLD+G DK+L Y+ + +E NP LG+R
Sbjct: 300 FLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFR 359

Query: 524 GIRITLDQPEIFLIQVRAMLRANAATGNLNILLPMVTSLDEVDEARRLIERAGREVEEMI 583
IR+ L++ +IF Q+RA+LRA + GNL ++ PM+ +L+E+ +A+ +++ ++
Sbjct: 360 AIRLCLEKQDIFRTQLRALLRA-STYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG 418

Query: 584 GYEIPKPRIGIMLEVPSMVFMLPHLAKRVDFISVGTNDLTQYILAVDRNNTRVANIYDSL 643
+GIM+E+PS AK VDF S+GTNDL QY +A DR N RV+ +Y
Sbjct: 419 VDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478

Query: 644 HPAMLRALAMIAREAEIHGIDLRLCGEMAGDPMCVAILIGLGYRHLSMNGRSVARVKYLL 703
HPA+LR + M+ + A G + +CGEMAGD + + +L+GLG SM+ S+ + L
Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQL 538

Query: 704 RRIDFAEAENLAQRSLEAQLATEVRHQVAAF 734
++ E + AQ++L A EV V
Sbjct: 539 LKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569


89ECP_2967ECP_2975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_2967524-2.198635PixF protein
ECP_2968522-1.078256PixJ protein
ECP_2969522-0.268517PixD protein
ECP_2970520-0.529444fimbrial usher protein PixC
ECP_2971424-2.916589PixH protein
ECP_2972323-2.906706PixA protein
ECP_2973221-2.770117hypothetical protein
ECP_2974222-3.756566hypothetical protein
ECP_2975124-4.195807transport activator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2967FIMBRIALPAPF1073e-32 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 107 bits (268), Expect = 3e-32
Identities = 67/172 (38%), Positives = 101/172 (58%), Gaps = 8/172 (4%)

Query: 1 MRITVFLLTFLSFLSDLWAVDIPINITGTIIIPPCQINNSNPVDVDFGNIRVSELDTKEH 60
+R+++F+ L+ ++ L D+ INI G + IPPC INN + VDFGNI +D
Sbjct: 2 IRLSLFISLLLTSVAVL--ADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRG 59

Query: 61 IKVVSFPVYCPYHQGEAYVKMTGQSM-TGKDNVLATNIDGLGIELYQGGEGTGNHLILGS 119
+ + CPY G ++K+TG +M G++NVLATNI GI LYQ G+G L LG+
Sbjct: 60 EVTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQ-GKGMSTPLTLGN 118

Query: 120 GSSGYGYEVINALSEKNVERTTFTFTAKIYKAEGVTINSGEFSASALINIVY 171
G SG GY V L + R+TFTFT+ ++ +N G+F +A ++++Y
Sbjct: 119 G-SGNGYRVTAGL---DTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2969cloacin290.040 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.5 bits (63), Expect = 0.040
Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 5/75 (6%)

Query: 3 GGHPGTSGPGTTVAAALSSGEVTLYTPAI----VCISRQKNVKKQRAENMQKMKPALKKT 58
GG GT G + VAA ++ G L TP V IS + A+ M +K K
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA-LSAAIADIMAALKGPFKFG 130

Query: 59 LMAVACLSAVPAAQA 73
L VA +P+ A
Sbjct: 131 LWGVALYGVLPSQIA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2970PF005777590.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 759 bits (1961), Expect = 0.0
Identities = 248/885 (28%), Positives = 386/885 (43%), Gaps = 65/885 (7%)

Query: 15 LNRLHIMKKNKSTFTINFITYSLMLSLAGVPVYAVDFNTDVLDAADRQNIDFSRFSRAGY 74
LHI K + F + + A + + FN L + D SRF
Sbjct: 13 TQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQE 72

Query: 75 IMPGQYQMEIRVNGQDISPSAFQIAFLEPPFSDSDNEKPLPEPCLTPEIVSRMGLTEASQ 134
+ PG Y+++I +N +A + F+ D+E+ + PCLT ++ MGL AS
Sbjct: 73 LPPGTYRVDIYLNNG-------YMATRDVTFNTGDSEQGI-VPCLTRAQLASMGLNTASV 124

Query: 135 EKVTYWNNGQCADFRQL-SGVEIRPNPAEGMLYINMPQAWLEYSDASWLPPSRWDNGIPG 193
+ + C + + + + L + +PQA++ ++PP WD GI
Sbjct: 125 SGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA 184

Query: 194 LLFDYNINGTVNKPHQGKQSQSLNYNGTAGANFGAWRLRADYQGNLNHTTGSAQGTDSQF 253
L +YN +G + G S N +G N GAWRLR + + N + S+ G+ +++
Sbjct: 185 GLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSS-GSKNKW 243

Query: 254 TWSRFYMYRAIPRWRANLTLGENYINSEIFSSWRYTGASLESDDRMLPPKLRGYAPQVSG 313
++ R I R+ LTLG+ Y +IF + GA L SDD MLP RG+AP + G
Sbjct: 244 QHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHG 303

Query: 314 IADTNARVVISQQGRILYDSTVPAGPFTIQDLD-SSVRGRLDVEVIEQDGRKKTFQVDTA 372
IA A+V I Q G +Y+STVP GPFTI D+ + G L V + E DG + F V +
Sbjct: 304 IARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYS 363

Query: 373 YVPYLTRPGQVRYKLVSGRSRTYEHTMEGPVFAAGEASWGISNTWSLYGGSIVAGDYNAL 432
VP L R G RY + +G R+ E P F G+ W++YGG+ +A Y A
Sbjct: 364 SVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAF 423

Query: 433 AVGLGRDLSKFGTVSADVTQSVARIPGYDTKQGKSWRLSYSKRFDEVNTDITFAGYRFSE 492
G+G+++ G +S D+TQ+ + +P G+S R Y+K +E T+I GYR+S
Sbjct: 424 NFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYST 483

Query: 493 RNYMTMDQYLNARYR--------------------NDFTGREKELYTVTLNKNFEDWKAS 532
Y +R + ++ +T+ + ++
Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-ST 542

Query: 533 VNLQYSHQTYWDRRTSD-YYTLSVNRYFDAFSFKNIALGISASRSKYLNRD--NDSAFVR 589
+ L SHQTYW D + +N +F++I +S S +K + + +
Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNT-----AFEDINWTLSYSLTKNAWQKGRDQMLALN 597

Query: 590 LSVPWGT------------GTASYSGSMSND-RYTNTVGYSDTL-NNGLSSYSLNAGVNS 635
+++P+ +ASYS S + R TN G TL + SYS+ G
Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657

Query: 636 GGGQPSQRQMSAYYNHNGSLTNLSASFSAVENGYSSFGMSASGGATVTMKGAALHAGGMN 695
GG S A N+ G N + +S + SGG G L G
Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSH-SDDIKQLYYGVSGGVLAHANGVTL--GQPL 714

Query: 696 GGTRLLVDTDGVGGVPVDGGR-VYTNRWGIGVVTDVSSYYRNTTSVDLNKLPEDMEATRS 754
T +LV G V+ V T+ G V+ + Y N ++D N L ++++ +
Sbjct: 715 NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774

Query: 755 VVESVLTEGAIGYREFEVLKGSRLFAVLRMSDNSYPPFGASVTNAKGRELGMVADSGLAW 814
V V T GAI EF+ G +L L +N PFGA VT+ + G+VAD+G +
Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833

Query: 815 LSGVNPGETLNVGW--DGRTQCVVDIPAHPDPAQQLL----LPCR 853
LSG+ + V W + CV + P+ QQLL CR
Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2971FIMBRIALPAPE333e-04 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 33.5 bits (76), Expect = 3e-04
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 9/86 (10%)

Query: 29 GMTLPEYWG----EEHVWWDGRASFKGQVIAPACTLSMEDAWQEIDMGTTPLRDLQNSPA 84
G+ LP G +HV +FKG++I PACT+ E++ G +++L S
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLVQS-G 60

Query: 85 GPEKKFRLRLRNCELTGAGKQVYTAT 110
G +K F + + G K T+
Sbjct: 61 GNQKDFTVDMNCPYSLGTMKVTITSN 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_2975HTHFIS2401e-76 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 240 bits (614), Expect = 1e-76
Identities = 112/479 (23%), Positives = 188/479 (39%), Gaps = 83/479 (17%)

Query: 10 SILLIDDDADVLDAYTQLLEQSGYRVFACNNPFEAQAWIQPDWPGIVLSDVCMPGCSGID 69
+IL+ DDDA + Q L ++GY V +N WI +V++DV MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 70 LMMLFHQDDQQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLSLVEEALRQRQS 129
L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ ++ AL + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 130 IIARRQYCQQTLQVELIGRSEWINQYRRRLQQLSETDIAVWLYGAPGTGRMTGARYLHQF 189
++ + Q L+GRS + + R L +L +TD+ + + G GTG+ AR LH +
Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 190 GRNAQGEFVYRELTPDNAPQLND------------------------FIALAQGGTLVLS 225
G+ G FV N + A+GGTL L
Sbjct: 184 GKRRNGPFV-----AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 226 HPEHLTREQQYHLVQ-LQSQEHRP----------FRLIGIGDTSLVELAASNHIIAELYY 274
+ + Q L++ LQ E+ R++ + L + +LYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 275 CFAMTQIACLPLTQRPDDIEPLFRHYLCKACQRLNHPVPEVGKEMLKEMMRRMWPNNVRE 334
+ + PL R +DI L RH++ +A + V +E L+ M WP NVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 335 LANAAE-----------------LFTVGILPLAETANPLMHVGT---------------- 361
L N +P + G+
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 362 --------PAPLDRRVEDAERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 412
DR + + E +I AL +G + A+ L + R L ++++ G+S
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


90ECP_3040ECP_3050N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3040-1195.225905type II secretion protein GspJ
ECP_3041-1194.632635type II secretion protein GspI
ECP_3042-2163.816531type II secretion protein GspH
ECP_3043-3153.115055type II secretion protein
ECP_3044-2142.908189type II secretion protein GspF
ECP_3045-1121.249355type II secretion protein GspE
ECP_3046-111-0.046347type II secretion protein GspD
ECP_3047-212-0.445380type II secretion protein GspC
ECP_3048-3120.093183lipoprotein
ECP_3049-2130.561788prepilin peptidase
ECP_3050-2141.082648lipoprotein AcfD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3040BCTERIALGSPG280.026 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 27.5 bits (61), Expect = 0.026
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 1 MRRAS--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45
MR GFTLLE++V I I LA L+ + + + A D
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3041BCTERIALGSPH348e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 8e-05
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3042BCTERIALGSPH775e-20 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 76.5 bits (188), Expect = 5e-20
Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATDSESPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDPPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + P +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3043BCTERIALGSPG2173e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (554), Expect = 3e-76
Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3044BCTERIALGSPF452e-161 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 452 bits (1165), Expect = e-161
Identities = 224/406 (55%), Positives = 300/406 (73%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKDLIPVHI-EARLNASAGGMLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPEIIEQFDHLGHALPASTRMLIAMSDTLQTSGVYWLAGLLGLLVL 239
VVA VV+ILL+ VVP+++EQF H+ ALP STR+L+ MSD ++T G + L LL +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALADLRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3046BCTERIALGSPD5740.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 574 bits (1482), Expect = 0.0
Identities = 295/668 (44%), Positives = 431/668 (64%), Gaps = 34/668 (5%)

Query: 24 LLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83
L L++ AAL P AEE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+
Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143
LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127

Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203
VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD
Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187

Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262
+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P
Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322
+R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306

Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382
+ + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+
Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366

Query: 383 QWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLST-LA 441
QWA+K+AG+ QF N + +PI T A + +G +S+ LA
Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406

Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501
LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556
S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L
Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNL 616
F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+ST+ K KRNL
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPVLPAQNQALPP 674
M+FIRPT++RD S +Y Q + E +++ + P Q+ A
Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645

Query: 675 EVRAFLNA 682
+V A ++A
Sbjct: 646 QVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3047BCTERIALGSPC1189e-34 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 118 bits (298), Expect = 9e-34
Identities = 72/287 (25%), Positives = 117/287 (40%), Gaps = 40/287 (13%)

Query: 40 IARGMFWLMLLIISAKVAHSLWRYFSFSAEYTA-VSPSANKPLRADAKAFDKNDVQLISQ 98
I R +F+L++L+ ++A WR A VS P +A + ND L
Sbjct: 14 IRRILFYLLMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL--- 67

Query: 99 QNWFGKYQPV--ATPVKQPEPAPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQ 153
FG A + + + + + LN+ L G+ G +R A+I + +Q
Sbjct: 68 ---FGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGV 124

Query: 154 GETLGSHNAVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTNKKAVSDEAKQAVAEPA 213
E + +NA I I D V+L+YQG+ E L L +E S
Sbjct: 125 NEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDS------------------GSDG 166

Query: 214 VSAPVEIPAAVRQAL-AKDPQKIFNYIQLTPVRKEG-IVGYAVKPGADRSLFDASGFKEG 271
V A V + L + + +Y+ +P+ + + GY + PG F G ++
Sbjct: 167 VPG-----AQVNEQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDN 221

Query: 272 DIAIALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARYDISIAL 318
D+A+ALN D D M ++ + + LTV R G R DI +
Sbjct: 222 DMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3049PREPILNPTASE2782e-96 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 278 bits (714), Expect = 2e-96
Identities = 110/274 (40%), Positives = 149/274 (54%), Gaps = 12/274 (4%)

Query: 1 MFFDVFQQYPAAMPVLATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGEMPSTQSKI-- 57
+ ++ P L + L+IGSFLNVVI R PIML R+ AE+ +
Sbjct: 3 LLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDE 62

Query: 58 ---SLALPRSHCPHCQQTIRVRDNIPLLSWLMLKGRCRDCQAKISKRYPLVELLTALAFL 114
+L +PRS CPHC I +NIPLLSWL L+GRCR CQA IS RYPLVELLTAL +
Sbjct: 63 PPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSV 122

Query: 115 LASLVWPESGWGLAVMILSAWLIAASVIDLDNQWLPDVFTQGVLWTGLIAAWAQQSPLTL 174
++ LA ++L+ L+A + IDLD LPD T +LW GL+ ++L
Sbjct: 123 AVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFVSL 181

Query: 175 QDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGALSLPNVALIASCC 234
DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S
Sbjct: 182 GDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV 241

Query: 235 GLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 263
G + S +PFGP L++ G L
Sbjct: 242 GAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3050PF03544495e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 48.8 bits (116), Expect = 5e-08
Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPD---PEPTPEPEPEPVP 88
S T + V+P P P EP PEP P PEP E P+P P+P+P+PV
Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109



Score = 43.0 bits (101), Expect = 4e-06
Identities = 17/88 (19%), Positives = 25/88 (28%), Gaps = 2/88 (2%)

Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKTG 92
+D P + PE +P P PEP PEP + E + V
Sbjct: 58 ADLEPPQAVQ-PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 93 YLTLGGSQRITGATCNGESSDGFTFTPG 120
+ R N + + T
Sbjct: 117 DVK-PVESRPASPFENTAPARPTSSTAT 143



Score = 41.9 bits (98), Expect = 1e-05
Identities = 21/116 (18%), Positives = 37/116 (31%), Gaps = 5/116 (4%)

Query: 35 TPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDP--EPTPEPEPEPVPTKTG 92
P + +P +P PEP +PEP PEP P+P E E K
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 93 YLTLGGSQRITGATCNGESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRA 148
+ ++ ES +P + ++T ++ + +
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPA---SPFENTAPARPTSSTATAATSKPVTSVASGP 157



Score = 39.2 bits (91), Expect = 7e-05
Identities = 20/97 (20%), Positives = 30/97 (30%), Gaps = 9/97 (9%)

Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPE-PTPDPEPTPEPTPDPEPTPEPEPEPV 87
+ P V PE +P P P E P P+P P+P + +P+ +
Sbjct: 65 AVQPPPEPVV----EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP-KPVKKVEQPKRDVK 119

Query: 88 P---TKTGYLTLGGSQRITGATCNGESSDGFTFTPGD 121
P R T +T +S T
Sbjct: 120 PVESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156



Score = 35.7 bits (82), Expect = 8e-04
Identities = 17/40 (42%), Positives = 17/40 (42%)

Query: 50 PDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPT 89
P P T D EP P PEP EPEPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83



Score = 30.7 bits (69), Expect = 0.034
Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 52 PTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKT 91
P P + + P P P P EPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83


91ECP_3317ECP_3325N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3317-216-0.396650serine endoprotease
ECP_3318-120-1.595492serine endoprotease
ECP_3319-114-1.602694malate dehydrogenase
ECP_3320-113-1.235442hypothetical protein
ECP_3321-113-0.970052arginine repressor ArgR
ECP_3322-114-0.401612hypothetical protein
ECP_3323-1130.498891hypothetical protein
ECP_3324-2101.305570p-hydroxybenzoic acid efflux subunit AaeB
ECP_3325-1101.499751p-hydroxybenzoic acid efflux subunit AaeA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3317V8PROTEASE702e-15 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 70.0 bits (171), Expect = 2e-15
Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 38/184 (20%)

Query: 90 GLGSGVIINANKGYVLTKNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137
+ SGV++ + +LT HV++ L +G ++ +
Sbjct: 102 FIASGVVVGKDT--LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190
D+A+++ + ++++ + +V G P ++ +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212

Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249
S + L+ +Q D S GNSG + N E+IGI+ G+
Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 250 MART 253
+
Sbjct: 264 VFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3318V8PROTEASE536e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 53.1 bits (127), Expect = 6e-10
Identities = 33/160 (20%), Positives = 61/160 (38%), Gaps = 26/160 (16%)

Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124
+ SGV++ + ++TNKHV++ AL+ +G +
Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 125 TDLAVLKINATGGLPTIP--INPRRVPH-----IGDVVLAIGNPYNLGQTITQGIISATG 177
DLA++K + I + P + + + + G P + T + G
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216

Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217
+I + +Q D S GNSG + N E++GI+
Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3319DHBDHDRGNASE280.043 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.043
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%)

Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62
+ GAA GIG+A+A L G+ ++ D P V S A + F +
Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66

Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104
A + D+++ AGV R + F+VN+ V N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146
V+K + + + +NP ++AA KA K G+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3321ARGREPRESSOR1689e-57 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 168 bits (428), Expect = 9e-57
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%)

Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74
+ ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+
Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69

Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131
S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI
Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128

Query: 132 FTTPANGFTVKELYEAILELF 152
K + + ILEL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3325RTXTOXIND542e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 2e-10
Identities = 29/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%)

Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61
SR V I+ F+ I + + S I P + ++ ++
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 62 NVHDNQLVKKGQVLFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114
V + + V+KG VL + Q +L +A+ + YQ+L++ E + L
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168

Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154
+ + VL + Q + Q + +L+L++
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211



Score = 51.4 bits (123), Expect = 2e-09
Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%)

Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150
E R + ++ + ++EE + + +L +L + +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208
+ +VIRAP V L V+T G +T T + +V ++ V A ++ + + G
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231
A I P L G V ++
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410


92ECP_3333ECP_3340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3333-3121.177623rod shape-determining protein MreC
ECP_3334-3140.235525rod shape-determining protein MreB
ECP_3335-215-0.416228hypothetical protein
ECP_3336-2130.190027regulatory protein CsrD
ECP_3337-2150.531788zinc-binding dehydrogenase
ECP_3338-215-1.048461hypothetical protein
ECP_3339014-2.459737hypothetical protein
ECP_3340013-3.296880acetyl-CoA carboxylase biotin carboxyl carrier
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3333PF03544280.043 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.4 bits (63), Expect = 0.043
Identities = 12/72 (16%), Positives = 20/72 (27%), Gaps = 3/72 (4%)

Query: 296 MMPQVLPSPDAMGPKLPEPATGITQPTPQQPATGNAVTAPAAPTQPAANRSPQRATPPQS 355
P+ +P P P + E +P P+ V P +P +R
Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK---KVEQPKRDVKPVESRPASPFENTAP 134

Query: 356 GAQPPARAPGGQ 367
+ A
Sbjct: 135 ARPTSSTATAAT 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3334SHAPEPROTEIN5760.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 576 bits (1487), Expect = 0.0
Identities = 347/347 (100%), Positives = 347/347 (100%)

Query: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60
MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120
QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ
Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120

Query: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180
VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG
Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240
VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300
LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300

Query: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3337NUCEPIMERASE290.026 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.026
Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 150 VVVTGASGGVGSTAVALLHKLGYQVVAV 177
+VTGA+G +G L + G+QVV +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3340RTXTOXIND270.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.026
Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 127 IEADKSGTVKAILVESGQPVEFDEPLV 153
I+ ++ VK I+V+ G+ V + L+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125


93ECP_3354ECP_3360N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3354-218-4.331716DNA-binding protein Fis
ECP_3355-212-2.395989methyltransferase
ECP_3356-313-2.040753hypothetical protein
ECP_3357-313-1.869402DNA-binding transcriptional regulator EnvR
ECP_3358-213-1.161045hypothetical protein
ECP_3359-214-0.716516acriflavine resistance protein E
ECP_3360-315-0.680830acriflavine resistance protein F
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3354DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3357HTHTETR1285e-39 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 128 bits (322), Expect = 5e-39
Identities = 77/209 (36%), Positives = 122/209 (58%), Gaps = 3/209 (1%)

Query: 1 MAKRTKAEALKTRQELIETAIAQFAQHGVSKTTLNDIADAANVTRGAIYWHFENKTQLFN 60
MA++TK EA +TRQ +++ A+ F+Q GVS T+L +IA AA VTRGAIYWHF++K+ LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EMW-LQQPSLRELIQDYLTAGLEHDPFQQLREKLIVGLQYIAKIPRQQALLKILYHKCEF 119
E+W L + ++ EL + A DP LRE LI L+ R++ L++I++HKCEF
Sbjct: 61 EIWELSESNIGELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 120 NDEM-LAEGVIREKMGFNPQTLREVLQACQQQGCVANNLDLDVVMIIIDGAFSGIVQNWL 178
EM + + R + + + L+ C + + +L II+ G SG+++NWL
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 179 MNMACYDLYKQAPALVDNVLRMFMPDENI 207
+DL K+A V +L M++ +
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3359RTXTOXIND448e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 8e-07
Identities = 39/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%)

Query: 98 ATYQASYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADATV 156
K +L + E+ A + Q + I D RQ +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311

Query: 157 IAAKATVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 215
+ + + AP+S ++ + TEG +VT +T + V + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370

Query: 216 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 270
+ D F+ + Q+ +++ Y L G ++ D D+
Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418

Query: 271 GSIT--LRAI------FPNPQHTLLPGMFVRARIDEG 299
G + + +I N L GM V A I G
Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 34.0 bits (78), Expect = 0.001
Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 13/127 (10%)

Query: 46 TAPLEVKTELPGR-TNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATYQASY 104
+E+ G+ T++ R E++P + IV EG V+ G L ++ +A
Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA-- 134

Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADATVIAAKATVE 164
+ K++++ A L RY L E ++ +
Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 165 SARINLA 171
+L
Sbjct: 185 LRLTSLI 191



Score = 29.0 bits (65), Expect = 0.030
Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%)

Query: 65 AEVRPQVSGIVLNRN-FTEGSDVQAGQSLYQIDP 97
+ +R VS V TEG V ++L I P
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3360ACRIFLAVINRP14020.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1402 bits (3631), Expect = 0.0
Identities = 1027/1034 (99%), Positives = 1030/1034 (99%)

Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120
VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKSSSSYLMVPGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180
EVQQQGISVEKSSSSYLMV GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240
QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300
KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIQEVVKTLFEAIMLVFLVMYLFLQ 360
DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI EVVKTLFEAIMLVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 MEDKLPPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
MEDKLPP+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540
SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600
LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERSGDENSAEAVIHRAKMELGKIRDG 660
EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER+GDENSAEAVIHRAKMELGKIRDG
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720
FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKVYVQADAKFRM 780
EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK+YVQADAKFRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPRTSSGDAM 840
LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP TSSGDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900
ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960
MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020
EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPVFFVVIRRCFKG 1034
VPVFFVVIRRCFKG
Sbjct: 1021 VPVFFVVIRRCFKG 1034


94ECP_3412ECP_3430N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3412020-1.595125general secretion pathway protein C
ECP_3413-118-0.579650general secretion pathway protein D
ECP_3414021-0.205270general secretion pathway protein E
ECP_3415023-0.842724general secretion pathway protein F
ECP_3416224-1.529237general secretion pathway protein G
ECP_3417222-1.780508general secretion pathway protein H
ECP_3418220-1.741026general secretion pathway protein I
ECP_3419221-1.556828general secretion pathway protein J
ECP_3420122-1.536614general secretion pathway protein K
ECP_3421222-2.221018general secretion pathway protein L
ECP_3422222-2.037931general secretion pathway protein M
ECP_3423-119-1.602481transposase for insertion sequence IS100
ECP_3424024-1.417397transposase/IS protein
ECP_3425133-1.224700type 4 prepilin-like proteins leader peptide
ECP_3426239-1.265209bacterioferritin
ECP_3427343-0.608523bacterioferritin-associated ferredoxin
ECP_3428343-0.529288bifunctional chitinase/lysozyme
ECP_34296540.005639elongation factor Tu
ECP_3430444-0.290872elongation factor G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3412BCTERIALGSPC844e-21 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 83.9 bits (207), Expect = 4e-21
Identities = 53/200 (26%), Positives = 94/200 (47%), Gaps = 15/200 (7%)

Query: 59 EFSLAALWRNENHAGVKDANPVAVNQETPKLSIALNGIVLTSNDETSFVLINEGNEQKRY 118
+F+L + +N AG DA N L+++L G++ +D S +I++ NEQ
Sbjct: 64 DFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSR 122

Query: 119 SLNEALESAPGT--FIRKINKTSVVFETHGHYEKVTLH-------PGLP--DIIKQPDSE 167
+NE + PG I I VV + G YE + L+ G+P + +Q
Sbjct: 123 GVNEEV---PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQR 179

Query: 168 NQNVLADYIIATPIRDGEQIYGLRLNPRKGLNAFTTSLLQPGDIALRINNLSLTHPDEVS 227
++DY+ +PI + ++ G RLNP ++F LQ D+A+ +N L L ++
Sbjct: 180 ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAK 239

Query: 228 QALSLLLTQQSAQFTIRRNG 247
+A+ + + T+ R+G
Sbjct: 240 KAMERMADVHNFTLTVERDG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3413BCTERIALGSPD7160.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 716 bits (1850), Expect = 0.0
Identities = 344/629 (54%), Positives = 466/629 (74%), Gaps = 11/629 (1%)

Query: 11 ITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRS 70
+T + AALL A E++ A+F DI++F+ V ++L KT++IDPSV+GTI+VRS
Sbjct: 12 LTLLIFAALLF---RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 71 NDTFSQQEYYQFFLSILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELV 130
D ++++YYQFFLS+LD+YG++VI ++NG LKVVRS + KT+ +A + PG+GDE+V
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVV 128

Query: 131 TRIVPLENVPARDLAPLLRQMMDAGSVGNVVHYEPSNVLILTGRASTINKLIEVIKRVDV 190
TR+VPL NV ARDLAPLLRQ+ D VG+VVHYEPSNVL++TGRA+ I +L+ +++RVD
Sbjct: 129 TRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDN 188

Query: 191 IGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRTNSLIISGPEK 250
G + L +ASA D+ +++ +L ++ KS +P + A +VAD+RTN++++SG
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTEL-NKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 251 ARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNSRKPSSTS 310
+RQRI +++K LD +++ +GNT+V YLKYAKA++LVEVLTG+S ++ EK ++ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK--PVAA 305

Query: 311 AMDNVAITADEQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLG 370
N+ I A QTN+L++TA V L VIA+LDIRR QVLVEAII EVQD +GLNLG
Sbjct: 306 LDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLG 365

Query: 371 VQWANKNVGAQQFTNTGLPVFNAAQGVADYKKNGGITSANPAWDMFSAYNGMAAGFFNGD 430
+QWANKN G QFTN+GLP+ A G Y K+G ++S+ S++NG+AAGF+ G+
Sbjct: 366 IQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA--SALSSFNGIAAGFYQGN 423

Query: 431 WGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGDNVFNTVERKT 490
W +LLTAL+S+ KNDILATPSIVTLDN A+FNVGQ+VPVL+GSQTTSGDN+FNTVERKT
Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKT 483

Query: 491 VGTKLKVTPQVNEGDAVLLEIEQEVSSVD---SSSNSTLGPTFNTRTIQNAVLVKTGETV 547
VG KLKV PQ+NEGD+VLLEIEQEVSSV SS++S LG TFNTRT+ NAVLV +GETV
Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETV 543

Query: 548 VLGGLLDDFSKEQVSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSL 607
V+GGLLD + KVPLLGDIP++G LFR TS + +KRNLM+FIRPT+IRD D YR
Sbjct: 544 VVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQA 603

Query: 608 SKEKYTRYRQEQQLRIDGKSKALVGSEDL 636
S +YT + Q + ++ + ++DL
Sbjct: 604 SSGQYTAFNDAQSKQRGKENNDAMLNQDL 632


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3415BCTERIALGSPF5120.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 512 bits (1321), Expect = 0.0
Identities = 195/405 (48%), Positives = 283/405 (69%), Gaps = 8/405 (1%)

Query: 2 NYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQK-------SSGVKTRRP 54
Y Y+A+ G+K +G +A+ RQAR LRE GL L + + S+G+ RR
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 55 -RISHSELTLFTRQLATLSAAALPLEESLAVIGQQSSNNRLADVLNQVRSAILEGHPLSD 113
R+S S+L L TRQLATL AA++PLEE+L + +QS L+ ++ VRS ++EGH L+D
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 114 ALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKLIQSLIYPCMLTTV 173
A++ FP F+ LY +V AGE SG L VL +LADY E RQ++RS++ Q++IYPC+LT V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 174 AIVVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRTGPTLLATVFIVAVGFW 233
AI VV ILL+ VVPK+ EQF+HMKQ LPLSTR+L+G+SD ++ GP +L + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 234 LWLKRGNNRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLN 293
+ L++ R FH LL + LIG + +N+ARY RTLSIL +S VPLL M +S + ++
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 294 NLEIRQRLANAAENVRQGNSIHLSLEQTAIFPPMMLYMVASGEKSGQLGTLMVRAADNQE 353
N R RL+ A + VR+G S+H +LEQTA+FPPMM +M+ASGE+SG+L +++ RAADNQ+
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 354 TLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN 398
+++ L L +FEP L+++MA +VLFIV+++LQP+LQLN++++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3416BCTERIALGSPG2491e-88 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 249 bits (636), Expect = 1e-88
Identities = 144/145 (99%), Positives = 144/145 (99%)

Query: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60
MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNHRYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120
LDNH YPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145
LSAGPDGEMGTEDDITNWGLSKKKK
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3417BCTERIALGSPH1412e-45 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 141 bits (357), Expect = 2e-45
Identities = 50/154 (32%), Positives = 76/154 (49%), Gaps = 18/154 (11%)

Query: 3 QQRGFTLLEMMLVLALVAITASVVLFTYGREDAASTRARETAARFTAALELAIDRATLSG 62
+QRGFTLLEMML+L L+ ++A +VL + + A +T ARF A L R +G
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFP--ASRDDSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 63 QPVGIHFSDSAWRIMV----PGKTP-------SAWRWVPLQEDAADESKNDWGEELSIQL 111
Q G+ W+ +V G P S +RW+PL+ S + G +L++
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF 119

Query: 112 ---QPFKPDDSNQPQVVILADGQITPFSLLMANA 142
+ + P D P V+I G++TPF L + A
Sbjct: 120 AQGEAWTPGD--NPDVLIFPGGEMTPFRLTLGEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3418BCTERIALGSPG319e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.0 bits (70), Expect = 9e-04
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 4/91 (4%)

Query: 14 MNKQSGMTLLEVLLAMSIFTAVALTLMSSMQGQ--RTAIERMRNETLALWIADNQLQSQD 71
+KQ G TLLE+++ + I +A ++ ++ G + ++ ++ +AL A + + D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK-LD 62

Query: 72 SFDEENTSSSGKELINGEELINGEEWNWRSD 102
+ T+ + L+ L N+ +
Sbjct: 63 NHHYPTTNQGLESLVEAPTL-PPLAANYNKE 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3419BCTERIALGSPH342e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 2e-04
Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 4 RQQGFTLLEVMAALAIFSMLSVLAFMIFSQASELHQRSQKEIQQFNQ 50
RQ+GFTLLE+M L + + + + + F + + + + + +F
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD--DSAAQTLARFEA 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3423HTHTETR280.044 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.044
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3425PREPILNPTASE1563e-49 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 156 bits (397), Expect = 3e-49
Identities = 76/166 (45%), Positives = 98/166 (59%), Gaps = 2/166 (1%)

Query: 55 VPLILCVAAAIACALAPFTPIVTGALFLYFCFALTLSVIDFRTQLLPDKLTLPLLWLGLV 114
V L+ + + AL L + + L+ ID LLPD+LTLPLLW GL+
Sbjct: 113 VELLTALLSVAVAMTLAPGWGTLAALLLTWVL-VALTFIDLDKMLLPDQLTLPLLWGGLL 171

Query: 115 FNAQSGLIDLHDAVYGAVAGYGVLWCVYWGVWLVCHKEGLGYGDFKLLAAAGAWCGWQTL 174
FN G + L DAV GA+AGY VLW +YW L+ KEG+GYGDFKLLAA GAW GWQ L
Sbjct: 172 FNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQAL 231

Query: 175 PMILLIASLGGIGYAIVSQLLQRRTITT-IAFGPWLALGSMINLGY 219
P++LL++SL G I LL+ + I FGP+LA+ I L +
Sbjct: 232 PIVLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3426HELNAPAPROT353e-05 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 35.2 bits (81), Expect = 3e-05
Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%)

Query: 5 TKVINYLNKLLGNE---LVAINQYFLHARMFKNWGLKRLNDVEYHESIDEM-----KHAD 56
T V N LN L N ++++ +W +K + HE +E+ + D
Sbjct: 11 TLVENSLNTQLSNWFLLYSKLHRF--------HWYVKGPHFFTLHEKFEELYDHAAETVD 62

Query: 57 RYIERILFLEGLPN--LQDLGKL------NIGEDVEEMLRSDLALELDGAKNLREAIGYA 108
ER+L + G P +++ + EM+++ + + + IG A
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 109 DSVHDYVSRDMMIEILRDEEGHIDWLETEL 138
+ D + D+ + ++ + E + L + L
Sbjct: 123 EENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3429TCRTETOQM803e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 3e-18
Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%)

Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126
+ +D PGH D++ + + +DGAIL+++A DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186
G+P I F+NK D + L V +++E LS + + +W+
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 187 AKILELAGFLDSYIPEPE 204
I L+ Y+
Sbjct: 177 TVIEGNDDLLEKYMSGKS 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3430TCRTETOQM6130.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 613 bits (1583), Expect = 0.0
Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128
+ W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188
K +P I F+NK+D+ G + V IK +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248
N+ +++Q ++ E +++L+EKY+ G+ L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308
+ R N + V GSA N G+ +++ + + S
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368
FKI L + R+YSGV++ D+V S K + + +
Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299

Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424
+ +I + +G+I L V GDT P ER+E P P++ VEP
Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484
+E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544
+P V Y E +K E + + + + + PL GS G ++ + + G
Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468

Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604
+ + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++
Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527

Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664
KKA LLEP + ++ P+E D + + + + V + E+P +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587

Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702
Y + L T GR+ E Y + V + R
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622


95ECP_3436ECP_3446N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_34362191.278063hypothetical protein
ECP_34371182.291304FKBP-type peptidylprolyl isomerase
ECP_3438-1203.105124FKBP-type peptidylprolyl isomerase
ECP_3439-1192.824815hypothetical protein
ECP_3442-1182.928015glutathione-regulated potassium-efflux system
ECP_3443-1192.136141ABC transporter ATP-binding protein
ECP_3444-2121.106447hydrolase
ECP_3445-2121.213183hypothetical protein
ECP_3446-2131.283732phosphoribulokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3436ACRIFLAVINRP290.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.021
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218
A +V+D VTQ +E + ++ + S + + + L + D A QV ++L +
Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113

Query: 219 SK 220
+
Sbjct: 114 AT 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3437INFPOTNTIATR1349e-41 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 134 bits (339), Expect = 9e-41
Identities = 85/240 (35%), Positives = 131/240 (54%), Gaps = 12/240 (5%)

Query: 14 MAVVLHAPITFAAEAAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIK 73
M +V A + A A AT A S D K +Y++GA LG K + GI
Sbjct: 3 MKLVTAAIMGLAMSTAMAATDATS---LTTDKDKLSYSIGADLG-------KNFKNQGID 52

Query: 74 LDKDQLIAGVQDAFA-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYR 132
++ D L G+QD + + L++++++ L F+ + + A+ K A +N+AKG +
Sbjct: 53 INPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFL 112

Query: 133 EKFAKEKGVKTSSTGLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLS 192
+ G+ +GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +
Sbjct: 113 SANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPAT 172

Query: 193 FRLDGVIPGWTEGLKNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251
F++ VIPGWTE L+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A
Sbjct: 173 FQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3442ISCHRISMTASE300.006 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 29.6 bits (66), Expect = 0.006
Identities = 32/135 (23%), Positives = 50/135 (37%), Gaps = 16/135 (11%)

Query: 11 YAHPESQDSVANWVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 68
Y P + D N V P + + +HD+ ++ D F L + +
Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68

Query: 69 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA------Y 118
P+ + P DR L F GPG N +G Y +IT PE +
Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124

Query: 119 RYDALNRYPMSDVLR 133
RY A R + +++R
Sbjct: 125 RYSAFKRTNLLEMMR 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3443GPOSANCHOR330.005 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%)

Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563
+ D + ++ E + + ++ R+ +R R + L E
Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602
+LE++ + +L A+ + EE+ SE QS + +L A
Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391

Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634
+ + + LEE L A E+L + L E
Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422



Score = 32.0 bits (72), Expect = 0.008
Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%)

Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572
+ + ++ + E A A + D ++ + +++
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179

Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632
L A+ A E + + E + TA + + ++ + ++ LE +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 633 EGQSN 637
++
Sbjct: 240 FSTAD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3446PF07299320.002 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 31.8 bits (72), Expect = 0.002
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116
P+ + + E ++ KG SRK++ ++ + + GTF
Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155


96ECP_3572ECP_3582N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_35722265.903976nickel transporter permease NikB
ECP_35730245.567922nickel transporter permease NikC
ECP_35740205.385698nickel transporter ATP-binding protein NikD
ECP_35750194.416708nickel transporter ATP-binding protein NikE
ECP_3576-1122.207678nickel responsive regulator
ECP_3577-2112.336349hypothetical protein
ECP_3578-392.195199ABC transporter ATP-binding protein
ECP_3579-1110.293279hypothetical protein
ECP_3580011-1.358368hypothetical protein
ECP_3581010-0.581894hypothetical protein
ECP_3582-1141.886043hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3572BORPETOXINB280.046 Bordetella pertussis toxin B subunit signature.
		>BORPETOXINB#Bordetella pertussis toxin B subunit signature.

Length = 226

Score = 28.1 bits (62), Expect = 0.046
Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 204 GQRHVTWARLRGLSDKQTERRHILRNASLPMITAVGMHIGELIGGTMIIENIFAWPGVG- 262
R +T A LRG D Q RH+ R S+ + G ++G GG +I++ PG
Sbjct: 53 KTRALTVAELRGSGDLQEYLRHVTRGWSIFALYD-GTYLGGEYGG--VIKD--GTPGGAF 107

Query: 263 ----RYAVSAIFNRDYP 275
+ + N P
Sbjct: 108 DLKTTFCIMTTRNTGQP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3575HTHFIS300.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.008
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLALKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3577ABC2TRNSPORT482e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 48.0 bits (114), Expect = 2e-08
Identities = 38/171 (22%), Positives = 70/171 (40%), Gaps = 6/171 (3%)

Query: 200 REREHGTVEHLLVMPITPFEIMMAKV-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFFTIAQLRFRK 368
+P +H + L + I+ ++ + I FF L R+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3578PF05272300.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.045
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3579RTXTOXIND823e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 81.8 bits (202), Expect = 3e-19
Identities = 72/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%)

Query: 6 RHLAWWGVGALAVAAVVAWLLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64
R +A++ +G L +A +++ L G +GR + I + I+VK
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113

Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91
EG+ VR+G+VL K+ L++ R + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132
Q Q+ + L+++++E + +N+ +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236
QT T + S ++AP +V Q +V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410

Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3582ALARACEMASE290.033 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.0 bits (65), Expect = 0.033
Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 18/98 (18%)

Query: 226 ENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRNAHPNQSLKNTL 283
E + RG GP +L + ++ + + + L T + N Q A N LK L
Sbjct: 63 EAITLRERGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLKALQNARLKAPL 118

Query: 284 AVHL------------PKRLVERLQQLGQIPDVSLKQL 309
++L P R++ QQL + +V L
Sbjct: 119 DIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156


97ECP_3824ECP_3829N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3824637-10.150759periplasmic binding protein
ECP_3825641-10.819283hypothetical protein
ECP_3826638-9.423150hemolysin-activating lysine-acyltransferase
ECP_3827636-8.517628hemolysin A
ECP_3828634-7.646229alpha-hemolysin translocation ATP-binding
ECP_3829530-4.733825hemolysin D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3824adhesinb2373e-79 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 237 bits (605), Expect = 3e-79
Identities = 87/294 (29%), Positives = 157/294 (53%), Gaps = 7/294 (2%)

Query: 5 ILVVALSSLLVSPLVIAKELNVVASFSVLGDMVSQIGGPYVHVTDLVQPDGDPHEFEPSP 64
+ + A SS S + +LNVVA+ S++ D+ I G +++ +V DPHE+EP P
Sbjct: 15 VGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEPLP 74

Query: 65 KDSKTLAQADVVFVNGLGLE----GWLDRLMKASGYRGE--VITASNGIDTLKMKEDGTT 118
+D K +QAD++F NG+ LE W +L++ + + S G+D + ++
Sbjct: 75 EDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQSEK 134

Query: 119 IT-DPHAWNSMKNGIVYAHNIVNGLSKADPEHASDYRKQGDSYIQQLQQLDNYATQTFAA 177
DPHAW +++NGI+YA NI LS+ DP + Y K +Y+++L LD A + F
Sbjct: 135 GKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNN 194

Query: 178 IPREKRKVLTSHDAFGYFAAAYGVRFLSPVGYSTESEASSKNVAKLINQIKREHVKLYFI 237
IP EK+ ++TS F YF+ AY V +TE E + + L+ ++++ V F+
Sbjct: 195 IPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSLFV 254

Query: 238 ENQTDPRLVKQIANASGAQAGGELYPEALTDSSGLAATYTAAFKHNVDTLAAGM 291
E+ D R +K ++ + +++ +++ + +Y + K+N++ +A G+
Sbjct: 255 ESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3826RTXTOXINC316e-114 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 316 bits (811), Expect = e-114
Identities = 163/170 (95%), Positives = 166/170 (97%)

Query: 1 MNRNNPLEVLGHVSWLWASSPLHRNWPVSLFAINVLPAIRANQYALLTRDNYPVAYCSWA 60
MN N PLE+LGHVSWLWASSPLHRNWPVSLFAINVLPAI+ANQY LLTRD+YPVAYCSWA
Sbjct: 1 MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWA 60

Query: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120
NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR
Sbjct: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120

Query: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKNKSDFNFSLTG 170
VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVK KSDFNFSLTG
Sbjct: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRKSDFNFSLTG 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3827RTXTOXINA14770.0 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 1477 bits (3824), Expect = 0.0
Identities = 978/1024 (95%), Positives = 992/1024 (96%)

Query: 1 MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60
M TITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ
Sbjct: 1 MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60

Query: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120
GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 121 YQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSMKIDELIKKQKSGSNVSSSEL 180
YQKAGN LGG AENIGDNLGKAG +LSTFQNFLGTALSSMKIDELIKKQKSG NVSSSEL
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 181 AKASIELINQLVDTAASINNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240
AKASIELINQLVDT AS+NNNVNSFSQQLN LGSVLSNTKHLNGVGNKLQNLPNLDNIGA
Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240

Query: 241 GLDTVSGILSVISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300
GLDTVSGILS ISASFILSNADADT TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL
Sbjct: 241 GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300

Query: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360
STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE
Sbjct: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360

Query: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420
TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH
Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420

Query: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480
VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW
Sbjct: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480

Query: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKPDEFQKQVFDPLKGNIDLSDSKSS 540
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKK DEFQKQVFDPLKGNIDLSDSKSS
Sbjct: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSS 540

Query: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHA 600
TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKG+VYDYSNLIQHA
Sbjct: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHA 600

Query: 601 SVGNNQYREIRIESHLGDGDDKVFLAAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660
SVGNNQYREIRIESHLGDGDDKVFL+AGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE
Sbjct: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660

Query: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE 720
AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHING +LTETDNLYSVE
Sbjct: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE 720

Query: 721 ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGD 780
ELIGT RADKFFGSKFTDIFHGADGDD IEGNDGNDRLYGDKGNDTL GGNGDDQLYGGD
Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGD 780

Query: 781 GNDKLTGGVGNNYLNGGDGDDELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGND 840
GNDKL G GNNYLNGGDGDDE QVQGNSLAKNVL GGKGNDKLYGSEGADLLDGGEG+D
Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840

Query: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRDVAFKREGNDLIMYKAEGNV 900
LLKGGYGNDIYRYLSGYGHHIIDDDGGK+DKLSLADIDFRDVAFKREGNDLIMYK EGNV
Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900

Query: 901 LSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY 960
LSIGHKNGITFRNWFEKESGDISNH+IEQIFDK GR+ITPDSLKKA EYQQ NN+A+YVY
Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960

Query: 961 GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITL 1020
G A Y +LNPLINEISKIISAAG+FDVKEER+AASLLQLSGNASDFSYGRNSITL
Sbjct: 961 GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITL 1020

Query: 1021 TASA 1024
T SA
Sbjct: 1021 TTSA 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3829RTXTOXIND6010.0 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 601 bits (1551), Expect = 0.0
Identities = 462/478 (96%), Positives = 468/478 (97%)

Query: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60
MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV
Sbjct: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60

Query: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRK 120
AYFIMGFLVIAFILSVLGQVEIVATANGKLT SGRSKEIKPIENSIVKEIIVKEGESVRK
Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRK 120

Query: 121 GDVLLKLTALGAEADTLKTQSSLLQTRLEQIRYQILSRSIELNKLPELKLPDEPYFQNVS 180
GDVLLKLTALGAEADTLKTQSSLLQ RLEQ RYQILSRSIELNKLPELKLPDEPYFQNVS
Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180

Query: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENLSRVEKSRLDDF 240
EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT+LARINRYENLSRVEKSRLDDF
Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240

Query: 241 RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300
SLLHKQAIAKHAVLEQENKYVEA NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 301 LDKLRQTTDSIELLTLELEKNEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIV 360
LDKLRQTTD+I LLTLEL KNEERQQASVIRAPVS KVQQLKVHTEGGVVTTAETLMVIV
Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360

Query: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQKLGL 420
PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ+LGL
Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420

Query: 421 VFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER 478
VFNVI+S+EEN LSTGNK+IPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL ER
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


98ECP_3868ECP_3879N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_3868-1111.081077ribonucleoside transporter
ECP_3869-2131.778140hypothetical protein
ECP_3870-1132.226954xanthine/uracil permease
ECP_3871-1110.831628cryptic adenine deaminase
ECP_3872-113-0.015448sugar phosphate antiporter
ECP_38730130.960486regulatory protein UhpC
ECP_38740141.180397sensory histidine kinase UhpB
ECP_3875014-0.071569DNA-binding transcriptional activator UhpA
ECP_3876013-0.966547hypothetical protein
ECP_38771162.852271acetolactate synthase 1 regulatory subunit
ECP_38780152.390398acetolactate synthase catalytic subunit
ECP_3879-1171.428228multidrug resistance protein D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3868TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 3e-05
Identities = 34/208 (16%), Positives = 72/208 (34%), Gaps = 13/208 (6%)

Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 86 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGVALGGFWAISASLTMRLVPPRTVPKA 145
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAAMG----VLCIFWIIKSLPSLPGE 201
+ +V LG +G F AAAA+ + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3871UREASE381e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.8 bits (88), Expect = 1e-04
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71
V+R D +I N ILD + G + I +K IA +G A D P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3872TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3873TCRTETB401e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.2 bits (94), Expect = 1e-05
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 87 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 143
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 202
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3874PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 2e-05
Identities = 28/142 (19%), Positives = 57/142 (40%), Gaps = 11/142 (7%)

Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 425 IVKHA-----DASAVTLQGWQHDERLMLVIEDDGSGLPPDSGQ-HGFGLTGMRERVTALG 478
+KH + L+G + + + L +E+ GS ++ + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 479 G---TLTISCLHG-TRVSVSLP 496
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3875HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_3879TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.7 bits (142), Expect = 1e-11
Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 1/184 (0%)

Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66
R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 125
+SD++G + ++L G+ I +++ V S ++LI A +QG G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLALCAGVTFSMARWM 185
+ A L+ + + + P IGG++ +W L + V F M
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 186 PETR 189
E R
Sbjct: 191 KEVR 194


99ECP_4213ECP_4228N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_42131164.101091transcriptional regulator HU subunit alpha
ECP_42141184.500375hypothetical protein
ECP_42151173.970611zinc resistance protein
ECP_42161194.074853sensor protein ZraS
ECP_42171193.496316transcriptional regulatory protein ZraR
ECP_42181172.136748phosphoribosylamine--glycine ligase
ECP_42190161.368593bifunctional
ECP_4221-1120.564279*hypothetical protein
ECP_4222-1131.344855hypothetical protein
ECP_4223-1172.419051homoserine O-succinyltransferase
ECP_4224-1172.086363malate synthase
ECP_42250161.588007isocitrate lyase
ECP_42260141.672916bifunctional isocitrate dehydrogenase
ECP_42270151.902884IclR family transcriptional regulator
ECP_4228-1151.723989B12-dependent methionine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4213DNABINDINGHU1202e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (302), Expect = 2e-39
Identities = 50/89 (56%), Positives = 66/89 (74%)

Query: 2 NKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61
NK LI +AE EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90
NPQTG+EIKI A+ VPAF +GKALKDAVK
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4216PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 3e-05
Identities = 49/262 (18%), Positives = 105/262 (40%), Gaps = 43/262 (16%)

Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253
I+F + V S+L F W + + + ++ Q +M + L L A + H + N L
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179

Query: 254 SSIKGLAKYFAERASAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVEL 310
++I+ L +A L+++M + ++ +++ L +V ++L L +++
Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236

Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 369
+ Q+ + ++Q+ P L Q L+ N + I + Q G
Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279

Query: 370 VISVTASESGAGVKISVTDSGKGIAADQLEAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429
I + ++ V + V ++G + E TG GL V ++ G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327

Query: 430 ---TIQVASQEGKGATFTLWLP 448
I+++ ++GK + +P
Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4217HTHFIS5270.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 527 bits (1359), Expect = 0.0
Identities = 184/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%)

Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67
ILV DDD + T+L L GY+V + ++ + DLV+ DV M + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEIKTLNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSV 127
L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 128 DAETPAVSASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187
++ S +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247
R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307
Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV +
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 308 EVPSLRQRREDIPLLANHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367
+P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 368 AVVLLTGEYISERELPLAIASTPIPLVQSQDIQP-------------------------- 401
L + I+ + + S +
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441
L E+E +ILAAL T GN+ +AA LG+ R TL K+
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4221SHAPEPROTEIN326e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.7 bits (72), Expect = 6e-04
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96
IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E
Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131

Query: 97 GA 98
GA
Sbjct: 132 GA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4222SACTRNSFRASE341e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.8 bits (77), Expect = 1e-04
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 5/54 (9%)

Query: 78 IDPDVRGCGVGRMLVKHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126
+ D R GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4225BINARYTOXINB320.004 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 32.3 bits (73), Expect = 0.004
Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346
ET+ PD+ L A P L Y + N D +T + + QL+++
Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4228BCTERIALGSPD320.019 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.019
Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 17/87 (19%)

Query: 348 SGLEPLNIGEDSLFVNVGERTN---VTGSA----KFKRLIKEEKYSEALDVARQQVENGA 400
+P+ + ++ + +TN VT + +R+I + LD+ R QV A
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQ------LDIRRPQVLVEA 351

Query: 401 QIIDINMDEGMLDAEAAMVRFLNLIAG 427
I ++ D L+ +++ N AG
Sbjct: 352 IIAEVQ-DADGLNLG---IQWANKNAG 374


100ECP_4348ECP_4356N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_43480171.141389phosphonate/organophosphate ester transporter
ECP_4349-216-0.392901hypothetical protein
ECP_4350-3160.130375hypothetical protein
ECP_4351-3170.525423hypothetical protein
ECP_4352-3140.198040hypothetical protein
ECP_4353-1120.698136hypothetical protein
ECP_4354-113-0.977172proline/glycine betaine transporter
ECP_4355-118-0.300085sensor protein BasS/PmrB
ECP_4356-117-0.828393DNA-binding transcriptional regulator BasR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4348PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.020
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 32 MVALLGPSGSGKSTLLRHLSGL 53
V L G G GKSTL+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4349FLGLRINGFLGH260.045 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 26.5 bits (58), Expect = 0.045
Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 66 IAGSDIMMSDAIPSGKASYSGFTLVLDSQQVEEGKRWFDNLA 107
I+GS+ + S + + Y G + ++Q + +R+F NL+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLS 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4354TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.3 bits (102), Expect = 2e-06
Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%)

Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144
G L D++GR+ +L +++ ++ + P +W +L I ++ G + G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112

Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGW 200
A ++A+ + +R GFM + FG +AG VLG G++ S
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159

Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKYWRS 260
PFF A L + L K E+ P SF+ W
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207

Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315
+T + ++A ++ + H+ G+ + ++ L +
Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267

Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353
G ++ R G R ++LG +LA P +L+ S IG+
Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317



Score = 39.0 bits (91), Expect = 4e-05
Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%)

Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344
L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I
Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89

Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401
+ + + +++ G ++A I V + + + R + ++A F +VAG
Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147

Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVVGLITG-VTMKETANR 444
P L + S + P + + + +TG + E+
Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4355PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 40/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%)

Query: 181 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 237
+ +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q
Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 238 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 292
+ + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+
Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 293 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWV 351
G + + G GL V R+ L+ + ++ ++ A V
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 352 RL 353
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4356HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 2e-23
Identities = 41/121 (33%), Positives = 60/121 (49%)

Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVTTARMAEQSLEAGHYSLVVLDLGLPDEDGLH 61
IL+ +DD + L A GY + A + + AG LVV D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121
L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 Q 122
+
Sbjct: 125 R 125


101ECP_4367ECP_4374N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4367117-4.417609DNA-binding transcriptional activator DcuR
ECP_4368213-4.753462sensory histidine kinase DcuS
ECP_4369-115-4.339168hypothetical protein
ECP_4370-117-4.260793hypothetical protein
ECP_4371022-3.609450hypothetical protein
ECP_4372018-4.233691hypothetical protein
ECP_4373018-3.969331lysyl-tRNA synthetase
ECP_4374018-3.462115POT family di-/tripeptide transport protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4367HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 4e-16
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4368PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4370SACTRNSFRASE260.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.012
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4374TCRTETA300.028 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.028
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


102ECP_4496ECP_4505N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4496020-2.363630arginine repressor
ECP_4497018-1.895257C4-dicarboxylate anaerobic carrier
ECP_4498-120-2.213847ornithine carbamoyltransferase
ECP_4499022-3.949416carbamate kinase
ECP_4500-216-1.539692arginine deiminase
ECP_4501-219-0.834737hypothetical protein
ECP_4504-1190.498507hypothetical protein
ECP_4505-2191.106176acetyltransferase YjgM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4496ARGREPRESSOR935e-27 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 93.0 bits (231), Expect = 5e-27
Identities = 37/138 (26%), Positives = 70/138 (50%), Gaps = 3/138 (2%)

Query: 21 LITEKSYLSQEEIRRDLQNHGFDSISQSTVSRLLKLLGVIKIRNTKGQKIYSVNPQLL-- 78
+IT +Q+E+ L+ G++ ++Q+TVSR +K L ++K+ G YS+
Sbjct: 13 IITANEIETQDELVDILKKDGYN-VTQATVSRDIKELHLVKVPTNNGSYKYSLPADQRFN 71

Query: 79 PTPDAGRSVAEMVLSVEHNGEFILIHTVAGYGRAVARILDFHALPEILGVIAGSNIVWVA 138
P RS+ + + ++ I++ T+ G +A+ ++D EI+G I G + + +
Sbjct: 72 PLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGTICGDDTILII 131

Query: 139 PRVVKRTALVHKQINYLL 156
R T +V K+I LL
Sbjct: 132 CRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4499CARBMTKINASE371e-132 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 371 bits (954), Expect = e-132
Identities = 136/309 (44%), Positives = 180/309 (58%), Gaps = 13/309 (4%)

Query: 6 TLVIALGGNALLKRGEPLEAEIQRKNIDLAAKTIAQL-TQHWRVVLVHGNGPQVGLLALQ 64
+VIALGGNAL +RG+ E N+ A+ IA++ + + VV+ HGNGPQVG L L
Sbjct: 4 RVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLLH 63

Query: 65 NSA---YAHVAPYPLDILGAESQGMIGYMLQQALKNQLPQREISV----LLTQVEVDAND 117
A + P+D+ GA SQG IGYM+QQALKN+L +R + ++TQ VD ND
Sbjct: 64 MDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKND 123

Query: 118 PAFSNPTKYIGPIYDHAQTQVLQAEKGWVFKAD-GHSFRRVVPSPQPKRIVERDAIQTLI 176
PAF NPTK +GP YD + L EKGW+ K D G +RRVVPSP PK VE + I+ L+
Sbjct: 124 PAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLV 183

Query: 177 AHDHLVICNGAGGVPVVEKADGYHGIEAVIDKDLSAALLASQIHADALLILTDADAVYLD 236
+VI +G GGVPV+ + G+EAVIDKDL+ LA +++AD +ILTD + L
Sbjct: 184 ERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALY 243

Query: 237 WGKPTQRPLAQVTPE----LLNEMQFDAGSMGPKVTACAKFVSQCRGIAGIGSLADGPEI 292
+G ++ L +V E E F AGSMGPKV A +F+ A I L E
Sbjct: 244 YGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVEA 303

Query: 293 LAGDKGTLI 301
L G GT +
Sbjct: 304 LEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4500ARGDEIMINASE418e-147 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 418 bits (1075), Expect = e-147
Identities = 140/407 (34%), Positives = 226/407 (55%), Gaps = 13/407 (3%)

Query: 6 VGSEIGQLCSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRQQGIEV 65
+ SEIG+L V+LHRP L+ LTP + LFDD+ +E A +EH++FA+ L+ +E+
Sbjct: 10 IFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNLVEI 69

Query: 66 LLLTDLLTQTLDIPEA-KSWLLETQISDYRLGPTFATD-VRTWLAEMSHRDLARHLSGGL 123
+ DL+++ L A ++ + I + + F + ++ + + ++ ++ + G+
Sbjct: 70 EYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKMISGV 129

Query: 124 TYSEIPASIKNMVVDTHDINDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNN 183
E+ ++ + N FI+ P+PN LFTRD I NGV+IN M RQRET
Sbjct: 130 VTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQRETIF 189

Query: 184 LRAIYRWHPQFAGGEFIKYFGDENINYDHATLEGGDVLVIGRGAVLIGMSERTTPQGVEF 243
I+++HP + + + A+LEGGD LV+ +G ++IG+SERT + VE
Sbjct: 190 AEYIFKYHPVY-KENVPIWLNRW----EEASLEGGDELVLNKGLLVIGISERTEAKSVEK 244

Query: 244 LAQALFKHRQA-ERVIAVELPKHRYCMHLDTVMTHIDIDTFSVYPEVVRPDVNCWTLTPD 302
LA +LFK++ + + ++A ++PK+R MHLDTV T ID F+ + + + LT +
Sbjct: 245 LAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDM-YFSIYVLTYN 303

Query: 303 GHGG--LKRTQESTLLHAIEKALGIDQVRLI-TTGGDAFEAEREQWNDANNVLTLRPGVV 359
+ +++ + + LG ++ +I GGD REQWND NVL + PG +
Sbjct: 304 PSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPGEI 362

Query: 360 VGYERNIWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLHRDGI 406
+ Y RN TN+ +++ GI V IP EL RGRGG RCMS PL R+ I
Sbjct: 363 IAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4505SACTRNSFRASE325e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.2 bits (73), Expect = 5e-04
Identities = 15/48 (31%), Positives = 18/48 (37%)

Query: 97 PAIRGKGLAKKLALMAMEQAREMGFKRCYLETTAFLKEAIALYEHLGF 144
R KG+ L A+E A+E F LET A Y F
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


103ECP_4533ECP_4542N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4533639-5.459331protein PapG
ECP_4534634-1.367789PapF protein
ECP_4535533-0.965830PapE protein
ECP_4536531-0.684212PapK fimbrial adapter
ECP_4537533-1.377199protein PapJ
ECP_4538531-2.584714chaperone protein PapD
ECP_4539530-2.330371outer membrane usher protein PapC
ECP_4540440-6.502853protein PapH
ECP_4541342-8.130855protein PapA
ECP_4542241-8.065825major pilu subunit operon regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4533PF036275490.0 PapG
		>PF03627#PapG

Length = 336

Score = 549 bits (1416), Expect = 0.0
Identities = 192/339 (56%), Positives = 232/339 (68%), Gaps = 7/339 (2%)

Query: 1 MKKWFPAFLF-LSLSGCNDALAANQSTIFYSFNDNIYHPQLSVKVTDIVQFIVDINSASS 59
MKKWFPA LF L +SG + A + +FYS + + +V +T QFI +
Sbjct: 1 MKKWFPALLFSLCVSGESSAW---NNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIA 57

Query: 60 TATLSYVACNGFTWTHGLYWSEYFAWLVVPKHV-SYNGYNIYLELQSKGGFSLD-AEDND 117
T T + GF Y+ EY AW+V PK V + NGY +++E+ +KG +S + DND
Sbjct: 58 TVTWNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDND 117

Query: 118 NYYLTKGFAWDE-VNSSGRVCFDIGEKRSLAWSFGGVTLNARLPVDLPKGDYTFPVKFLR 176
+Y+ KG+ WDE +G +C GE L F + LP DLP GDY+ + +
Sbjct: 118 SYFFLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTS 177

Query: 177 GIQRNNYDYIGGRYKIPSSLMKTFPFNGTLNFSIKNTGGCRPSAQSLEINHGDLSINSAN 236
G+QR+ Y+G R+KIP ++ KT P + F KN GGCRPSAQSLEI HGDLSINSAN
Sbjct: 178 GMQRHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSAN 237

Query: 237 NHYAAQTLSVSCDVPTNIRFFLLSNTTPAYSHGQQFSVGLGHGWDSIVSINGVDTGETTM 296
NHYAAQTLSVSCDVP NIRF LL NTTP YSHG++FSVGLGHGWDSIVS+NGVDTGETTM
Sbjct: 238 NHYAAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTM 297

Query: 297 RWYRAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 335
RWY+AGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP
Sbjct: 298 RWYKAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4534FIMBRIALPAPF292e-105 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 292 bits (749), Expect = e-105
Identities = 165/167 (98%), Positives = 165/167 (98%)

Query: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE
Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60

Query: 61 VTKTISISCPYKSGSLWIKVTGNTMGGGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120
VTK ISISCPYKSGSLWIKVTGNTMG GQNNVLATNITHFGIALYQGKGMSTPLTLGNGS
Sbjct: 61 VTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120

Query: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167
GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN
Sbjct: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4535FIMBRIALPAPE2769e-99 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 276 bits (706), Expect = 9e-99
Identities = 153/158 (96%), Positives = 156/158 (98%)

Query: 1 MLMSQHAHAADNLTFKGKLIIPACTVQNAEVDWGDIEIQNLVQNGGNQKDFTVDMNCPYS 60
+LMSQH HAADNLTFKGKLIIPACTVQNAEV+WGDIEIQNLVQ+GGNQKDFTVDMNCPYS
Sbjct: 16 VLMSQHVHAADNLTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSGGNQKDFTVDMNCPYS 75

Query: 61 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQFTPGKITG 120
LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQ TPGKITG
Sbjct: 76 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTPGKITG 135

Query: 121 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 158
TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS
Sbjct: 136 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4539PF005777440.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 744 bits (1922), Expect = 0.0
Identities = 244/882 (27%), Positives = 364/882 (41%), Gaps = 67/882 (7%)

Query: 2 MRVMKDRI-PFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLP 60
+ + K R+ F V + +++ + FN L + D +RF + P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 61 GQYLLDVIVNGQSISPASLQISFVEPQSSGDKAEKKLPQACLTSDMVRLMGLTAESLDKV 120
G Y +D+ +N + A+ ++F S CLT + MGL S+ +
Sbjct: 76 GTYRVDIYLNNGYM--ATRDVTFNTGDSEQGI------VPCLTRAQLASMGLNTASVSGM 127

Query: 121 VYWHDGQCADF-HGLPGVDIRPDTGAGVLRINMPQAWLEYSDATWLPPSRWDDGIPGLML 179
D C + + D G L + +PQA++ ++PP WD GI +L
Sbjct: 128 NLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187

Query: 180 DYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTW 239
+YN +G +N GG+SH N G N+G WRLR + S S + + +
Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSS--DSSSGSKNKWQH 245

Query: 240 SRFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIA 299
+L R I R+ LTLG+ DIF ++ GA L SDD MLP RG+AP I GIA
Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305

Query: 300 ETNARVVVSQQGRVLYDSMVPAGPFSIQDLD-SSVRGRLDVEVIEQNGRKKTFQVDTASV 358
A+V + Q G +Y+S VP GPF+I D+ + G L V + E +G + F V +SV
Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365

Query: 359 PYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLYGGAVLAGDYNALAA 418
P L R G RY + +G R + E P F GL W++YGG LA Y A
Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNF 425

Query: 419 GAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERN 478
G G ++G G LS D+TQ+ + + + G+S R Y+K + + +I GYR+S
Sbjct: 426 GIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485

Query: 479 YMTMEQYLNARYR--------------------NDYSSREKEMYTVTLNKNVADWNTSFN 518
Y +R + + ++ +T+ + + + +
Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLY 544

Query: 519 LQYSRQTYWDIRKTD-YYTVSVNRYFNVFGLQGVAVGLSASRSKYLGRD--NDSAYLRIS 575
L S QTYW D + +N F + LS S +K + + L ++
Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFE-----DINWTLSYSLTKNAWQKGRDQMLALNVN 599

Query: 576 VPLGT------------GTASYSGSMSND-RYVNMAGYTDT-FNDGLDSYSLNAGLNSGG 621
+P +ASYS S + R N+AG T D SYS+ G GG
Sbjct: 600 IPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGG 659

Query: 622 GLTSQRQINAYYSHRSPLANLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGG 681
S A ++R N + S SGG G L G
Sbjct: 660 DGNSGSTGYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTL--GQPLND 716

Query: 682 TRLLVDTDGVGGVPVDGGQVV-TNRWGTGVVTDISSYYRNTTSVDLKRLPDDVEATRSVV 740
T +LV G V+ V T+ G V+ + Y N ++D L D+V+ +V
Sbjct: 717 TVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776

Query: 741 ESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS 800
T GAI +F G +L L + PFGA VTSE + G+VAD G +LS
Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLS 835

Query: 801 GVTPGETLSVNW--DGKIQCQVNVPETAISDQQLL----LPC 836
G+ + V W + C N S QQLL C
Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4540FIMBRIALPAPE320.001 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 29/173 (16%)

Query: 29 GMSLPEYWG----EEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDL-QNGF 83
G+ LP G +HV F G+++ PACT+ + ++ G+ +++L Q+G
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAE----VNWGDIEIQNLVQSG- 60

Query: 84 SGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGET---PDKFNLSGQAKGINLQI 140
G ++ F++ + NC ++ ++ +T +G G + P+ SG I L
Sbjct: 61 -GNQKDFTVDM-NCPYS------LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYN 112

Query: 141 ADARGNIARAGKV-MPAIP--LTGNEEALDYTLRIVR----NGKKLEAGNYFA 186
++ I A + P +TG A TL N + L+AG + A
Sbjct: 113 SNN-SGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4542FIMREGULATRY1683e-58 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 168 bits (428), Expect = 3e-58
Identities = 100/104 (96%), Positives = 104/104 (100%)

Query: 1 MAHHEIISRAGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHT 60
MAHHE+ISR+GNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGH+
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKEVCEKHQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104
RKEVCEK+QMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104


104ECP_4662ECP_4669N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4662-2191.202768hypothetical protein
ECP_46630181.676515DNA-binding transcriptional regulator
ECP_46640170.055980isoaspartyl dipeptidase
ECP_4665018-0.271701hypothetical protein
ECP_4666016-0.000233hypothetical protein
ECP_46671160.091981RNA 2'-phosphotransferase-like protein
ECP_4668116-0.432568hypothetical protein
ECP_4669114-3.045961hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4662PHPHTRNFRASE1511e-46 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 151 bits (383), Expect = 1e-46
Identities = 46/110 (41%), Positives = 65/110 (59%)

Query: 1 MIEVPAAIMIAEKLASEVDFFSIGTNDLTQYIMAADRGNSTVAKLVDYCNDAVINAIAMV 60
M+E+P+ + A A EVDFFSIGTNDL QY MAADR N V+ L + A++ + MV
Sbjct: 430 MVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMV 489

Query: 61 CQAGRNNEIPVSMCGEMAGDIQQTARLLTMGIDKLSASPSRLPALKAAIR 110
+A + V MCGEMAGD LL +G+D+ S S + + ++ +
Sbjct: 490 IKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLL 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4664UREASE354e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.1 bits (81), Expect = 4e-04
Identities = 30/129 (23%), Positives = 48/129 (37%), Gaps = 33/129 (25%)

Query: 26 CDVLIANGKIIAVASNIPSDIVPDCT--------VVDLSGQILCPGFIDQHVHLIGG--- 74
D+ + +G+I A+ D+ P T V+ G+I+ G +D H+H I
Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQI 145

Query: 75 ------------GGEAGP------TTRTP-EVALSRLTEA--GITSVVGLLGTDSISRHP 113
GG GP TT TP ++R+ EA + G + S P
Sbjct: 146 EEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPMNLAFAGKGNAS-LP 204

Query: 114 ESLLAKTRA 122
+L+
Sbjct: 205 GALVEMVLG 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4668TCRTETA300.018 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.018
Identities = 65/317 (20%), Positives = 113/317 (35%), Gaps = 26/317 (8%)

Query: 82 RPFLLASALASGLLILAMAWLLPFILVLLIRVLAGV-----ASAGMLIFGSTLIMQHTRH 136
RP LL S + + MA ++ + R++AG+ A AG I T + RH
Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARH 132

Query: 137 PFVLAALFSGVGIGIALGNEYVLAGLHFDLSSQTLWQGAGALSGMMLIALTLLMP-SKKH 195
++A F G G+ G VL GL S + A AL+G+ + L+P S K
Sbjct: 133 FGFMSACF---GFGMVAGP--VLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 196 AITPMPLAKTEQQIMSWW---------LLAILYGLAGFGYIIVATYLPLMAKDAGSPLLT 246
P+ W L+A+ + + G + A ++ T
Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT 247

Query: 247 AHLWTLVGLSIVPGCFGWLWA---AKRWGALPCLTANLLVQAI-CVLLTLASDSPLLLII 302
+ +L I+ + A R G L ++ +LL A+ + I
Sbjct: 248 IGI-SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306

Query: 303 SSLGFGGTFMGTTSLVMTIARQLSVPGNLNLLGFVTLIYGIGQILGPALTSMLGNGTSAL 362
L +G +L ++RQ+ L G + + + I+GP L + + +
Sbjct: 307 MVL-LASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITT 365

Query: 363 ASATLCGAAALFIAALI 379
+ A A +
Sbjct: 366 WNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4669ADHESNFAMILY290.026 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 29.1 bits (65), Expect = 0.026
Identities = 10/45 (22%), Positives = 17/45 (37%)

Query: 53 LFVIVAVCTFFVQSCARKSNHAASFQNYHATIDGKEIAGITNNIS 97
+++ + + +CA S Q IA IT NI+
Sbjct: 6 TLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIA 50


105ECP_4781ECP_4787N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
ECP_4781-2150.882169phosphoglycerate mutase
ECP_4782-1120.336763right origin-binding protein
ECP_4783hypothetical protein
ECP_4784DNA-binding response regulator CreB
ECP_4785sensory histidine kinase CreC
ECP_4786hypothetical protein
ECP_4787two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4781VACCYTOTOXIN290.017 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.8 bits (64), Expect = 0.017
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189
P +V GIA G V T+ GL W ++ N D + +W
Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4784HTHFIS909e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 9e-23
Identities = 34/139 (24%), Positives = 61/139 (43%)

Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFDVEVFERGLPVLDKARQQVPDVMILDVGLPDI 60
M T+ + +D+ I L L + G+DV + + D+++ DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120
+ F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 RVKKFSTPSPVIRIGHFEL 139
K+ + L
Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4785PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.003
Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 40/182 (21%)

Query: 312 LRQARLENRQEVVLTAVDVAALFR---RVSEARTVQLAE--KNITLHV----------MP 356
+R LE+ + ++ L R R S AR V LA+ + ++ +
Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ 241

Query: 357 TEINVAAE------PALLEQAL-GNLLDNAIDFTPESGRITLSAEVDQEYVTLKVLDTGS 409
E + P +L Q L N + + I P+ G+I L D VTL+V +TGS
Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 410 GIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASL 467
N ++S+G GL V E + L+ E ++ + ++G V A +
Sbjct: 302 LALK----------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 468 RL 469
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
ECP_4787HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%)

Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60
M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119
N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RT 121

Sbjct: 121 EP 122


Database: VIFASCDB
Posted date: Jun 1, 2014 9:04 PM
Number of letters in database: 79,683
Number of sequences in database: 213

Lambda K H
0.322 0.138 0.400

Gapped
Lambda K H
0.267 0.0533 0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 213
Number of Hits to DB: 207,414,809
Number of extensions: 9395514
Number of successful extensions: 36166
Number of sequences better than 5.0e-02: 1164
Number of HSP's gapped: 33567
Number of HSP's successfully gapped: 2291
Length of database: 79,683
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)

 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.