PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeBM012A.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP006888 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1U063_0086U063_0098Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0086-112-4.181988putative endonuclease G
U063_0087-114-4.658061Type III restriction-modification system
U063_0088-214-3.640431Type III restriction-modification system DNA
U063_0089-116-2.821597Biotin synthase
U063_0090018-3.726037Inner membrane protein YihY
U063_0091318-4.128773Phosphatidate cytidylyltransferase
U063_0093215-3.458415Hypothetical protein
U063_0094316-2.997756hypothetical protein
U063_0095417-3.158720hypothetical protein
U063_0096721-3.353705hypothetical protein
U063_0097519-3.131840OrfA in transposon IS607
U063_0098218-2.276046OrfB in transposon IS607
2U063_0200U063_0233Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_02002160.739615SSU ribosomal protein S16p
U063_0201115-1.042092KH domain RNA binding protein YlqC
U063_0202113-2.24144216S rRNA processing protein RimM
U063_0203215-2.053803tRNA (Guanine37-N1) -methyltransferase
U063_0204622-3.433605LSU ribosomal protein L19p
U063_0205822-5.219949hypothetical protein
U063_02071125-6.696795hypothetical protein
U063_02081026-5.859158hypothetical protein
U063_02091127-5.006157hypothetical protein
U063_0210925-5.438550DNA topoisomerase I
U063_0211726-5.730757OrfB in transposon IS607
U063_0212727-5.808033OrfA in transposon IS607
U063_0213728-6.157124hypothetical protein
U063_0214629-7.084216hypothetical protein
U063_0215730-8.053179hypothetical protein
U063_02161029-8.001063hypothetical protein
U063_02171230-7.675258hypothetical protein
U063_02181330-7.741896hypothetical protein
U063_02191230-8.102243hypothetical protein
U063_02201126-8.205784integrase/recombinase XerD
U063_0221631-7.014199hypothetical protein
U063_0222531-7.559076hypothetical protein
U063_0223526-7.194975hypothetical protein
U063_0224625-6.500547hypothetical protein
U063_0225525-4.950119hypothetical protein
U063_0226725-4.484781DNA topoisomerase I
U063_0227725-4.603612hypothetical protein
U063_0228823-4.679076OrfB in transposon IS607
U063_0229923-4.978418OrfA in transposon IS607
U063_0230822-5.096575putative lipoprotein
U063_0231822-5.662425hypothetical protein
U063_0232518-4.202692transposase A
U063_0233214-3.367260transposase OrfB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0214IGASERPTASE300.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.010
Identities = 20/128 (15%), Positives = 44/128 (34%), Gaps = 9/128 (7%)

Query: 92 NSTATQQENTKQNQAIEQNGTTQAKEPQSKQEPKKTLHPDE-------PWLDYDPKAHKC 144
N ++ T I QA P ++ DE P +
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 145 LQERQKEEIQEKAQSNNSDEPWIEHGKRMQEKAKAHYQACLEREKAKELAKEQNNAQKEV 204
+Q+ + EK + + ++ + + ++AK++ +A + + + E Q
Sbjct: 1042 ENSKQESKTVEKNEQDATET--TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 205 KKEMPTID 212
KE T++
Sbjct: 1100 TKETATVE 1107



Score = 28.1 bits (62), Expect = 0.037
Identities = 27/134 (20%), Positives = 48/134 (35%), Gaps = 7/134 (5%)

Query: 41 RQQAKTL-KNLDSATQSVGVNAIK-EQNKANKNSEQPKNSQNEPRQETTNAQTNSTATQQ 98
+Q++KT+ KN AT++ N ++ K+N + N + ET QT T
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 99 ENTKQNQAIEQNGTTQAKEPQSKQEPKKTLHPDEPWLDYDPKAHKCLQERQKEEIQEKAQ 158
K+ +A + TQ + Q K + +P E ++ Q
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR-----ENDPTVNIKEPQ 1159

Query: 159 SNNSDEPWIEHGKR 172
S + E +
Sbjct: 1160 SQTNTTADTEQPAK 1173


3U063_0346U063_0399Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_03462141.540212NADH ubiquinone oxidoreductase chain A
U063_03472130.853116NAD-dependent protein deacetylase
U063_03482120.047195Putative integral membrane protein
U063_03490110.237300Orotate phosphoribosyltransferase
U063_0350-1100.297491Ribosome recycling factor
U063_03510100.308466Preprotein translocase subunit SecG
U063_0352010-0.046555biotin synthesis protein
U063_035328-0.288202Tryptophanyl-tRNA synthetase
U063_035429-0.798989periplasmic oligopeptide-binding protein OppA
U063_0355110-0.905806Oligopeptide transport system permease protein
U063_0356210-0.967494hypothetical protein
U063_0357111-1.029228Shikimate 5-dehydrogenase I alpha
U063_0358-2110.7564483'-to-5' exoribonuclease RNase R
U063_03590112.403149hypothetical protein
U063_03600123.363509SSU ribosomal protein S6p
U063_0361-1113.787411Single-stranded DNA-binding protein
U063_0362-1103.558860SSU ribosomal protein S18p
U063_0363-1103.533864Outer membrane protein HopT (BabB)
U063_0364-1112.838716Alanyl-tRNA synthetase
U063_03650110.101199Septum formation protein Maf
U063_0366013-0.895000Formamidase
U063_0367217-2.433746Prephenate and/or arogenate dehydrogenase
U063_0368421-4.429380Adenine-specific methyltransferase
U063_0369621-4.400488DNA-cytosine methyltransferase
U063_0372521-5.439841OrfB in transposon IS607
U063_0373523-5.090737OrfA in transposon IS607
U063_0374423-5.348596hypothetical protein
U063_0375622-7.261003hypothetical protein
U063_0376619-6.156283hypothetical protein
U063_0377618-6.484679conjugal transfer protein TraG
U063_0378717-6.070883hypothetical protein
U063_0379415-4.047967hypothetical protein
U063_0380416-4.209797hypothetical protein
U063_0381415-3.718306hypothetical protein
U063_0382415-3.731577hypothetical protein
U063_0383415-3.778681Plasmid partitioning protein ParA
U063_0384516-4.136458hypothetical protein
U063_0385724-6.340561hypothetical protein
U063_0386925-5.691629OrfB in transposon IS607
U063_0387925-5.691629OrfA in transposon IS607
U063_0388825-5.548149hypothetical protein
U063_0389724-5.368468hypothetical protein
U063_0390314-2.017242hypothetical protein
U063_039107-0.501551OrfA in transposon IS607
U063_0392-18-0.162293OrfB in transposon IS607
U063_0393-190.430252hypothetical protein
U063_03940100.383387adenine/cytosine DNA methyltransferase
U063_03951100.345539Proline/sodium symporter PutP
U063_0396111-0.068406Proline dehydrogenase
U063_0397416-0.825169hypothetical protein
U063_0398314-0.896844hypothetical protein
U063_03992140.049541hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0351SECGEXPORT495e-10 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 48.8 bits (116), Expect = 5e-10
Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 1 MTSALLGLQIVLAVLIVVVVLLQ--KSSSIGLGAYSGSNESLFGAKGPASFMAKLTMFLG 58
M ALL + +++A+ +V +++LQ K + +G +G++ +LFG+ G +FM ++T L
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 59 LLFVINTIALGYFYNKEYGKSILD 82
LF I ++ LG N +
Sbjct: 61 TLFFIISLVLGNI-NSNKTNKGSE 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0356IGASERPTASE290.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.010
Identities = 12/78 (15%), Positives = 26/78 (33%), Gaps = 3/78 (3%)

Query: 48 EMERQNRALSPEQEEANTTTTI---AEENPTKDPPLPLETVVQEKENKQENKQEQEKETK 104
E+E++N+ + + + ++ E V ++ +
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 105 PKQNSASPTQNHQKALTT 122
KQ S + +N Q A T
Sbjct: 1044 SKQESKTVEKNEQDATET 1061



Score = 27.7 bits (61), Expect = 0.038
Identities = 16/69 (23%), Positives = 24/69 (34%), Gaps = 11/69 (15%)

Query: 51 RQNRALSPEQEEANTTTTIAEENPTKDPPLPLETVVQEKENKQENKQEQEKETKPKQNSA 110
+ N E T TT +E T EKE K + + E+ +E +
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETAT-----------VEKEEKAKVETEKTQEVPKVTSQV 1129

Query: 111 SPTQNHQKA 119
SP Q +
Sbjct: 1130 SPKQEQSET 1138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0367SHIGARICIN290.016 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 29.0 bits (65), Expect = 0.016
Identities = 11/74 (14%), Positives = 19/74 (25%), Gaps = 5/74 (6%)

Query: 36 TPVKKSATIIDLGGAKAQIIRNIPKSIRKNFIAAHPMCGTEFYGPKASVKGLYENALVIL 95
P + L GA + ++RK + Y L
Sbjct: 18 APAVEGDVSFRLSGATSSSYGVFISNLRKALPYERKLYDIPLLRSTLPGSQRY-----AL 72

Query: 96 CDLEDSGTEQVEIA 109
L + E + +A
Sbjct: 73 IHLTNYADETISVA 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0374cloacin300.013 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.013
Identities = 28/79 (35%), Positives = 35/79 (44%), Gaps = 8/79 (10%)

Query: 159 GYDNNPNSPSNNAINGKDGANGSNGYGINGNDGINGSSGSNGNNSNHNAVGSGIDTDGVL 218
G++ +S S N ING G G G+ G +GS S+ NN GSGI G
Sbjct: 8 GHNTGAHSTSGN-ING-----GPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 219 GV-DGVNGSNSSSGVSVGG 236
G +G NS G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 28.5 bits (63), Expect = 0.024
Identities = 19/75 (25%), Positives = 28/75 (37%), Gaps = 1/75 (1%)

Query: 159 GYDNNPNSPSNNAINGKDGANGSNGYGINGNDGINGSSGSNGNNSNHNAVGSGIDTDGVL 218
+NNP + + G +G G NGN G GS ++ V G
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGG-GSGTGGNLSAVAAPVAFGFPALSTP 98

Query: 219 GVDGVNGSNSSSGVS 233
G G+ S S+ +S
Sbjct: 99 GAGGLAVSISAGALS 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0375BINARYTOXINA270.048 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 26.9 bits (59), Expect = 0.048
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 13/91 (14%)

Query: 1 MFYTYHTEQPPLNEEQHKILQNAIANNEVDKRVLGVWKFN-----------SENQNNLTQ 49
FY Y E P E+++K L+NAI+ N++DK + + + +ENQN ++
Sbjct: 99 YFYDYQIESNP-REKEYKNLRNAISKNKIDKPINVYYFESPEKFAFNKEIRTENQNEISL 157

Query: 50 NNTNEMVKNTTNDDLIKDNNIKQNNIIDNNN 80
NE +K T D L K + K ++ + N
Sbjct: 158 EKFNE-LKETIQDKLFKQDGFKDVSLYEPGN 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0380RTXTOXIND300.039 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.039
Identities = 22/172 (12%), Positives = 52/172 (30%), Gaps = 20/172 (11%)

Query: 249 ETELDALEKQARNNKSFRHESYFYKVL-GSATSQIESLKKRENALSDHLDSLKSLLEKTH 307
+ ++E E YF V +K++ + + + L+K
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 308 WEKEKFTPPTNEKE-----LNQQLKEIKWLNKESLTPKNTYKKIQKLAVCKSPLIKDYLY 362
E+ N E +L + L + K+ + + Y+
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK----------YVE 263

Query: 363 TTKKLFATQKKIIALEKDYKDLK----VLKEEFSKDLEADLSHSKKRFELYT 410
+L + ++ +E + K ++ + F ++ L + L T
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0384FbpA_PF05833320.033 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 32.1 bits (73), Expect = 0.033
Identities = 42/236 (17%), Positives = 88/236 (37%), Gaps = 29/236 (12%)

Query: 1238 EQDYKIIKDFMDKVGKNNINLSEQTLNEYFIHHPENILGHLSLEKTRYSFETNGEQI--- 1294
++ ++ KD ++ N + T N F+ L K + ++++ + +
Sbjct: 229 KEIVEVCKDLFKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYK-KIQYDSSSKLLENF 287

Query: 1295 --YKYELQALEDKNLDLSQALNQAIEKLPKGVYQYHKTTLKTDALIIDANNERYQEVQKL 1352
K + L+ K+ DL + + I + K + T K + + + ++ +L
Sbjct: 288 YYAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCE------DKDIFKLYGEL 341

Query: 1353 IK----NLERG-ELVKWDDLYFQLEQNNEMGIFLKPTKINSKVQDSRLKAYFKIKDALND 1407
+ L++G ++ + Y E + + I L K S+ S K Y K+K +
Sbjct: 342 LTANIYALKKGLSHIELANYY--SENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEA 399

Query: 1408 L------TSAELNPLSS---DLELESKRVRLNLVYDEFVKKFGYLNENKNRKDIKQ 1454
ELN L S ++ + + E ++ GY+ K K K
Sbjct: 400 ANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKKELIET-GYIKFKKIYKSKKS 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0396ANTHRAXTOXNA310.036 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.036
Identities = 36/173 (20%), Positives = 71/173 (41%), Gaps = 19/173 (10%)

Query: 121 QEESQLKERILKRKNEKIILNVNFIGEEVLGEEEANARFEKY---SQALKSNYIQYISIK 177
Q+ S+ ++ + + EK+ F+ E+ + + Y S+ K Y +
Sbjct: 118 QDLSEEEKNSMNSRGEKVPFASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGI 177

Query: 178 ITTIFSQINILDFEY-----SKKEIVKRLDALYALALEEEKKQGMPKFINLDMEEFRDLE 232
I S+ LD E+ S + D L++ +E K + K I+++ ++
Sbjct: 178 SLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKE-KLELNNKSIDINF-----IK 231

Query: 233 LTVESFMESIAK-----FDLNAGIVLQAYIPDSYEYLKKLHAFSKERVLKGLK 280
+ F + + F + VL+ Y PD +EY+ KL E++ + LK
Sbjct: 232 ENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0398GPOSANCHOR421e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 42.4 bits (99), Expect = 1e-06
Identities = 17/147 (11%), Positives = 45/147 (30%)

Query: 3 KAQEANTKLNRERNDLAREKENLTKANTELKTERDNLNNQLNASQKQVKELEQSQQVLKN 62
N L ++L E N + + +++ + + +LE++ + N
Sbjct: 75 DLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMN 134

Query: 63 EKAELTKDKENLTKANAELKTENQKLTQEKTELTEKNKALTTEKEKLNTDLSNAKSQVIQ 122
+ + L A L L + + A + + + L + + +++ +
Sbjct: 135 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 194

Query: 123 LNQEKDKLEQKYAPYKKLEKLYEVFSE 149
L + + K E
Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKA 221



Score = 40.0 bits (93), Expect = 6e-06
Identities = 29/144 (20%), Positives = 54/144 (37%)

Query: 2 LKAQEANTKLNRERNDLAREKENLTKANTELKTERDNLNNQLNASQKQVKELEQSQQVLK 61
A+E K ++ ++ A + + L +L+ + N A ++K LE + L
Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154

Query: 62 NEKAELTKDKENLTKANAELKTENQKLTQEKTELTEKNKALTTEKEKLNTDLSNAKSQVI 121
KA+L K E + + + L EK L + L E + +++
Sbjct: 155 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 214

Query: 122 QLNQEKDKLEQKYAPYKKLEKLYE 145
L EK L + A +K +
Sbjct: 215 TLEAEKAALAARKADLEKALEGAM 238



Score = 38.1 bits (88), Expect = 3e-05
Identities = 28/133 (21%), Positives = 53/133 (39%)

Query: 3 KAQEANTKLNRERNDLAREKENLTKANTELKTERDNLNNQLNASQKQVKELEQSQQVLKN 62
A +T + + L EK L +L+ + N A ++K LE + L+
Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260

Query: 63 EKAELTKDKENLTKANAELKTENQKLTQEKTELTEKNKALTTEKEKLNTDLSNAKSQVIQ 122
+AEL K E + + + L EK L + L + + LN + + + +
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320

Query: 123 LNQEKDKLEQKYA 135
+ K +LE ++
Sbjct: 321 SREAKKQLEAEHQ 333



Score = 34.7 bits (79), Expect = 4e-04
Identities = 26/129 (20%), Positives = 51/129 (39%)

Query: 3 KAQEANTKLNRERNDLAREKENLTKANTELKTERDNLNNQLNASQKQVKELEQSQQVLKN 62
L E+ L + L KA + ++ + + LE + L++
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH 302

Query: 63 EKAELTKDKENLTKANAELKTENQKLTQEKTELTEKNKALTTEKEKLNTDLSNAKSQVIQ 122
+ L ++++L + + ++L E +L E+NK ++ L DL ++ Q
Sbjct: 303 QSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 362

Query: 123 LNQEKDKLE 131
L E KLE
Sbjct: 363 LEAEHQKLE 371



Score = 30.4 bits (68), Expect = 0.007
Identities = 34/145 (23%), Positives = 62/145 (42%)

Query: 1 MLKAQEANTKLNRERNDLAREKENLTKANTELKTERDNLNNQLNASQKQVKELEQSQQVL 60
L E+ L EK +L + L R +L L+AS++ K+LE Q L
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 61 KNEKAELTKDKENLTKANAELKTENQKLTQEKTELTEKNKALTTEKEKLNTDLSNAKSQV 120
+ + +++L + + ++L E +L E+NK ++ L DL ++
Sbjct: 336 EEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK 395

Query: 121 IQLNQEKDKLEQKYAPYKKLEKLYE 145
Q+ + ++ K A +KL K E
Sbjct: 396 KQVEKALEEANSKLAALEKLNKELE 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0399GPOSANCHOR391e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.5 bits (89), Expect = 1e-05
Identities = 35/208 (16%), Positives = 67/208 (32%), Gaps = 3/208 (1%)

Query: 15 RKELEARIGELEDENTELLREREYLAAETSELKDDNDQLRQKNDKLFITKDKLTKENAAL 74
+ELEAR +LE + +A+ L+ + L + L + + A
Sbjct: 115 IQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD 174

Query: 75 TTENDRLNHQVIALTKEQDSLKQERAQLQDAHGFLEKLCADLEKDNQHLTDKLKKLESTQ 134
+ + L + AL Q L++ + LE + L + LE
Sbjct: 175 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 234

Query: 135 KNLENSNDQLLQVKEKIAEEKTELEREMVRLKSLEATGKSDLDLHNR---RLASANQDLK 191
+ N + + + EK LE L+ + + L + L+
Sbjct: 235 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALE 294

Query: 192 RQNRKLEEENIALKERVDGLKEQLSKQQ 219
+ LE ++ L L+ L +
Sbjct: 295 AEKADLEHQSQVLNANRQSLRRDLDASR 322



Score = 37.0 bits (85), Expect = 4e-05
Identities = 34/210 (16%), Positives = 69/210 (32%), Gaps = 3/210 (1%)

Query: 13 QVRKELEARIGELEDENTELLREREYLAAETSELKDDNDQLRQKNDKLFITKDKLTKENA 72
+L L+D N EL E + + + K +L K L K
Sbjct: 71 LKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 73 ALTTENDRLNHQVIALTKEQDSLKQERAQLQDAHGFLEKLCADLEKDNQHLTDKLKKLES 132
+ + ++ L E+ +L +A L+ A + L + LE+
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 133 TQKNLENSNDQLLQVKEKIAEEKTELEREMVRLKSLEAT---GKSDLDLHNRRLASANQD 189
Q LE + + + + + LE E L + +A + ++ +
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 190 LKRQNRKLEEENIALKERVDGLKEQLSKQQ 219
L+ + LE L++ ++G +
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADS 280



Score = 33.9 bits (77), Expect = 4e-04
Identities = 28/206 (13%), Positives = 63/206 (30%)

Query: 16 KELEARIGELEDENTELLREREYLAAETSELKDDNDQLRQKNDKLFITKDKLTKENAALT 75
K L+ EL +E + + SE +L + L + + A +
Sbjct: 81 KALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADS 140

Query: 76 TENDRLNHQVIALTKEQDSLKQERAQLQDAHGFLEKLCADLEKDNQHLTDKLKKLESTQK 135
+ L + AL + L++ + LE + L + +LE +
Sbjct: 141 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 200

Query: 136 NLENSNDQLLQVKEKIAEEKTELEREMVRLKSLEATGKSDLDLHNRRLASANQDLKRQNR 195
N + + + EK L L+ + + ++ + +
Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260

Query: 196 KLEEENIALKERVDGLKEQLSKQQNH 221
+ E AL+ ++ +K +
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTL 286



Score = 32.3 bits (73), Expect = 0.001
Identities = 41/205 (20%), Positives = 75/205 (36%), Gaps = 3/205 (1%)

Query: 16 KELEARIGELEDENTELLREREYLAAETSELKDDNDQLRQKNDKLFITKDKLTKENAALT 75
LEAR ELE + +A+ L+ + L + L + + A +
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 76 TENDRLNHQVIALTKEQDSLKQERAQLQDAHGFLEKLCADLEKDNQHLTDKLKKLESTQK 135
+ L + AL Q L++ + LE + L + LE +
Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305

Query: 136 NLENSNDQLLQVKEKIAEEKTELEREMVRLKSLEATGKSDLDLHNRRLAS---ANQDLKR 192
L + L + + E K +LE E +L+ ++ R L + A + L+
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA 365

Query: 193 QNRKLEEENIALKERVDGLKEQLSK 217
+++KLEE+N + L+ L
Sbjct: 366 EHQKLEEQNKISEASRQSLRRDLDA 390



Score = 28.9 bits (64), Expect = 0.016
Identities = 26/128 (20%), Positives = 47/128 (36%)

Query: 15 RKELEARIGELEDENTELLREREYLAAETSELKDDNDQLRQKNDKLFITKDKLTKENAAL 74
K LEA L +L + E ++ L + L + +L K
Sbjct: 213 IKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA 272

Query: 75 TTENDRLNHQVIALTKEQDSLKQERAQLQDAHGFLEKLCADLEKDNQHLTDKLKKLESTQ 134
+ + ++ L E+ +L+ E+A L+ L L +D + K+LE+
Sbjct: 273 MNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEH 332

Query: 135 KNLENSND 142
+ LE N
Sbjct: 333 QKLEEQNK 340


4U063_0410U063_0423Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_04103203.588528Urease accessory protein UreD
U063_04114223.322091Urease accessory protein UreG
U063_04123202.484069Urease accessory protein UreF
U063_04132162.278006Urease accessory protein UreE
U063_04142182.272219Urea channel protein UreI
U063_04151162.234418Urease alpha subunit
U063_0416-3101.361877Urease beta subunit
U063_04180132.167900*Lipoprotein signal peptidase
U063_04191142.665852Phosphoglucosamine mutase
U063_04202141.950448SSU ribosomal protein S20p
U063_04212132.118333Peptide chain release factor 1
U063_04223151.996257hypothetical protein
U063_04233141.853954Outer membrane protein HorA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0415UREASE10410.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1041 bits (2694), Expect = 0.0
Identities = 352/569 (61%), Positives = 441/569 (77%), Gaps = 4/569 (0%)

Query: 3 KISRKEYASMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSN-NP 61
++SR YA+M+GPT GDKVRL DT+L EVE D+T +GEE+KFGGGK +R+GM QS
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 62 SKEELDLIITNALIVDYTGIYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEAL 121
+D +ITNALI+D+ GI KADIG+KDG+IA IGK GN DMQ GV + VGP TE +
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTEVI 121

Query: 122 AGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGGTGPADGTNATTITPGRRNLKF 181
AGEG IVTAGG+D+HIHFI PQQI A SG+T M+GGGTGPA GT ATT TPG ++
Sbjct: 122 AGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 182 MLRAAEEYSMNFGFLAKGNVSNDASLADQIEAGAIGFKIHEDWGTTPSAINHALDVADKY 241
M+ AA+ + MN F KGN S +L + + GA K+HEDWGTTP+AI+ L VAD+Y
Sbjct: 182 MIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEY 241

Query: 242 DVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTN 301
DVQV IHTDTLNE+G VEDT+AAI GRT+H +HTEGAGGGHAPDII++ G+ N++P+STN
Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTN 301

Query: 302 PTIPFTVNTEAEHMDMLMVCHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGIFSITSS 361
PT P+TVNT AEH+DMLMVCHHL +I ED+ FA+SRIR +TIAAED LHD+G FSI SS
Sbjct: 302 PTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS 361

Query: 362 DSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNFRIKRYLSKYTINPAIAHGISE 421
DSQAMGRVGEV RTWQTADK K++ GRLKEE GDNDNFR+KRY++KYTINPAIAHG+S
Sbjct: 362 DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSH 421

Query: 422 YVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFAH 481
+GS+EVGK ADLVLW+PAFFGVKP+M++ GG IA + MGD NASIPTPQPV+YR MF
Sbjct: 422 EIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGA 481

Query: 482 HGKAKYDANITFVSQAAYDKGIKEELGLERQVLPVKNCR-NITKKDMQFNDTTAHIEVNS 540
+G+++ ++++TFVSQA+ D G+ LG+ ++++ V+N R I K M N T HIEV+
Sbjct: 482 YGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDP 541

Query: 541 ETYHVFVDGKEVTSKPANKVSLAQLFSIF 569
ETY V DG+ +T +PA + +AQ + +F
Sbjct: 542 ETYEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0423IGASERPTASE394e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.3 bits (91), Expect = 4e-05
Identities = 59/325 (18%), Positives = 113/325 (34%), Gaps = 22/325 (6%)

Query: 65 KVAQNTASNDSQEATTLENTASTDNITATTDETYTKSTDTTVAGTAQKVETDNTAVQSAE 124
+V + + D+ TT N + + +E + + V A ++ T +
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 125 QTLKTDVAKVQ------ADASAKDFDETTFQADQAVEQTAETNLQKAENQLTKDQNTLET 178
++ + A ++ + +A QT E +E + T+ T ET
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 179 ALKD-QTPSTPTTPPTKEEPKHTASSGTPTPSTPPAKEEPKHTASTPTPSTIGVASQLVK 237
A + + + T T+E PK T+ P +E+ + P+ + +K
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTS-------QVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 238 DTTTVNNLKSVSVSGMNTTLSGVKTMSQQSATIGNLLNSSTDLSSVIPNAQGLSSAFSAL 297
+ + N + + T S V+ +S T+ N NS + A + S
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTV-NTGNSVVENPENTTPATTQPTVNSES 1215

Query: 298 ES-AQNTLKGYLDSSSATIGQLTNGSN--AVVGALDKAINQVDMALSDLRATDTQKTQAV 354
+ +N + + S + T SN + V D + LSD RA K Q V
Sbjct: 1216 SNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARA----KAQFV 1271

Query: 355 TLATTGSSATTTDAINFLNALKNNL 379
L + + + N + N+
Sbjct: 1272 ALNVGKAVSQHISQLEMNNEGQYNV 1296



Score = 38.1 bits (88), Expect = 1e-04
Identities = 49/268 (18%), Positives = 90/268 (33%), Gaps = 18/268 (6%)

Query: 52 KLTSDSPTQQQDQKVAQNTASNDSQEATTLENTASTDNITATTDETYTKSTDTTVAGTAQ 111
K D+ + A ++ + T A + + T T T TK T T
Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 112 KVETDNTAVQSAEQTLKTDVAKVQADASAKDFDETTFQADQAVEQTAETNLQKAENQLTK 171
KVET+ T +V KV + S K T Q + + + E Q
Sbjct: 1113 KVETEKTQ----------EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 172 DQNTLETALKDQTPSTPTTPPTKEEPKHTASSGTPTP--STPPAKEEPKHTASTPTPSTI 229
+ +T S P T+ +T +S P +TP + ++ S+ P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 230 ------GVASQLVKDTTTVNNLKSVSVSGMNTTLSGVKTMSQQSATIGNLLNSSTDLSSV 283
V + TT+ N+ +V++ + +T + ++ LN +S
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQH 1282

Query: 284 IPNAQGLSSAFSALESAQNTLKGYLDSS 311
I + + + + ++ SS
Sbjct: 1283 ISQLEMNNEGQYNVWVSNTSMNKNYSSS 1310


5U063_0444U063_0451Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_04442121.488193hypothetical protein
U063_04451132.775620Beta-1,3-galactosyltransferase
U063_04462133.095630methyl-accepting chemotaxis protein
U063_04471113.5622042',3'-cyclic-nucleotide 2'-phosphodiesterase
U063_0448-2114.675700S-ribosylhomocysteine lyase
U063_0449-1123.816779Cystathionine gamma-lyase
U063_04500152.676024Cystathionine beta-synthase
U063_04512161.759927hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0446OMS28PORIN300.022 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.1 bits (67), Expect = 0.022
Identities = 35/163 (21%), Positives = 77/163 (47%), Gaps = 11/163 (6%)

Query: 375 HTEEELSSKVEQLSRNADDVKSILDIINDIADQTNLLALNAAIEAARAGEHGRGFAVVAD 434
H++++ + K++Q D V LD IN + + +++ +E R + A
Sbjct: 41 HSDQKDNKKLDQ----KDQVNQALDTINKVTED-----VSSKLEGVRESSLELVESNDAG 91

Query: 435 EVRNLAGRTQKSLAEINSTIMVIVQEINAVSSQMNLNSQKMERLSDMSK-SVQETYEKMS 493
V+ G + ++++ +V QE V+ + ++ ++ +MSK +VQET + +S
Sbjct: 92 VVKKFVG-SMSLMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVS 150

Query: 494 SNLSSVVSDSNQSMDDYAKSGHQIEVMVSDFAEVEKVASKTLA 536
+ Q M + + + ++E+ +FA+VE+V +A
Sbjct: 151 VAGEATFLIEKQIMLNKSPNNKELELTKEEFAKVEQVKETLMA 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0447PF05704290.039 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 29.5 bits (66), Expect = 0.039
Identities = 25/189 (13%), Positives = 55/189 (29%), Gaps = 11/189 (5%)

Query: 382 RKDVAYIYKFANTLIGVHITGENLLKYMEWSYRFYNQLQPGDLTI-SFNENIRGYNFDMF 440
++ VA + K + + I G N ++++ + Q G + F++ +R + +
Sbjct: 87 QQCVASVKKNSGDFKVIIIDGNNYKEWVDIPDFLIKRWQEGKMLDAWFSDILRLFLLCKY 146

Query: 441 SGV--KYQVDVTKPAGQRIINPTINNIPIDPKAIYKLAINNYRFGTLSTTLNLVTDTDR- 497
G+ V + I+ I+N+ S +
Sbjct: 147 GGLWIDATVYMFDKVPNYIVESNRFMFQSSFLESETTHISNWLIFVKSKNDPFLVGLKNS 206

Query: 498 ---YYDSYDALQDSGQIRDLIIKYITEEKGGK--VTPELEGNWE--IINYDFKNPLLEKL 550
Y + D D + ++ K N ++ Y P +
Sbjct: 207 MVTYLKKKEKPADYYIFHDFVSVMAVSKEYSKYWKEIPYVNNVNPHMLQYLGNLPYDNSM 266

Query: 551 REKLKEGSI 559
+K S
Sbjct: 267 FNYIKSTSP 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0448LUXSPROTEIN2263e-79 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 226 bits (577), Expect = 3e-79
Identities = 59/145 (40%), Positives = 90/145 (62%), Gaps = 7/145 (4%)

Query: 8 VESFNLDHTKVKAPYVRIADRKKGVNGDVIVKYDVRFKQPNKDHMDMPSLHSLEHLVAEI 67
++SF +DHT++ AP VR+A + GD I +D+RF PNKD + +H+LEHL A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGF 62

Query: 68 IRNHA----SYVVDWSPMGCQTGFYLTVLNHDNYTEVLEVLEKTMQDVLKA---TEVPAS 120
+RNH ++D SPMGC+TGFY++++ + +V + M+DVLK ++P
Sbjct: 63 MRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIPEL 122

Query: 121 NEKQCGWAANHTLEGAKNLARAFLD 145
NE QCG AA H+L+ AK +A+ L+
Sbjct: 123 NEYQCGTAAMHSLDEAKQIAKNILE 147


6U063_0481U063_0486Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_04812121.710120A/G-specific adenine glycosylase
U063_04824132.2736662-oxoglutarate/malate translocator
U063_04833141.005927Cytochrome c oxidase subunit CcoN
U063_0484216-0.362091Cytochrome c oxidase subunit CcoO
U063_0485316-1.533452Cytochrome c oxidase subunit CcoQ
U063_0486215-0.901043Cytochrome c oxidase subunit CcoP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0484PF07201290.021 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 28.7 bits (64), Expect = 0.021
Identities = 13/77 (16%), Positives = 29/77 (37%), Gaps = 6/77 (7%)

Query: 146 FDTAYAEALTQKKVFGVPYDTENGVKLGSVEEAKKAYLEEAKKITADMKDKRVLDAIQRG 205
F +L ++K+ +++ ++ VEE YL + ++ +L +
Sbjct: 60 FSERKELSLDKRKL------SDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNS 113

Query: 206 EVLEIVALIAYLNSLGN 222
+ + L AYL
Sbjct: 114 PNISLSQLKAYLEGKSE 130


7U063_0636U063_0649Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
U063_06362163.335950LSU ribosomal protein L21p
U063_06372163.370321LSU ribosomal protein L27p
U063_06381143.459018Dipeptide-binding ABC transporter, periplasmic
U063_06390144.054930Dipeptide transport system permease protein
U063_0640-1133.151170Dipeptide transport system permease protein
U063_0641-2133.046438Dipeptide transport ATP-binding protein DppD
U063_0642-2132.741000Dipeptide transport ATP-binding protein DppF
U063_0643-2122.243046GTP-binding protein Obg
U063_0644-1131.726441hypothetical protein
U063_06450172.242680putative periplasmic protein
U063_06460161.320812Glutamate-1-semialdehyde aminotransferase
U063_0647017-1.092582membrane protein
U063_0648117-0.933268hypothetical protein
U063_06492150.096875putative N-carbamoyl-D-amino acid
8U063_0666U063_0690Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0666315-1.840913Flagellar L-ring protein FlgH
U063_0667314-1.555094CMP-N-acetylneuraminic acid synthetase
U063_0668213-0.778446CMP-N-acetylneuraminic acid synthetase
U063_0669213-0.357504flagellar protein G
U063_06701130.911236Tetraacyldisaccharide 4'-kinase
U063_06710151.623973NAD synthetase
U063_0673-1161.823836*Ketol-acid reductoisomerase
U063_0674-2181.206616Septum site-determining protein MinD
U063_0675-115-0.555805Cell division topological specificity factor
U063_0676117-1.233486Rossmann fold nucleotide-binding protein Smf
U063_0677128-3.200522Putative Holliday junction resolvase YggF
U063_0678332-5.793155hypothetical protein
U063_0679022-3.621427hypothetical protein
U063_0680021-3.083036hypothetical protein
U063_0683121-2.674806hypothetical protein
U063_0684121-3.176323hypothetical protein
U063_0685121-2.785804hypothetical protein
U063_0686222-3.468734Small-conductance mechanosensitive channel
U063_0687225-4.599883hypothetical protein
U063_0688122-3.705108hypothetical protein
U063_06891180.357239hypothetical protein
U063_06903142.179320hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0666FLGLRINGFLGH1934e-64 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 193 bits (491), Expect = 4e-64
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAEYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + E S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


9U063_0776U063_0796Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0776-1193.318598hypothetical protein
U063_0777-1182.895337[NiFe] hydrogenase nickel
U063_0778-1172.563627[NiFe] hydrogenase metallocenter assembly
U063_07790152.794267[NiFe] hydrogenase metallocenter assembly
U063_07801132.633764hypothetical protein
U063_07831153.856681**Outer membrane protein HopS (BabA)
U063_07840182.024497DNA-damage-inducible protein J
U063_0785-1161.976013hypothetical protein
U063_07861131.339237Acyl-CoA hydrolase
U063_07872100.243449Dehydrogenase
U063_0788211-0.132672Vitamin B12 ABC transporter, permease component
U063_078938-1.625040Zinc ABC transporter, ATP-binding protein ZnuC
U063_0792410-1.859382Cysteinyl-tRNA synthetase
U063_0793310-1.686239putative peptidoglycan lipid II flippase MurJ
U063_0794410-1.418032hypothetical protein
U063_0795412-1.556125Holliday junction DNA helicase RuvA
U063_0796212-1.175077hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0784SECA290.004 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.5 bits (66), Expect = 0.004
Identities = 13/69 (18%), Positives = 34/69 (49%), Gaps = 12/69 (17%)

Query: 4 SSTKKDYTKYSEKQLVNLIHQLERKIKKMQNDRVSFKEKMAKELEKRDQNFKDKIDALNE 63
S + + +++VN+I+ +E +++K+ ++ EL+ + F+ +++
Sbjct: 12 SRNDRTLRRM--RKVVNIINAMEPEMEKLSDE----------ELKGKTAEFRARLEKGEV 59

Query: 64 LLQKISQAF 72
L I +AF
Sbjct: 60 LENLIPEAF 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0787DHBDHDRGNASE872e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.4 bits (216), Expect = 2e-22
Identities = 51/199 (25%), Positives = 91/199 (45%), Gaps = 6/199 (3%)

Query: 11 KVAIITGASSGIGLECALMLLDQGYKVYALSRHATLCVALNHALC------ESVDVDVSD 64
K+A ITGA+ GIG A L QG + A+ + + +L E+ DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 SNALKEVFSNISAKEDHCDVLINSAGYGVFGSVEDTPIEEVKKQFSVNFFALCEVVQFCL 124
S A+ E+ + I + D+L+N AG G + EE + FSVN + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 PLLKNKPYSKIFNLSSIAGRVSMLFLGHYSASKHALEAYSDALRLELKPFNVQVCLIEPG 184
+ ++ I + S V + Y++SK A ++ L LEL +N++ ++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 PVKSNWEKTAFSVENFESE 203
+++ + + ++ EN +
Sbjct: 189 STETDMQWSLWADENGAEQ 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0792OMS28PORIN300.015 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.1 bits (67), Expect = 0.015
Identities = 17/51 (33%), Positives = 32/51 (62%), Gaps = 4/51 (7%)

Query: 309 EEDLLVSKKRLDKIYRLKQRVLGTLGGINPNFKKEILECMQDDLNVSKALS 359
+E L+ S++ LD+ + Q+VL + G+NP+ K ++L +V+KA+S
Sbjct: 188 KETLMASERALDETVQEAQKVLNMVNGLNPSNKDQVLA----KKDVAKAIS 234


10U063_0826U063_0853Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0826212-2.274340putative periplasmic protein
U063_0827517-2.340758hypothetical protein
U063_0828819-2.364621cag pathogenicity island protein Cag Zeta Cag1
U063_0829818-2.542561cag pathogenicity island protein Cag2
U063_0830917-2.175004cag pathogenicity island protein Cag Delta
U063_0831918-2.735182cag pathogenicity island protein Cag Gamma
U063_0832820-2.770699cag pathogenicity island protein Cag Beta
U063_0833921-3.085825cag pathogenicity island protein Cag Alpha
U063_0834822-3.326847cag pathogenicity island protein CagZ
U063_0835922-3.138680cag pathogenicity island protein CagY
U063_08361028-4.434965cag pathogenicity island protein CagX
U063_08371031-4.459502cag pathogenicity island protein CagW
U063_08381331-5.406273cag pathogenicity island protein CagV
U063_08391131-5.489905cag pathogenicity island protein CagU
U063_08401226-5.315275cag pathogenicity island protein CagT
U063_08411024-5.750493cag pathogenicity island protein CagS
U063_0842621-4.323471cag pathogenicity island protein CagQ
U063_0843719-3.021143cag pathogenicity island protein CagP
U063_0844618-2.887136cag pathogenicity island protein CagM
U063_0845620-3.321025cag pathogenicity island protein CagN
U063_0846520-3.031263cag pathogenicity island protein CagL
U063_0847521-3.444696cag pathogenicity island protein CagI
U063_0848620-3.596332cag pathogenicity island protein CagH
U063_0849621-4.590775cag pathogenicity island protein CagG
U063_0850621-3.489021cag pathogenicity island protein CagF
U063_0851419-2.589466cag pathogenicity island protein CagE
U063_0852317-1.237545cag pathogenicity island protein CagD
U063_0853317-0.423463cag pathogenicity island protein CagC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0828TYPE3IMSPROT280.008 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 27.8 bits (62), Expect = 0.008
Identities = 9/51 (17%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 35 ALGLIGAGVLCCVLSGAMGIVGIIFVAIGIFLSFSNINLVKLIEKLFKKQS 85
L+ L + S + G + I IN ++ +++F +S
Sbjct: 87 CFPLLTVAALMAIASHVV-QYGFLISGEAIKPDIKKINPIEGAKRIFSIKS 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0835IGASERPTASE384e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.1 bits (88), Expect = 4e-04
Identities = 55/293 (18%), Positives = 92/293 (31%), Gaps = 34/293 (11%)

Query: 41 KESSDHHLDNPTETQTHFDEDKLEETQTQMDSGGDETSESSNGSLADKLFKKARKLVDNK 100
+ + L NP E + T + D S SN + VD
Sbjct: 973 NVNGRYDLYNP-EVEKRNQTVDTTNITTPNNIQADVPSVPSNN--------EEIARVDEA 1023

Query: 101 RPFTQQKDLDEETQELNEEDDQENNGYQEETQIDLIDDETSKKTQQHSPQDLSNEETTEA 160
ET E E N QE ++ + + ++ T Q+ + +A
Sbjct: 1024 PVPPPAPATPSETTETVAE-----NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA 1078

Query: 161 NHFEDSSKESKESSDQHLDNPAETQTQETKTHFDEYKLEETQTQMDSEGNETSESSNGSL 220
N + +S + ETQT ETK K E+ + + + +S S
Sbjct: 1079 NTQTNEVAQSGSET-------KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131

Query: 221 ADKLFKKARKLVDNKRPFTQQKDLDEETQELNEEDDQENNGYQEETQIDLIDDETSKKTQ 280
+ + + + R ++ E + N D E +ET ++ T T
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP--AKETSSNVEQPVTESTTV 1189

Query: 281 QHSPQDLSN-----EETTEANHFEDSSKESKE------SSDQHLDNPAETQTN 322
+ N TT+ +SS + K S H PA T +N
Sbjct: 1190 NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242



Score = 34.7 bits (79), Expect = 0.005
Identities = 28/158 (17%), Positives = 60/158 (37%), Gaps = 2/158 (1%)

Query: 665 KNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKDCVSQAKTEAEKKECEKLLTPEAKKL 724
+ ++ E + + P + E+ + + Q EK E + T +
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 725 LEEEAKESVKAYLDC--VSQAKNEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSRAR 782
+ +EAK +VKA V+Q+ +E ++ + + +K E+AK + +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 783 NEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTDEERK 820
KQE + + P+A+ E +K ++
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTT 1165



Score = 32.7 bits (74), Expect = 0.017
Identities = 32/278 (11%), Positives = 69/278 (24%), Gaps = 14/278 (5%)

Query: 1102 RARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKL 1161
+ + + TP + + S AR +E TP
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI-----ARVDEAPVPPPAPATPSETTE 1038

Query: 1162 LEQEVKKSVKAYLDCVSKARNEKEKQECEKLLTPEARKLLEQQALDCLKNAKTEADKKRC 1221
E K ++ + E Q E ++ Q + ++ + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 1222 VKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK 1281
++K+ AK + + K+E + + P+A+ E
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 1282 DCLSQARNEEE---RRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKL 1338
+ + E + + P V+ + E
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218

Query: 1339 LTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKC 1376
R+ + + A NDR+ + C
Sbjct: 1219 PKNRHRRSVRSVPHNVEPA------TTSSNDRSTVALC 1250



Score = 31.6 bits (71), Expect = 0.041
Identities = 28/214 (13%), Positives = 62/214 (28%), Gaps = 11/214 (5%)

Query: 956 RARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKL 1015
+ + + TP + + S AR +E TP
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI-----ARVDEAPVPPPAPATPSETTE 1038

Query: 1016 LEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEQQALDCLKNAKTEADKKRC 1075
E K ++ + E Q E ++ Q + ++ + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 1076 VKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK 1135
++K+ AK + + K+E + + P+A+ E
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN---- 1154

Query: 1136 DCLSQARNEEERRACEKLLTPEARKLLEQEVKKS 1169
+ + +++ A + E +EQ V +S
Sbjct: 1155 --IKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186



Score = 31.6 bits (71), Expect = 0.044
Identities = 29/174 (16%), Positives = 69/174 (39%), Gaps = 22/174 (12%)

Query: 811 KNAKTDEERKKCLKDLP-KDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKK 869
+ ++ E + + P ++ + + ++KT + ++ T + ++
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 870 LLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKLLEQQALDCLKNAKTEADKKRC 929
+ +EAK +VKA A++ E KE + T E + +++ AK E +K +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE------KAKVETEKTQE 1121

Query: 930 VKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKE 983
V KV ++ S K + ++E + + E + ++E +
Sbjct: 1122 V--------PKVTSQVSPK-------QEQSETVQPQAEPARENDPTVNIKEPQS 1160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0836TYPE4SSCAGX8710.0 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 871 bits (2252), Expect = 0.0
Identities = 513/522 (98%), Positives = 517/522 (99%)

Query: 1 MGQAFFKKIVGCFCLGYLFLSSVIEAAAPDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60
MGQAFFKKIVGCFCLGYLFLSS IEA A DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS
Sbjct: 1 MGQAFFKKIVGCFCLGYLFLSSAIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60

Query: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120
LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR
Sbjct: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120

Query: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180
DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL
Sbjct: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180

Query: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240
ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240

Query: 241 EETIKQRAKDKINIKTDKPQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300
EE ++QRAKDKI+IKTDK QKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD
Sbjct: 241 EEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300

Query: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQKELIKQENLNTTAYINRVMMASNE 360
NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQ+ELIKQENLNTTAYINRVMMASNE
Sbjct: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360

Query: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420
QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF
Sbjct: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420

Query: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480
DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
Sbjct: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480

Query: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK
Sbjct: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0838PF043351194e-35 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 119 bits (300), Expect = 4e-35
Identities = 43/205 (20%), Positives = 73/205 (35%), Gaps = 10/205 (4%)

Query: 27 KLNKANRTFKRAFYL---SMVLNVAAVTSIVMMMPLKKTDIFVYGIDRYTGEFKIVKRSD 83
KL A R+ K A+ + + L A V ++ + PLK + +V +DR TGE I +
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLH 83

Query: 84 A-RQIVNSEAVVDSATSKFVSLLFGYSKNSLRDRKDQLMQYCDVSFQTQAMRMFNENIRQ 142
I EAV + +V G+ + + D +M Q + R + + Q
Sbjct: 84 GDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQ 143

Query: 143 FVDKVRA-EAIISSNIQREKVKNSPLTRLTFFITIKITPDTMENYEYITKKQVTIYYDFA 201
+ A + I + +F +T T TI Y
Sbjct: 144 SPQNILANRTDVFVEI-KRVSFLGGNVAQVYFTKESVTGSNS----TKTDAVATIKYKVD 198

Query: 202 RGNSSQENLIINPFGFKVFDIQITD 226
S + + NP G++V +
Sbjct: 199 GTPSKEVDRFKNPLGYQVESYRADV 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0845TYPE4SSCAGX310.007 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.9 bits (69), Expect = 0.007
Identities = 30/119 (25%), Positives = 56/119 (47%), Gaps = 16/119 (13%)

Query: 24 AINTALLPSEYKKLVALGFKKIKTLHQRHDDKEVTEEEKKFATNALREKLRNDRARAEQI 83
A+N AL+ +Y++ + K K + D KE+ E++K EK + + +A++
Sbjct: 112 AVNFALMTRDYQEFL----KTKKLIVDAPDPKELEEQKKAL------EKEKEAKEQAQKA 161

Query: 84 QKNIEAFEKKNNSSIQKKAAKHKGLQELNEINANPLNDNPNSNSSTETKSNKDDNFDEM 142
QK+ K +++A L+ L +NP N + N N S K +++ D+M
Sbjct: 162 QKD------KREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQM 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0851ACRIFLAVINRP330.008 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 32.9 bits (75), Expect = 0.008
Identities = 20/88 (22%), Positives = 32/88 (36%), Gaps = 18/88 (20%)

Query: 19 EVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSI-------FILFVT 71
+ K K+ EL+ +G+ +D F+ SI F +
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMK--VLYPYD--------TTPFVQLSIHEVVKTLFEAIML 350

Query: 72 IVLSVILF-QAYEPVLIVAIVIVLVALG 98
+ L + LF Q LI I + +V LG
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLG 378


11U063_0974U063_0999Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0974214-2.198199hypothetical protein
U063_0975012-2.761639hypothetical protein
U063_0976013-2.885925Outer membrane protein HorF
U063_0977114-3.817693Chorismate mutase I
U063_0978115-4.752309hypothetical protein
U063_0979-110-0.828042hypothetical protein
U063_0980-19-0.026245integrase/recombinase XerD
U063_09811100.834273Methylated-DNA--protein-cysteine
U063_09822110.967612Integral membrane protein
U063_09832141.730163lipopolysaccharide biosynthesis protein WbpB
U063_09842132.268443Ribonucleotide reductase, alpha subunit
U063_09854192.345705hypothetical protein
U063_09865161.258755hypothetical protein
U063_09873110.717574hypothetical protein
U063_09881110.322251hypothetical protein
U063_09891100.971787N-acetylglucosamine-1-phosphate
U063_09901100.952306Flagellar biosynthesis protein FliP
U063_09911111.505654Iron(III) dicitrate transport protein FecA
U063_0992-1112.120933Ferrous iron transport protein B
U063_09931122.491338Polysaccharide biosynthesis protein WlaX
U063_09941124.1169953-ketoacyl-CoA thiolase
U063_09952133.991109Succinyl-CoA:3-ketoacid-coenzyme A transferase
U063_09963143.906171Succinyl-CoA:3-ketoacid-coenzyme A transferase
U063_09973143.464073Short chain fatty acids transporter
U063_09982142.718880putative outer membrane protein
U063_09992142.949649Acetone carboxylase, beta subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0986PF07132348e-05 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 33.9 bits (77), Expect = 8e-05
Identities = 19/45 (42%), Positives = 31/45 (68%)

Query: 42 IGEGVGAGMGGAMGGMIGALGGPWGTVFGAGIGGGIGAYSGAEIG 86
+G +G G+GG +GG+ +LGG G + G G+GGG+G+ G+ +G
Sbjct: 61 MGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLG 105



Score = 30.8 bits (69), Expect = 8e-04
Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 38 LGRDIGEGVGAGMGGAMGGMIGALGGPWGTVFGAGIGGGIGAYSGAEIGD 87
+G +G G+G G+GG + G GG G G G+G +G+ G+ +G
Sbjct: 61 MGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0990FLGBIOSNFLIP2763e-96 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 276 bits (707), Expect = 3e-96
Identities = 113/245 (46%), Positives = 162/245 (66%), Gaps = 2/245 (0%)

Query: 1 MRFFIFLILICPLIYPLMSADSALPSVNLSLNAPNDPKQLVTTLNVIALLTLLVLAPSLI 60
MR + + + L A + LP + S P + + + +T L P+++
Sbjct: 1 MRRLLSVAPVL-LWLITPLAFAQLPGIT-SQPLPGGGQSWSLPVQTLVFITSLTFIPAIL 58

Query: 61 LVMTSFTRLIVVFSFLRTALGTQQTPPTQILVSLSLILTFFIMEPSLKKAYDTGIKPYMD 120
L+MTSFTR+I+VF LR ALGT PP Q+L+ L+L LTFFIM P + K Y +P+ +
Sbjct: 59 LMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE 118

Query: 121 KKISYTEAFEKSALPFKEFMLKNTREKDLALFFRIRNLPNPKTPDEVSLSVLIPAFMISE 180
+KIS EA EK A P +EFML+ TRE DL LF R+ N + P+ V + +L+PA++ SE
Sbjct: 119 EKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSE 178

Query: 181 LKTAFQIGFLLYLPFLVIDMVISSILMAMGMMMLPPVMISLPFKILVFILVDGFNLLTEN 240
LKTAFQIGF +++PFL+ID+VI+S+LMA+GMMM+PP I+LPFK+++F+LVDG+ LL +
Sbjct: 179 LKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGS 238

Query: 241 LVASF 245
L SF
Sbjct: 239 LAQSF 243


12U063_1172U063_1194Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_11720133.119017[NiFe] hydrogenase nickel incorporation protein
U063_11731133.085253Flagellar hook protein FlgE
U063_11741142.499763CDP-diacylglycerol pyrophosphatase
U063_11751132.548425Alkylphosphonate utilization operon protein
U063_11761132.501951hypothetical protein
U063_11772132.599905hypothetical protein
U063_11782142.572351Catalase
U063_11790132.253120iron-regulated outer membrane protein FrbP_1
U063_1180416-2.244671Crossover junction endodeoxyribonuclease RuvC
U063_1181614-1.831352hypothetical protein
U063_11829250.783047hypothetical protein
U063_11838220.487971hypothetical protein
U063_11847210.463638hypothetical protein
U063_11858180.233282hypothetical protein
U063_1186816-0.186588Outer membrane protein HofD
U063_11874130.634467Outer membrane protein HofC
U063_1188012-1.539226Catalase
U063_1189-112-2.880716DNA-cytosine methyltransferase
U063_1190-212-2.893380hypothetical protein
U063_1191-110-2.473913DNA adenine methylase
U063_1192-18-2.037913GTP-binding protein TypA/BipA
U063_1193-111-3.524358adenine specific DNA methyltransferase
U063_1194-112-3.391302adenine specific DNA methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1173FLGHOOKAP1427e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 7e-06
Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 645 GNVFSQTGNSGQALIGAANTGR--RGSISGSKLESSNVDLSRSLTNLIVVQRGFQANSKA 702
++ S GN L ++ T +S + S V+L NL Q+ + AN++
Sbjct: 472 ASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQV 531

Query: 703 VTTSDQILNTLLNLK 717
+ T++ I + L+N++
Sbjct: 532 LQTANAIFDALINIR 546



Score = 39.2 bits (91), Expect = 5e-05
Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 4 SLWSGVNGMQAHQIALDIESNNIANVNTTGFKYSR 38
+ + ++G+ A Q AL+ SNNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1184TYPE4SSCAGX250.030 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 25.1 bits (54), Expect = 0.030
Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 33 LDENKSDLAEMNEINEQLPQAQKNSKNSQTKKPLCSRLNLENL 75
L+E K L + E EQ +AQK+ + + ++ +R NLENL
Sbjct: 141 LEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLENL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1192TCRTETOQM1981e-57 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 198 bits (505), Expect = 1e-57
Identities = 115/461 (24%), Positives = 190/461 (41%), Gaps = 67/461 (14%)

Query: 3 NIRNIAVIAHVDHGKTTLVDGLLSQSGTFSEREKVDE--RVMDSNDLERERGITILSKNT 60
I NI V+AHVD GKTTL + LL SG +E VD+ D+ LER+RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIYYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSFGI 120
+ +++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + GI
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 CPIVVVNKIDKPAAEPDRVVDEVFDLF---------VAMGASDKQLDFPV-----VYAAA 166
I +NKID+ + V ++ + V + + +F
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 167 RDGYAMKSLDDE----------------------------KKNL--EPLFETILEHVPSP 196
D K + + K N+ + L E I S
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 197 SGSVDEPLQMQIFTLDYDNYVGKIGIARVFNGSVKKNESVLLMKSDGSKENGRITKLIGF 256
+ L ++F ++Y ++ R+++G + +SV + KE +IT++
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKEKIKITEMYTS 297

Query: 257 LGLARTEIENAYAGDIVAIAG--FNAMDV-GDSVVDPNNPMPLDPMHLEEPTMSVYFAVN 313
+ +I+ AY+G+IV + V GD+ + P +P P + +
Sbjct: 298 INGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENP----LPLLQTTVEPS 353

Query: 314 DSPLAGLEGKHVTANKLKDRLLKEMQTNIAMKCEEMGEGKFKVSGRGELQITILAENLRR 373
+ + D LL+ + + +S G++Q+ + L+
Sbjct: 354 KPQQREMLLDALLEISDSDPLLRYYVDSAT--------HEIILSFLGKVQMEVTCALLQE 405

Query: 374 E-GFEFSISRPEVIIKEENGVKCEPFEHLVIDTPQDFSGAI 413
+ E I P VI E K E H+ + P F +I
Sbjct: 406 KYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 41.8 bits (98), Expect = 8e-06
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 396 EPFEHLVIDTPQDFSGAIIERLGKRKAEMKAMNPMSDGYTRLEFEIPARGLIGYRSEFLT 455
EP+ I PQ++ K A + + + L EIPAR + YRS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 456 DTKGEGVMNHSFLEFRPFSG 475
T G V + +G
Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615


13U063_1204U063_1209Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1204-113-3.371706hypothetical protein
U063_1205-114-3.568797integral membrane protein
U063_1206-215-3.814512hypothetical protein
U063_1207-213-3.426359hypothetical protein
U063_1208-116-4.153548Type I restriction-modification system,
U063_1209-213-3.521279OrfB in transposon IS607
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1205CHANLCOLICIN280.006 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.1 bits (62), Expect = 0.006
Identities = 16/48 (33%), Positives = 27/48 (56%)

Query: 46 SFQDPEKREEYIERLKKNHERKMILQDKQKEEQMRLYQAKKERESRQK 93
+FQ+ E+R + IER K ER++ L + +++ L + K E QK
Sbjct: 149 AFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQK 196


14U063_1309U063_1336Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1309218-2.817517Lysozyme
U063_1310315-4.109260hypothetical protein
U063_1311518-4.757445hypothetical protein
U063_1312420-4.816985hypothetical protein
U063_1313320-5.172635hypothetical protein
U063_1314321-5.351905hypothetical protein
U063_1315326-6.322148hypothetical protein
U063_1316125-6.514227hypothetical protein
U063_1317422-6.278591protein phosphatase 2C
U063_1318521-6.469931protein phosphatase 2C
U063_1319522-6.771253hypothetical protein
U063_1322625-5.866820hypothetical protein
U063_1323523-6.163181integrase/recombinase
U063_1324525-5.898715OrfA in transposon IS607
U063_1325426-6.174194OrfB in transposon IS607
U063_1326625-6.970396hypothetical protein
U063_1327626-6.972931hypothetical protein
U063_1328625-7.927728hypothetical protein
U063_1329725-7.215546site-specific recombinase
U063_1330624-7.329550DNA topoisomerase I
U063_1331625-7.682923VirB4-like ATPase of type IV secretion complex
U063_1332527-7.358873hypothetical protein
U063_1333417-5.850679hypothetical protein
U063_1334416-5.723996hypothetical protein
U063_1335216-5.198025hypothetical protein
U063_1336116-3.921378hypothetical protein
15U063_1393U063_1403Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
U063_13932120.667091Outer membrane protein HomC
U063_1394014-0.243057Deoxycytidine triphosphate deaminase
U063_1395114-0.881462Biotin carboxyl carrier protein of acetyl-CoA
U063_1396114-2.072607Biotin carboxylase of acetyl-CoA carboxylase
U063_1397417-4.934747hypothetical protein
U063_1398012-3.616834hypothetical protein
U063_1399113-3.030915type II DNA methyltransferase
U063_1400013-1.506569hypothetical protein
U063_1401012-1.462148hypothetical protein
U063_1402212-0.333944hypothetical protein
U063_14032110.076776C4 aminotransferase
16U063_1418U063_1442Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_14182111.2794576-phosphogluconolactonase
U063_14192101.419656Glucokinase
U063_14203110.686032Alcohol dehydrogenase
U063_14211111.415809lipopolysaccharide biosynthesis protein
U063_14222132.375388hypothetical protein
U063_14230142.878565Outer membrane protein HorH
U063_1424-1112.621043Pyruvate:ferredoxin oxidoreductase gamma
U063_1425-1102.135543Pyruvate:ferredoxin oxidoreductase delta
U063_1426-2101.492449Pyruvate:ferredoxin oxidoreductase alpha
U063_1427010-0.287316Pyruvate:ferredoxin oxidoreductase beta subunit
U063_1428211-0.826962Adenylosuccinate lyase
U063_1429012-0.223132Outer membrane protein HorI
U063_1430111-0.176436Excinuclease ABC subunit B
U063_1431213-0.447592hypothetical protein
U063_1432113-0.553830hypothetical protein
U063_14330160.240306hypothetical protein
U063_14340150.119161Gamma-glutamyltranspeptidase
U063_1435-113-1.035010Flagellar hook-associated protein FlgK
U063_1436015-1.365275hypothetical protein
U063_1437117-0.991651DNA-cytosine methyltransferase
U063_1438313-0.842221FlgM protein
U063_1439312-1.709627hypothetical protein
U063_1440413-1.574612peptidyl-prolyl cis-trans isomerase SlyD
U063_1441414-2.264452Putative periplasmic protein
U063_1442414-2.091746Outer membrane lipoprotein PalA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1424YERSSTKINASE290.011 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.3 bits (65), Expect = 0.011
Identities = 18/63 (28%), Positives = 33/63 (52%), Gaps = 9/63 (14%)

Query: 50 YNRVDDEPILNHERFMQPDYVLVIDPGLVFIENIFANEKEDTTYIITSYLNKEELFEKKP 109
++R ++P E F P+ + + N+ A+EK D ++++ L+ E FEK P
Sbjct: 293 HSRSGEQPKGFTESFKAPE---------LGVGNLGASEKSDVFLVVSTLLHCIEGFEKNP 343

Query: 110 ELK 112
E+K
Sbjct: 344 EIK 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1432CHANLCOLICIN320.018 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 31.6 bits (71), Expect = 0.018
Identities = 54/310 (17%), Positives = 111/310 (35%), Gaps = 36/310 (11%)

Query: 44 AKFSNNTLKIIEELNNGVKSASEEIKAKAHDFSNEKLTNEQIKDLLNNAEIPTSGRDAII 103
AK+S LK + A+ E +AKA +N +++KD++N A + R
Sbjct: 55 AKWSTAQLKKTQAEQAARAKAAAEAQAKAK--ANRDALTQRLKDIVNEALRHNASR---- 108

Query: 104 FGVNNLNPEIVEFMHQNNKKMIIE-------KASNKELELLKDAN--FRHPENIRASLDH 154
P E H NN M E KA K + + A F+ E R ++
Sbjct: 109 ------TPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIER 162

Query: 155 DAISHILKRHGVNSVNVRNGEIPITNEDIANYRYIVNNADAILRTLDKYDKEAITAFKQ- 213
+ + + R + + + + ++ A + + +D K +
Sbjct: 163 EKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSS 222

Query: 214 INGYAVVVEQAINKKNELALKTMFKSKGDYKNNEVYKEFSSTSLDADAKVRHRLSSYSGA 273
I+ ++ K+NELA + K E+ + S A+ +++R
Sbjct: 223 IHARDAEMKTLAGKRNELAQASA-------KYKELDELVKKLSPRANDPLQNRPFF---- 271

Query: 274 TENSTQKPLTDQEDLLKTQENLNASTQEPNHLSPLEQANAEKLAKLESEKLESEQEFLKA 333
+T++ + + + Q+ + AS N ++ + +K S + +
Sbjct: 272 --EATRRRVGAGKIREEKQKQVTASETRINRINA-DITQIQKAISQVSNNRNAGIARVHE 328

Query: 334 KEQENKRKEA 343
E+ K+ +
Sbjct: 329 AEENLKKAQN 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1435FLGHOOKAP15650.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 565 bits (1457), Expect = 0.0
Identities = 131/610 (21%), Positives = 232/610 (38%), Gaps = 75/610 (12%)

Query: 6 SSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMGVDVEA 65
S +N + +GL A Q+ ++ NNIS+ + Y+RQ I + + V GV V
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 66 IERVHDEFVFSRYTKANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNSWKELS 125
++R +D F+ ++ A +++ + + + +SL T +QD+F S + L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNML-STSTSSLATQMQDFFTSLQTLV 120

Query: 126 KNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQHKASEELKSVIKEVNSLGSQIAEINK 185
NA+D A +QAL K+E L + K T + L + + + + + ++N+ QIA +N
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 186 RIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTNSLTDKDSADFDESYNL 245
+I + + N L D+RD+L L +++G V S +YN+
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEV--------------SVQDGGTYNI 226

Query: 246 NIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVINITDK-LNQGKVGALLDVY 304
+ +G++++ GS L S V + I I +K LN G +G +L
Sbjct: 227 TMANGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFR 286

Query: 305 NDGSNGTLKGKLQDYIDLLDSFARGLIESTNAIYAQSASHYIEGEPVEFNSDEAFKDTNY 364
+ L + L A E+ N + +A D N
Sbjct: 287 SQ--------DLDQTRNTLGQLALAFAEAFNTQH------------------KAGFDANG 320

Query: 365 NIKNGSFDL----IAYNTDGKEIARKTIAITPITTMNDIIQAINANTDDNQ-----DNNT 415
+ F + + NT K +T + + I+ + + Q N T
Sbjct: 321 DAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDYKISFDNNQWQVTRLASNTT 380

Query: 416 ENDFDDYFTASFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNIS 475
D + + + + + M L D + I+
Sbjct: 381 FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLI-------TDEAKIA 433

Query: 476 LNKEYKKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTG 535
+ E E G+ D N L S N K ++ Y L
Sbjct: 434 MASE---EDA----------GDSDNRNGQALLDLQS----NSKTVGGAKSFNDAYASLVS 476

Query: 536 KINTDAEKSGRILDTKKSMLETIKKEQLSISQVSVDEEMLNLIKFQSGYAANAKVISTID 595
I T+ +++ + +Q SIS V++DEE NL +FQ Y ANA+V+ T +
Sbjct: 477 DIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTAN 536

Query: 596 RMIDTLLGIK 605
+ D L+ I+
Sbjct: 537 AIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1442OMPADOMAIN1463e-45 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 146 bits (369), Expect = 3e-45
Identities = 45/169 (26%), Positives = 71/169 (42%), Gaps = 24/169 (14%)

Query: 22 NMDKETVAGDVSAKTVQTAPV-TTEPAPEKEEPKQEPAPVVEEKPAVESGTIIASIYFDF 80
D ++ VS + Q PAP PAP V+ K T+ + + F+F
Sbjct: 177 RPDNGMLSLGVSYRFGQGEAAPVVAPAPA-------PAPEVQTK----HFTLKSDVLFNF 225

Query: 81 DKYEIKESDQETLDEIVQKAKE---NHMQVLLEGNTDEFGSSEYNQALGVKRTLSVKNAL 137
+K +K Q LD++ + V++ G TD GS YNQ L +R SV + L
Sbjct: 226 NKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYL 285

Query: 138 VIKGVEKDMIKTISFGETKPKCAQKT---------KECYKENRRVDVKL 177
+ KG+ D I GE+ P +C +RRV++++
Sbjct: 286 ISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


17U063_1453U063_1475Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1453019-3.472821ATP synthase B' chain
U063_1454118-2.965283Chromosome (plasmid) partitioning protein ParB
U063_1455017-2.879460Chromosome (plasmid) partitioning protein ParA
U063_1456017-2.741629Biotin-protein ligase
U063_1457117-2.816098Methionyl-tRNA formyltransferase
U063_1458115-2.933357hypothetical protein
U063_1459314-0.067932hypothetical protein
U063_1461-115-0.365728hypothetical protein
U063_1462015-0.745765Peptidyl-prolyl cis-trans isomerase
U063_1463117-1.451933Carbon storage regulator
U063_1464117-1.8249234-diphosphocytidyl-2-C-methyl-D-erythritol
U063_1465120-1.574182tmRNA-binding protein SmpB
U063_1466217-0.082548Ferric siderophore transport system
U063_1467318-0.607102Biopolymer transport protein ExbD/TolR
U063_1468218-0.384729LSU ribosomal protein L34p
U063_1469117-0.741611Ribonuclease P protein component
U063_14700160.496674Protein YidD
U063_14710140.157058Inner membrane protein translocase component
U063_14720110.248823RNA-binding protein Jag
U063_14730100.337940RNA-binding protein Jag
U063_1474090.680355GTPase and tRNA-U34 5-formylation enzyme TrmE
U063_14752111.160343Outer membrane protein HomD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1455PF07675310.004 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.2 bits (70), Expect = 0.004
Identities = 30/105 (28%), Positives = 40/105 (38%), Gaps = 7/105 (6%)

Query: 70 QISQVILKTQMPFLDLVPSNLGLAGFEKTFYDSQDENKRGELMLKNALESVV---GLYDY 126
VI T F SNL A FE + D + ++ VV G+YDY
Sbjct: 414 TFGSVIPATGPLFTGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDY 473

Query: 127 IIIDSPPALGPLTINSLSAAHSVIIPIQCEFFALEGTKLLLNTIR 171
I + PA G + I A P + + FA E K T+R
Sbjct: 474 CITNPEPASGKMWI----AGDGGNQPARYDDFAFEAGKKYTFTMR 514


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1457FERRIBNDNGPP290.018 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 29.1 bits (65), Expect = 0.018
Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 70 EPEVQILKDLKPDFIVVVAYGKILSKEVLEIAP 102
EP +++L ++KP F+V A + + IAP
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAP 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1459RTXTOXIND433e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 3e-06
Identities = 22/170 (12%), Positives = 59/170 (34%), Gaps = 18/170 (10%)

Query: 51 RAQYQSHFKALEQKEEALKEREREQKAQFDDAVKQASALALQDERAKIIEEARKNAFLEQ 110
+ Q+ + QKE L ++ E+ + + ++ R + +
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 111 QKGLELLQKELDEKSKQVQQLHQKEAEIERLKRENNEAESRLKAENEKKLNEKLDLEREK 170
LE + + E +L ++++E+++ E A+ + + + E
Sbjct: 252 HAVLEQ-ENKYVEAV---NELRVYKSQLEQIESEILSAKEEYQLVTQ-------LFKNEI 300

Query: 171 IEKALHEKNELKFKQQEEQLEMLRNELKNAQRKAELSSQQFQGEVQELAI 220
++K + +L + + +A +S +VQ+L +
Sbjct: 301 LDK--LRQTTDNIGLLTLELAKNEERQQASVIRAPVS-----VKVQQLKV 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_147160KDINNERMP425e-146 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 425 bits (1094), Expect = e-146
Identities = 133/399 (33%), Positives = 223/399 (55%), Gaps = 32/399 (8%)

Query: 175 DLNTLTIIKTLTFYDDLHYDLKIAFKSPN--------NIIPSYVITNGYRPVADLDS--- 223
D T KT Y + + + N + + P D S
Sbjct: 155 DAAGNTFTKTFVLKRG-DYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNF 213

Query: 224 --YTFSGVLLENNDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDSQGFE 278
+TF G D+K EK + D + + S +++ + +YF T + G
Sbjct: 214 ALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-DGTN 272

Query: 279 ALIDSEIGTKNPLGFISLKNEA-----------NLHGYIGPKDYRSLKAISPMLTDVIEY 327
+ +G N + I K++ N ++GP+ + A++P L ++Y
Sbjct: 273 NFYTANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDY 330

Query: 328 GLITFFAKGVFVLLDYLYQFVGNWGWAIILLTIIVRLILYPLSYKGMVSMQKLKELAPKM 387
G + F ++ +F LL +++ FVGNWG++II++T IVR I+YPL+ SM K++ L PK+
Sbjct: 331 GWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKI 390

Query: 388 KELQEKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELKSSE 447
+ ++E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+ +
Sbjct: 391 QAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAP 450

Query: 448 WILWIHDLSIMDPYFILPLLMGASMYWHQSVTPNTMTDPMQAKIFKLLPLLFTIFLITFP 507
+ LWIHDLS DPY+ILP+LMG +M++ Q ++P T+TDPMQ KI +P++FT+F + FP
Sbjct: 451 FALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFP 510

Query: 508 AGLVLYWTTNNILSVLQQLIINKILENKKRMHAQNKKES 546
+GLVLY+ +N+++++QQ +I + LE K+ +H++ KK+S
Sbjct: 511 SGLVLYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKKS 548


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1472IGASERPTASE270.019 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 26.6 bits (58), Expect = 0.019
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 54 AGVKESVKEVKEEGVKETSTKEIHQNAEEKKQLETETPQEE 94
A KE + KET+T E EEK ++ETE QE
Sbjct: 1086 AQSGSETKETQTTETKETATVE----KEEKAKVETEKTQEV 1122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1474TCRTETOQM310.013 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.0 bits (70), Expect = 0.013
Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 KGHKVRLIDTAGIRESADKIERLGIEKSLKSLENCDIILGVFDLSKPLEKEDFNLIETLN 318
+ KV +IDT G + ++ R SL L D + + ++ + L L
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVL---DGAILLISAKDGVQAQTRILFHALR 117

Query: 319 RTKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


18U063_1494U063_1499Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_14943190.124217restriction enzyme BcgI alpha chain-like
U063_14952151.014479hypothetical protein
U063_14963140.561319Thymidylate kinase
U063_14972130.897414Phosphopantetheine adenylyltransferase
U063_14982130.8702173-polyprenyl-4-hydroxybenzoate carboxy-lyase
U063_14992130.360429Flagellar basal-body P-ring formation protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1497LPSBIOSNTHSS2213e-77 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 221 bits (564), Expect = 3e-77
Identities = 63/147 (42%), Positives = 95/147 (64%)

Query: 4 IGIYPGTFDPVTNGHIDIIHRSSELFEKLIVAVAHSSAKNPMFSLKERLKMMQLATKNFK 63
IYPG+FDP+T GH+DII R LF+++ VAV + K PMFS++ERL+ + A +
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVECVAFEGLLANLAKEYHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQ 123
N + +FEGL N A++ ++RGLRV+SDFE ELQM NK+L +LET++ + +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 NAFISSSIVRSIIAHKGDASHLVPKEI 150
+F+SSS+V+ + G+ H VP +
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHV 148


19U063_1536U063_1550Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_153609-3.187191putative type IIS restriction/modification
U063_1537012-3.406541Type III restriction-modification system
U063_1538012-3.045755Type III restriction-modification system
U063_1539011-2.613310Type III restriction-modification system
U063_1540012-1.380509ATP-dependent DNA helicase RecG
U063_1541015-0.890370hypothetical protein
U063_1542-114-0.746990Outer membrane protein HorM
U063_15430120.021606Exodeoxyribonuclease III
U063_15451120.284908*periplasmic competence protein ComH
U063_1546216-0.024320Chromosomal replication initiator protein DnaA
U063_1547217-0.919801purine nucleoside phosphorylase PunB
U063_1548115-1.711592hypothetical protein
U063_1549215-2.433070isomerizing glucosamine--fructose-6-phosphate
U063_1550019-4.687241Thymidylate synthase ThyX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1546HTHFIS355e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 5e-04
Identities = 9/51 (17%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 125 TVYEIAKKVAQSDTPPYNPVLFYGGTGLGKTHILNAIGNHALEKHKKVVLV 175
+Y + ++ Q+D ++ G +G GK + A+ ++ ++ V +
Sbjct: 148 EIYRVLARLMQTDLT----LMITGESGTGKELVARALHDYGKRRNGPFVAI 194


20U063_0040U063_0045N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0040-3120.465108DNA transformation competancy protein ComB8
U063_0041-212-0.379616ComB9 competence protein
U063_0042-2110.486286VirB10-like protein ComB10
U063_0043-290.540352Mannose-6-phosphate isomerase
U063_0044-2100.709199GDP-mannose 4,6-dehydratase
U063_0045-1100.537334GDP-fucose synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0040PF043351345e-41 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 134 bits (339), Expect = 5e-41
Identities = 38/202 (18%), Positives = 73/202 (36%), Gaps = 4/202 (1%)

Query: 40 QSVFRLERNRLKIAYKLLGLMSFIALVLAIVLISVLPLQKTEHHF--VDFLNQDKHYAII 97
+ K+A+ + G+ +A + + ++ PL+ E + VD + A
Sbjct: 22 RDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAK 81

Query: 98 QRADKSISSNEALARSLIGAYVLNRESINRIDDKSRYELVRLQSSSKVWQRFEDLIKTQN 157
D +I+ +EA+ + + YV RE + ++ V + S+ R+ KT N
Sbjct: 82 LHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDN 141

Query: 158 SIYAQSHLEREVHI-VNIAIYQQDNNPIASVSIAAKLLNENKLVYEKRYKIVLSYLFDTP 216
Q+ L + V I +A V + + + + + Y D
Sbjct: 142 PQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST-KTDAVATIKYKVDGT 200

Query: 217 DFDYASMPKNPTGFKVTRYSIT 238
KNP G++V Y
Sbjct: 201 PSKEVDRFKNPLGYQVESYRAD 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0041TYPE4SSCAGX300.012 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.1 bits (67), Expect = 0.012
Identities = 26/70 (37%), Positives = 36/70 (51%), Gaps = 8/70 (11%)

Query: 192 KEKEEETIIIGDNTNAMKIVKKDIQKGYRALKSSQ--RKWYCLGICSKKSKLSLMPKEIF 249
K +EE+ II D A+ + Q + ALK + R + K+SK +MP EIF
Sbjct: 367 KIREEKQKIILDQAKAL-----ETQYVHNALKRNPVPRNYNYYQAPEKRSK-HIMPSEIF 420

Query: 250 NDKQFTYFKF 259
+D FTYF F
Sbjct: 421 DDGTFTYFGF 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0044NUCEPIMERASE881e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 87.9 bits (218), Expect = 1e-21
Identities = 46/180 (25%), Positives = 72/180 (40%), Gaps = 19/180 (10%)

Query: 7 LITGVTGQDGSYLAEYLLNLGYEVHGLKRRSSSINTSRIDHLYEDLHSDHKRRFFLHYGD 66
L+TG G G ++++ LL G++V G+ + + S E L F H D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFHKID 60

Query: 67 MTDSSNLIHLIATTKPTEIYNLAAQSHVKVSFETPEYTANADGIGTLRILEAMRILGLEK 126
+ D + L A+ ++ + V+ S E P A+++ G L ILE R ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 127 KTRFYQASTSELYGEVLETPQNENTPF-------NPRSPYAVAKMYAFYITKNYREAYNL 179
AS+S +YG N PF +P S YA K + Y Y L
Sbjct: 120 --HLLYASSSSVYGL------NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0045NUCEPIMERASE504e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.8 bits (119), Expect = 4e-09
Identities = 50/346 (14%), Positives = 105/346 (30%), Gaps = 54/346 (15%)

Query: 5 ILITGAYGMVGQNTALYFKKNKPDV-----------TLLTPKKSELC-----------LL 42
L+TGA G +G + + + V L + EL L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 43 DKDNIQAYLKEYKPTGIIHCAGRVGGIVANMNDLSTYMVENLLMGLYLFSSALDLGVKKA 102
D++ + + R + ++ + Y NL L + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 103 INLASSCAYPKFAPNPLKESDLLNGSLEPTNEGYALAKLSVMKYCEYVSAEKGVFYKTLV 162
+ +SS Y P D ++ + YA K + S G+ L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGLR 177

Query: 163 PCNLYGEFDKFEEKIAHMIPGLIARMHAAKLKNEKEFVMWGDGTARREYLNAKDLARFIA 222
+YG + + P + + K ++ G +R++ D+A I
Sbjct: 178 FFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 223 LAYENIASIPS-----------------VMNVGSGVDYSIEEYYEKVAQVLDYKGAFVKD 265
+ I + V N+G+ + +Y + + L +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 266 LSKPVGMQQKLMDISK-QKALKWELEIPLEQGIKEAYEYYLKLLEV 310
+P + + D + + + E ++ G+K +Y +V
Sbjct: 289 PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


21U063_0446U063_0452N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_04462133.095630methyl-accepting chemotaxis protein
U063_04471113.5622042',3'-cyclic-nucleotide 2'-phosphodiesterase
U063_0448-2114.675700S-ribosylhomocysteine lyase
U063_0449-1123.816779Cystathionine gamma-lyase
U063_04500152.676024Cystathionine beta-synthase
U063_04512161.759927hypothetical protein
U063_04520120.489917Chaperone protein DnaK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0446OMS28PORIN300.022 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.1 bits (67), Expect = 0.022
Identities = 35/163 (21%), Positives = 77/163 (47%), Gaps = 11/163 (6%)

Query: 375 HTEEELSSKVEQLSRNADDVKSILDIINDIADQTNLLALNAAIEAARAGEHGRGFAVVAD 434
H++++ + K++Q D V LD IN + + +++ +E R + A
Sbjct: 41 HSDQKDNKKLDQ----KDQVNQALDTINKVTED-----VSSKLEGVRESSLELVESNDAG 91

Query: 435 EVRNLAGRTQKSLAEINSTIMVIVQEINAVSSQMNLNSQKMERLSDMSK-SVQETYEKMS 493
V+ G + ++++ +V QE V+ + ++ ++ +MSK +VQET + +S
Sbjct: 92 VVKKFVG-SMSLMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVS 150

Query: 494 SNLSSVVSDSNQSMDDYAKSGHQIEVMVSDFAEVEKVASKTLA 536
+ Q M + + + ++E+ +FA+VE+V +A
Sbjct: 151 VAGEATFLIEKQIMLNKSPNNKELELTKEEFAKVEQVKETLMA 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0447PF05704290.039 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 29.5 bits (66), Expect = 0.039
Identities = 25/189 (13%), Positives = 55/189 (29%), Gaps = 11/189 (5%)

Query: 382 RKDVAYIYKFANTLIGVHITGENLLKYMEWSYRFYNQLQPGDLTI-SFNENIRGYNFDMF 440
++ VA + K + + I G N ++++ + Q G + F++ +R + +
Sbjct: 87 QQCVASVKKNSGDFKVIIIDGNNYKEWVDIPDFLIKRWQEGKMLDAWFSDILRLFLLCKY 146

Query: 441 SGV--KYQVDVTKPAGQRIINPTINNIPIDPKAIYKLAINNYRFGTLSTTLNLVTDTDR- 497
G+ V + I+ I+N+ S +
Sbjct: 147 GGLWIDATVYMFDKVPNYIVESNRFMFQSSFLESETTHISNWLIFVKSKNDPFLVGLKNS 206

Query: 498 ---YYDSYDALQDSGQIRDLIIKYITEEKGGK--VTPELEGNWE--IINYDFKNPLLEKL 550
Y + D D + ++ K N ++ Y P +
Sbjct: 207 MVTYLKKKEKPADYYIFHDFVSVMAVSKEYSKYWKEIPYVNNVNPHMLQYLGNLPYDNSM 266

Query: 551 REKLKEGSI 559
+K S
Sbjct: 267 FNYIKSTSP 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0448LUXSPROTEIN2263e-79 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 226 bits (577), Expect = 3e-79
Identities = 59/145 (40%), Positives = 90/145 (62%), Gaps = 7/145 (4%)

Query: 8 VESFNLDHTKVKAPYVRIADRKKGVNGDVIVKYDVRFKQPNKDHMDMPSLHSLEHLVAEI 67
++SF +DHT++ AP VR+A + GD I +D+RF PNKD + +H+LEHL A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGF 62

Query: 68 IRNHA----SYVVDWSPMGCQTGFYLTVLNHDNYTEVLEVLEKTMQDVLKA---TEVPAS 120
+RNH ++D SPMGC+TGFY++++ + +V + M+DVLK ++P
Sbjct: 63 MRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIPEL 122

Query: 121 NEKQCGWAANHTLEGAKNLARAFLD 145
NE QCG AA H+L+ AK +A+ L+
Sbjct: 123 NEYQCGTAAMHSLDEAKQIAKNILE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0452SHAPEPROTEIN1523e-43 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 152 bits (385), Expect = 3e-43
Identities = 78/384 (20%), Positives = 141/384 (36%), Gaps = 86/384 (22%)

Query: 5 IGIDLGTTNSAMAVYEG----NEAKIIA-NKEGKNTTPSIVAFTDKGEILVGESAKRQAV 59
+ IDLGT N+ + V NE ++A ++ + S+ A VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA--------VGHDAKQMLG 64

Query: 60 TNPEKTIYSIKRIMGLMFNEDKAKEAEKRLPYKIVDRNGACAIEISGKIYTPQEISAKIL 119
P I +I+ + ++G A + +++ +
Sbjct: 65 RTPGN-IAAIRPM-----------------------KDGVIA-----DFFVTEKMLQHFI 95

Query: 120 MKLKEDAESYLGESVTEAVITVPAYFNDSQRKATKEAGTIAGLNVLRIINEPTSAALAYG 179
++ ++ ++ VP +R+A +E+ AG + +I EP +AA+ G
Sbjct: 96 KQVHSNS---FMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 180 LDKKESEKIMVYDLGGGTFDVTVLETGDNVVEVLATGGDAFLGGDDFDNRVIDFLASEFK 239
L E+ MV D+GGGT +V V+ V +GGD FD +I+++ +
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 240 SETGIEIKNDVMALQRLKEAAENAKKELSSAM----ETEINLPFITADATGPKHLVKKLT 295
S G + AE K E+ SA EI + P+ +
Sbjct: 208 SLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-S 253

Query: 296 RAKFESLTEDL----------VEETISKIESVIKDAGLTKNEISEVVMVGGSTRIPKVQE 345
E+L E L +E+ ++ S I + G +V+ GG + +
Sbjct: 254 NEILEALQEPLTGIVSAVMVALEQCPPELASDISERG--------MVLTGGGALLRNLDR 305

Query: 346 RVKAFIHKELNKSVNPDEVVAVGA 369
+ + + +P VA G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGG 329


22U063_0585U063_0592N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0585-3131.071817Non-specific DNA-binding protein Dps
U063_0586-3121.006112Flagellar sensory histidine kinase FlgS
U063_0587-3111.666164hypothetical protein
U063_0588-2112.169039Flagellar P-ring protein FlgI
U063_0589-2112.350061Cold-shock DEAD-box protein A
U063_0590-291.988208putative membrane protease
U063_0591-292.125375hypothetical protein
U063_0592-3102.073151Oligopeptide transport ATP-binding protein OppD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0585HELNAPAPROT1502e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 150 bits (379), Expect = 2e-49
Identities = 39/140 (27%), Positives = 75/140 (53%), Gaps = 1/140 (0%)

Query: 5 EILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIVQLGHH 64
L ++ +L+ K+H FHW VKG FF +H+ EE+Y+ A+ D +AER++ +G
Sbjct: 15 NSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQ 74

Query: 65 PLVTLSEAIKLTRVKEETKTSFHSKDIFKEILEDYKHLEKEFKELSNTAEKEGDKVTVTY 124
P+ T+ E + + + + + ++ + ++ DYK + E K + AE+ D T
Sbjct: 75 PVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEENQDNATADL 133

Query: 125 ADDQLAKLQKSIWMLQAHLA 144
+ +++K +WML ++L
Sbjct: 134 FVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0586PF06580300.014 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.014
Identities = 10/71 (14%), Positives = 25/71 (35%), Gaps = 13/71 (18%)

Query: 281 IVLQNFLYNAIDAIEALEESEQ-GQVKIEAFIQNEFIVFTIIDNGKEVENKSALFEPFET 339
+++Q + N I + + Q G++ ++ N + + + G +
Sbjct: 258 MLVQTLVENGI--KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------- 308

Query: 340 TKLKGNGLGLA 350
+ G GL
Sbjct: 309 ---ESTGTGLQ 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0588FLGPRINGFLGI363e-127 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 363 bits (934), Expect = e-127
Identities = 118/345 (34%), Positives = 191/345 (55%), Gaps = 26/345 (7%)

Query: 19 AEKIGDIASVVGVRDNQLIGYGLVIGLNGTGDK-SGSKFTMQSISNMLESVNVKISADDI 77
+I DIAS+ RDNQLIGYGLV+GL GTGD S FT QS+ ML+++ +
Sbjct: 28 TSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQS 87

Query: 78 KSKNVAAVMITASLPPFARQGDKIDIHISSIGDAKSIQGGTLVMTPLNAVDGNIYALAQG 137
+KN+AAVM+TA+LPPFA G ++D+ +SS+GDA S++GG L+MT L+ DG IYA+AQG
Sbjct: 88 NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQG 147

Query: 138 AITSGNSS-----------NLLSANIINGATIERGVSYDLFHKNAMVLSLKNPNFKNAIQ 186
A+ S SA + NGA IER + +VL L+NP+F A++
Sbjct: 148 ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVR 207

Query: 187 VQNTLNKI----FGNKVAIALDPKTIQITRPERFSMVEFLALVQEIPINYSAKNKIIVDE 242
V + +N +G+ +A D + I + +P + +A ++ + + K++++E
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPAKVVINE 267

Query: 243 KSGTIVSGVDIMVHPIVVTSQDITLKITKDP--------LNDFKNTQDLDNNMSLDTAHN 294
++GTIV G D+ + + V+ +T+++T+ P Q + M++
Sbjct: 268 RTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSK 327

Query: 295 TLSSNGKNITIAGVVKALQKIGVSAKGMVSILQALKKSGAISAEM 339
G ++ +V L IG+ A G+++ILQ +K +GA+ AE+
Sbjct: 328 VAIVEGPDLR--TLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0592HTHFIS320.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.004
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 7/50 (14%)

Query: 30 VAIVGESGSGKSSIANLIMRLNPR----FKPHNGEVLFETTNLLKESEAF 75
+ I GESG+GK +A + R F N + L ESE F
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRD---LIESELF 209


23U063_0767U063_0774N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0767-1122.458083Flagellar hook protein FlgE
U063_0768-1122.235129Flagellar basal-body rod modification protein
U063_0769-1132.349464Flagellar hook-length control protein FliK
U063_0770-2131.787867Phosphate acetyltransferase
U063_0771-1191.589574Phosphate acetyltransferase
U063_07720190.783970Acetate kinase
U063_0773-1170.969368Acetate kinase
U063_07740191.712770Acetate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0767FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 7e-04
Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 2 NDTLLNAYSGIKTHQFGIDSLSNNIANVNTLGY 34
+ + NA SG+ Q +++ SNNI++ N GY
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGY 33



Score = 33.0 bits (75), Expect = 0.004
Identities = 10/48 (20%), Positives = 20/48 (41%)

Query: 557 IRHKYLETSNVNAGNALTNLILMQRGYSMNARAFGAGDDMIKEAISLK 604
+ ++ S VN NL Q+ Y NA+ + + I+++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0769IGASERPTASE517e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.2 bits (122), Expect = 7e-09
Identities = 47/230 (20%), Positives = 79/230 (34%), Gaps = 9/230 (3%)

Query: 285 KRDKTLSKKKSEKTPTKAQTTAPSATPENAPKIPLKTPPLMPLIGANPPNDNSPTPLEKE 344
KR++T+ TP Q PS N + P+ P A TP E
Sbjct: 987 KRNQTVDTTNIT-TPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA--------TPSETT 1037

Query: 345 ETTKEASDNKEKTKETNNSAQNAQNAQASDKTSENKSVTPKETIKHFTQQLKQEIQEYKP 404
ET E S + KT E N AQ + E KS T + Q E +E +
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 405 PMSRISMDLFPKELGKVEVVIQKVGKNLKVSVISHHNSLQTFLDNQQDLKNSLNALGFEG 464
++ + + +E KVE + + V +T + + + + +
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 465 VDLSFSQDSSKEQPKEQLRELFKEQESSPLKENALKSYQENTDNENKETS 514
+ + EQP ++ ++ + N S EN +N T+
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207



Score = 35.0 bits (80), Expect = 0.001
Identities = 51/273 (18%), Positives = 90/273 (32%), Gaps = 22/273 (8%)

Query: 6 NPIHTNASANANALNSGAKNEDTKNTPKSASKDFSKILNQKISKDKTAPKENPSALKATP 65
NP + + N N + P S N++I++ AP P+ ATP
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSN------NEEIARVDEAPVPPPA--PATP 1033

Query: 66 KNAKEGAKEDAKALEKTPTLQPQHAQNPAKDQQAPTLKDWLNHPKTHPTA-LHKTQHENH 124
E E++K KT Q A + + N T + ++ E
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 125 ETNPKNPNETLNKNEKKPNGVTSN------SHQANLPNKNPLTPTNHANNAIKTPTTPTH 178
ET ET +++ V + + + K + T PT
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 179 NAKESKTLKDIQ-TLSQKHDLNASNI------QATTTPENKTPLNASDQLALKTTQTPIN 231
N KE ++ + Q +SN+ T N N + T T +
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 232 NTLAKNDARNTANLSSVLQSLEKKESHNKERTT 264
+ K R+ ++ SV ++E + + +R+T
Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST 1246



Score = 29.6 bits (66), Expect = 0.036
Identities = 38/225 (16%), Positives = 70/225 (31%), Gaps = 23/225 (10%)

Query: 198 LNASNIQATTTPENKTPLNASDQLALKTTQTPINNTLAKNDARNTANLSSVLQSLEKKES 257
N + P N + D+ + P T ++ N +++EK E
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPV---PPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 258 HNKERTTPPNNEKKTPPLKEALPMNAIKRDKTLSKKKSEKTPTKAQTTAPSATPE----- 312
E T N + K + N + S ++++T T + E
Sbjct: 1057 DATETT--AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 313 ------NAPKIPLKTPP-------LMPLIGANPPNDNSPTPLEKEETTKEASDNKEKTKE 359
PK+ + P + P ND + E + T +D ++ KE
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 360 TNNSAQNAQNAQASDKTSENKSVTPKETIKHFTQQLKQEIQEYKP 404
T+++ + + T + P+ T TQ KP
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP 1219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0772ACETATEKNASE943e-26 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 93.7 bits (233), Expect = 3e-26
Identities = 42/99 (42%), Positives = 61/99 (61%), Gaps = 6/99 (6%)

Query: 1 MEILVLNLGSSSIKFKLFGMKENKPLASGLAEKIGEEIGQLKIRSHLHHNEQELKEKLVI 60
M+ILV+N GSSS+K++L K+ LA GLAE+IG L N +++K K +
Sbjct: 1 MKILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDM 56

Query: 61 KDHASGLLMIRENLT--KMGIIKDFNQIDAIGHRVVQGG 97
KDH + ++ + L G+IKD ++IDA+GHRVV GG
Sbjct: 57 KDHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGG 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0773ACETATEKNASE2742e-94 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 274 bits (703), Expect = 2e-94
Identities = 103/185 (55%), Positives = 132/185 (71%)

Query: 5 GGDKFHAPVLINEKVMQEIGKLSILAPLHNPANLAGIEFVQKAHPHIPQIAVFDTAFHAT 64
GG+ F + VLI + V++ I LAPLHNPAN+ GI+ + P +P +AVFDTAFH T
Sbjct: 94 GGEYFTSSVLITDDVLKAITDCIELAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQT 153

Query: 65 MPSYAYMYALPYELYEKYQIRRYGFHGTSHHYVAKEAAKFLNTAYEEFNAISLHLGNGSS 124
MP YAY+Y +PYE Y KY+IR+YGFHGTSH YV++ AA+ LN E I+ HLGNGSS
Sbjct: 154 MPDYAYLYPIPYEYYTKYKIRKYGFHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSS 213

Query: 125 AVAIQKGKSVDTSMGLTPLEGLIMGTRCGDIDPTVVEYTAQCANKSLEEVMKMLNHESGL 184
A++ GKS+DTSMG TPLEGL MGTR G IDP+++ Y + N S EEV+ +LN +SG+
Sbjct: 214 IAAVKNGKSIDTSMGFTPLEGLAMGTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGV 273

Query: 185 KGICG 189
GI G
Sbjct: 274 YGISG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0774ACETATEKNASE934e-26 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 92.9 bits (231), Expect = 4e-26
Identities = 36/77 (46%), Positives = 49/77 (63%)

Query: 7 IEARKEKGDKKAKLAFEMCAYRIKKYIGAYIAVLKKVDAILFTGGLGENYSALRESVCEG 66
+A + GDK+A+LA + AYR+KK IG+Y A + VD I+FT G+GEN +RE + +G
Sbjct: 287 EDAAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDG 346

Query: 67 LENLGIALCKPTNDNPG 83
LE LG L K N G
Sbjct: 347 LEFLGFKLDKEKNKVRG 363


24U063_0821U063_0828N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0821011-1.758790hypothetical protein
U063_0822-38-0.845768LSU ribosomal protein L9p
U063_0823-210-1.220317ATP-dependent protease HslV
U063_0824-211-2.171835ATP-dependent protease ATP-binding subunit HslU
U063_0825112-2.122743GTP-binding protein Era
U063_0826212-2.274340putative periplasmic protein
U063_0827517-2.340758hypothetical protein
U063_0828819-2.364621cag pathogenicity island protein Cag Zeta Cag1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0821SECA300.030 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.030
Identities = 18/82 (21%), Positives = 37/82 (45%), Gaps = 1/82 (1%)

Query: 67 IYTTNYNELLIIDG-QQRLTTITLLFIALMNYLNDEDELLEKFSRQKIQNRYLINSDEKG 125
I + E+ I G Q+RL L + + +L+ E EL E+ R++I + + K
Sbjct: 692 IPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKE 751

Query: 126 DKKFRLILSEPDRDTLLSLIDK 147
+ ++ ++ +L +D
Sbjct: 752 EVVGAEMMRHFEKGVMLQTLDS 773


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0824HTHFIS290.042 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.042
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 51 TPKNILMIGSTGVGKTEIARRI---AKIMELPFVKV 83
T +++ G +G GK +AR + K PFV +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0825PF03944330.002 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.7 bits (74), Expect = 0.002
Identities = 26/94 (27%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMGDAELCVFLASVHDDLKGYEEFLNLCQKPHILALSKIDMA 127
L E+ LNQ + + + A +AEL A+V + + + FLN + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQQYDSQFLALVPLSAKKSQNLN 161
+ L +L ++Q Q L L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0828TYPE3IMSPROT280.008 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 27.8 bits (62), Expect = 0.008
Identities = 9/51 (17%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 35 ALGLIGAGVLCCVLSGAMGIVGIIFVAIGIFLSFSNINLVKLIEKLFKKQS 85
L+ L + S + G + I IN ++ +++F +S
Sbjct: 87 CFPLLTVAALMAIASHVV-QYGFLISGEAIKPDIKKINPIEGAKRIFSIKS 136


25U063_0884U063_0890N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_0884212-0.995492hypothetical protein
U063_0885013-0.290926neuraminidase (sialidase)
U063_0886015-0.109260Dihydroorotase
U063_0887016-2.203201Ferric siderophore transport system, periplasmic
U063_0888-214-2.248033hypothetical protein
U063_0889-114-1.395130Flagellar motor switch protein FliN
U063_0890-113-0.290798Endonuclease III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0884TYPE3IMSPROT310.004 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 30.5 bits (69), Expect = 0.004
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 87 LQSYSVMLFFNLLLLTDILGFLPFSIYHHFMASLIFSALFCSSLFLSSPLLGVIALVALS 146
L Y F L+L+ +LPFS S + + +L PLL V AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 147 SSLL 150
S ++
Sbjct: 101 SHVV 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0887TONBPROTEIN512e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 50.8 bits (121), Expect = 2e-09
Identities = 22/52 (42%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 91 PQKPPTPPTPPTPPTPP-KPIEKPKPEPKPKPKPKPEPKKPDHKHKALKKVE 141
P P P P P P P+ KP+PKPKPKPKP K + + +K VE
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 43.8 bits (103), Expect = 3e-07
Identities = 39/234 (16%), Positives = 72/234 (30%), Gaps = 59/234 (25%)

Query: 84 PKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPKPEPKKPDHKHKALKKVEKV 143
P + P +P P P P P P E P KPKPKPKP+PK V+KV
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPK----------PVKKV 106

Query: 144 EEKKIVEEKKEEKKIVEQKVEQKVEHKKVEEKKPVKKEFDPNQLSFLPKEVAPPRQENNK 203
+E+ K++ K + + ++ + + +
Sbjct: 107 QEQP----KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQ 162

Query: 204 GLDNQTRRDIDELYGEEFGDLGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNA 263
YP A L +G
Sbjct: 163 ---------------------------------------------YPARAQALRIEGQVK 177

Query: 264 VEFYLHPNGDITDLKIIIGSEYKMLDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 317
V+F + P+G + +++I+ M + ++ + +P + ++ I +
Sbjct: 178 VKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231



Score = 43.4 bits (102), Expect = 4e-07
Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 83 APKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPKPEPKKPDHKHKALKKVEK 142
A Q PP P P P P P PK P KPKP+PK K +++ K
Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEP-PKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 143 VEEKKIVE 150
+ K +
Sbjct: 112 RDVKPVES 119



Score = 40.7 bits (95), Expect = 3e-06
Identities = 18/54 (33%), Positives = 25/54 (46%)

Query: 76 PSKNTQGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPKPEPKK 129
P + Q P+P + +P P PP KPKP+PKPKP K + +
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110



Score = 37.3 bits (86), Expect = 4e-05
Identities = 17/60 (28%), Positives = 23/60 (38%), Gaps = 2/60 (3%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKP--KPKPEPKKPD 131
Q + +P P P P PKP KPKP+P K +PK + K +
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 36.5 bits (84), Expect = 8e-05
Identities = 24/67 (35%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 95 PTPPTPPTPPTPP-KPIEKPKPEPKPKPKPKPEPKKPDHKHKALKKVEKVEEKKIVEEKK 153
P PP PP +P+ +P+PEP+P P+P E K K K + KK+ E+ K
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 154 EEKKIVE 160
+ K VE
Sbjct: 112 RDVKPVE 118



Score = 35.0 bits (80), Expect = 2e-04
Identities = 16/58 (27%), Positives = 24/58 (41%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPKPEPKKPD 131
+P + P+P P++ P P P PKP K + +PK KP +P
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPA 122



Score = 31.1 bits (70), Expect = 0.005
Identities = 13/53 (24%), Positives = 17/53 (32%)

Query: 75 DPSKNTQGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPKPEP 127
+P P + P P P P K E+PK + KP P
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0889FLGMOTORFLIN992e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 99 bits (249), Expect = 2e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0890OMS28PORIN280.029 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 27.8 bits (61), Expect = 0.029
Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 11/112 (9%)

Query: 22 NQTTELHHKNPYELLVATILSAQCTDARVNQITPKLFEKYPSVSDLAL-----ASLEEVK 76
N+ E+ K E A ++ + T QI + K P+ +L L A +E+VK
Sbjct: 132 NKVVEMSKKAVQETQKAVSVAGEATFLIEKQI---MLNKSPNNKELELTKEEFAKVEQVK 188

Query: 77 EIIQSVSYFNNKSKHLISMAQKVVRDFKGVIPSTQKELMSLDGVGQKTANVV 128
E + + +++ + AQKV+ G+ PS + ++++ V + +NVV
Sbjct: 189 ETLMASERALDET---VQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNVV 237


26U063_0912U063_0920N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_091229-0.248698membrane fusion protein MtrC
U063_091319-0.313266Acriflavin resistance protein
U063_0914111-0.982839hypothetical protein
U063_0915011-0.979804vacuolating cytotoxin-like protein
U063_0916-214-1.551696ABC transporter, permease
U063_0917-212-0.362121ABC transporter, ATP-binding protein
U063_0918-110-0.120145hypothetical protein
U063_0919-211-0.085930DNA ligase
U063_0920-210-0.565087Chemotaxis protein CheV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0912RTXTOXIND502e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.2 bits (120), Expect = 2e-09
Identities = 22/69 (31%), Positives = 34/69 (49%)

Query: 40 STGIVDSIKVTEGSVVKKGDVLLLLYNQDKQAQSDSTEQQLIFAKKQYQRYSKIGGAVDK 99
IV I V EG V+KGDVLL L +A + T+ L+ A+ + RY + +++
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 100 NTLEGYEFT 108
N L +
Sbjct: 163 NKLPELKLP 171



Score = 31.0 bits (70), Expect = 0.005
Identities = 23/152 (15%), Positives = 50/152 (32%), Gaps = 25/152 (16%)

Query: 70 QAQSDSTEQQLIFAKKQYQR--YSKIGGAVDKNTLEGYEFTYRRLESDYAYSIAVLNKTI 127
+++ S +++ + ++ K+ D L L + A + ++
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL---------LTLELAKNEERQQASV 329

Query: 128 LRAPFDGVIASKNIQVGEGVSANNTVLLRLVSHARKLVIE--FDSKYINAVKVG------ 179
+RAP + + GV L+ +V L + +K I + VG
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389

Query: 180 -DTYTYSIDGDSNQHEAKITKIYP--TVDENT 208
+ + Y+ G K+ I D+
Sbjct: 390 VEAFPYTRYGYL---VGKVKNINLDAIEDQRL 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0913ACRIFLAVINRP8970.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 897 bits (2320), Expect = 0.0
Identities = 285/1038 (27%), Positives = 517/1038 (49%), Gaps = 40/1038 (3%)

Query: 1 MYKTAINRPITTLMFALAIVFFGTMGFKKLSVALFPKIDLPTVVVTTTYPGASAEIIESK 60
M I RPI + A+ ++ G + +L VA +P I P V V+ YPGA A+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTDKIEEAVMGIDGIKKVTSTSSKNVSIVV-IEFELEKPNEEALNDVVNKISSVR-FDDS 118
VT IE+ + GID + ++STS S+ + + F+ + A V NK+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 119 NIKKPSINKFDTDSQAIISLFVSSSSVPAT--TLNDYAKNTIKPMLQKINGVGGVQLNGF 176
+++ I+ + S ++ S + T ++DY + +K L ++NGVG VQL G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 177 RERQIRIYADPTLMNKYNLTYADLFSTLKAENVEIDGGRIVNS------QRELSILINAN 230
+ +RI+ D L+NKY LT D+ + LK +N +I G++ + Q SI+
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 231 SYSVADVEKIQV-----GNHVRLGDIAKIEIGLEEDNTFASFKDKPGVILEIQKIAGANE 285
+ + K+ + G+ VRL D+A++E+G E N A KP L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 286 IEIVDRVYEALKHIQAISP-SYEIRPFLDTTGYIRTSIEDVKFDLILGAILAVLVVFAFL 344
++ + L +Q P ++ DTT +++ SI +V L +L LV++ FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 345 RNGTITLVSAISIPISIMGTFALIQWMGFSLNMLTMVALTLAIGIIIDDAIVVIENIHK- 403
+N TL+ I++P+ ++GTFA++ G+S+N LTM + LAIG+++DDAIVV+EN+ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 404 KLEMGMNKRKASYEGVREIGFALVAISAMLLSVFVPIGNMKGIIGRFFQSFGITVALAIA 463
+E + ++A+ + + +I ALV I+ +L +VF+P+ G G ++ F IT+ A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 464 LSYVVVVTIIPMVSSVVVNPRHS-------RFYVWSEPFFKALESRYTKLLQWVLNHKLI 516
LS +V + + P + + ++ P + F+ W F + YT + +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 517 ISIAVVLVFVGSLFVASKLGMEFMLKEDRGRFLVWLKAKPGVSIDY----MTQKSKIFQK 572
+ L+ G + + +L F+ +ED+G FL ++ G + + + Q + + K
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 573 AIEKHAEVEFTTLQVGY-GTTQNPFKAKIFVQLKPLKERKKEGELGQFELMSALRKELKS 631
+ + E FT + G QN FV LKP +ER + ++ + EL
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNA--GMAFVSLKPWEERNGDENS-AEAVIHRAKMELGK 656

Query: 632 MPEAKGLDTINLSEVTLLGGGGDSSPFQTFVFSHSQEAVDKSVANLRKFLLESPELKGKI 691
+ + + N+ + L G ++ F + + D + L + + +
Sbjct: 657 IRDGFVI-PFNMPAIVEL---GTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 692 EGYHTSTSESQPQLQLKILRQNANKYGVSAQTIGSVVSSAFSGTSQASVFKEDGKEYDMI 751
+ E Q +L++ ++ A GVS I +S+A G + + F + G+ +
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGG-TYVNDFIDRGRVKKLY 771

Query: 752 IRVPDNKRVSVEDIKRLQVRNKYDKLMFLDALVEITETKSPSSISRYNRQRSVTVLAQPK 811
++ R+ ED+ +L VR+ +++ A + RYN S+ + +
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 812 AGISLGEILTQVSKNTKEWLVEGANYRFTGEADNAKETNGEFLVALATAFVLIYMILAAL 871
G S G+ + + +N L G Y +TG + + + + +A +FV++++ LAAL
Sbjct: 832 PGTSSGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890

Query: 872 YESILEPFIIMVTMPLSFSGAFFALGLVHQPLSMFSMIGLILLIGMVGKNATLLIDVANE 931
YES P +M+ +PL G A L +Q ++ M+GL+ IG+ KNA L+++ A +
Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950

Query: 932 -ERKKGLNIQEAILFAGKTRLRPILMTTIAMVCGMLPLALASGDGAAMKSPIGIAMSGGL 990
K+G + EA L A + RLRPILMT++A + G+LPLA+++G G+ ++ +GI + GG+
Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 991 MISMVLSLLIVPVFYRLL 1008
+ + +L++ VPVF+ ++
Sbjct: 1011 VSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0915VACCYTOTOXIN2772e-77 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 277 bits (709), Expect = 2e-77
Identities = 104/397 (26%), Positives = 182/397 (45%), Gaps = 14/397 (3%)

Query: 2803 AGNNSLMWLNALFVAKGGNPLFAPYYLQDTPTKHIVTLMKDITSALGMLSKPNLKNNSTD 2862
+G L L + + +A + T I + T+ L ++ K +
Sbjct: 904 SGAQGRDLLQTLLI-DSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSGLQ 962

Query: 2863 VLQLNTYTQQMGRLAKLSNFASFDSTDFSERLSSLKNQKFADAIPNAMDVILKYSQRDKL 2922
L L+ RL LS + F++RL +LK+Q+FA + +A +V+ +++ + +
Sbjct: 963 TLSLSNAMILNSRLVNLSRRHTNHIDSFAKRLQALKDQRFAS-LESAAEVLYQFAPKYEK 1021

Query: 2923 KNNLWATGVGGVSFVENGTGTLYGVNVGYDRFIKG---VIVGGYAAYGYSGFYER--ITS 2977
N+WA +GG S G +LYG + G D ++ G IVGG+ +YGYS F + +
Sbjct: 1022 PTNVWANAIGGTSLNSGGNASLYGTSAGVDAYLNGEVEAIVGGFGSYGYSSFSNQANSLN 1081

Query: 2978 SKSDNVDVGLYARTFIKKSELTFSVNETWGANKTQISSADTLLSMINQSYKYNTWTTNAR 3037
S ++N + G+Y+R F + E F G++++ ++ LL +NQSY Y ++ R
Sbjct: 1082 SGANNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALLRDLNQSYNYLAYSAATR 1141

Query: 3038 VNYGYDFMFKNKSIILKPQIGLRYYYIGMTGLEGVMHNALYNQFKANADPSKKSVLTIDF 3097
+YGYDF F +++LKP +G+ Y ++G T + + S + +
Sbjct: 1142 ASYGYDFAFFRNALVLKPSVGVSYNHLGSTNFKS----NSNQKVALKNGASSQHLFNASA 1197

Query: 3098 AFENRHYFNTNSYFYAIGGIGRDLLVRSMGDKLVRFIGDNTLSYREGELYNTFASITTGG 3157
E R+Y+ SYFY G+ ++ + V + R NT A + GG
Sbjct: 1198 NVEARYYYGDTSYFYMNAGVLQEFANFGSSNA-VSLNTFKVNATRNP--LNTHARVMMGG 1254

Query: 3158 EVRLFKSFYANAGVGARFGLDYKMINITGNIGMRLAF 3194
E++L K + N G L + + N+GMR +F
Sbjct: 1255 ELKLAKEVFLNLGFVYLHNLISNIGHFASNLGMRYSF 1291



Score = 35.8 bits (82), Expect = 0.004
Identities = 14/100 (14%), Positives = 30/100 (30%), Gaps = 5/100 (5%)

Query: 702 SYTFDGANNAFNENKFNGGSFSFNHAEQTNTFNNNSFNGGSFSFNAKRVDFNHNSFNGGV 761
SY+ + E FN + ++A Q +N + G+ + N + G
Sbjct: 272 SYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLW-QSAGLNIIAPPEGG 330

Query: 762 FNF---NNTPKVSFTDDTFNVNNQFKING-AQTTFTFNKG 797
+ + + + + + N Q N
Sbjct: 331 YKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSA 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0918LCRVANTIGEN315e-04 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 31.2 bits (70), Expect = 5e-04
Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 16 KRKKLLTELAELEAEIKVSSERRSSFNVSLSPS 48
R KL ELAEL AE+K+ S ++ N LS S
Sbjct: 149 ARSKLREELAELTAELKIYSVIQAEINKHLSSS 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_0920HTHFIS542e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.5 bits (131), Expect = 2e-10
Identities = 24/110 (21%), Positives = 44/110 (40%), Gaps = 6/110 (5%)

Query: 194 ILIAEDSLSALKTLEKIVQTLELRYLAFPNGRELLDYLYEKEHYQQVGVVITDLEMPVIS 253
IL+A+D + L + + N L ++ +V+TD+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA----GDGDLVVTDVVMPDEN 61

Query: 254 GFEVLKTIKADSRTEHLPVIINSSMSSDSNRQLAQSLEADGFVVKSNILE 303
F++L IK LPV++ S+ ++ A A ++ K L
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


27U063_1295U063_1303N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1295-38-0.191909dicarboxylic acid transporter PcaT
U063_1296-38-0.712878Flagellar basal-body rod protein FlgG
U063_1297-38-0.703784hypothetical protein
U063_1298-3111.066460hypothetical protein
U063_1299-3101.339557Translation elongation factor LepA
U063_1300-2111.2766951-deoxy-D-xylulose 5-phosphate synthase
U063_1301-2100.358740Flagellar assembly protein FliH
U063_1302-290.786246Flagellar motor switch protein FliG
U063_1303-190.821520Flagellar M-ring protein FliF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1295TCRTETB401e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.2 bits (94), Expect = 1e-05
Identities = 58/315 (18%), Positives = 104/315 (33%), Gaps = 67/315 (21%)

Query: 35 APYFAKEFTHTNDPTLALISAFLVFMLGFFMRPLGSLFFGKLGDKKGRKTSMVYSIILMA 94
P A +F T + +AF++ G+ +GKL D+ G K +++ II+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSI------GTAVYGKLSDQLGIKRLLLFGIIINC 90

Query: 95 LGSFLLALLPTKEIVGEWAFLFLLLARLLQGFSVGGE------YGVVATYLSELGKNGKK 148
GS + VG F L++AR +QG G VVA Y+ + +
Sbjct: 91 FGSVIGF-------VGHSFFSLLIMARFIQG--AGAAAFPALVMVVVARYIPKENRGKAF 141

Query: 149 GFYGSFQYVTLVGGQLLAIFSLFIVENIYTHEQISAFAWRYLFALGGILALLSLFLRNIM 208
G GS + +G + I I+ W YL + I + FL ++
Sbjct: 142 GLIGS---IVAMGEGVGPAIGGMIAHYIH---------WSYLLLIPMITIITVPFLMKLL 189

Query: 209 EETMDSQTTSKTTIKEETQRGSLKELLNHKKALM-------IVFGLTMGGSLCFYTFTVY 261
+ + +K + K ++ + T +
Sbjct: 190 K-----------------KEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 262 LKIFLTNSSSFSPK-------ESSFIMLLALSYFIFLQPLCG---MLADKIKRTQMLMVF 311
IF+ + + ++ M+ L I + G M+ +K L
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 312 AIAGLIVTPVVFYGI 326
I +I+ P I
Sbjct: 293 EIGSVIIFPGTMSVI 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1296FLGHOOKAP1300.008 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.008
Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 3 NGYYAATGAMATQFNRLDLTSNNLANLNTNGFKRDDAITG 42
+ A + L+ SNN+++ N G+ R I
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1299TCRTETOQM1147e-29 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 114 bits (288), Expect = 7e-29
Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 7/162 (4%)

Query: 3 NIRNFSIIAHIDHGKSTLADCLISECNAIS---NREMKSQVMDTMDIEKERGITIKAQSV 59
I N ++AH+D GK+TL + L+ AI+ + + + D +E++RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 60 RLNYTFKGEDYVLNLIDTPGHVDFSYEVSRSLCSCEGALLVVDATQGVEAQTIANTYIAL 119
+F+ E+ +N+IDTPGH+DF EV RSL +GA+L++ A GV+AQT +
Sbjct: 62 ----SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 120 DNNLEILPVINKIDLPNANVLEVKQDIEDTIGIDCSNANEVS 161
+ + INKID ++ V QDI++ + + +V
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVE 159



Score = 81.8 bits (202), Expect = 2e-18
Identities = 50/215 (23%), Positives = 90/215 (41%), Gaps = 17/215 (7%)

Query: 161 SAKAKLGIKDLLEKIITTIPAPSGDFNAPLKALIYDSWFDNYLGALALVRIMDGSINTEQ 220
SAK +GI +L+E I + + + L ++ + LA +R+ G ++
Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279

Query: 221 EILVMGTGKKHGVLGLYYPNPLKKIPTKSLECGEIGIV---SLGLKSVTDIAVGDTLTDA 277
+ + K + +Y + GEI I+ L L SV +GDT
Sbjct: 280 SVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLL- 333

Query: 278 KNPTFKPIEGFMPAKPFVFAGLYPIETDRFEDLREALLKLQLNDCALNFEPESSVALGFG 337
P + IE P + + P + + E L +ALL++ +D L + +S+
Sbjct: 334 --PQRERIEN---PLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---E 385

Query: 338 FRVGFLGLLHMEVIKERLEREFSLNLIATAPTVVY 372
+ FLG + MEV L+ ++ + + PTV+Y
Sbjct: 386 IILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420



Score = 31.0 bits (70), Expect = 0.014
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 399 IKEPFVRATIITPSEFLGNLMQLLNNKRGIQEKMEYLNQSRVMLTYSLPSNEIVMDFYDK 458
+ EP++ I P E+L + L + V+L+ +P+ I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 459 LKSCTKGYASFDYEP 473
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1302FLGMOTORFLIG351e-123 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 351 bits (902), Expect = e-123
Identities = 122/338 (36%), Positives = 209/338 (61%), Gaps = 4/338 (1%)

Query: 8 KQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQIGAAV 67
K+ + L+ +K AILL+ +G + + ++ ++L + I ++ +I +L ++ V
Sbjct: 7 KEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNV 66

Query: 68 LEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEAKKVMDKLTKSLQTQKNFAYLGKIKP 127
L EF + + ++I GG++YARELL ++LG+++A +++ L +LQ+ + F ++ + P
Sbjct: 67 LLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQS-RPFEFVRRADP 125

Query: 128 QQLADFIINEHPQTIALILAHMEAPNAAETLSYFPDEMKAEISIRMANLGEISPQVVKRV 187
+ +FI EHPQTIALIL++++ A+ LS P E++ ++ R+A + SP+VV+ V
Sbjct: 126 ANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREV 185

Query: 188 STVLENKLESLTSYK-IEVGGLRAVAEIFNRLGQKSAKTTLARIESVDNKLAGAIKEMMF 246
VLE KL SL+S GG+ V EI N +K+ K + +E D +LA IK+ MF
Sbjct: 186 ERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMF 245

Query: 247 TFEDIVKLDNFAIREILKVADKKDLSLALKTSTKDLTDKFLNNMSSRAAEQFVEEMQYLG 306
FEDIV LD+ +I+ +L+ D ++L+ ALK+ + +K NMS RAA E+M++LG
Sbjct: 246 VFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLG 305

Query: 307 AVKIKDVDVAQRKIIEIVQSLQEKG--VIQTGEEEDVI 342
+ KDV+ +Q+KI+ +++ L+E+G VI G EEDV+
Sbjct: 306 PTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343



Score = 30.2 bits (68), Expect = 0.010
Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 3/102 (2%)

Query: 4 KLTPKQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQI 63
+ P + + IA++L + IL L + T ++++I ++ T ++
Sbjct: 122 RADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEV 181

Query: 64 GAA---VLEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEA 102
VLE+ A S Y + GG++ E++ E
Sbjct: 182 VREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEK 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1303FLGMRINGFLIF5540.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 554 bits (1429), Expect = 0.0
Identities = 177/583 (30%), Positives = 294/583 (50%), Gaps = 66/583 (11%)

Query: 11 VDFFIKLNKKQKIALIAAGVLITALLVFLLLYPFKEKDYTQGGYGVLFEGLDSSDNALIL 70
+++ +L +I LI AG A++V ++L+ K DY LF L D I+
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWA-KTPDYR-----TLFSNLSDQDGGAIV 66

Query: 71 QHLQQNQIPYKVSRDD-TILIPKDKVYEERITLASQGIPKTSKVGFEIFDTKDFGATDFD 129
L Q IPY+ + I +P DKV+E R+ LA QG+PK VGFE+ D + FG + F
Sbjct: 67 AQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFS 126

Query: 130 QNIKLIRAIEGELSRTIESLNPILKANVHIAIPKDSVFVAKEVPPSASVMLKLKPDMKLS 189
+ + RA+EGEL+RTIE+L P+ A VH+A+PK S+FV ++ PSASV + L+P L
Sbjct: 127 EQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALD 186

Query: 190 PTQILGIKNLIAAAVPKLTIENVKIVNENGESIGEGDILENSKELALEQLHYKQNFENIL 249
QI + +L+++AV L NV +V+++G + + + + ++L QL + + E+ +
Sbjct: 187 EGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT--SGRDLNDAQLKFANDVESRI 244

Query: 250 ENKIVNILAPIVGGKNKVVARVNAEFDFSQKKSTKETFDPNN-----VVRSEQNLEEKKE 304
+ +I IL+PIVG N V A+V A+ DF+ K+ T+E + PN +RS Q ++
Sbjct: 245 QRRIEAILSPIVGNGN-VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQV 303

Query: 305 GAPKKQVGGVPGVVSN-IGPVQGLKDNKEQEKYEKSQN---------------------- 341
GA GGVPG +SN P + +QN
Sbjct: 304 GAGYP--GGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNE 361

Query: 342 TTNYEVGKTISEIKGEFGTLVRLNAAVVVDGRYKIALKDGANALEYEPLSDESLKKINAL 401
T+NYEV +TI K G + RL+ AVVV+ + L DG + PL+ + +K+I L
Sbjct: 362 TSNYEVDRTIRHTKMNVGDIERLSVAVVVNYK---TLADG----KPLPLTADQMKQIEDL 414

Query: 402 VKQAIGYNQNRGDDVAVSNFEFNPITPMLDNATLSEKIMHKTQKILGSFTPLIKYVLVFI 461
++A+G++ RGD + V N F+ + T E + Q + +++LV +
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN-----TGGELPFWQQQSFIDQLLAAGRWLLVLV 469

Query: 462 VLFIFYKKVIVPFSERMLEVVPDEDKEVKSMFEEMDEEEDELNKLGDLRKKVEDQLGLNA 521
V +I ++K + P R +E ++ + E + E L+K L+++ +Q
Sbjct: 470 VAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQ----- 524

Query: 522 TFSEEEVRYEIVLEKIRGTLKERPDEIAMLFKLLIKDEISSDN 564
+ E++ ++IR E D + L+I+ +S+D+
Sbjct: 525 -----RLGAEVMSQRIR----EMSDNDPRVVALVIRQWMSNDH 558


28U063_1368U063_1374N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1368-113-0.568835hypothetical protein
U063_1369-1110.913303D-3-phosphoglycerate dehydrogenase
U063_1370-2130.508107decarboxylase
U063_1371-1130.669451hypothetical protein
U063_1372-1130.858101hypothetical protein
U063_13730131.188071Chemotaxis protein CheV
U063_1374-2120.772578Signal transduction histidine kinase CheA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1368V8PROTEASE300.003 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 30.4 bits (68), Expect = 0.003
Identities = 14/48 (29%), Positives = 23/48 (47%)

Query: 46 TNEGLSQTDAKSHEINLEESPNNPNTPNDEKAPHNEENRNNALSQNLD 93
N+ N ++PNNP+ PN+ P+N +N +N + N D
Sbjct: 284 ANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNGDNNNSD 331



Score = 26.9 bits (59), Expect = 0.049
Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 65 SPNNPNTPNDEKAPHNEENRNNALSQNLDA 94
+P+ PN P++ P N +N +N S N DA
Sbjct: 306 NPDEPNNPDNPNNPDNPDNGDNNNSDNPDA 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1371ALARACEMASE320.002 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 31.7 bits (72), Expect = 0.002
Identities = 9/43 (20%), Positives = 15/43 (34%), Gaps = 1/43 (2%)

Query: 136 GVMPEETLETYSQISETCKRLKLKGLMCIGAHADDEKKIEKSF 178
G P+ L + Q+ + LM A A+ I +
Sbjct: 132 GFQPDRVLTVWQQL-RAMANVGEMTLMSHFAEAEHPDGISGAM 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1373HTHFIS603e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.8 bits (145), Expect = 3e-12
Identities = 29/129 (22%), Positives = 50/129 (38%), Gaps = 13/129 (10%)

Query: 181 GEVLFLDDSKTARKTLKNHLSKLGFSITEAVDGEDGLNKLEMLFKKYGDDLRKHLKFIIS 240
+L DD R L LS+ G+ + + + +++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----------AGDGDLVVT 53

Query: 241 DVEMPKMDGYHFLFKLQKDPRFAYIPVIFNSSICDNYSAERAKEMGAVAYLVK-FDAEKF 299
DV MP + + L +++K +PV+ S+ +A +A E GA YL K FD +
Sbjct: 54 DVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 300 TEEISKILD 308
I + L
Sbjct: 112 IGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1374HTHFIS541e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 1e-09
Identities = 24/121 (19%), Positives = 54/121 (44%), Gaps = 4/121 (3%)

Query: 683 VLAIDDSSTDRAIIRKCLKPLGITLLEATNGLEGLEMLKNGDKIPDAILVDIEMPKMDGY 742
+L DD + R ++ + L G + +N + GD D ++ D+ MP + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDENAF 63

Query: 743 TFASEVRKYNKFKNLPLIAVTSRVTKTDRMCGVESGMTEYITKPYSGEYLTTVVKRSIKL 802
++K +LP++ ++++ T + E G +Y+ KP+ L ++ R++
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 803 E 803

Sbjct: 122 P 122


29U063_1471U063_1478N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_14710140.157058Inner membrane protein translocase component
U063_14720110.248823RNA-binding protein Jag
U063_14730100.337940RNA-binding protein Jag
U063_1474090.680355GTPase and tRNA-U34 5-formylation enzyme TrmE
U063_14752111.160343Outer membrane protein HomD
U063_1476-1120.634360hypothetical protein
U063_1477-2131.958592hypothetical protein
U063_1478-2122.083550membrane-associated lipoprotein Lpp20
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_147160KDINNERMP425e-146 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 425 bits (1094), Expect = e-146
Identities = 133/399 (33%), Positives = 223/399 (55%), Gaps = 32/399 (8%)

Query: 175 DLNTLTIIKTLTFYDDLHYDLKIAFKSPN--------NIIPSYVITNGYRPVADLDS--- 223
D T KT Y + + + N + + P D S
Sbjct: 155 DAAGNTFTKTFVLKRG-DYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNF 213

Query: 224 --YTFSGVLLENNDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDSQGFE 278
+TF G D+K EK + D + + S +++ + +YF T + G
Sbjct: 214 ALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-DGTN 272

Query: 279 ALIDSEIGTKNPLGFISLKNEA-----------NLHGYIGPKDYRSLKAISPMLTDVIEY 327
+ +G N + I K++ N ++GP+ + A++P L ++Y
Sbjct: 273 NFYTANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDY 330

Query: 328 GLITFFAKGVFVLLDYLYQFVGNWGWAIILLTIIVRLILYPLSYKGMVSMQKLKELAPKM 387
G + F ++ +F LL +++ FVGNWG++II++T IVR I+YPL+ SM K++ L PK+
Sbjct: 331 GWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKI 390

Query: 388 KELQEKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELKSSE 447
+ ++E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+ +
Sbjct: 391 QAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAP 450

Query: 448 WILWIHDLSIMDPYFILPLLMGASMYWHQSVTPNTMTDPMQAKIFKLLPLLFTIFLITFP 507
+ LWIHDLS DPY+ILP+LMG +M++ Q ++P T+TDPMQ KI +P++FT+F + FP
Sbjct: 451 FALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFP 510

Query: 508 AGLVLYWTTNNILSVLQQLIINKILENKKRMHAQNKKES 546
+GLVLY+ +N+++++QQ +I + LE K+ +H++ KK+S
Sbjct: 511 SGLVLYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKKS 548


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1472IGASERPTASE270.019 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 26.6 bits (58), Expect = 0.019
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 54 AGVKESVKEVKEEGVKETSTKEIHQNAEEKKQLETETPQEE 94
A KE + KET+T E EEK ++ETE QE
Sbjct: 1086 AQSGSETKETQTTETKETATVE----KEEKAKVETEKTQEV 1122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1474TCRTETOQM310.013 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.0 bits (70), Expect = 0.013
Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 KGHKVRLIDTAGIRESADKIERLGIEKSLKSLENCDIILGVFDLSKPLEKEDFNLIETLN 318
+ KV +IDT G + ++ R SL L D + + ++ + L L
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVL---DGAILLISAKDGVQAQTRILFHALR 117

Query: 319 RTKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1476BINARYTOXINB300.010 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.010
Identities = 14/60 (23%), Positives = 22/60 (36%)

Query: 155 SKSMGDLLAKAMPIERILKAYSVPVGSLENYEKIYYQNAFKPKVQITFDNNSDAEIKNAL 214
+ + D L P + +A + G E + YQ + FD + IKN L
Sbjct: 536 AVNPSDPLETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQL 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1478LIPOLPP20294e-105 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 294 bits (753), Expect = e-105
Identities = 173/175 (98%), Positives = 174/175 (99%)

Query: 1 MKNQVKKILGMSVIATMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60
MKNQVKKILGMSV+A MVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK
Sbjct: 1 MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60

Query: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120
YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS
Sbjct: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120

Query: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175
ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK
Sbjct: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175


30U063_1596U063_1604N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
U063_1596-1152.158860Flagellar hook-basal body complex protein FliE
U063_1597-1141.999753Flagellar basal-body rod protein FlgC
U063_1598-1141.575843Flagellar basal-body rod protein FlgB
U063_15990131.590346Cell division protein FtsW
U063_1600-1130.196396iron(III) ABC transporter
U063_1601013-0.067637iron(III) ABC transporter, periplasmic
U063_16021140.182859Alkyl hydroperoxide reductase subunit C-like
U063_1603012-0.437350Methionine ABC transporter substrate-binding
U063_1604012-0.668458Cell division protein FtsI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1596FLGHOOKFLIE776e-22 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 77.0 bits (189), Expect = 6e-22
Identities = 19/77 (24%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 34 EQKGGEFSKLLKQSINELNNTQEQSDKALADMATGQIK-DLHQAAIAIGKAETSMKLMLE 92
Q F+ L +++ +++TQ + G+ L+ + KA SM++ ++
Sbjct: 27 PQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQ 86

Query: 93 VRNKAISAYKELLRTQI 109
VRNK ++AY+E++ Q+
Sbjct: 87 VRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1597FLGHOOKAP1280.015 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.015
Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 121 NVNAVVEMADLVEATRAYQANVAAFQSAKNMAQNAIGM 158
VN E +L + Y AN Q+A + I +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1600FERRIBNDNGPP383e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.4 bits (89), Expect = 3e-05
Identities = 29/184 (15%), Positives = 79/184 (42%), Gaps = 12/184 (6%)

Query: 106 NVELLKKLSPDLVVTFVG-NPKAVEHAKKFGISFLSFQETT--IAEAMQAMQ--AQAKAL 160
N+ELL ++ P +V G P A+ +F + +A A +++ A L
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 161 EVDASKKLAKMQKTLDFIAERLKGVKKKKGVELFHKAN----KISGHQAISSDILEKGGI 216
+ A LA+ + + + R + + + L + + G ++ +IL++ GI
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVK-RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 217 DN-FGLKYVKFGRADISVEKIVK-ENPEIIFIWWISPLTPEDVLNNPKFSTIKAIKNKQV 274
N + + +G +S++++ ++ +++ + + ++ P + + ++ +
Sbjct: 207 PNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRF 266

Query: 275 YKLP 278
++P
Sbjct: 267 QRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1601FERRIBNDNGPP352e-04 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 35.3 bits (81), Expect = 2e-04
Identities = 31/184 (16%), Positives = 76/184 (41%), Gaps = 12/184 (6%)

Query: 104 NVELLKKLSPDLVVTFVGNPKAVEHAKKF--GILFLSFQEKTIAEVMEDID---AQAKAL 158
N+ELL ++ P +V G + E + G F K + A L
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 159 EIDASKKLAKMQETLDFIKERLKGVKKKKGVELFHKAN----KISGHQALDSDILEKGGI 214
+ A LA+ ++ + +K R + + + L + + G +L +IL++ GI
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVK-RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 215 DN-FGLKYVKFGRADVSVEKIVK-ENPEIIFIWWISPLTPEDVLNNPKFSTIKAIKNKQV 272
N + + +G VS++++ ++ +++ + + ++ P + + ++ +
Sbjct: 207 PNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRF 266

Query: 273 YKLP 276
++P
Sbjct: 267 QRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
U063_1604TYPE3IMPPROT290.028 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.4 bits (66), Expect = 0.028
Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 4 LRYKLLLFVFIGFWGLLVLNLFI 26
KL+LFV + W LL L +
Sbjct: 195 TPIKLVLFVALDGWTLLSKGLIL 217



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.