PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomesequence.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP019701 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1B0909_11140B0909_11060Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_11140122-3.6500343-hydroxyacyl-CoA dehydrogenase
B0909_11135341-7.162151hypothetical protein
B0909_24555439-7.949767*hypothetical protein
B0909_11120223-4.484479restriction endonuclease subunit S
B0909_111103140.942318cation:proton antiporter
B0909_111053120.904206K+/H+ antiporter subunit F
B0909_111001111.469367Na+/H+ antiporter subunit E
B0909_110951121.614398monovalent cation/H+ antiporter subunit D
B0909_11090-1121.104893Na+/H+ antiporter subunit C
B0909_11085-1110.975438monovalent cation/H+ antiporter subunit A
B0909_110801130.225951hypothetical protein
B0909_110751120.882839globin-coupled sensor protein
B0909_110701140.233089STAS domain-containing protein
B0909_110652150.366719response regulator
B0909_110602150.799904chemotaxis protein CheA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11095PERTACTIN320.009 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.6 bits (71), Expect = 0.009
Identities = 28/131 (21%), Positives = 45/131 (34%), Gaps = 9/131 (6%)

Query: 294 GGIATIIFGAIGVLASQALGRLAGFSVLV-SSGTLLAAMGTGNPTVAAGALYYMVSSTLT 352
GG ++ G GV S + LA V G + A TV+ G+L + +
Sbjct: 281 GGFGPLLDGWYGVDVSDSTVDLAQSIVEAPQLGAAIRAGRGARVTVSGGSLSAPHGNVIE 340

Query: 353 ISAFFMLIELVERGQDAGANVLAVTMEAYGEGDEEEEEEEVGVTMPATMAVLGACFAACG 412
R A+ L++T++A V P + + G
Sbjct: 341 TG-------GGARRFPPPASPLSITLQAGARAQGRALLYRVL-PEPVKLTLAGGAQGQGD 392

Query: 413 ILLAGLPPLSG 423
I+ LPP+ G
Sbjct: 393 IVATELPPIPG 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11065HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 2/115 (1%)

Query: 4 KVLTVDDSRTIRNMLLVTLNNAGFETIQAEDGIEGLEVLEQSNPDVIVTDINMPRLDGFG 63
+L DD IR +L L+ AG++ + + + D++VTD+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIEGVRRNEKYRAIPILVLTTESDAEKKNRARQAGATGWIVKPFDPAKLIDAIER 118
+ ++ + +P+LV++ ++ +A + GA ++ KPFD +LI I R
Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11060PF06580411e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 1e-05
Identities = 20/133 (15%), Positives = 38/133 (28%), Gaps = 51/133 (38%)

Query: 470 IRNAVDHGIETPEKREAAGKNPEGTIKLSAKHRSGRILIELQDDGAGINRERVRQKAIDN 529
+ N + HGI G I L +G + +E+++ G+ +
Sbjct: 264 VENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN--------- 306

Query: 530 DLIAADANLTDEEIDNLIFAPGFSTADKISDISGRGVGMDVVKRSIQALGG---RISISS 586
G G+ V+ +Q L G +I +S
Sbjct: 307 ------------------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 587 RPGHGSTFTMSLP 599
+ G + +P
Sbjct: 337 KQG-KVNAMVLIP 348


2B0909_10940B0909_10825Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_10940429-1.942881UDP-N-acetylglucosamine
B0909_10935530-3.418202hypothetical protein
B0909_10925530-3.349079*flagellin
B0909_10920528-2.334737flagellin
B0909_10915320-2.023771flagellin
B0909_109104190.027107flagellar biosynthetic protein FliP
B0909_109003160.802233flagellar basal body protein FliL
B0909_108902180.923010flagellar L-ring protein FlgH
B0909_108853180.855935flagellar protein
B0909_108803180.756595flagellar basal body P-ring protein FlgI
B0909_10875016-0.967994flagella basal body P-ring formation protein
B0909_10870-117-1.156057flagellar basal-body rod protein FlgG
B0909_10865-214-0.234310flagellar hook-basal body complex protein FliE
B0909_10860-113-0.229456flagellar basal body rod protein FlgC
B0909_10855-39-0.832032flagellar basal body rod protein FlgB
B0909_10850011-1.127867LysE family translocator
B0909_10845212-0.961276hypothetical protein
B0909_10840412-0.819077flagellar protein export ATPase FliI
B0909_10835313-1.652431flagellar basal-body rod protein FlgF
B0909_10830210-2.412463DUF1217 domain-containing protein
B0909_10825312-2.454052flagellar motor stator protein MotA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10925FLAGELLIN1251e-33 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 125 bits (314), Expect = 1e-33
Identities = 61/386 (15%), Positives = 119/386 (30%), Gaps = 12/386 (3%)

Query: 4 ILTNISAMAALQTLRSIDAKMETTQSRVSSGLRVGTASDNAAYWSIATTMRSDNMALSAV 63
I TN ++ L + + + R+SSGLR+ +A D+AA +IA S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYSAM---ESAVEVVKQVKAKMVAATEEGVDRSKIQEEISQLQEQLR 120
G + T A+ + ++ V+++ + T D IQ+EI Q E++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 SISSSASFSGENWLQADLTSATGNAVTKNVVGSFIRTADGSVSVKKID-YQLDSTTVLFD 179
+S+ F+G L G I + VK + +
Sbjct: 124 RVSNQTQFNGVKVLSQ---DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 180 EGGQLGIIDRVFNVTPASTTLKINTSGTISEHAVLTNSVDSLIKSGATFEGNYANVTTAV 239
G L + ++ AV+T++ + Y N
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV-----YVNAANGQ 235

Query: 240 AGGAAAGDYVKVNGVWVKAVAAASNPGQEIAATSNAGANQWVVDVTPIPAGTVVTAAASL 299
A + V+ A + + IA G D +
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 300 DTVDIRTLSNEELDVMVRATDAALEAITSATADLGSISMRIAIQEDFVSKLTDSIDKGIG 359
+ T++ E++ + V A + +AT + F +
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 360 RLVDADMNEESTKLKALQTQQQLAIQ 385
++A+ + + + A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANA 381



Score = 78.2 bits (192), Expect = 1e-17
Identities = 32/181 (17%), Positives = 58/181 (32%)

Query: 222 IKSGATFEGNYANVTTAVAGGAAAGDYVKVNGVWVKAVAAASNPGQEIAATSNAGANQWV 281
++S + N + AV S A + A V
Sbjct: 327 LQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKV 386

Query: 282 VDVTPIPAGTVVTAAASLDTVDIRTLSNEELDVMVRATDAALEAITSATADLGSISMRIA 341
+ S + + + + + D+AL + + + LG+I R
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFD 446

Query: 342 IQEDFVSKLTDSIDKGIGRLVDADMNEESTKLKALQTQQQLAIQSLSIANTSSENILSLF 401
+ +++ R+ DAD E + + Q QQ L+ AN +N+LSL
Sbjct: 447 SAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506

Query: 402 R 402
R
Sbjct: 507 R 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10920FLAGELLIN1152e-30 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 115 bits (289), Expect = 2e-30
Identities = 61/386 (15%), Positives = 118/386 (30%), Gaps = 11/386 (2%)

Query: 4 ILTNIAAMSALQTLRSIGQDMEATQGRVSSGQRVGTASDNAAYWSIATTMRSDNMALSAV 63
I TN ++ L + + R+SSG R+ +A D+AA +IA S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYAGM---ESAIEVVKEIKAKLVAATEDGVDKNKVQEEITQLQEQLR 120
G + T + + ++ V+E+ + T D +Q+EI Q E++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 GISEAASFSGENWLQADLSVAGGTVTKDVVGSFVRDANGIVSVKKIDYTLDTDSVLFDTR 180
+S F+G L D + G + + VK + LD +V
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM--KIQVGANDGETITIDLQKIDVKSLG--LDGFNVNGPKE 179

Query: 181 ATGTKTGILDKVYTVAEDGVTLSINTGGVTSEVTVKSFSIDSLIKSGAAFQGNYASVTTA 240
AT K T + + + V + + + +TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 AAGGAGVGDYVKVEGTWVLAANATTAPTQEIAATTTTPAAASWIVATANAPTTTPAVTSL 300
A D K + A A + T + T
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTID----TKTGNDG 295

Query: 301 DGINIIGLSGTQINQMMKAVDAALKDMTSAAADLGSISMRIGLQEDFVSKLTDSIDSGVG 360
+G ++G ++ + + A ++ +A + F +S
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 361 RLVDAEMNEESTKLKALQTQQQLAIQ 386
++A + + + A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANA 381



Score = 86.6 bits (214), Expect = 1e-20
Identities = 51/348 (14%), Positives = 105/348 (30%), Gaps = 12/348 (3%)

Query: 65 DALGLGAAKVDTAYAGMESAIEVVKEIKAKLVAATEDGVDKNKVQEEITQLQEQLRGISE 124
D LG + + ++ K D + + + +
Sbjct: 163 DVKSLGLDGFNVNGPKEATVGDLKSSFKN---VTGYDTYAVGANKYRVDVNSGAVVTDTT 219

Query: 125 AASFSGENWLQADLSVAGGTVTKDVVGSFVRD----ANGIVSVKKIDYTLDTDSVLFDTR 180
A + + ++ A ++ + G K I +
Sbjct: 220 APTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFD 279

Query: 181 ATGTKTGILDKVYTVAEDGVTLSINTGGVT-SEVTVKSFSIDSLIKSGAAFQGNYASVTT 239
G I K V+ +IN VT + + + + + + + + Y SV
Sbjct: 280 YKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVN 339

Query: 240 AAAGGAGVGDYVKVEGTWVLAANATTAPTQEIAATTTTPAAASWIVATANAPTTTPAVTS 299
+ + + A NA ++ A A+ T T T+
Sbjct: 340 GQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTA 399

Query: 300 LD----GINIIGLSGTQINQMMKAVDAALKDMTSAAADLGSISMRIGLQEDFVSKLTDSI 355
+ + ++D+AL + + + LG+I R + ++
Sbjct: 400 SGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNL 459

Query: 356 DSGVGRLVDAEMNEESTKLKALQTQQQLAIQSLSIANSSSESILSLFR 403
+S R+ DA+ E + + Q QQ L+ AN +++LSL R
Sbjct: 460 NSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10910FLAGELLIN1161e-30 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 116 bits (291), Expect = 1e-30
Identities = 56/379 (14%), Positives = 111/379 (29%), Gaps = 6/379 (1%)

Query: 4 ILTNINAMSALQTLRSISSNMEDTQSRISSGLKVGSASDNAAYWSIATTMRSDNEALGAV 63
I TN ++ L S++ R+SSGL++ SA D+AA +IA S+ + L
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYAGM---ESVIDVVKQIKNKLVTAQESSADKTKIQGEITQLQDQLK 120
G + T + + + V+++ + S +D IQ EI Q +++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 GIVESASFSGENWLKADLSAAATTKSVVGSFVREGGTVSVKTIDYLMDASKVLVDTRATG 180
+ F+G L D + G + + +D V AT
Sbjct: 124 RVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQ-KIDVKSLGLDGFNVNGPKEATV 182

Query: 181 TKTGILDKVQDVGVDTVTLTINDGGTLSEHTVQAYSLDTLTTAGAEFQGNFAKTATDNYV 240
K ++ V + + TD+
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 241 KVEGSWVKAV--ASAGTQEIASTTTAAGTITAGTWMVDTTNAGAGTVAASGSVLSMNISS 298
+ ++AGT E + A G ++
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 299 LTGTQLSALVKAVDKSLTELTSAGAQLGSISSRISLQEDFASKLKGSIDKGVGRLVDADM 358
+ G +++ V + + +A Q + F K + ++A+
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANN 362

Query: 359 NEESTRLKALQTQQQLAIQ 377
+ + + A
Sbjct: 363 AVKGESKITVNGAEYTANA 381



Score = 79.3 bits (195), Expect = 3e-18
Identities = 50/334 (14%), Positives = 98/334 (29%), Gaps = 2/334 (0%)

Query: 63 VQDALGLGAAKVDTAYAGMESVIDVVKQIKNKLVTAQESSADKTKIQGEITQLQDQLKGI 122
V + +++ + V + +
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 123 VESASFSGENWLKADLSAAATTKSVVGSFVREGGTVSVKTIDYLMDASKVLVDTRATGTK 182
+ + EN DL + + G + D V
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 183 TGILDKVQDVGVDTVTLTINDGGTLSEHTVQAYSLDTLTTAGAEFQGNFAKTATDNYVKV 242
G + + VTLT+ D + + A + + G F
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 243 EGSWVKAVASAGTQEIASTTTAAGTITAGTWMV--DTTNAGAGTVAASGSVLSMNISSLT 300
+ S ++A + + + A T A V A+ S L ++
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 301 GTQLSALVKAVDKSLTELTSAGAQLGSISSRISLQEDFASKLKGSIDKGVGRLVDADMNE 360
+ + ++D +L+++ + + LG+I +R +++ R+ DAD
Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473

Query: 361 ESTRLKALQTQQQLAIQSLSIANSDSQNILSLFR 394
E + + Q QQ L+ AN QN+LSL R
Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10900FLGBIOSNFLIP299e-105 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 299 bits (766), Expect = e-105
Identities = 105/246 (42%), Positives = 158/246 (64%), Gaps = 5/246 (2%)

Query: 1 MIRFLVTIAVLLALPGLANAQQFPSDLFNTQIDGSVAAWI--IRTFGLLTVLSVAPGILI 58
M R L VLL L Q P + + + G +W ++T +T L+ P IL+
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPG-ITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 59 MVTSFPRFVIAFSILRSGMGLASTPSNMILLSMAMFMTFYVMSPTFDKAWTDGVQPLLQN 118
M+TSF R +I F +LR+ +G S P N +LL +A+F+TF++MSP DK + D QP +
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 119 QINEQQAIQRIAEPFRTFMNANTRDKDLKLFVDIARERGQGVMTDNVVDYRVLVPAFMLS 178
+I+ Q+A+++ A+P R FM TR+ DL LF +A + V R+L+PA++ S
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANT--GPLQGPEAVPMRILLPAYVTS 177

Query: 179 EIRRGFEIGFLIILPFLVIDLIVATITMAMGMMMLPPTSISLPFKILFFVLIDGWNLLVG 238
E++ F+IGF I +PFL+IDL++A++ MA+GMMM+PP +I+LPFK++ FVL+DGW LLVG
Sbjct: 178 ELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVG 237

Query: 239 SLVRSF 244
SL +SF
Sbjct: 238 SLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10890FLGLRINGFLGH2611e-90 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 261 bits (668), Expect = 1e-90
Identities = 58/243 (23%), Positives = 98/243 (40%), Gaps = 30/243 (12%)

Query: 7 PALLLPLALLAGC---------QNNQTLKEIGNAPAMSPIGSGLQFSQTPQMGMYPKQPK 57
L + L GC Q + + + P +P+ +G F Q+ Q Y QP
Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPV---PGPTPVANGSIF-QSAQPINYGYQP- 64

Query: 58 HMASGYSLWSDSQGALFKDLRALNIGDILTVNIQINDKADFDNETERNRTNSSGLNWKAK 117
LF+D R NIGD LT+ +Q N A + +R + +
Sbjct: 65 ---------------LFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTV 109

Query: 118 AQIL-GWTPDADSNIKYGSDTDTQAKGKTKRSEKLTLLVAAVVTGILENGNLIISGSQEV 176
+ L G +A ++++ KG S + + V +L NGNL + G +++
Sbjct: 110 PRYLQGLFGNARADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQI 169

Query: 177 RVNHEIRILNVGGIVRPQDVDAQNMISYERIAEARISYGGRGRLTEVQQPPVGQQVVDLF 236
+N + G+V P+ + N + ++A+ARI Y G G + E Q Q+
Sbjct: 170 AINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNL 229

Query: 237 SPL 239
SP+
Sbjct: 230 SPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10880FLGPRINGFLGI5060.0 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 506 bits (1304), Expect = 0.0
Identities = 363/373 (97%), Positives = 366/373 (98%)

Query: 1 MRLLRIIAAAILFSAQPFLSVSAAHADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS 60
MR+LRIIAAA++FSA PFLS A ADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS
Sbjct: 1 MRVLRIIAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS 60

Query: 61 LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD 120
LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD
Sbjct: 61 LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD 120

Query: 121 ATSLRGGTLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE 180
ATSLRGG LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE
Sbjct: 121 ATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE 180

Query: 181 RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRT 240
RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR
Sbjct: 181 RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV 240

Query: 241 ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV 300
ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV
Sbjct: 241 ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV 300

Query: 301 IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI 360
IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI
Sbjct: 301 IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI 360

Query: 361 KSAGALQAELVLQ 373
KSAGALQAELVLQ
Sbjct: 361 KSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10870FLGHOOKAP1413e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 3e-06
Identities = 19/80 (23%), Positives = 31/80 (38%), Gaps = 15/80 (18%)

Query: 4 LAIAATGMDAQQTNLEVIANNIANINTTGYKRARAEFTDLLYQTERMQGVPNRANQAIVP 63
+ A +G++A Q L +NNI++ N GY R T + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT---TIMAQANSTLGA----------- 49

Query: 64 EGANIGLGVQTSAVRNIHTQ 83
G +G GV S V+ +
Sbjct: 50 -GGWVGNGVYVSGVQREYDA 68



Score = 39.2 bits (91), Expect = 1e-05
Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 214 IKQSYLEGSNVDAVKEITDLITAQRAYEMNSKVITTADEM 253
+ S V+ +E +L Q+ Y N++V+ TA+ +
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10865FLGHOOKFLIE336e-05 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 33.1 bits (75), Expect = 6e-05
Identities = 19/99 (19%), Positives = 42/99 (42%), Gaps = 6/99 (6%)

Query: 19 GISSLTESVFGSEQTTPAQQTGASFASVLGNMSVDAMNSLKKAEVAS------FEGIQGK 72
GI + + + + AQ++ A++ + + A+ F +
Sbjct: 5 GIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPG 64

Query: 73 ANTREVVDAVLSAEQSLQTAIALRDKIVSAYLDITKMQI 111
+V+ + A S+Q I +R+K+V+AY ++ MQ+
Sbjct: 65 VALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10860FLGHOOKAP1270.023 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 27.2 bits (60), Expect = 0.023
Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 97 NVNILIEMADMREANRSYDANLQVIRQTRDLVASTIDL 134
VN+ E +++ + Y AN QV++ + + I++
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 26.5 bits (58), Expect = 0.038
Identities = 18/72 (25%), Positives = 23/72 (31%), Gaps = 19/72 (26%)

Query: 5 SAASKIAGSGLEVQSTRLRIVSENIANARSTGDTPGADPYRRKTVTFGSELDR------- 57
S+ A SGL L S NI++ G Y R+T
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAG-------YTRQTTIMAQANSTLGAGGWV 53

Query: 58 -----VSGVERV 64
VSGV+R
Sbjct: 54 GNGVYVSGVQRE 65


3B0909_10565B0909_10440Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_10565-116-3.047776ABC transporter permease subunit
B0909_10560-218-3.100809putrescine ABC transporter permease PotI
B0909_10555-120-3.447890iron ABC transporter permease
B0909_24560023-4.112735hypothetical protein
B0909_10550-123-3.564731PAS domain S-box protein
B0909_10540338-6.798683*hypothetical protein
B0909_10535025-0.525142hypothetical protein
B0909_10530025-0.721997hypothetical protein
B0909_10525026-0.521337hypothetical protein
B0909_10520127-0.315268hypothetical protein
B0909_10515323-0.858215hypothetical protein
B0909_10510323-1.401861hypothetical protein
B0909_10505327-1.930448hypothetical protein
B0909_10500425-1.548483hypothetical protein
B0909_10495326-1.827566hypothetical protein
B0909_10490327-2.233296DNA cytosine methyltransferase
B0909_10485532-2.600945hypothetical protein
B0909_10480633-1.920264hypothetical protein
B0909_10475544-10.960311hypothetical protein
B0909_10470544-10.890127hypothetical protein
B0909_10465640-9.195739hypothetical protein
B0909_10460537-7.870834hypothetical protein
B0909_10455536-7.176668hypothetical protein
B0909_10450430-6.714539hypothetical protein
B0909_10445325-2.216427hypothetical protein
B0909_10440426-2.721614hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10540cloacin320.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.010
Identities = 23/79 (29%), Positives = 30/79 (37%), Gaps = 13/79 (16%)

Query: 739 HPVMLGSQDRLQLLQKKVRQAQEDITNNKGDVKAKQRILDIADHVKEKI------AEESR 792
H M G Q+ K ++AQ D V KQ D A K A ESR
Sbjct: 380 HDPMAGGHRMWQMAGLKAQRAQTD-------VNNKQAAFDAAAKEKSDADAALSSAMESR 432

Query: 793 LEDEEHSDGGETDSNDERK 811
+ E+ E + NDE+
Sbjct: 433 KKKEDKKRSAENNLNDEKN 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10520CHANLCOLICIN310.009 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.8 bits (69), Expect = 0.009
Identities = 34/122 (27%), Positives = 55/122 (45%), Gaps = 6/122 (4%)

Query: 256 IDRTKAE--ATAAKAEAAQVKASADRLAGIGGAVRAVVDGIQESRIRDQIRKEFARDLAK 313
+ +T+AE A A A AQ KA A+R A + + D + E+ + R A +LA
Sbjct: 62 LKKTQAEQAARAKAAAEAQAKAKANRDA----LTQRLKDIVNEALRHNASRTPSATELAH 117

Query: 314 AKAAMEKAQAEAAKAKNSQRNAEKQAADQRHAFRQQAQRLVSAQQEIRNLSAALAAALDE 373
A A +A+ E + ++ A K+A AF++ QR ++E L A E
Sbjct: 118 ANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAE 177

Query: 374 PQ 375
+
Sbjct: 178 EK 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10510CHANLCOLICIN320.007 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 32.4 bits (73), Expect = 0.007
Identities = 24/100 (24%), Positives = 37/100 (37%), Gaps = 20/100 (20%)

Query: 273 DRAMTAAAKEEAKTVATASRDALVRDAREQGFLTTGMYWSTLAQVSEITTAMTNERPEDT 332
++A A A EA+ A A+RDAL + ++ +I T
Sbjct: 68 EQAARAKAAAEAQAKAKANRDALTQ------------------RLKDIVNEALRHNASRT 109

Query: 333 PPRIDGDFGE--AIQKAFDALRLQVSGEAERQALSANEFA 370
P + A+Q + LRL + E R+ A E A
Sbjct: 110 PSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKA 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10490PF05272300.029 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.029
Identities = 20/78 (25%), Positives = 27/78 (34%), Gaps = 4/78 (5%)

Query: 114 VQAARSQSWAEARAMLKRAGYGISECRVDFSYYNTPEARRRLIVIGRLGERDGFIEGAIE 173
+Q R Q +AEA + AG D Y PE RL+ G G +
Sbjct: 722 LQKFRGQLFAEALHLY-LAGERYFPSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGA 780

Query: 174 DAATDIPK---TVRHAFT 188
AA + +V F
Sbjct: 781 PAAEGAAQKGYSVNTTFV 798


4B0909_10170B0909_10050Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_101702141.964270porin family protein
B0909_101650141.633123L,D-transpeptidase
B0909_101553171.097861DNA-3-methyladenine glycosylase I
B0909_101503161.345599hypothetical protein
B0909_101454160.939486putative sulfate exporter family transporter
B0909_10140217-0.046664LysR family transcriptional regulator
B0909_10135216-0.635432DUF305 domain-containing protein
B0909_10130116-0.477192hypothetical protein
B0909_101250160.643196hypothetical protein
B0909_101200170.835125type II toxin-antitoxin system RelE/ParE family
B0909_101150171.342918ribbon-helix-helix protein, CopG family
B0909_101100161.519716histidine--tRNA ligase
B0909_101052140.998778glyoxalase
B0909_101002161.289412ATP phosphoribosyltransferase regulatory
B0909_10095224-0.457036ATP phosphoribosyltransferase
B0909_10090432-1.018377DoxX family protein
B0909_10085325-0.393545hypothetical protein
B0909_100804210.214633hypothetical protein
B0909_245702211.103026chaperonin GroEL
B0909_100752231.193694co-chaperone GroES
B0909_100700171.882947TIGR01459 family HAD-type hydrolase
B0909_100650142.881113bifunctional riboflavin kinase/FAD synthetase
B0909_10055-1142.705660isoleucine--tRNA ligase
B0909_10050-1103.028724hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10165OMPADOMAIN330.001 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 32.6 bits (74), Expect = 0.001
Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 12/104 (11%)

Query: 117 TLNALYRFPL-ESLPITPYVGAGVGINVPHVEVNRPSGKTFEYQFGGATLQAQAGLSYRI 175
L A +P+ + L I +G V V + T G + G+ Y I
Sbjct: 99 QLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDT------GVSPVFAGGVEYAI 152

Query: 176 TDNWSTFVEYKGTYSFIDVDIDNGASLKTDIITNAVNFGVAYKF 219
T +T +EY+ T + D + D ++ GV+Y+F
Sbjct: 153 TPEIATRLEYQWTNNIGDAHTIG---TRPDNGM--LSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10145SURFACELAYER320.002 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 32.3 bits (73), Expect = 0.002
Identities = 20/68 (29%), Positives = 30/68 (44%), Gaps = 2/68 (2%)

Query: 122 IVSYAAGRMLGLPPKLATLIACGNSICGNSAIAAAAPAIGAKPEDVAASIAFTAVLGVAA 181
IVS AA +L + P AT + + N+ A A DV + + +A+ VA
Sbjct: 7 IVSAAAAALLAVAPIAATAMPVNAATTINADSAINANT--NAKYDVDVTPSISAIAAVAK 64

Query: 182 VLLMPFLP 189
MP +P
Sbjct: 65 SDTMPAIP 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10140PF05043290.033 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.8 bits (64), Expect = 0.033
Identities = 9/44 (20%), Positives = 20/44 (45%)

Query: 17 HLTRAAEAIGLTASAVSSAIKNLEAFYNVELFHRVGRNIELTES 60
H + AE + T AV + ++++ + +FH I + +
Sbjct: 27 HRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINT 70


5B0909_09920B0909_09850Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_099203190.307545cell wall hydrolase
B0909_09915318-0.797718F0F1 ATP synthase assembly protein
B0909_09910319-1.203936F0F1 ATP synthase subunit A
B0909_09905121-0.078061F0F1 ATP synthase subunit C
B0909_099001220.220188F0F1 ATP synthase subunit B
B0909_09895-1150.287337F0F1 ATP synthase subunit B
B0909_09890-1190.941213aminotransferase class IV
B0909_098851152.082039MerR family transcriptional regulator
B0909_098802153.212010ribonuclease HII
B0909_098752142.380633hypothetical protein
B0909_098701121.909011PA0069 family radical SAM protein
B0909_09865-1131.923893glycosyl transferase
B0909_098600141.708760FAD-binding oxidoreductase
B0909_098551111.279259threonylcarbamoyl-AMP synthase
B0909_098502100.136203hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_09895IGASERPTASE334e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 4e-04
Identities = 26/120 (21%), Positives = 51/120 (42%), Gaps = 6/120 (5%)

Query: 30 AKSLDERAQNIQDELAEAKRLREEAQ-HLLAEYQRK---RKEAEAEAAGIVAAAEREAAA 85
+K++++ Q+ + A+ + + +EA+ ++ A Q + +E + E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 86 LTEEAKQKTEEF--VARRTALSEQKIKQAEEDAIGAVRAAAVDIAIAASEKLIAEKTTAA 143
E+AK +TE+ V + T+ K +Q+E A A D + E TTA
Sbjct: 1108 KEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167


6B0909_09485B0909_09450Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_094853132.729758HAD family phosphatase
B0909_094802123.093247A/G-specific adenine glycosylase
B0909_094751122.971531hypothetical protein
B0909_094701133.434334DUF721 domain-containing protein
B0909_094651123.185018DsbA family protein
B0909_094600123.552180chromosome segregation protein SMC
B0909_09455-1123.035363VOC family protein
B0909_09450-1123.276458pyruvate, phosphate dikinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_09460RTXTOXIND481e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.3 bits (115), Expect = 1e-07
Identities = 28/205 (13%), Positives = 66/205 (32%), Gaps = 5/205 (2%)

Query: 625 DAPGAAALRLSQKNRLAEIETELDEARSILDEAEDQLAAKTEDIRSSELRLSDVRDRSRL 684
A GA A L ++ L + L++ R + +L E E +V + L
Sbjct: 128 TALGAEADTLKTQSSLLQ--ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 685 ATRHLAEAREALTAAERASG--DLLRRRDIVSEALNQIGAQIDEIAVQEENARIEMEDAP 742
L + + + ++ +L ++R L +I + V++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 743 DLSVLDLRLRESQLEVATDRGLLAEARARHEGVSRE-AESRQRRIQAIGQERSTWQSRAA 801
++ + E + + L +++ E + E +++ ++ +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 802 SAADHIATLREREEEAREEIAELDI 826
D+I L + E I
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVI 330



Score = 38.3 bits (89), Expect = 2e-04
Identities = 31/187 (16%), Positives = 68/187 (36%), Gaps = 9/187 (4%)

Query: 722 AQIDEIAVQEENARIEMEDAPDLSVLDLRLRE--SQLEVATDRGLLAEARARHEGVSREA 779
A++++ Q + IE+ P+L + D + S+ EV L+ E + + +
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ---NQK 202

Query: 780 ESRQRRIQAIGQERSTWQSRAASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLL 839
++ + ER T +R + + R ++ + + IA E+ +
Sbjct: 203 YQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYV 262

Query: 840 ---NELQKTEDARREAADRLAEAE-NLQRAADRVAATALSELAEAREKRGRAEERLVSAR 895
NEL+ + + + A+ Q L +L + + G L
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322

Query: 896 EKRQETE 902
E++Q +
Sbjct: 323 ERQQASV 329



Score = 31.0 bits (70), Expect = 0.031
Identities = 23/179 (12%), Positives = 60/179 (33%), Gaps = 15/179 (8%)

Query: 180 AELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVE 239
+ R + ++E + +L + + R L + + + + +
Sbjct: 149 EQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208

Query: 240 AKEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLK------LPELREDEARVAAAL 293
+ E + L + I + ++E ++ +SL + E E + A+
Sbjct: 209 LDKKRAERLTVLAR---INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265

Query: 294 QRLQIARTQLDDEANRLLRRRDELARRLSQLGEDIIREERLVSDNAQILARLDEEEAEL 352
L++ ++QL+ + +L ++E +I+ Q + EL
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL------DKLRQTTDNIGLLTLEL 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_09450PHPHTRNFRASE3091e-96 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 309 bits (792), Expect = 1e-96
Identities = 110/473 (23%), Positives = 178/473 (37%), Gaps = 95/473 (20%)

Query: 434 GRRVILVRIETSPEDIHGMHA--AEGILTTRGGMTSHAAVVARGMGIPCVTGAGSMRVDM 491
+++ + +P D ++ +G T GG TSH+A+++R + IP V G + +
Sbjct: 154 AEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKI 213

Query: 492 RNKVLIGVGCMLKRGDVITIDGSSGRVLKG----EVPMTQPELSGDFGKLMEWAD----- 542
++ GD++ +DG G V+ EV + + + + EWA
Sbjct: 214 QH------------GDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEP 261

Query: 543 ----SLRRMTVRTNADTPADARAARAFGAEGIGLCRTEHMFFEGDRIHVMREMILAESEK 598
+ + N TP D A G EGIGL RTE ++ + D++ E
Sbjct: 262 STTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDRDQLPTEEE-------- 313

Query: 599 GRRAALDELLPMQRSDFTELFQIMHGLPVTIRLLDPPLHEFLPKTDGEIVEVAAAMGMPQ 658
Q + E+ Q M G PV IR LD + L
Sbjct: 314 ------------QFEAYKEVVQRMDGKPVVIRTLDIGGDKELSY---------------- 345

Query: 659 TVFRQRLDALHEFNPMLGHRGCRLAISYPEIAEMQARAIFEAAVAAARITGAPVVPEIMV 718
L E NP LG R RL + +I Q RA+ A+ ++M
Sbjct: 346 ------LQLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGN--------LKVMF 391

Query: 719 PLVGLRSELDYVTEVIDGVAAAVAQETGMEIEYL--TGTMIELPRAALRAHVIAEAAEFF 776
P++ EL ++ + E G+++ G M+E+P A+ A++ A+ +FF
Sbjct: 392 PMIATLEELRQAKAIMQEEKDKLLSE-GVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFF 450

Query: 777 SFGTNDLTQTTFGISRDDAARFINTYQRKGIIERDPFISLDFDGVGELIRIAAERGRQTR 836
S GTNDL Q T R + + P+ V +I+ A G+
Sbjct: 451 SIGTNDLIQYTMAADRMNE---------RVSYLYQPYHPAILRLVDMVIKAAHSEGKWV- 500

Query: 837 PELKLGICGEHGGDPASIHFCEDADLDYVSCSPFRVPIARLAAAQATLAAKRD 889
G+CGE GD +I LD S S + AR + + +
Sbjct: 501 -----GMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELKP 548


7B0909_09105B0909_08960Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_091052162.401702trans-aconitate 2-methyltransferase
B0909_091000131.797740branched chain amino acid aminotransferase
B0909_09095-1141.097842LysR family transcriptional regulator
B0909_09085-113-0.015522N-acetyltransferase
B0909_09080115-0.022040hypothetical protein
B0909_090750140.328215superoxide dismutase
B0909_090702170.621103phosphatase
B0909_090653161.848789YafY family transcriptional regulator
B0909_090604162.171600LysE family translocator
B0909_090554172.478435cytochrome C556
B0909_090503162.601324diacylglycerol kinase
B0909_090455172.475073cobalt transporter
B0909_090404152.513718cobalt transporter
B0909_090352142.515242hypothetical protein
B0909_090303152.465997glutamine amidotransferase
B0909_090254152.294817GNAT family N-acetyltransferase
B0909_090204152.030473prolyl aminopeptidase
B0909_090153151.362015MerR family transcriptional regulator
B0909_090104151.371337hypothetical protein
B0909_090054151.229214metal/formaldehyde-sensitive transcriptional
B0909_090002150.692817cation transporter
B0909_08995117-0.388394hypothetical protein
B0909_08990015-0.366107glycine betaine/L-proline ABC transporter
B0909_08985015-0.613293proline/glycine betaine ABC transporter
B0909_245800120.103333ABC transporter substrate-binding protein
B0909_089800120.483225hypothetical protein
B0909_089750121.570623S9 family peptidase
B0909_089700152.274567DUF930 domain-containing protein
B0909_089652153.159193dihydrodipicolinate synthase family protein
B0909_089602143.147766polysaccharide deacetylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_09065ARGREPRESSOR280.019 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 27.9 bits (62), Expect = 0.019
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 8/63 (12%)

Query: 1 MRKASRLFEIIQILRLARRPVTAQT-IAEKLE-----VTARSVYRDIAALQTMRVPVEGE 54
M K R +I +I+ + Q + + L+ VT +V RDI L ++VP
Sbjct: 1 MNKGQRHIKIREIIT--ANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNG 58

Query: 55 RGV 57

Sbjct: 59 SYK 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_09025SACTRNSFRASE416e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 6e-07
Identities = 16/83 (19%), Positives = 28/83 (33%), Gaps = 5/83 (6%)

Query: 65 VAEADGALIGYALLLPLYRAQEGRRGLELHHLFVRDGHRGHGTGQHLVSRARDIAKRLGC 124
+ + IG + + + V +R G G L+ +A + AK
Sbjct: 69 LYYLENNCIGRIKI-----RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 125 DYLSVSATTGNVAAHRFYENMDF 147
L + N++A FY F
Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08975SECYTRNLCASE250.048 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 24.7 bits (54), Expect = 0.048
Identities = 9/36 (25%), Positives = 14/36 (38%)

Query: 25 RIMPVIDIFMRLFAALLAAIPQRVLRIFFLPGFFGF 60
++ I L+ L+A +P L F F F
Sbjct: 369 YVLNRITWPGSLYLGLIALVPTMALVGFGASQNFPF 404


8B0909_08910B0909_08800Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_089103152.540839peptide-methionine (R)-S-oxide reductase
B0909_089053162.192854putative monovalent cation/H+ antiporter subunit
B0909_089002151.821789Na+/H+ antiporter subunit B
B0909_088951140.932305Na+/H+ antiporter subunit C
B0909_088900140.526483Na+/H+ antiporter subunit D
B0909_088851120.843550Na+/H+ antiporter subunit E
B0909_08880-1221.042066cation:proton antiporter
B0909_08875-1211.426744Na+/H+ antiporter subunit G
B0909_08870-1191.875940transcriptional regulator
B0909_08865-2142.427718DNA replication protein
B0909_088552133.061904hypothetical protein
B0909_088501122.561830cysteine desulfuration protein SufE
B0909_088451122.342869hypothetical protein
B0909_088401102.635425sensor histidine kinase
B0909_088352122.136332peptidoglycan-binding protein
B0909_088302131.607056DUF1491 family protein
B0909_088251153.009485DUF2336 domain-containing protein
B0909_088201172.734521hypothetical protein
B0909_088151172.813512hypothetical protein
B0909_088102163.100818DUF1254 domain-containing protein
B0909_088051153.321199DUF1214 domain-containing protein
B0909_088001163.045462M20 peptidase family dipeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08840PF06580448e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 8e-07
Identities = 28/154 (18%), Positives = 53/154 (34%), Gaps = 28/154 (18%)

Query: 316 PIAETIESCEAMLGLQAKEKGLTLTSRIQRGIGEISADQRAIRQVLINLAGNAIKF---- 371
+A+ + ++ L L + + L Q I D + ++ L N IK
Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQ--INPAIMDVQVPPMLVQTLVENGIKHGIAQ 274

Query: 372 TDAGGVVSIDAAREGRLLKLTVSDTGIGIASDKIDLLGQPFMQVQNEYTRCYEGTGLGLS 431
GG + + ++ + L V +TG + E TG GL
Sbjct: 275 LPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------------------ESTGTGLQ 316

Query: 432 LV-KGLVALHGG--TFAIASQPGEGTVVTIMLPA 462
V + L L+G ++ + G +++P
Sbjct: 317 NVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08825TYPE3OMBPROT290.026 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 29.3 bits (65), Expect = 0.026
Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 236 QLATTLSGLGMEFSEAAFVLQCFYPHLSAAEGDMSRAEALLDRLDIVECEE 286
Q T+L+G + F+ + FY + A GD S+ +A+++RLD + E
Sbjct: 57 QNQTSLTGKSLLFARDR--AEVFYEAIKLAGGDTSKIKAMMERLDTYKLGE 105


9B0909_24585B0909_08555Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_245852253.701086hypothetical protein
B0909_086653214.277552hypothetical protein
B0909_086605214.520084DNA-packaging protein
B0909_086553214.496764hypothetical protein
B0909_086501153.699188Dabb family protein
B0909_245901173.853742phage portal protein
B0909_086451174.075429hypothetical protein
B0909_086401173.628936HK97 family phage prohead protease
B0909_086351173.731310phage major capsid protein
B0909_086300173.196248hypothetical protein
B0909_086252193.503422head-tail adaptor protein
B0909_086203183.850370DUF3168 domain-containing protein
B0909_086153173.004584MFS transporter
B0909_086102152.888206phage major tail protein, TP901-1 family
B0909_086051192.894425gene transfer agent family protein
B0909_086002193.239023phage tail assembly chaperone
B0909_085952233.067134DUF2442 domain-containing protein
B0909_085852204.270775DUF4160 domain-containing protein
B0909_085803204.478740phage tail tape measure protein
B0909_085752184.314118TIGR02217 family protein
B0909_085700173.843829DUF2163 domain-containing protein
B0909_08565-1163.600869peptidase P60
B0909_08560-1143.657796hypothetical protein
B0909_085550113.060525hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08610TCRTETA290.039 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 28.6 bits (64), Expect = 0.039
Identities = 19/64 (29%), Positives = 27/64 (42%)

Query: 33 LCGPLLGRTLVRHGAAPVLVAGSLLFAAGFAVLAFAGGVASYLLGWAVIGFAATCGLTTA 92
C P+LG R G PVL+ A +A++A A + +G V G G
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117

Query: 93 AHAA 96
A+ A
Sbjct: 118 AYIA 121


10B0909_08340B0909_08280Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_08340218-0.470870hypothetical protein
B0909_08335217-0.398485DUF2147 domain-containing protein
B0909_083301150.233089hypothetical protein
B0909_246001131.137260sel1 repeat family protein
B0909_083250142.092807pyridoxal phosphate-dependent aminotransferase
B0909_083200131.894191hypothetical protein
B0909_083150130.732626pyridine nucleotide-disulfide oxidoreductase
B0909_083100160.4709702-keto-3-deoxy-L-rhamnonate aldolase
B0909_08305430-0.660140hypothetical protein
B0909_083004231.037648*hypothetical protein
B0909_082905230.527619hypothetical protein
B0909_082854210.783581porin
B0909_082804211.257018porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_0833056KDTSANTIGN270.015 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 27.2 bits (60), Expect = 0.015
Identities = 13/25 (52%), Positives = 19/25 (76%), Gaps = 1/25 (4%)

Query: 1 MKRVMMIAAALLMGSNLAFAAEAIE 25
MK++M+IA+A+ S L F+A AIE
Sbjct: 1 MKKIMLIASAMSALS-LPFSASAIE 24


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08305PHAGEIV310.004 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 31.1 bits (70), Expect = 0.004
Identities = 35/151 (23%), Positives = 52/151 (34%), Gaps = 21/151 (13%)

Query: 74 VPVGESWMIKQLLDAGARTLLVPMVDSADQARDLVSAMHYPPRGIRGMGAAVARASAFNT 133
V S + K + +P S +Q D S P G + F
Sbjct: 84 VGSIPSIIQKYNPNNQDYIDELP--SSDNQEYDDNS----APSGGFFVPQNDNVTQTFKI 137

Query: 134 ITDYADSASDSVCLLVQAETRAAINDLDNILAVEGVDG-VFIGPAD-LAADMGYLGRIDE 191
A V L V++ T + N+L+V+G + V P D L +L +D
Sbjct: 138 NNVRAKDLIRVVELFVKSNTSKS----SNVLSVDGSNLLVVSAPKDILDNLPQFLSTVDL 193

Query: 192 PEVQAVIEAAIVKI---------VAAGKAAG 213
P Q +IE I ++ AAG G
Sbjct: 194 PTDQILIEGLIFEVQQGDALDFSFAAGSQRG 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08300ACRIFLAVINRP270.019 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.1 bits (60), Expect = 0.019
Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 6/49 (12%)

Query: 61 RAQGQNVVEAVKE------NPGTATSLLAIVGALGFAIGYAVGAGTQQS 103
+G+ VVEA P TSL I+G L AI G+G Q +
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001


11B0909_07120B0909_07010Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_07120213-0.859932GGDEF domain-containing protein
B0909_07115317-1.253473ATP-dependent Clp endopeptidase proteolytic
B0909_07110420-0.442805ATP-dependent Clp protease ATP-binding subunit
B0909_07105321-0.570969glyoxalase/bleomycin resistance/dioxygenase
B0909_07100422-0.301407endopeptidase La
B0909_07095423-0.463641DNA-binding protein HRL18
B0909_07090324-0.647436esterase-like activity of phytase family
B0909_07085324-0.562294***NADH-quinone oxidoreductase subunit A
B0909_07065421-1.428878NADH-quinone oxidoreductase subunit B
B0909_07060220-0.541431NADH-quinone oxidoreductase subunit C
B0909_07055020-0.957045GFA family protein
B0909_070500210.159953NADH-quinone oxidoreductase subunit D
B0909_07045-1220.025338hypothetical protein
B0909_07040-120-0.059568NADH-quinone oxidoreductase subunit E
B0909_07035-120-0.033340NADH-quinone oxidoreductase subunit NuoF
B0909_07030-120-0.430782NADH-quinone oxidoreductase subunit G
B0909_07025123-0.535721NADH-quinone oxidoreductase subunit NuoH
B0909_07020323-1.803446NADH-quinone oxidoreductase subunit NuoI
B0909_07015223-1.564374NADH-quinone oxidoreductase subunit J
B0909_07010220-1.168262NADH-quinone oxidoreductase subunit NuoK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07120ACRIFLAVINRP310.007 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.007
Identities = 11/73 (15%), Positives = 28/73 (38%)

Query: 132 LAGVVLANRSRASLSRKVLAATFILQAFFGAVVATVVIPGDITSPNAVALTSPLAFSGML 191
L +S + + ++ +L A F + G I ++ + S +A S ++
Sbjct: 425 LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLV 484

Query: 192 GFTVIIMISAKII 204
+ + A ++
Sbjct: 485 ALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07095PF05272330.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.1 bits (75), Expect = 0.006
Identities = 13/81 (16%), Positives = 31/81 (38%), Gaps = 6/81 (7%)

Query: 298 DWLLGLPWGKKSKIKTDLNSAETILDQDHFGLDKVKERIVEYLAVQARASKIRGP----- 352
DW+ W + +++ L D+ ++V + +++ P
Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596

Query: 353 -ILCLVGPPGVGKTSLAKSIA 372
+ L G G+GK++L ++
Sbjct: 597 YSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07090DNABINDINGHU1164e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 116 bits (293), Expect = 4e-38
Identities = 50/88 (56%), Positives = 59/88 (67%)

Query: 2 NKNELVSAVAEKAGLTKADAASAVDAVFETVQSELKNGGDIRLAGFGSFSVSRREASKGR 61
NK +L++ VAE LTK D+A+AVDAVF V S L G ++L GFG+F V R A KGR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPSTGAEVDIPARNVPKFSAGKGLKDAV 89
NP TG E+ I A VP F AGK LKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07035PF03544290.023 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.023
Identities = 14/73 (19%), Positives = 25/73 (34%), Gaps = 3/73 (4%)

Query: 201 TEEIKPDRSNLDKPAEAPADAAPVPPSNAAKPKT--DAPETDPKLKTPATAPKAAEANVK 258
E +P+ + P EAP P KPK + +K + P + N
Sbjct: 75 EPEPEPEPIP-EPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTA 133

Query: 259 TAEKEAAGSAKPS 271
A ++ + +
Sbjct: 134 PARPTSSTATAAT 146


12B0909_06730B0909_06625Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_067302121.489374aminotransferase class I/II-fold pyridoxal
B0909_067253141.302532ArsR family transcriptional regulator
B0909_067202130.856940NIPSNAP family protein
B0909_067151130.841938antibiotic biosynthesis monooxygenase
B0909_067100130.707944SDR family oxidoreductase
B0909_067050140.713018ribonuclease E/G
B0909_06700-1160.164448N-acetylmuramoyl-L-alanine amidase
B0909_06695-1130.135080penicillin-binding protein 1A
B0909_066900150.241765peptide chain release factor 2
B0909_066852142.573267cysteine hydrolase family protein
B0909_066802143.018861hypothetical protein
B0909_066753152.936773NAD kinase
B0909_066703152.968368PhzF family phenazine biosynthesis protein
B0909_066653153.159926hypothetical protein
B0909_066603163.090730Hpt domain-containing protein
B0909_066551110.632356(2Fe-2S)-binding protein
B0909_066501130.175661DUF922 domain-containing protein
B0909_066451131.566564dihydropteroate synthase
B0909_066400132.410783dihydroneopterin aldolase
B0909_06635-1142.3462152-amino-4-hydroxy-6-
B0909_066300142.192784hypothetical protein
B0909_066251123.099781TIGR01620 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06710DHBDHDRGNASE901e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.7 bits (222), Expect = 1e-23
Identities = 71/253 (28%), Positives = 109/253 (43%), Gaps = 23/253 (9%)

Query: 5 KVAIVTAGGSGMGAAVAKRLAADGYKLAILSSSGKGEELAKELGGIGVTGSNQSNDDLSR 64
K+A +T G+G AVA+ LA+ G +A + + + E + D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 LADLT------LEKFGRIDVLVNSAGHGPRASILDITDEQWHTGLDVYLMNVIRPTRIVA 118
A + + G ID+LVN AG I ++DE+W V V +R V+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PVMVKQKAGAIVNISTAWAFEPSSMFPTSAVFRAGLASYTKIFADTYAADNVRMNNVLPG 178
M+ +++G+IV + + A P + A +A +TK A N+R N V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 ----------WIDSLPAT-------EERRESVPMQRYGKSEEIAATVAFLASEGAGYITG 221
W D A E + +P+++ K +IA V FL S AG+IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 222 QNIRVDGGLTRSV 234
N+ VDGG T V
Sbjct: 249 HNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06705IGASERPTASE505e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.1 bits (119), Expect = 5e-08
Identities = 43/191 (22%), Positives = 69/191 (36%), Gaps = 11/191 (5%)

Query: 767 DDDSAEADGDADENGVNEAANSDEDGKRKRRRRGKRGGRRNRDEALDAAEAEAGDDSETE 826
A A V E + + K + +NR+ A +A + E
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE 1084

Query: 827 GEEVSVAAPVAEEPAATEVVEDAVADVVAEEAPKKPRRTRKAKAKTEAEEAPKAETVETV 886
VA +E A V EE K + K ++ +PK E ETV
Sbjct: 1085 -----VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139

Query: 887 EPVIIEPAVVEATVVDVAIEEVGEASADLAPETEAPATEEKTRANRGSNVSSSEPVVTSS 946
+P EPA V++ + ++ + +TE PA ++T +N V+ S V T +
Sbjct: 1140 QPQ-AEPARENDPTVNI---KEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGN 1193

Query: 947 GSSANPDGDEP 957
NP+ P
Sbjct: 1194 SVVENPENTTP 1204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06660PF03544360.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 35.7 bits (82), Expect = 0.001
Identities = 9/88 (10%), Positives = 23/88 (26%)

Query: 1881 HQMEVSEPVARAIAATQPAAERRVETRPQPAAVAPVQQRPAQPQPAPAPAALRGSLPLEN 1940
+ +P + +P E E + V + +P+P P + ++
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 1941 RQTENRQAPAPAQPVAANPAGRREEGGG 1968
++ P +
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSK 148



Score = 31.5 bits (71), Expect = 0.032
Identities = 22/115 (19%), Positives = 31/115 (26%), Gaps = 11/115 (9%)

Query: 1906 TRPQPAAVAPVQ--QRPAQPQPAPAPAALRGSLPLENRQTEN---RQAPAPAQPVAAN-P 1959
T PA + P Q Q P +P P P P + + P P
Sbjct: 53 TMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVE 112

Query: 1960 AGRREEGGGWISDLLRGASRDGQEASAASAPRTSAEQQPTRAAD-TRNPRHMVES 2013
+R+ R AS A A T+ PR + +
Sbjct: 113 QPKRDV----KPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN 163


13B0909_06330B0909_06295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_063302100.839636sensor histidine kinase
B0909_063252120.513741nitrogen regulation protein NR(I)
B0909_063202120.642656PAS domain-containing sensor histidine kinase
B0909_063152110.360445sigma-54-dependent Fis family transcriptional
B0909_06310112-0.383029Trk system potassium transporter TrkA
B0909_06305011-1.249981RNA-binding protein Hfq
B0909_06300211-0.232394GTPase HflX
B0909_062952110.329078nucleoside triphosphate pyrophosphohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06320HTHFIS6380.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 638 bits (1646), Expect = 0.0
Identities = 441/480 (91%), Positives = 464/480 (96%)

Query: 3 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWVSAGEGDLVVTDVVMPDENAF 62
ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW++AG+GDLVVTDVVMPDENAF
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIAIIGRALSEPK 122
DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI IIGRAL+EPK
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 RKPAKLDDDMQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 182
R+P+KL+DD QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 183 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQNRSTGRFEQAEGGTLFLDEIGDM 242
GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQ RSTGRFEQAEGGTLFLDEIGDM
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 243 PMDAQTRLLRVLQQGEYTTVGGRTPIRTDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 302
PMDAQTRLLRVLQQGEYTTVGGRTPIR+DVRIVAATNKDLKQSINQGLFREDLYYRLNVV
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 303 PLRLPPLRDRAEDIPDLVRHFIQQGEKEGLEGKRFESEALEVMKAYAWPGNVRELENLIR 362
PLRLPPLRDRAEDIPDLVRHF+QQ EKEGL+ KRF+ EALE+MKA+ WPGNVRELENL+R
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 363 RLMALYPQEVITREIIEQELQSDVPDSPLDKMAVRTGSLTISQAVEENMRDYFASFGDGL 422
RL ALYPQ+VITREIIE EL+S++PDSP++K A R+GSL+ISQAVEENMR YFASFGD L
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 423 PPPGLYDRVLRELEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRP 482
PP GLYDRVL E+EYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSR
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06310HTHFIS405e-140 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 405 bits (1043), Expect = e-140
Identities = 158/478 (33%), Positives = 254/478 (53%), Gaps = 30/478 (6%)

Query: 2 ASDILVVDDEADIREIVAGILSDEGHETRMAFDSDSALAAISERVPRLIFLDIWMQGSKL 61
+ ILV DD+A IR ++ LS G++ R+ ++ + I+ L+ D+ M
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD--E 60

Query: 62 DGLALLDEIKSRHPEIPVVMISGHGNIETAVNAIKRGAFDFIEKPFKADRLILIAERALE 121
+ LL IK P++PV+++S TA+ A ++GA+D++ KPF LI I RAL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 122 NSKLKREVQELKKRTGDAVELVGASLAVSQLRQTIDRVAPTNSRIMILGPSGSGKELVAR 181
K + E + G LVG S A+ ++ + + R+ T+ +MI G SG+GKELVAR
Sbjct: 121 EPKRRPSKLEDDSQDGM--PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 182 MVHKKSSRASGPFVALNAATITPDRMEIALFGTEG---LPGQPRKVGALEEAHRGVLYLD 238
+H R +GPFVA+N A I D +E LFG E Q R G E+A G L+LD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 239 EVGEMPRETQNKILRVLVDQQFERVGGGKRVKVDVRIISSTAHHLESLIAEGQFREDLYH 298
E+G+MP + Q ++LRVL ++ VGG ++ DVRI+++T L+ I +G FREDLY+
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 299 RLAVVPVKVPALSERREDIPFLVDMFMRQISEQAGIRPRKIGDDAMAVLQTHDWPGNIRQ 358
RL VVP+++P L +R EDIP LV F++Q +E+ G+ ++ +A+ +++ H WPGN+R+
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 359 LRNNIERLMILAR---------------------PEGGEAPISADMLPSDIGDMLPK-IS 396
L N + RL L E A + + + + + + +
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 397 AQGDQHIMTLPLREAREMFERDYLVAQINRFGGNISRTAEFVGMERSALHRKLKSLGV 454
+ GD + E ++A + GN + A+ +G+ R+ L +K++ LGV
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


14B0909_05890B0909_05855Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_058902150.966901cadmium-translocating P-type ATPase
B0909_058852150.604058cation transporter
B0909_05880114-0.507668cytochrome c oxidase accessory protein CcoG
B0909_05875016-0.295914SRPBCC family protein
B0909_05870319-0.273607glyoxalase/bleomycin resistance/extradiol
B0909_05865313-0.694781VOC family protein
B0909_05860211-0.805775cytochrome-c oxidase, cbb3-type subunit III
B0909_05855213-0.697992CcoQ/FixQ family Cbb3-type cytochrome c oxidase
15B0909_05790B0909_05355Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_05790229-3.302577hypothetical protein
B0909_05785331-3.967984hypothetical protein
B0909_05780333-3.733912dehydrogenase
B0909_05770335-5.914640hypothetical protein
B0909_05765233-5.934946SOS response-associated peptidase
B0909_05760129-5.307846hypothetical protein
B0909_05755128-5.740277hypothetical protein
B0909_05750027-5.260116hypothetical protein
B0909_05745231-5.021610hypothetical protein
B0909_05740128-4.096447hypothetical protein
B0909_05735226-3.041988secretion activator protein
B0909_05730329-3.778114class I SAM-dependent methyltransferase
B0909_05725431-3.413457hypothetical protein
B0909_05720533-3.599915hypothetical protein
B0909_05715531-3.086646hypothetical protein
B0909_05710531-3.347680GNAT family N-acetyltransferase
B0909_05705437-5.144213hypothetical protein
B0909_05700435-4.984179hypothetical protein
B0909_05695333-4.779656hypothetical protein
B0909_05690334-5.114674hypothetical protein
B0909_05685136-6.830874hypothetical protein
B0909_05680133-5.544778type II toxin-antitoxin system RelE/ParE family
B0909_05675-124-3.400303XRE family transcriptional regulator
B0909_05670024-2.931794hypothetical protein
B0909_05665023-3.161659hypothetical protein
B0909_05660220-2.556173hypothetical protein
B0909_05655422-2.972015hypothetical protein
B0909_05650627-3.005717hypothetical protein
B0909_05645733-4.274693hypothetical protein
B0909_05640732-4.122847hypothetical protein
B0909_05635837-5.483980HK97 gp10 family phage protein
B0909_05630839-5.350983head morphogenesis protein
B0909_05625839-5.326691hypothetical protein
B0909_05620940-6.799767hypothetical protein
B0909_056151031-4.402630hypothetical protein
B0909_056101030-3.568755hypothetical protein
B0909_05605826-1.178687hypothetical protein
B0909_05600625-3.011192hypothetical protein
B0909_05595523-2.892320DUF2184 domain-containing protein
B0909_05590522-3.255850hypothetical protein
B0909_05585424-3.919760DUF2213 domain-containing protein
B0909_05580426-4.887490hypothetical protein
B0909_05575530-6.377820PBSX family phage terminase large subunit
B0909_05565432-7.065845hypothetical protein
B0909_05560441-8.037711hypothetical protein
B0909_05555342-8.079377hypothetical protein
B0909_05550239-7.467882DUF2806 domain-containing protein
B0909_05545239-7.095609*hypothetical protein
B0909_05535336-5.646356HNH endonuclease
B0909_05530433-5.751467hypothetical protein
B0909_05525430-4.803937hypothetical protein
B0909_05520429-4.531772hypothetical protein
B0909_05515427-3.529413hypothetical protein
B0909_05510426-3.352827hypothetical protein
B0909_05505226-3.896997hypothetical protein
B0909_24635224-3.508278helix-turn-helix transcriptional regulator
B0909_05500225-3.970871hypothetical protein
B0909_05495227-3.675136hypothetical protein
B0909_05490030-4.579041methyltransferase
B0909_05485-131-4.433208endonuclease
B0909_05480032-3.288778hypothetical protein
B0909_05475238-5.860219hypothetical protein
B0909_05470241-7.498208hypothetical protein
B0909_05465448-10.477888hypothetical protein
B0909_24640545-10.328218hypothetical protein
B0909_05455543-9.996055hypothetical protein
B0909_05450543-10.158734hypothetical protein
B0909_05445641-9.607719helix-turn-helix domain-containing protein
B0909_05440642-9.187980hypothetical protein
B0909_05435532-5.007581hypothetical protein
B0909_05430533-4.300475hypothetical protein
B0909_05420220-2.635910hypothetical protein
B0909_05415220-2.009129hypothetical protein
B0909_05410220-1.708289hypothetical protein
B0909_05405219-1.653545hypothetical protein
B0909_05400222-1.549836hypothetical protein
B0909_05395426-1.604819hypothetical protein
B0909_05390631-2.319045hypothetical protein
B0909_05385531-4.674933hypothetical protein
B0909_24645328-2.924061hypothetical protein
B0909_05380-121-1.970711hypothetical protein
B0909_05375016-1.701259integrase
B0909_05370-114-1.606790*lipoyl(octanoyl) transferase LipB
B0909_05365011-0.584628NAD(P)-dependent alcohol dehydrogenase
B0909_053552111.121857DMT family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05715PF03944304e-04 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.0 bits (67), Expect = 4e-04
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 19 NAREEFFRQRNLFLAQSYASATSEIARLNARVDGLEADLRLARGEVDDTLN 69
N ++ R+ FL Q + T +AR+NA + GL+A++ +VD+ LN
Sbjct: 88 NLMQDILRETERFLNQRLNTDT--VARVNAELTGLQANVEEFNRQVDNFLN 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05685ISCHRISMTASE250.050 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 24.6 bits (53), Expect = 0.050
Identities = 15/71 (21%), Positives = 23/71 (32%), Gaps = 14/71 (19%)

Query: 12 VAFAAIAQAQVREAPDIPTAKDKESVRLERCSASISDF-----HGWQ---------LGKA 57
+A AI Q+ A D+P K R I D +
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 58 KRLRDWCRQNG 68
++L++ C Q G
Sbjct: 61 RKLKNQCVQLG 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05520FIMREGULATRY325e-04 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 32.2 bits (73), Expect = 5e-04
Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 213 VIDAIEDEYLSGKTGRDICRRYHISQGHMRDMVRR 247
VI A++D + G + +++C +Y ++ G+ + R
Sbjct: 47 VILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTLGR 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24635TYPE3IMSPROT240.047 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 24.0 bits (52), Expect = 0.047
Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 39 RLDDARRKANVHKSTALVAKAL 60
++ DAR+K V KS +V+ AL
Sbjct: 13 KIRDARKKGQVAKSKEVVSTAL 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05420PF04183260.015 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 26.4 bits (58), Expect = 0.015
Identities = 9/46 (19%), Positives = 20/46 (43%)

Query: 4 SLKVLKDIFPKNEPAFEAPFRVETDKDGEVHIRDAQGDILATFARY 49
++++K+ FP+ + + V + + I D Q T R+
Sbjct: 446 DMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDLQTGHFVTVLRF 491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05395IGASERPTASE320.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.007
Identities = 30/123 (24%), Positives = 46/123 (37%), Gaps = 22/123 (17%)

Query: 191 ADVTKGMQVREEVEDYQHVGPDNARDITPAQPSVMARLRAAQEAPQQPEEEREGFDAAFV 250
A V K + + E E Q V P V +++ QE Q E + + A
Sbjct: 1104 ATVEKEEKAKVETEKTQEV------------PKVTSQVSPKQE---QSETVQPQAEPARE 1148

Query: 251 HSETETALTGEILPNTNSDDESPAQSSDNAGMTPVDEAGAD-------EVPASDAPASTD 303
+ T + NT +D E PA+ + + PV E+ E P + PA+T
Sbjct: 1149 NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208

Query: 304 PER 306
P
Sbjct: 1209 PTV 1211


16B0909_05295B0909_05085Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_05295327-4.384628serine O-acetyltransferase
B0909_05290331-4.629871alpha/beta hydrolase
B0909_05285841-7.769723endodeoxyribonuclease
B0909_05280854-11.280232hypothetical protein
B0909_05275758-10.859521hypothetical protein
B0909_05270756-10.422650hypothetical protein
B0909_05265755-11.062736hypothetical protein
B0909_24650755-11.218092hypothetical protein
B0909_05260755-11.533818hypothetical protein
B0909_05255855-12.400695tail fiber domain-containing protein
B0909_05250952-13.197797hypothetical protein
B0909_05245855-14.108703hypothetical protein
B0909_05240655-13.145061hypothetical protein
B0909_05235656-13.739821hypothetical protein
B0909_05225557-13.543828head protein
B0909_05220557-13.566158hypothetical protein
B0909_05210458-13.613997portal protein
B0909_05205460-15.051086hypothetical protein
B0909_05200453-13.696768hypothetical protein
B0909_05195554-13.722418hypothetical protein
B0909_05190654-13.617661hypothetical protein
B0909_05185854-13.831913hypothetical protein
B0909_05180753-13.253534hypothetical protein
B0909_05175753-13.374975hypothetical protein
B0909_05170755-14.855173site-specific DNA-methyltransferase
B0909_05165652-14.769868hypothetical protein
B0909_05160652-14.848194hypothetical protein
B0909_05155649-13.757252resolvase
B0909_05150749-13.917961hypothetical protein
B0909_05145749-14.322535hypothetical protein
B0909_05140952-12.854191hypothetical protein
B0909_05135950-11.928000hypothetical protein
B0909_05130538-7.556459hypothetical protein
B0909_05125330-6.290944lysozyme
B0909_05120322-4.968190hypothetical protein
B0909_05115116-3.651232recombinase family protein
B0909_05110012-0.377778zinc-finger domain-containing protein
B0909_05105013-0.625089salicylate hydroxylase
B0909_05100213-1.466501cystathionine beta-lyase
B0909_050952130.586279amino acid ABC transporter substrate-binding
B0909_050902140.695019amino acid ABC transporter permease
B0909_050852140.751587amino acid ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05210GPOSANCHOR330.005 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.005
Identities = 16/114 (14%), Positives = 34/114 (29%), Gaps = 6/114 (5%)

Query: 647 NKSLAEAQVLKAQLEAQADEKEREFDAYKLRVEDDFRRDELAQKLVIEQAEIAAKYGAQI 706
A+ + + + + R +K + + A+I
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 707 DIARLEVEQSRQRQDVEWAIERERLEAKRKKQIDEQRQMD--EAAKAQMMAELQ 758
+E + + E A + + + +R +D AK Q+ AE Q
Sbjct: 284 K----TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_05175TYPE4SSCAGX300.041 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.1 bits (67), Expect = 0.041
Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 662 ISELVKAEVEAEIDMARRLETVKEQASLSA 691
+SEL+K + E E+D RLE ++EQA +A
Sbjct: 199 LSELIKQQRENELDQMERLEDMQEQAQANA 228


17B0909_01890B0909_01785Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_01890216-0.167961ABC transporter ATP-binding protein
B0909_018851120.135166phosphodiesterase
B0909_01875-1140.617904nucleotidyltransferase family protein
B0909_01870-2160.639368tRNA (cytidine(34)-2'-O)-methyltransferase
B0909_01865-1191.080709Na/Pi cotransporter family protein
B0909_018601131.185245hypothetical protein
B0909_018552160.733991oxygen-dependent coproporphyrinogen oxidase
B0909_018502160.526483hypothetical protein
B0909_018451151.391795DUF1059 domain-containing protein
B0909_018401142.196028CCA tRNA nucleotidyltransferase
B0909_018350133.215841CoA pyrophosphatase
B0909_01825-2133.386895DUF1285 domain-containing protein
B0909_01820-1143.371416MoxR family ATPase
B0909_01815-2132.816231DUF58 domain-containing protein
B0909_01810-1102.559099DUF4159 domain-containing protein
B0909_01805-1101.623844hypothetical protein
B0909_018000110.632029GNAT family N-acetyltransferase
B0909_017950110.788032LysR family transcriptional regulator
B0909_017902110.482341MBL fold metallo-hydrolase
B0909_017853120.960665MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01820HTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 4e-04
Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 121 GQLLMADEINRASPRTQSALLQAMQEYHITMAGQTYELPKPFHVLATQN-PLEQE 174
G L DEI Q+ LL+ +Q+ T G + ++A N L+Q
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01805TONBPROTEIN386e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.0 bits (88), Expect = 6e-05
Identities = 32/180 (17%), Positives = 66/180 (36%), Gaps = 7/180 (3%)

Query: 49 AIANPLLTQEEREPLSTIVPVIVDRSQSQDVQDRPQMTDTALETLKDRLSRFPRIEPRIV 108
A+ P E EP P+ ++ V ++P+ ++ P+ + + V
Sbjct: 60 AVQPPPEPVVEPEP--EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117

Query: 109 EVRDDGESDSPSTQLFSALSSAVADVSPSRVGGAIFLSDGQIHDIPNALPNAEQALGFRA 168
E R ++ + + L+S+ A + S+ ++ + P QAL
Sbjct: 118 ESRPASPFENTAP---ARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEG 174

Query: 169 PVHGLITGKADEFDRRIEIVRAPRFGIVNEEQQLTLRV--FDDGRPAGGGSAEVTVKMNG 226
V D ++I+ A + E + +R ++ G+P G + K+NG
Sbjct: 175 QVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKING 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01800SACTRNSFRASE382e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 2e-06
Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 3/65 (4%)

Query: 59 GWLYVQLLFVPETMRGKGTAAKLLAMAEEEARKRGCTGAYIDT--MNPDALRTYERYGFT 116
G+ ++ + V + R KG LL A E A++ G ++T +N A Y ++ F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 117 RIGSL 121
IG++
Sbjct: 148 -IGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01785TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 11/158 (6%)

Query: 44 AASIAATFGLTVPQALQTGTAFFFGMLLGAAGFGRLADRYGRRRVLIVTVACDALFGVLS 103
+ + A +G+ + + A G L+DR+GRR VL+V++A A+ +
Sbjct: 38 SNDVTAHYGILL-------ALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM 90

Query: 104 IFSPDFTILLILRFLTGAAVGGTLPVDYAMMAEFLPAKNRGRWLVFLEGFWAVGTLIVAL 163
+P +L I R + G G T V A +A+ R R F+ G +VA
Sbjct: 91 ATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSA--CFGFGMVAG 147

Query: 164 AAWGASLAGVADAWRYIFAVTAFPAVLGLGLRFLVPES 201
G + G + + A A + L FL+PES
Sbjct: 148 PVLGGLMGGFSPHAPFFAA-AALNGLNFLTGCFLLPES 184


18B0909_01500B0909_01365Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_015003140.036467hypothetical protein
B0909_01495213-0.248846cysteine desulfurase
B0909_01490314-0.444359guanine deaminase
B0909_01485212-0.222755sulfite exporter TauE/SafE family protein
B0909_014802130.721252alpha-hydroxy-acid oxidizing protein
B0909_014752120.775239QacE family quaternary ammonium compound efflux
B0909_014702130.365832TetR/AcrR family transcriptional regulator
B0909_014653150.154947peptidase M15
B0909_01460314-0.208747metallophosphoesterase
B0909_01455314-0.173245hydroxyisourate hydrolase
B0909_01450120-0.853328ureidoglycolate lyase
B0909_01445-117-1.723006DUF86 domain-containing protein
B0909_01440018-0.919483nucleotidyltransferase
B0909_014350170.8924182-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline
B0909_014302151.628505allantoinase PuuE
B0909_01425-1131.070474DUF1045 domain-containing protein
B0909_01420-1111.398465FAD-binding oxidoreductase
B0909_01410-2111.8931225-methylaminomethyl-2-thiouridine
B0909_01405-1121.427902response regulator
B0909_01400-1121.598241DEAD/DEAH box helicase
B0909_013950141.877993EamA family transporter
B0909_013901141.156694DoxX family protein
B0909_013852150.855214L,D-transpeptidase
B0909_013802150.978643DMT family transporter
B0909_013751160.962851circularly permuted type 2 ATP-grasp protein
B0909_013701170.294121hypothetical protein
B0909_013652190.295372transglutaminase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01460HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 2e-13
Identities = 26/105 (24%), Positives = 47/105 (44%)

Query: 4 AHGRKKQPEIVRRSLLDCAAKLAADQGVAALSIQAVADAAGVTKGGLFHHFPSKQALLEA 63
A K++ + R+ +LD A +L + QGV++ S+ +A AAGVT+G ++ HF K L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 64 VMADLLSALDAEIDALISQDCEAYGCFTRAYVKAVFDDRDRDSGR 108
+ S + ++ R + V + + R
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERR 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01415ACETATEKNASE280.026 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 28.2 bits (63), Expect = 0.026
Identities = 8/14 (57%), Positives = 9/14 (64%)

Query: 48 YHTALPRRYGFHGT 61
Y R+YGFHGT
Sbjct: 168 YTKYKIRKYGFHGT 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01400HTHFIS612e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 2e-11
Identities = 26/110 (23%), Positives = 43/110 (39%), Gaps = 6/110 (5%)

Query: 1092 VLVAEDNDVNQIVFTQILQQAGLRFLIVGNGKKAVQAWEENNPAIILMDVSMPVMNGHQA 1151
+LVA+D+ + V Q L +AG I N + + +++ DV MP N
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 1152 TLAIRAAEQAAADGRHVPIIGVTAHTQEADRELCLQAGMDDYLSKPISPE 1201
I+ A +P++ ++A + G DYL KP
Sbjct: 66 LPRIKKARP------DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109



Score = 52.5 bits (126), Expect = 7e-09
Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 917 IRVLVIDDNDVNRRILIEQLRTWGIDGHAVEDGPSGIAVLQEAASLGFSIDAIILDYHMP 976
+LV DD+ R +L + L G D + + + D ++ D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-----DLVVTDVVMP 58

Query: 977 VMNGLDVVERIRADERFDDIAIVFLT 1002
N D++ RI+ + D+ ++ ++
Sbjct: 59 DENAFDLLPRIK--KARPDLPVLVMS 82


19B0909_01305B0909_01150Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_01305-124-3.447491hypothetical protein
B0909_01300-123-4.006315MarR family transcriptional regulator
B0909_01295024-4.229712*site-specific integrase
B0909_01285127-5.523015recombinase family protein
B0909_01280234-5.918650hypothetical protein
B0909_01275341-6.829996flagellin
B0909_01270331-4.259805hypothetical protein
B0909_01265338-4.131187DUF1851 domain-containing protein
B0909_01260135-3.792354hypothetical protein
B0909_01255031-2.629736DUF4123 domain-containing protein
B0909_01250-127-2.956550type VI secretion system tip protein VgrG
B0909_01245-126-2.354401hypothetical protein
B0909_01235025-5.295541DUF2958 domain-containing protein
B0909_01230025-5.909509hypothetical protein
B0909_01225025-5.973816hypothetical protein
B0909_01220026-5.382050ABC transporter substrate-binding protein
B0909_01215129-6.563443hypothetical protein
B0909_01210234-7.253744hypothetical protein
B0909_01205337-8.190060site-specific integrase
B0909_01200645-9.613683ABC transporter permease subunit
B0909_01195544-9.459884SAM-dependent methyltransferase
B0909_01190641-9.414907restriction endonuclease subunit S
B0909_01185438-8.198700ABC transporter
B0909_01180021-4.839463hypothetical protein
B0909_01175014-0.962903XRE family transcriptional regulator
B0909_011701141.655403N-acetyltransferase
B0909_011651132.056274EamA-like transporter family protein
B0909_011602152.370627methyl-accepting chemotaxis protein
B0909_011553152.578486MFS transporter
B0909_011503151.812745NADPH-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01270FLAGELLIN1531e-42 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 153 bits (387), Expect = 1e-42
Identities = 85/565 (15%), Positives = 169/565 (29%), Gaps = 66/565 (11%)

Query: 4 ILTNMAAMAALQTLRTIGANMADTQRQVSSGLRVQAAADNAAYWSISTTMRSDNMALSAV 63
I TN ++ L ++++ ++SSGLR+ +A D+AA +I+ S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SDALGLGAAKVDVAYAGMESTADVLS---EFRAKLVAAKEDGLDKGKIQTELDQLKDQLL 120
S G + + + L E + D IQ E+ Q +++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 SIATSASFNGVNWLNTNAPENLWELSSLPTSITSSFIRSADGSVRVGTTDIDVADVSLFN 180
++ FNGV L+ + + + ++ + IDV + L
Sbjct: 124 RVSNQTQFNGVKVLSQD------------NQMKIQVGANDGETITIDLQKIDVKSLGLDG 171

Query: 181 VGGGGALQKDIRSLGDIGGFREGEFTGIGNIGFQQFVFTGPFTFGAADHISFDLLLDASD 240
G + + L G T + A
Sbjct: 172 FNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDT---------TAPT 222

Query: 241 LSAGVSYTVTL-DKTLVDAALGRPDGVINDALDFSFVLGTAFDAAGIPAGSVARSFSGAY 299
+ V T DA + + A I G +F
Sbjct: 223 VPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG 282

Query: 300 SQYTIGSREG-TGEPGASVAVSNVISSFAPGNHGAGLENAPYYSLENDYPQWSFGFSGPF 358
+TI ++ G G S ++ + + AG N +L+
Sbjct: 283 VTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQ-------------- 328

Query: 359 TVYRDVEFSFDIQVGNDAPVSISVTRDMVDTALGTSDGKINSAADMATVLDLALEGKGLN 418
S + + + + SD + N+A + + + N
Sbjct: 329 --------SSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTAN 380

Query: 419 VSASGAAVVFDIDKTLYPLAARRSFMQISNVSDNLGPAPDFDILDVDITDPANDLDHYLS 478
+ + + D I + + L+
Sbjct: 381 AAGDKVTLA-----------------GKTMFIDKTASGVSTLINEDAAAAKK-STANPLA 422

Query: 479 GVDTMLQKVISGAAALGTVKTRIDMQESFMRTLMDSIDKGIGRLVDADMNESSTRLKALQ 538
+D+ L KV + ++LG ++ R D + + + +++ R+ DAD + + Q
Sbjct: 423 SIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQ 482

Query: 539 TQEQLAIQSLQIANSNAENMMMLFR 563
+Q L AN +N++ L R
Sbjct: 483 ILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01255PF07132310.004 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 31.2 bits (70), Expect = 0.004
Identities = 24/64 (37%), Positives = 33/64 (51%), Gaps = 1/64 (1%)

Query: 2 SATTTVAGAGGAIAGGWLGRVIGGLAGSLAGPGGTAVGAWIGGQVGAMAGRAAASAIASY 61
TT+ G + GG LG +GGL SL G GG +G +GG +G+ G SA+
Sbjct: 53 DIMTTMMFMGSMMGGG-LGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGG 111

Query: 62 MEGA 65
+ GA
Sbjct: 112 LGGA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01210BONTOXILYSIN290.032 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 29.5 bits (66), Expect = 0.032
Identities = 10/44 (22%), Positives = 21/44 (47%)

Query: 378 SSDLDNDSIMILMNHNAEAYVRVFQQVNRIFDNPEEYENDIPVI 421
+S +DN +++I+ + + F+ I+ PE Y + I
Sbjct: 10 NSPVDNKNVVIVRARKTNTFFKAFKVAPNIWVAPERYYGEPLDI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01205TONBPROTEIN300.042 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.6 bits (66), Expect = 0.042
Identities = 24/165 (14%), Positives = 47/165 (28%), Gaps = 39/165 (23%)

Query: 503 DPTQICEPFVPTTVPTPPAPQTPPPG-----------------SKGSTPSGNSGGSSHSG 545
+P P P P P P + + ++
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPA 131

Query: 546 TTKTGSASASTATSATAAANHRPRNFALIEKNRRAKTTFPA---------------TPSS 590
+ +A+A+T+ T+ A+ R + +PA +
Sbjct: 132 RLTSSTATAATSKPVTSVASG-------PRALSRNQPQYPARAQALRIEGQVKVKFDVTP 184

Query: 591 SGRAFGAFITSCLPDDAEHRADRKFDFEWQLEWWRRMTDIPVHVI 635
GR I S P + R + W+ E + + I V+++
Sbjct: 185 DGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNIL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01165SACTRNSFRASE401e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.5 bits (92), Expect = 1e-06
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 1/59 (1%)

Query: 79 RHTVEHSVYVHMDHRGKGVAEALMQALIGRARAIGKHVMIAGIESRNVASIRLHEKLGF 137
+E + V D+R KGV AL+ I A+ ++ + N+++ + K F
Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01150TCRTETA852e-20 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 85.3 bits (211), Expect = 2e-20
Identities = 81/348 (23%), Positives = 138/348 (39%), Gaps = 11/348 (3%)

Query: 10 ITLLFVATLTIMAGTTVAPSLPAIEQSFLSTPHVGLLSRMVLTLPSVFVALCAPVAGMLA 69
I +L L + + P LP + + + + V ++L L ++ CAPV G L+
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 70 DRFGRKRLLLGAILLYSLSGMSGLFADSLAGLLVGRAFLGLAIGGIMTIGTALVGDYFES 129
DRFGR+ +LL ++ ++ A L L +GR G+ G + A + D +
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDG 126

Query: 130 PARERYLGLQQAFTQLGGVLFVVAGGLLADIHWRAPFAVYAV-ALLILPAALLFLREPER 188
R R+ G A G V V GGL+ APF A L L E +
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 189 GDGRARDGGPAGAVNWPVVAVLALTVFLVNALFYTI--PSQLPFFLRELGV-----FSGS 241
G+ R + A V + A+F+ + Q+P L + + +
Sbjct: 187 GERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 242 IAGYAIGIFNLAGALT-ALNFARLRRHMGVAIILAVGLMLMATGFALLAMAKGLASMLSA 300
G ++ F + +L A+ + +G L +G++ TG+ LLA A
Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306

Query: 301 VAVTGLGLGVVMPAIMSTTIMLAPLRLRGRIAGIVTASMFLGHFISPL 348
+ + G G+ MPA+ + +G++ G + A L + PL
Sbjct: 307 MVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353


20B0909_01105B0909_00920Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_011052111.049475endo-1,4-beta-xylanase
B0909_011003131.339362glycosyltransferase
B0909_010950131.766497glycosyltransferase family 2 protein
B0909_01090-1141.697719hypothetical protein
B0909_010850122.069987glycosyltransferase
B0909_010800142.351152cell shape-determining protein
B0909_247300152.707155hypothetical protein
B0909_010700162.949207phosphomannomutase
B0909_247352152.411404globin
B0909_010651151.844751DUF423 domain-containing protein
B0909_010602201.870888hypothetical protein
B0909_010552182.062165DUF2325 domain-containing protein
B0909_010500181.784555TetR family transcriptional regulator
B0909_010450181.380158Zn-dependent hydrolase
B0909_010402181.720471dihydropyrimidinase
B0909_010353222.547902NUDIX domain-containing protein
B0909_010302222.607554ABC transporter ATP-binding protein
B0909_010251211.854986ABC transporter permease
B0909_010200191.923017ABC transporter permease
B0909_247401182.071811ABC transporter substrate-binding protein
B0909_010150171.854028efflux RND transporter periplasmic adaptor
B0909_01010-1131.624085AcrB/AcrD/AcrF family protein
B0909_010050121.157350Crp/Fnr family transcriptional regulator
B0909_010000121.958327MBL fold metallo-hydrolase
B0909_009952131.653661urease accessory protein UreG
B0909_009902140.945852urease accessory protein UreF
B0909_009852200.809264urease accessory protein UreE
B0909_009751170.944115peroxiredoxin
B0909_009700160.258081TIGR02117 family protein
B0909_00965-1160.287948urease subunit alpha
B0909_009601180.571188Urease operon accessory protein
B0909_009551171.018405lysozyme inhibitor LprI family protein
B0909_009502210.963928GFA family protein
B0909_009455190.393037urease subunit beta
B0909_009404161.008898DUF1272 domain-containing protein
B0909_009353151.211543urease subunit gamma
B0909_009302150.531203urease accessory protein UreD
B0909_009251140.134083polyhydroxybutyrate depolymerase
B0909_009202140.047380urea ABC transporter ATP-binding subunit UrtE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01040HTHTETR679e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 9e-16
Identities = 29/162 (17%), Positives = 70/162 (43%), Gaps = 9/162 (5%)

Query: 11 KRTRIQ-EEKQEAILEAALSVFSINGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRAL 69
++T+ + +E ++ IL+ AL +FS G +++ +IA+AAG+++ + ++F+ K + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 IERVLDTWLDPLRAFDAE--GNPESEIRSYIRRKLEMSRDFPRESRLFA-----NEILQG 122
E + + A+ G+P S +R + LE + R L E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 APHIEDELKGPLKQLVDEKAEVIRAWVKAGRIAK-CDPYHLI 163
++ + + D + ++ ++A +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01035PF05272290.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.041
Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 321 VIEAIGHFDPVTFDPVLVGRVRSAAERLGYSHMDIISG 358
+++A+G DP P+L G+VR G+ ++ SG
Sbjct: 803 LVQALG-ADPGKSSPMLEGQVRDWLNENGWEYLRETSG 839


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01030UREASE501e-08 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 50.1 bits (120), Expect = 1e-08
Identities = 30/95 (31%), Positives = 40/95 (42%), Gaps = 16/95 (16%)

Query: 4 TIIKNGTIVTADLTYKADVKIEGGRITEIG----PDLTGGT---------VLDATGCFVM 50
T+I N I+ KAD+ ++ GRI IG PD+ G V+ G V
Sbjct: 70 TVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVT 129

Query: 51 PGGIDPHVHLEMPFMGTYSADDFESGTRAALAGGT 85
GG+D H+H P + SG L GGT
Sbjct: 130 AGGMDSHIHFICP---QQIEEALMSGLTCMLGGGT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01000RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 2e-08
Identities = 47/257 (18%), Positives = 93/257 (36%), Gaps = 25/257 (9%)

Query: 102 DSLVLQKSQLEANKAKAEASVTQSKAQVLEAQANLNDAVRQRDRAARLGQSGSGSVSETE 161
+ ++KS+L+ + +K VLE + +A + Q E+E
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAI-AKHAVLEQENKYVEA--VNELRVYKSQLEQ---IESE 281

Query: 162 KTEAAAQVAQARLDAAKQAVSA---GEADIKVVDAQIDDIDLKLTRTGVKTPVAGIVSAK 218
A + + + +I ++ ++ + + + ++ PV+ V
Sbjct: 282 ILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL 341

Query: 219 NAKVGAIASGAGNPLFTVI-KDGAIELVADLSETDIQKVKAGQKAYLTV-AGGATK---I 273
L ++ +D +E+ A + DI + GQ A + V A T+ +
Sbjct: 342 KVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYL 401

Query: 274 EGKVRLVSPTVDPTTRLGSVH---VVLPENS--------PARSGMYASAEVIVEETNALA 322
GKV+ ++ RLG V + + EN P SGM +AE+ + ++
Sbjct: 402 VGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVIS 461

Query: 323 LPLSAVTSGRDGSTTRR 339
LS + S R
Sbjct: 462 YLLSPLEESVTESLRER 478



Score = 38.3 bits (89), Expect = 4e-05
Identities = 19/151 (12%), Positives = 51/151 (33%), Gaps = 9/151 (5%)

Query: 56 IIATGTIRPVDEIY-VQPLVDGLSIDTLNADIGDRVEANAVLAVLSSDSLVLQKSQLEAN 114
A G + ++P+ + + + + + G+ V VL L++ EA+
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKE-GESVRKGDVLLKLTA-------LGAEAD 135

Query: 115 KAKAEASVTQSKAQVLEAQANLNDAVRQRDRAARLGQSGSGSVSETEKTEAAAQVAQARL 174
K ++S+ Q++ + Q + +L E+ + + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 175 DAAKQAVSAGEADIKVVDAQIDDIDLKLTRT 205
+ E ++ A+ + ++ R
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRY 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00995ACRIFLAVINRP6240.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 624 bits (1612), Expect = 0.0
Identities = 260/1064 (24%), Positives = 471/1064 (44%), Gaps = 71/1064 (6%)

Query: 4 SAWSIRNPIAPLLGFALLMILGMQAFKTLPITRFPNIDVPVVAVTVTQSGASPSELEMQV 63
+ + IR PI + +LM+ G A LP+ ++P I P V+V+ GA ++ V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TKEIEDAVAAISGVDEIQSTVV-DGQSTTTVVFRIEKRTEEAVQDTKDAIDKIRSDLPAD 122
T+ IE + I + + ST G T T+ F+ + A ++ + LP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 IEEPIVSKIDVEGQAIQTFAVSS--PNMTLEELSWFVDDTIKRSLQGQSGIGKVDRYGGA 180
+++ +S + S P T +++S +V +K +L +G+G V +G
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA- 180

Query: 181 DREVRVSLSPEKLDAYGITASEVNTQLRGTNIDLGSGRGQIGGN--------EQTIRTLG 232
+R+ L + L+ Y +T +V QL+ N + +G Q+GG +I
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAG--QLGGTPALPGQQLNASIIAQT 238

Query: 233 DTRDVSQLANTTIALS-NGRFVKLSELGTVTDTYQEQKSFSRFNGNPAVTFAVFRSKGAS 291
++ + T+ ++ +G V+L ++ V + +R NG PA + + GA+
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 292 EVSVAETVAESLAKVRKDHP-DVSIEMVDDAVYFTYGNYKAALDTLIEGAILAVIVVLLF 350
+ A+ + LA+++ P + + D F + + TL E +L +V+ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 351 LRNWRATLIAAVALPLSAIPTFWIMDIMGFSLNLVSFLALTLATGILVDDAIVEIENIAR 410
L+N RATLI +A+P+ + TF I+ G+S+N ++ + LA G+LVDDAIV +EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 411 HIKMGKT-PYRAALEAADEIGLAVIATSFTIIAVFVPVSFMPGIPGQYFIQFGLTVAFSV 469
+ K P A ++ +I A++ + + AVF+P++F G G + QF +T+ ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 470 FFSLAVARLITPLMAAYLMRAEDAMDDHHDNDSRLMKAYTRMVSATTR------KWWA-- 521
S+ VA ++TP + A L++ A +HH+N + +
Sbjct: 479 ALSVLVALILTPALCATLLKPVSA--EHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGS 536

Query: 522 RYLTLIGAIGFLVASVILLAGVPGSFLPPDDASRVTLSVELPPNATLDETDRTTTEI--Y 579
L+ + V+L +P SFLP +D ++LP AT + T + ++ Y
Sbjct: 537 TGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDY 596

Query: 580 HAIRDINGVESVFILGGASPKGDLELRRATVNVILQHIDHSLLKTLVNKGLGSIPLIGQY 639
+ + VESVF + G S G N G+ + L
Sbjct: 597 YLKNEKANVESVFTVNGFSFSGQ----------------------AQNAGMAFVSLK--- 631

Query: 640 LPKVEEKGRLRPQWDVERDIFAKVRGIPDVRIIKLNDRAEREL-SFNFLSSNEKD----- 693
P E G V ++ I D +I N A EL + D
Sbjct: 632 -PWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLG 690

Query: 694 ---LNDAVGILESRLRASPI-LANVSSEGALPRPELQIRPRKDEIARLGITPQQISQTVR 749
L A L P L +V G + ++ +++ LG++ I+QT+
Sbjct: 691 HDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTIS 750

Query: 750 VATIGDIDAQLTKISLDDRQIPIRVQASLDTRRDLATIRALKIKTASGSLVPLYSVADID 809
A G + R + VQA R + L +++A+G +VP +
Sbjct: 751 TALGG---TYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSH 807

Query: 810 YAEGPSSIKRNDRNRVVSIGSDVPFGTALDTSTAEFKRIVEETKLPASVRLAESGDAKVQ 869
+ G ++R + + I + GT+ + A + + +KLPA + +G + +
Sbjct: 808 WVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQE 865

Query: 870 GEMQQGFVNAMLLGLMLVLVVLILLFKDVIQPFTILFSLPLAIGGVAVALIITQNALSMP 929
+ + ++V + L L++ P +++ +PL I GV +A + +
Sbjct: 866 RLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY 925

Query: 930 VLIGILMLMGIVTKNAILLVDFAIE-MRRHGMERVHAMIEAGRKRARPIIMTSIAMSAGM 988
++G+L +G+ KNAIL+V+FA + M + G V A + A R R RPI+MTS+A G+
Sbjct: 926 FMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGV 985

Query: 989 LPSALGVGEGGSFRAPMAIAVIGGIIVSTVLSLIVVPAFFLIMD 1032
LP A+ G G + + I V+GG++ +T+L++ VP FF+++
Sbjct: 986 LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00955UREASE11020.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1102 bits (2853), Expect = 0.0
Identities = 503/570 (88%), Positives = 538/570 (94%), Gaps = 1/570 (0%)

Query: 1 MPYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQ 60
M Y++SRAAYA MFGPTVGDKVRLADTELFIE+EKD TT+GEEVKFGGGKVIRDGMGQSQ
Sbjct: 1 MSYRMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQ 60

Query: 61 ATRAEGAVDTVITNVVIVDHSGIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEA 120
TR GAVDTVITN +I+DH GI KAD+GLK+GRI AIGKAGNPD QPGVTIIVGP TE
Sbjct: 61 VTREGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEV 120

Query: 121 IAGEGRILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGGSGPAHGTLATTCT-GAWHIE 179
IAGEG+I+TAGGMD+HIH+ICPQQIEEALMSG+TCMLGGG+GPAHGTLATTCT G WHI
Sbjct: 121 IAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIA 180

Query: 180 RMIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADE 239
RMIE+ DAFPMNLA AGKGNASLP L EM+L GA+SLKLHEDWGTTPAAID CL+VADE
Sbjct: 181 RMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADE 240

Query: 240 YDVQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSST 299
YDVQVMIHTDTLNESGFVEDT+AAI+GRTIHA+HTEGAGGGHAPDII++CG PNVIPSST
Sbjct: 241 YDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSST 300

Query: 300 NPTRPYTVNTLAEHLDMLMVCHHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIIS 359
NPTRPYTVNTLAEHLDMLMVCHHLSP+IPEDIAFAESRIRKETIAAEDILHDIGAFSIIS
Sbjct: 301 NPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIIS 360

Query: 360 SDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEEVGENDNFRVRRYIAKYTINPAIAQGVS 419
SDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEE G+NDNFRV+RYIAKYTINPAIA G+S
Sbjct: 361 SDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLS 420

Query: 420 HEIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFA 479
HEIGS+EVGKRADLVLWNPAFFGVKP+MVLLGG+IAAAPMGDPNASIPTPQP+HYRPMF
Sbjct: 421 HEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFG 480

Query: 480 AYGKLRTNSSVTFVSQASLDVGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVD 539
AYG+ RTNSSVTFVSQASLD GLA RLGVAK+L+AV+N RGGI KASMIHNSLTPHIEVD
Sbjct: 481 AYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVD 540

Query: 540 PETYEVRADGELLTCEPATVLPMAQRYFLF 569
PETYEVRADGELLTCEPATVLPMAQRYFLF
Sbjct: 541 PETYEVRADGELLTCEPATVLPMAQRYFLF 570


21B0909_00815B0909_00750Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_008154173.071286DUF2793 domain-containing protein
B0909_00805-1130.835449DUF1153 domain-containing protein
B0909_00800013-1.501509N-acetyltransferase
B0909_00795112-0.989642DUF1059 domain-containing protein
B0909_00790114-0.477771flagellar export protein FliJ
B0909_00785013-0.579383DNA-binding response regulator
B0909_00780013-0.160251N-acetyltransferase
B0909_007752121.666738alpha/beta fold hydrolase
B0909_007702142.040927N-acetyltransferase
B0909_007652162.996072histidine phosphotransferase
B0909_007601152.906226DUF1134 domain-containing protein
B0909_007550153.901570hypothetical protein
B0909_007500153.243755lytic murein transglycosylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00805PRTACTNFAMLY290.042 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.9 bits (64), Expect = 0.042
Identities = 31/111 (27%), Positives = 40/111 (36%), Gaps = 7/111 (6%)

Query: 218 AIAGLPAGTTKPANGSAAGFSLLFIAEGGFALGDAVGGGGRE--FVVPARGIYLVTLSLA 275
A+ L T + + A G GG G AV GG F G Y V +S +
Sbjct: 249 AVVHLQRATIRRGDAPAGG-----AVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGS 303

Query: 276 VVSSSGHRVTLFVNGAAASFGIAGNASTAGSSQSATSLLALETGDRLRLQH 326
V + V GAA G + +G S SA +ETG R
Sbjct: 304 SVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAP 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00795SACTRNSFRASE270.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.012
Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 34 LIIIDHTGVPDALRGRGVGQALAAHAIDEAR 64
+I+ V R +GVG AL AI+ A+
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAK 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00780HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 27/124 (21%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 2 RVLLIEDDSATAQSIELMLKSESFNVYTTDLGEEGVDLGKLYDYDIILLDLNLPDMSGYE 61
+L+ +DD+A + L ++V T D D+++ D+ +PD + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRTLRLSKVKTPILILSGMAGIEDKVRGLGFGADDYMTKPFHKDELVARI-HAIVRRSK 120
+L ++ ++ P+L++S ++ GA DY+ KPF EL+ I A+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GHAQ 124
++
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00750IGASERPTASE388e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.7 bits (87), Expect = 8e-05
Identities = 43/247 (17%), Positives = 91/247 (36%), Gaps = 19/247 (7%)

Query: 169 AATRDLEAEQAKQSATRFRKEGETASRELQDVAARNKEAMSAVARESRKVARLEEQVATL 228
+ T + AE +KQ + K E ++ + A+N+E A+ + K +VA
Sbjct: 1034 SETTETVAENSKQES----KTVEKNEQDATETTAQNREVAKE-AKSNVKANTQTNEVAQS 1088

Query: 229 MAKNADLDTMLTLRNQEAARLKEQLAAAGGRENDMIIRLPNPAVPAEKAAPPPQ-QPAPA 287
++ + T T + A KE+ A + + ++ + P ++ + Q Q PA
Sbjct: 1089 GSETKETQT--TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 288 PEAAPAIAADAGQAATE---------EESPALPVVEEPTAPQPNGLEQEIEDIRNQGTAL 338
E P + Q+ T +E+ + + N +E+ N A
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 339 TERLLNVRGTGNDEPIRREIARIAAEMIALTAAREGEKSPIP--ELLAKASGSSGRESLA 396
T+ +N + + R R + ++S + +L + + + ++ A
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARA 1266

Query: 397 KRAKVVM 403
K V +
Sbjct: 1267 KAQFVAL 1273


22B0909_00065B0909_24815Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_00065020-3.1040632-hydroxy-3-oxopropionate reductase
B0909_00060122-3.602531beta-galactosidase
B0909_00055123-3.264529hypothetical protein
B0909_00050123-2.692736hypothetical protein
B0909_00045020-1.673405NUDIX domain-containing protein
B0909_00040113-0.154748hypothetical protein
B0909_00035112-0.273473LacI family transcriptional regulator
B0909_00030111-0.205031FAD-dependent oxidoreductase
B0909_00025213-0.802932sugar ABC transporter substrate-binding protein
B0909_00020118-2.003098sugar ABC transporter permease
B0909_00015119-3.356367carbohydrate ABC transporter permease
B0909_00010121-3.592187sn-glycerol-3-phosphate ABC transporter
B0909_00005-123-3.384969***phosphoribosylaminoimidazolesuccinocarboxamide
B0909_24755020-2.622671TetR/AcrR family transcriptional regulator
B0909_24760116-2.046166efflux RND transporter periplasmic adaptor
B0909_24770115-1.135809hydrophobe/amphiphile efflux-1 family RND
B0909_24775215-0.827144efflux transporter outer membrane subunit
B0909_24780014-0.517389elongation factor P
B0909_24785014-0.247976EF-P lysine aminoacylase GenX
B0909_24790013-0.358274lysine-2,3-aminomutase-like protein
B0909_24795-114-0.243197hypothetical protein
B0909_24800-111-0.258943Lrp/AsnC family transcriptional regulator
B0909_24805-1120.313211LysR family transcriptional regulator
B0909_24810-1110.140784FAD-binding oxidoreductase
B0909_248152120.425766molybdate ABC transporter substrate-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00005PF05272354e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 4e-04
Identities = 15/56 (26%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 32 VVLVGPSGCGKSTLLRMIAGLETVTSGDISISNRVVNEIEPKDRDIAMVFQNYALY 87
VVL G G GKSTL+ + GL+ + I +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24780HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 40/207 (19%), Positives = 83/207 (40%), Gaps = 19/207 (9%)

Query: 21 RRPRRSAEETRRDILAKAEELFRERGFNAVAIADIAAALGMSPANVFKNFSSKNALVDAI 80
R+ ++ A+ETR+ IL A LF ++G ++ ++ +IA A G++ ++ +F K+ L I
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 81 A---FEQIGAFERDI---RPLDKNHAPLSRLRHLARTLMEQHHQDL------NDNPYIFE 128
IG E + P D L H+ + + + + L + ++ E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 129 MILMTAKQDMKCGDYYKAVIASLLADIISDGIDAG-IYAPADIPPLAETVLHALTSVIHP 187
M ++ Q C + Y + + I+A + A A + ++ ++
Sbjct: 123 MAVVQQAQRNLCLESY-----DRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177

Query: 188 VLIAREDIGNLATRCDQLVDLIDAGLR 214
L A + +L V ++
Sbjct: 178 WLFAPQSF-DLKKEARDYVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24785RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 2e-08
Identities = 29/132 (21%), Positives = 58/132 (43%), Gaps = 20/132 (15%)

Query: 69 EIRPRVTGIIREIPFKEGSEVKQGDILYQIEDNTYLAEVAQAKANVAKAEASIPSAQANL 128
EI+P I++EI KEG V++GD+L ++ A+A+ K ++S+ A+
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQ 150

Query: 129 ARYERLVNS----GATQIEYENAKVTLLQAEADVAQTKAAL---------ETAEINLDLT 175
RY+ L S +++ + +E +V + + + + + L+L
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 176 KVRAPFDGITSA 187
K RA + +
Sbjct: 211 KKRAERLTVLAR 222



Score = 43.3 bits (102), Expect = 1e-06
Identities = 20/98 (20%), Positives = 33/98 (33%), Gaps = 9/98 (9%)

Query: 105 AEVAQAKANVAKAEASIPSAQANLARYERLVNSGATQIEYENAKVTLLQAEADVAQTKAA 164
E+ K+ + + E+ I SA+ TQ+ L Q ++
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQL--------VTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 165 LETAEINLDLTKVRAPFDGITSA-TAFSIGNVVTANQT 201
L E + +RAP + G VVT +T
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24790ACRIFLAVINRP11180.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1118 bits (2893), Expect = 0.0
Identities = 582/1031 (56%), Positives = 742/1031 (71%), Gaps = 5/1031 (0%)

Query: 1 MAHFFIRRPVFAWVIAIVIMLGGALAIATLSISQYPDIAPTTVRVSATYNGASAETVEKS 60
MA+FFIRRP+FAWV+AI++M+ GALAI L ++QYP IAP V VSA Y GA A+TV+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTTIIEDGMTGLDDLTYMTSSS-STGSAEVTLTFGNSIMPDIAQVQVQNKLQLVQSQLPD 119
VT +IE M G+D+L YM+S+S S GS +TLTF + PDIAQVQVQNKLQL LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 TVQQQGIQVSRSTSSILMVGALISTDGKRNSADLGDVFSSRVEDQIKRLEGVGSINVFGS 179
VQQQGI V +S+SS LMV +S + D+ D +S V+D + RL GVG + +FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWLDPFKLNKYQLTTADVTSAIESQNTQVSVGSLGAVPAVKGQQLNVTVTAQSQL 239
+YAMRIWLD LNKY+LT DV + ++ QN Q++ G LG PA+ GQQLN ++ AQ++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 TTVADFESVILKVEKDGATVRLSDVARIEIGQETYGGDSRSNGRPSAGFAVNLATGANAL 299
+F V L+V DG+ VRL DVAR+E+G E Y +R NG+P+AG + LATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 DTAARVKAALANVEGSLPEGVAIEYPYDTTPFVKLSIEKVVHTLIEAIILVFVVLLVFLQ 359
DTA +KA LA ++ P+G+ + YPYDTTPFV+LSI +VV TL EAI+LVF+V+ +FLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NLRATFIPMIAVPVVLLGTFGILALTGYSINTLTMFAMVLAIGLLVDDAIVVVENVERIM 419
N+RAT IP IAVPVVLLGTF ILA GYSINTLTMF MVLAIGLLVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 SEEGLSPVEATEKSMGEITGAIIGIALVLTAVFIPMAFFGGSTGIIYRQFSVTIVSAMLL 479
E+ L P EATEKSM +I GA++GIA+VL+AVFIPMAFFGGSTG IYRQFS+TIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAVVAIVLTPALCATMLKPI--DHHKKQRGPGAWFNRGFGKTTDGYVSSIGYLLKRPLRV 537
S +VA++LTPALCAT+LKP+ +HH+ + G WFN F + + Y +S+G +L R
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 538 MIIFAIVIGGCAWFFSKLPSSFLPQEDQGVLLTIIQTPTGSNIERTNEVVKQVESYFREK 597
++I+A+++ G F +LPSSFLP+EDQGV LT+IQ P G+ ERT +V+ QV Y+ +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 EAANVESVFGVLGFSFSGSGQNNAIVFTKLKDFSERTAPDQHAGAIVQRAMGTFFGFRDA 657
E ANVESVF V GFSFSG QN + F LK + ER + A A++ RA RD
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 658 QVFPLLPPAIQGMGTSSGFSMYLVDSGRNGTDALTASSKELIALATGNP-KISSLRSDSQ 716
V P PAI +GT++GF L+D G DALT + +L+ +A +P + S+R +
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 717 DNETQMKIILDQEKMGAMGVDLSSVNLMLSTIFAGRDVNDFTLNGELKPVYVQGDAPYRM 776
++ Q K+ +DQEK A+GV LS +N +ST G VNDF G +K +YVQ DA +RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 777 QPDDLKFWYARNTNGEMVPFSSFSEVKWINAPPSLARFNGTGAISLEGTAGAGVASGEAM 836
P+D+ Y R+ NGEMVPFS+F+ W+ P L R+NG ++ ++G A G +SG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 837 DEMERLTASLPGGYTVAWQGISYQERLSGSQAPMLYALSVLIVFLCLAALYESWSIPFSV 896
ME L + LP G W G+SYQERLSG+QAP L A+S ++VFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 897 ILAVPVGVLGALTAAHFFGQTNDVYFKVGLLTTIGLAAKNAILIVEFAKERQEH-GLSLV 955
+L VP+G++G L AA F Q NDVYF VGLLTTIGL+AKNAILIVEFAK+ E G +V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 956 EAALEAAKLRLRPIIMTSLAFILGVVPLAIATGAGSAAQNAIGIGVLGGMLSATLLGIFF 1015
EA L A ++RLRPI+MTSLAFILGV+PLAI+ GAGS AQNA+GIGV+GGM+SATLL IFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1016 VPSFFVIIRRL 1026
VP FFV+IRR
Sbjct: 1021 VPVFFVVIRRC 1031


23B0909_25015B0909_25130Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_2501509-3.165985ABC transporter substrate-binding protein
B0909_25020010-3.176256carbohydrate ABC transporter permease
B0909_2502509-3.793828sugar ABC transporter permease
B0909_25030-110-3.330186gfo/Idh/MocA family oxidoreductase
B0909_25035012-2.795990hypothetical protein
B0909_25040-111-1.676582GntR family transcriptional regulator
B0909_25045013-0.721458SIS domain-containing protein
B0909_25050113-0.067193N-acetylglucosamine-6-phosphate deacetylase
B0909_250551130.266290ROK family protein
B0909_250602130.321707FAD-dependent oxidoreductase
B0909_25070011-0.101484phosphatase PAP2 family protein
B0909_25080-2110.5917181-deoxy-D-xylulose-5-phosphate reductoisomerase
B0909_25090-310-0.0156475-aminolevulinate synthase
B0909_25095-2110.207198glycine zipper 2TM domain-containing protein
B0909_25100-311-0.050876outer membrane protein assembly factor
B0909_25105-2110.027437translocation/assembly module TamB
B0909_25110-118-1.725614chemotaxis protein CheW
B0909_25115221-1.226495PAS domain S-box protein
B0909_25120628-1.024111formate dehydrogenase accessory
B0909_25125730-1.250912F0F1 ATP synthase subunit epsilon
B0909_251304200.234834F0F1 ATP synthase subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25030MALTOSEBP544e-10 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 54.0 bits (129), Expect = 4e-10
Identities = 87/357 (24%), Positives = 135/357 (37%), Gaps = 45/357 (12%)

Query: 70 EQCQDKATTLAAAGTPVAMAYVGSRTLKQFAQNDLIVPVPMTEDEKKSYYPNIVDTVTFE 129
++ ++K +AA G + + +AQ+ L+ + + + YP D V +
Sbjct: 67 DKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYN 126

Query: 130 DTQWGVPVAFSTKALYWNKDLFKQAGLDPEVPPKTWAEEIAFAKQIKEKTGIAGYGLPAK 189
P+A +L +NKDL PPKTW E A K++K K G A
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLPN-------PPKTWEEIPALDKELKAK------GKSAL 173

Query: 190 TFDNTMHQFMHWVYT----------NNGKVIDGDKITVDSPQVVAALTAYKD-ITPYSVE 238
F N + W NGK D + VD+ A LT D I +
Sbjct: 174 MF-NLQEPYFTWPLIAADGGYAFKYENGKY-DIKDVGVDNAGAKAGLTFLVDLIKNKHMN 231

Query: 239 GPTAYEQNEIRAIFLDGKVGMIQAGSGAATRLQETKINWGIATLPL---GPEAKGPGTLL 295
T Y E A F G+ M G A + + +K+N+G+ LP P G L
Sbjct: 232 ADTDYSIAE--AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVL- 288

Query: 296 ITDSLAIFKGTGVEEKATEFAK--FITSPGPQGEYELQGGAGLTPLRPSA--KVDEFVAK 351
S I + +E A EF + +T G L+ PL A +E +AK
Sbjct: 289 ---SAGINAASPNKELAKEFLENYLLTDEG------LEAVNKDKPLGAVALKSYEEELAK 339

Query: 352 DPFWKPLIDGIAYGGPEPLFTDYKGFQDVMIEMVQSVVTGKATPEDAAKKASSALEQ 408
DP ++ G P F + V + +G+ T ++A K A + + +
Sbjct: 340 DPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396


24B0909_25245B0909_25335Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_25245214-2.083642succinate dehydrogenase, hydrophobic membrane
B0909_252503160.027227succinate dehydrogenase, cytochrome b556
B0909_252552171.424219hypothetical protein
B0909_252601151.895463methyltransferase
B0909_252651141.962689EVE domain-containing protein
B0909_252701112.426829hypothetical protein
B0909_252752112.564009NAD(P)-dependent glycerol-3-phosphate
B0909_252803121.830419tRNA
B0909_252853151.166756hydroxymethylbilane synthase
B0909_252901130.957518uroporphyrinogen-III synthase
B0909_252951160.287800hypothetical protein
B0909_25300-113-0.724244heme biosynthesis protein HemY
B0909_25305-212-1.726214hypothetical protein
B0909_25310-19-0.867051glutamine amidotransferase
B0909_25315011-1.040596MFS transporter
B0909_25320212-2.014167DUF167 domain-containing protein
B0909_25325212-1.713521inorganic diphosphatase
B0909_25330110-0.950179GNAT family N-acetyltransferase
B0909_25335211-1.125545translational GTPase TypA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25250PF06580290.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.003
Identities = 18/97 (18%), Positives = 33/97 (34%), Gaps = 7/97 (7%)

Query: 29 HRITGGALYFGTLLVAWWLIAIASGPAYYDWVNWAMGTIIGRLI----LIGYTWALVHHM 84
I A+ L++ + W+ MG II R++ +IG W + +
Sbjct: 41 SMIFNIAISLMGLVLTHAYRSF---IKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTS 97

Query: 85 LGGLRHFMWDLGYGFEKHFTTKLAKASWVVSICLTAL 121
+ L F+ F + VV+ + L
Sbjct: 98 IWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLL 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25295PF03544330.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.002
Identities = 27/139 (19%), Positives = 43/139 (30%), Gaps = 11/139 (7%)

Query: 43 DDKTPASPVKPGAPETKPEAVATGNPAPVWDQPKKTPIEPEPNVGKADSAAAGPKAESIG 102
P S + +P P PV EPEP A E
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPV------VEPEPEPEPIPEPPKEAPVVIE--- 95

Query: 103 DKQKEPVSSVPVPPKADAKPTADTAGA-SAAATAAAASKPAFGATASAGTSGNASAAKPS 161
K K P P K +P D S A+ + PA +++A + + +
Sbjct: 96 -KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154

Query: 162 ATTPSSGPAKPSTPPQAER 180
+ + +P P +A+
Sbjct: 155 SGPRALSRNQPQYPARAQA 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25300PF03544340.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.8 bits (77), Expect = 0.001
Identities = 30/105 (28%), Positives = 33/105 (31%), Gaps = 8/105 (7%)

Query: 431 EGPVEDLTIENAIAAAPARAEPVAKTIVVEAAPEPRTEPRPAPATPIEVAPTVSVPKEKK 490
E P I + A P A E EP EP P P P E AP V + K
Sbjct: 42 ELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKE-APVVIEKPKPK 100

Query: 491 PVT-------IEAPVETEAADEKAEAVPFFGGAPDDPGVKKPGAE 528
P +E P E A PF AP P A
Sbjct: 101 PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25315TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.5 bits (113), Expect = 4e-08
Identities = 59/314 (18%), Positives = 128/314 (40%), Gaps = 15/314 (4%)

Query: 53 LGLIGLFQFLPSLLLILVTGTVADRHNRRRIMAICLLIAALCAVALLGLTLAHRFSPWPV 112
L L L QF + +L G ++DR RR ++ + L AA V + A W +
Sbjct: 49 LALYALMQFACAPVL----GALSDRFGRRPVLLVSLAGAA---VDYAIMATAPFL--WVL 99

Query: 113 FAILVVFGIERAFMGPAVQSLAPNLVPVEDLPNAIAWNSSSWQMASILGPVAGGLLYGLG 172
+ +V GI A G + ++ ++ + S+ + + GPV GGL+ G
Sbjct: 100 YIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFS 158

Query: 173 ASVAYSVAFVLFIVSAILAVTIRKPEQRGPAKAISLETM--LAGFKFISQEKIVLGAISL 230
+ A L ++ + + +G + + E + LA F++ +V +++
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAV 218

Query: 231 DLFAVLLGGA-VALMPIFAKEVLTLGPWGLGLLRAAPGI-GAITVAVILAFKPIRHHAGL 288
L+G AL IF ++ +G+ AA GI ++ A+I R
Sbjct: 219 FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERR 278

Query: 289 LMFVGVGLFGISTVVFGLSETAWLSIAALVVMGASDMVSVYVRETLIALWTPDEVRGRVN 348
+ +G+ G ++ + W++ +V++ + + + + +++ +E +G++
Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPAL-QAMLSRQVDEERQGQLQ 337

Query: 349 AVNMVFVGASNELG 362
++ +G
Sbjct: 338 GSLAALTSLTSIVG 351



Score = 35.6 bits (82), Expect = 3e-04
Identities = 40/175 (22%), Positives = 68/175 (38%), Gaps = 12/175 (6%)

Query: 6 AENRFGAFRHSSYRRFFSARFFSAFAIQIVSVSVG--WQMYEVTGNAFYLGLIGL----F 59
A N +FR + +A F +Q+V W ++ + IG+ F
Sbjct: 196 ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAF 255

Query: 60 QFLPSLLLILVTGTVADRHNRRRIMAICLLIAALCAVALLGLTLAHRFSPWPVFAILVVF 119
L SL ++TG VA R RR + + + IA LL + +P+ +L
Sbjct: 256 GILHSLAQAMITGPVAARLGERRALMLGM-IADGTGYILL-AFATRGWMAFPIMVLLASG 313

Query: 120 GIERAFMGPAVQSLAPNLVPVEDLPNAIAWNSSSWQMASILGPVAGGLLYGLGAS 174
GI PA+Q++ V E ++ + SI+GP+ +Y +
Sbjct: 314 GIGM----PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25335TCRTETOQM1799e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 179 bits (455), Expect = 9e-51
Identities = 102/449 (22%), Positives = 188/449 (41%), Gaps = 92/449 (20%)

Query: 1 MALRNIAIIAHVDHGKTTLVDELLKQSGSFRENQRVME--RVMDSNDIEKERGITILAKA 58
M + NI ++AHVD GKTTL + LL SG+ E V + D+ +E++RGITI
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 59 TSIEWKDTRINIVDTPGHADFGGEVERILSMVDGAIVLVDAAEGPMPQTKFVVGKALKVG 118
TS +W++T++NI+DTPGH DF EV R LS++DGAI+L+ A +G QT+ + K+G
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 119 LRPIVAINKIDRPDARHEEVVNEVFDLFAA----------------------------LD 150
+ I INKID+ V ++ + +A ++
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 151 ATDEQLD--------------------------FPILYGSGRSGWMNVNPEGPTDEGLAP 184
D+ L+ FP+ +GS ++ G+
Sbjct: 181 GNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNN-----------IGIDN 229

Query: 185 LLDLVVKHVPEPKVEEGPFRMIGTI--LEANNFLGRIITGRIASGSIKPNQAVKVLGQDG 242
L++++ G + G + +E + R+ R+ SG + +V++ ++
Sbjct: 230 LIEVITNKFYSS-THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK 288

Query: 243 KLIENGRISKILAFRGIERTSIEEAHAGDIVAIAGLS---KGTVADTFCDPAVMEPMQAQ 299
+I+++ E I++A++G+IV + + DT P
Sbjct: 289 I-----KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPL 343

Query: 300 PIDPPTVTMSFIVNDSPLAGTEGDKVTSRVIRDRLFKEAEGNVALKIEESADKDSFFVSG 359
P+ TV P + + + D L + ++ + L+ + +S
Sbjct: 344 PLLQTTV--------EPSKPQQREML-----LDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 360 RGELQLAVLIENMRRE-GFELAVSRPRVV 387
G++Q+ V ++ + E+ + P V+
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVI 419



Score = 38.3 bits (89), Expect = 9e-05
Identities = 21/84 (25%), Positives = 33/84 (39%), Gaps = 1/84 (1%)

Query: 395 QLMEPVEEVVIDVDEEHSGVVVQKMSERKAEMVELRPSGGNRVRLVFFAPTRGLIGYQSE 454
+L+EP I +E+ + A +V+ + N V L P R + Y+S+
Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSD 592

Query: 455 LLTDTRGTAVMNRLFHDYQPYKGE 478
L T G +V Y GE
Sbjct: 593 LTFFTNGRSVCLTELKGYHVTTGE 616


25B0909_25965B0909_26055Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_25965213-1.41051150S ribosomal protein L27
B0909_25970212-2.03714250S ribosomal protein L21
B0909_25975115-1.038260metallopeptidase family protein
B0909_25980012-1.232194DUF1737 domain-containing protein
B0909_25985113-0.395844CoA ester lyase
B0909_25990112-0.324594hypothetical protein
B0909_259951130.2394513-isopropylmalate dehydratase small subunit
B0909_260002131.0514793-isopropylmalate dehydrogenase
B0909_260052191.087296cytochrome-c peroxidase
B0909_260102211.830070cobyrinate a,c-diamide synthase
B0909_260151191.990165uroporphyrinogen-III C-methyltransferase
B0909_260201182.317972cobalt-precorrin-5B (C(1))-methyltransferase
B0909_260251191.839614precorrin-4 C(11)-methyltransferase
B0909_260301191.387810cobalamin biosynthesis protein
B0909_260352211.176378bifunctional cobalt-precorrin-7
B0909_260402201.706386cobalt-precorrin-6A reductase
B0909_260452201.278600precorrin-3B C(17)-methyltransferase
B0909_260502201.263557precorrin-2 C(20)-methyltransferase
B0909_260552181.314362precorrin-8X methylmutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25965SECGEXPORT260.011 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 26.4 bits (58), Expect = 0.011
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 36 IILRQRGTQWHPGANVGIGKDHTIFALTAGNVNFRTKANG 75
+I+ Q+G GA+ G G T+F ++G+ NF T+
Sbjct: 19 LIMLQQGKGADMGASFGAGASATLFG-SSGSGNFMTRMTA 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25985PHPHTRNFRASE280.040 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.040
Identities = 16/76 (21%), Positives = 27/76 (35%), Gaps = 16/76 (21%)

Query: 90 VLLPKVEQPADINDLADLLAEA---------DAPGRLKVWAMIETPFGVLNAASIADAAH 140
V+ P + ++ ++ E D ++V M+E P + A A
Sbjct: 389 VMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAK--- 445

Query: 141 TPDARLAAFVIGLNDL 156
+ F IG NDL
Sbjct: 446 ----EVDFFSIGTNDL 457


26B0909_26165B0909_26250Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_26165027-3.536124MFS transporter
B0909_26170-132-5.334768zinc metalloprotease HtpX
B0909_26175-135-5.467208DUF1674 domain-containing protein
B0909_26180031-5.762593ParB/RepB/Spo0J family partition protein
B0909_26185-131-6.359276ParA family protein
B0909_26190028-5.32583816S rRNA (guanine(527)-N(7))-methyltransferase
B0909_26195028-5.195793tRNA uridine-5-carboxymethylaminomethyl(34)
B0909_26200020-4.276217tRNA uridine-5-carboxymethylaminomethyl(34)
B0909_26205-118-4.083607transcription termination factor Rho
B0909_26210222-3.288276protoporphyrinogen oxidase HemJ
B0909_26215124-2.865202uroporphyrinogen decarboxylase
B0909_26220023-3.362406hypothetical protein
B0909_26225020-2.803577kinase/pyrophosphorylase
B0909_26230121-2.627383Maf-like protein
B0909_26235218-2.758315shikimate dehydrogenase
B0909_26240321-2.507517dephospho-CoA kinase
B0909_26245219-2.048043DNA polymerase III subunit epsilon
B0909_26250219-1.969491protein-export chaperone SecB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_26175CHANLCOLICIN250.027 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 24.7 bits (53), Expect = 0.027
Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 11 EKPRRLSPAAERALKEAEERRRQKADLQ 38
EK R+ + AAE+A +EAE+RR++ +
Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_26250SECBCHAPRONE1447e-47 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 144 bits (364), Expect = 7e-47
Identities = 43/150 (28%), Positives = 75/150 (50%), Gaps = 3/150 (2%)

Query: 7 AQGAVSPSLNILAQYIKDLSFENPGAPRSLQARDNAPSININVNVNANPISGSDFDVVLT 66
Q P L I Y+KD+SFE P P Q D P ++ +++ A + ++V L
Sbjct: 12 TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQ-DWEPKLSFDLSTEAKQVGDDLYEVCLN 70

Query: 67 LNAEA--KDGDKVLFAAELVYGGVFRIAGFPQEHMLPVLFIECPRLLFPFARQIIADVTR 124
++ E + V F E+ GVF I+G + M L +CP +LFP+AR++++ +
Sbjct: 71 ISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFPYARELVSSLVN 130

Query: 125 NGGFPPLMIDPIDFAQMFAQRVAEEQAKAK 154
G FP L + P++F +F + ++ +
Sbjct: 131 RGTFPALNLSPVNFDALFMDYLQRQEQAEQ 160


27B0909_13095B0909_13065Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_130952200.054511class I SAM-dependent methyltransferase
B0909_13090120-0.132061polyribonucleotide nucleotidyltransferase
B0909_130855280.86499430S ribosomal protein S15
B0909_130804271.122216tRNA pseudouridine(55) synthase TruB
B0909_130753221.52095030S ribosome-binding factor RbfA
B0909_130703201.405677translation initiation factor IF-2
B0909_130652191.338699RNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_13085PF06580280.043 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.043
Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 89 ENRVAEALSRVKAGGLIVAAGSKEDGILTLRKTLTKLGIETESTPKYHGVAL 140
EN + ++++ GG I+ G+K++G +TL T + ++T + G L
Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-SLALKNTKESTGTGL 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_13060TCRTETOQM758e-16 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 74.5 bits (183), Expect = 8e-16
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)

Query: 415 IMGHVDHGKTSLLDAIRQANVVAGEAG------------------GITQHIGAYQVEKNG 456
++ HVD GKT+L +++ + E G GIT G +
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 457 QKITFIDTPGHAAFTAMRARGAQATDIAVLVVAADDSVMPQTIESINHAKAAGVPIVVAI 516
K+ IDTPGH F A R D A+L+++A D V QT + + G+P + I
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 517 NKIDKHEANPEKVRQQL 533
NKID++ + V Q +
Sbjct: 128 NKIDQNGIDLSTVYQDI 144


28B0909_12405B0909_12350Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_12405224-4.665045type II and III secretion system protein family
B0909_12400326-5.761477Flp pilus assembly protein CpaB
B0909_12395334-6.835647peptidase
B0909_12385331-5.427479Flp family type IVb pilin
B0909_12380331-5.648816hypothetical protein
B0909_12375119-1.587864*SelT/SelW/SelH family protein
B0909_12365117-0.087699DoxX family protein
B0909_123602190.651207type 1 glutamine amidotransferase
B0909_123553191.373568DUF1508 domain-containing protein
B0909_123502171.415834glycosyl hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12400BCTERIALGSPD1238e-32 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 123 bits (309), Expect = 8e-32
Identities = 58/279 (20%), Positives = 106/279 (37%), Gaps = 29/279 (10%)

Query: 212 RQVSQIVNMLTIEGEDQVTLKVTVAEVSRQVLKQLGFN---------------GSISSST 256
+ +++ L I QV ++ +AEV LG IS++
Sbjct: 331 NDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAI 389

Query: 257 SNNGFEFANPSNLGNAISGASRIASGAIGSGSLNFATYLNAMEQAGVVRTLAEPSLTAIS 316
+ + + + S S A G N+A L A+ + LA PS+ +
Sbjct: 390 AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLD 449

Query: 317 GEQAKFYVGGDFRLPAEQEVTIDKDTGQQTITRTTDTVDYGITLNFRPVVLSPGRISLKI 376
+A F VG + + T D T+ R GI L +P + + L+I
Sbjct: 450 NMEATFNVGQEVPVLTGS-QTTSGDNIFNTVER----KTVGIKLKVKPQINEGDSVLLEI 504

Query: 377 ETNVSEPTYEGNVVTGNAGRNIPGSTYMSIRKRETSTTVELPSGGSIVIAGLVQDNIRQA 436
E VS + + + G + R + V + SG ++V+ GL+ ++
Sbjct: 505 EQEVSSVADAASSTSSDLG--------ATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDT 556

Query: 437 MSGLPGISKVPIFGTLFRSKDFIRNETELVIIATPYLVR 475
+P + +P+ G LFRS ++ L++ P ++R
Sbjct: 557 ADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12390PREPILNPTASE449e-08 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 43.6 bits (103), Expect = 9e-08
Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 27/164 (16%)

Query: 4 AAIFLTLPLCVAFAALNDLFSMTIPNRIPLILLLSFVVVAPLTGMDWQTFAMSIAAATAV 63
L L + DL M +P+++ L LL ++ L G + + ++ A A
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG--FVSLGDAVIGAMAG 191

Query: 64 FLVCF-------ALFAANAMGGGDAKLLTAATVWYGFNISLVEFLLAVTFLGGVLTIGIL 116
+LV + L MG GD KLL A W G+ + LL+ +G + IG++
Sbjct: 192 YLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS-LVGAFMGIGLI 250

Query: 117 LLRSRSQEIMAAGIPIPDSLLVAKKIPYGIGIAVAGL--LTYGD 158
LLR+ Q +K IP+G +A+AG L +GD
Sbjct: 251 LLRNHHQ---------------SKPIPFGPYLAIAGWIALLWGD 279


29B0909_11065B0909_10995N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_110652150.366719response regulator
B0909_110602150.799904chemotaxis protein CheA
B0909_11055-1100.827325chemotaxis protein
B0909_110500110.596541chemotaxis response regulator protein-glutamate
B0909_11045110-0.400689response regulator
B0909_11040010-0.205994chemoreceptor glutamine deamidase CheD
B0909_11035-190.192399hypothetical protein
B0909_11030-18-0.158858flagellar M-ring protein FliF
B0909_11025-18-0.401098LuxR family transcriptional regulator
B0909_11020-190.099426helix-turn-helix transcriptional regulator
B0909_11015-280.052344methyl-accepting chemotaxis protein
B0909_11010-2100.015122response regulator
B0909_11005-211-0.681918large conductance mechanosensitive channel
B0909_11000-112-0.443406pyridoxal phosphate-dependent aminotransferase
B0909_10995-115-0.333826UDP-glucose 4-epimerase GalE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11065HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 2/115 (1%)

Query: 4 KVLTVDDSRTIRNMLLVTLNNAGFETIQAEDGIEGLEVLEQSNPDVIVTDINMPRLDGFG 63
+L DD IR +L L+ AG++ + + + D++VTD+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIEGVRRNEKYRAIPILVLTTESDAEKKNRARQAGATGWIVKPFDPAKLIDAIER 118
+ ++ + +P+LV++ ++ +A + GA ++ KPFD +LI I R
Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11060PF06580411e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 1e-05
Identities = 20/133 (15%), Positives = 38/133 (28%), Gaps = 51/133 (38%)

Query: 470 IRNAVDHGIETPEKREAAGKNPEGTIKLSAKHRSGRILIELQDDGAGINRERVRQKAIDN 529
+ N + HGI G I L +G + +E+++ G+ +
Sbjct: 264 VENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN--------- 306

Query: 530 DLIAADANLTDEEIDNLIFAPGFSTADKISDISGRGVGMDVVKRSIQALGG---RISISS 586
G G+ V+ +Q L G +I +S
Sbjct: 307 ------------------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 587 RPGHGSTFTMSLP 599
+ G + +P
Sbjct: 337 KQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11050HTHFIS732e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 2e-16
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 16/153 (10%)

Query: 1 MSALARVLVVDDSPTMRGLISAVL-KADPEVEVVGQAGNAMEARAAIKQLNPDVVTLDIE 59
M+ A +LV DD +R +++ L +A +V + NA I + D+V D+
Sbjct: 1 MTG-ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVV 56

Query: 60 MPEMNGLEFLEKIMRLRP-MPVIMVSSLTHRGADASLAALEIGAFDCVGKPAPGDARPFG 118
MP+ N + L +I + RP +PV+++S+ ++ A E GA+D + KP
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPF-------- 106

Query: 119 DLADKVKAAARSQHAAYRAARPESAAAVQPTPV 151
DL + + R+ R + P+
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11045HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 26/118 (22%), Positives = 52/118 (44%), Gaps = 5/118 (4%)

Query: 6 KIKVLIVDDQVTSRLLLSDALTQLGFKQITAAGDGEQGLKIMEQQPHHLVISDFNMPKMD 65
+L+ DD R +L+ AL++ G+ + + + + LV++D MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GLGFLHAVRA-NPTTKKAAFIILTAQGDRALVQKAAQLGANNVLAKPFTIDKMRAAIE 122
L ++ P ++++AQ KA++ GA + L KPF + ++ I
Sbjct: 62 AFDLLPRIKKARPDLP---VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11030FLGMRINGFLIF385e-130 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 385 bits (991), Expect = e-130
Identities = 139/576 (24%), Positives = 242/576 (42%), Gaps = 61/576 (10%)

Query: 14 PQVLKNVAALGQTRLLMLGGVGVLSMALILAAALYVNRPAYETLYVGLEKSDLNKISIAL 73
P+ L+ + L + L G ++A+++A L+ P Y TL+ L D I L
Sbjct: 10 PKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQL 69

Query: 74 AESGLDFQVGTDGASLQVPVGLTSKARLLLAERGLPDSANAGYELFDNVGSLGLTSFMQE 133
+ + ++ +++VP + RL LA++GLP G+EL D G++ F ++
Sbjct: 70 TQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQ-EKFGISQFSEQ 128

Query: 134 VTRVRALEGEISRSIQQIDGVAAARVHIVMPDVGNFRRGEQKPTASVMIR--ASATAGRK 191
V RALEGE++R+I+ + V +ARVH+ MP F R ++ P+ASV +
Sbjct: 129 VNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEG 188

Query: 192 ASASIRHLVASAVPGLEVDDVTLLDSTGQLLASGDDVTNAAMNRSLTLAQNVQQEITTNI 251
+++ HLV+SAV GL +VTL+D +G LL + + L A +V+ I I
Sbjct: 189 QISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRI 248

Query: 252 DKALAPFLGMDNFRSSVTAQLNTDSRQIQETVYDPESRVERSVRTVKEDQK--------- 302
+ L+P +G N + VTAQL+ +++ E Y P ++ ++
Sbjct: 249 EAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYP 308

Query: 303 -------SHETQPDTAATV---------EQNVPQAAPQGGGSGPQSSDESAKKEEQTNYE 346
S++ P A + QN PQ + + + S ++ E +NYE
Sbjct: 309 GGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSN--SAGPRSTQRNETSNYE 366

Query: 347 INSKTVATVKNGYTVEKISVAVVVNKGRIAKMVGEPVDQAKIDAYLAEMQKIVTSAAGIS 406
++ T N +E++SVAVVVN +A P+ + +++ + A G S
Sbjct: 367 VDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLT----ADQMKQIEDLTREAMGFS 422

Query: 407 SDRGDVVTLTAMDFLETQLLDEAATGPGI-MEVLSRNSAGIINSLAFVAVAFLVIWLGVR 465
RGD + + F + + + P + L + VA+++ VR
Sbjct: 423 DKRGDTLNVVNSPF--SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 480

Query: 466 PLVRTVTGNGAAAGQLSQETAGLELPDFSPGMDAGAGGLMEGFGADFGFDSTDDLLAGGD 525
P + AA + A E A S D+ L
Sbjct: 481 PQLTRRVEEAKAAQE-------------------QAQVRQETEEAVEVRLSKDEQLQQRR 521

Query: 526 SEGTFNRRVR-EGPERRLSRMVEISEERAAKILRKW 560
+ N+R+ E +R+ M + A ++R+W
Sbjct: 522 A----NQRLGAEVMSQRIREMSDNDPRVVALVIRQW 553


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11010HTHFIS513e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.6 bits (121), Expect = 3e-08
Identities = 26/119 (21%), Positives = 46/119 (38%), Gaps = 5/119 (4%)

Query: 1045 RVLCIDNEPKILEGMTLLLTGWGCEVLPTGSVATLEEPFLSLAAAPDVIIADYHLDDGDG 1104
+L D++ I + L+ G +V T + ATL A D+++ D + D +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWR--WIAAGDGDLVVTDVVMPDENA 62

Query: 1105 ISAIRLIRTFHGKAIPALLVTADRSPEVRSDAEKYGISVQHKPVKPAALRAYINQISSA 1163
+ I+ +P L+++A + A + G KP L I I A
Sbjct: 63 FDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYL--PKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11005MECHCHANNEL1261e-40 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 126 bits (318), Expect = 1e-40
Identities = 64/140 (45%), Positives = 85/140 (60%), Gaps = 12/140 (8%)

Query: 1 MLNEFKTFIARGNVMDLAVGVIIGAAFSKIVDSVVNDLIMPIVGAIFGGFDFSNYFLPLS 60
++ EF+ F RGNV+DLAVGVIIGAAF KIV S+V D+IMP +G + GG DF + + L
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL- 61

Query: 61 SSVTATSLAAARDQGAVFAYGSFLTVLINFLILAWIIFLMVKGVNKLRDSVDRKKVEEKP 120
A V YG F+ + +FLI+A+ IF+ +K +NKL +K EE
Sbjct: 62 ------RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKL-----NRKKEEPA 110

Query: 121 EAAPPPEDVKLLTEIRDLLK 140
A P ++ LLTEIRDLLK
Sbjct: 111 AAPAPTKEEVLLTEIRDLLK 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10995NUCEPIMERASE1391e-40 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 139 bits (351), Expect = 1e-40
Identities = 78/349 (22%), Positives = 140/349 (40%), Gaps = 51/349 (14%)

Query: 1 MAVLVTGGAGYIGSHMVWALLDAGEEVVVVDRLSTG-------SRWAVA--PAARFYLGD 51
M LVTG AG+IG H+ LL+AG +VV +D L+ +R + P +F+ D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 AADRAMLDQIFEENQIETIFHFAGSVSVPESISQPLEYYENNTGTTRALVAAAVAHGIRN 111
ADR + +F E +F ++V S+ P Y ++N ++ + I++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 112 FIFSSTAAVYG---NQPF--EGPVPETAILSPENPYGLSKLASEIMLRDVVQAHDFNYVA 166
+++S+++VYG PF + V P + Y +K A+E+M +
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDH-----PVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 167 LRYFNVAGADPQGRAGPSPTGVANLIKVACEAATGRRDRVEVYGTDYPTADGTGVRDYIH 226
LR+F V G P GR + + G+ ++VY G RD+ +
Sbjct: 176 LRFFTVYG--PWGRPDMALFKFTKAML------EGK--SIDVYN------YGKMKRDFTY 219

Query: 227 VSDLIDAHVLAMAHLRAGGGT---------------RTLNCGYGVGYSVLDVLHAVQRES 271
+ D+ +A + + R N G ++D + A++
Sbjct: 220 IDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDAL 279

Query: 272 EHEFPIIHCPRRAGDIATMVADSARIQSELGWRPRFNDLSTIVRTALQW 320
E P + GD+ AD+ + +G+ P + V+ + W
Sbjct: 280 GIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNW 327


30B0909_10925B0909_10860N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_10925530-3.349079*flagellin
B0909_10920528-2.334737flagellin
B0909_10915320-2.023771flagellin
B0909_109104190.027107flagellar biosynthetic protein FliP
B0909_109003160.802233flagellar basal body protein FliL
B0909_108902180.923010flagellar L-ring protein FlgH
B0909_108853180.855935flagellar protein
B0909_108803180.756595flagellar basal body P-ring protein FlgI
B0909_10875016-0.967994flagella basal body P-ring formation protein
B0909_10870-117-1.156057flagellar basal-body rod protein FlgG
B0909_10865-214-0.234310flagellar hook-basal body complex protein FliE
B0909_10860-113-0.229456flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10925FLAGELLIN1251e-33 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 125 bits (314), Expect = 1e-33
Identities = 61/386 (15%), Positives = 119/386 (30%), Gaps = 12/386 (3%)

Query: 4 ILTNISAMAALQTLRSIDAKMETTQSRVSSGLRVGTASDNAAYWSIATTMRSDNMALSAV 63
I TN ++ L + + + R+SSGLR+ +A D+AA +IA S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYSAM---ESAVEVVKQVKAKMVAATEEGVDRSKIQEEISQLQEQLR 120
G + T A+ + ++ V+++ + T D IQ+EI Q E++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 SISSSASFSGENWLQADLTSATGNAVTKNVVGSFIRTADGSVSVKKID-YQLDSTTVLFD 179
+S+ F+G L G I + VK + +
Sbjct: 124 RVSNQTQFNGVKVLSQ---DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 180 EGGQLGIIDRVFNVTPASTTLKINTSGTISEHAVLTNSVDSLIKSGATFEGNYANVTTAV 239
G L + ++ AV+T++ + Y N
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV-----YVNAANGQ 235

Query: 240 AGGAAAGDYVKVNGVWVKAVAAASNPGQEIAATSNAGANQWVVDVTPIPAGTVVTAAASL 299
A + V+ A + + IA G D +
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 300 DTVDIRTLSNEELDVMVRATDAALEAITSATADLGSISMRIAIQEDFVSKLTDSIDKGIG 359
+ T++ E++ + V A + +AT + F +
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 360 RLVDADMNEESTKLKALQTQQQLAIQ 385
++A+ + + + A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANA 381



Score = 78.2 bits (192), Expect = 1e-17
Identities = 32/181 (17%), Positives = 58/181 (32%)

Query: 222 IKSGATFEGNYANVTTAVAGGAAAGDYVKVNGVWVKAVAAASNPGQEIAATSNAGANQWV 281
++S + N + AV S A + A V
Sbjct: 327 LQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKV 386

Query: 282 VDVTPIPAGTVVTAAASLDTVDIRTLSNEELDVMVRATDAALEAITSATADLGSISMRIA 341
+ S + + + + + D+AL + + + LG+I R
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFD 446

Query: 342 IQEDFVSKLTDSIDKGIGRLVDADMNEESTKLKALQTQQQLAIQSLSIANTSSENILSLF 401
+ +++ R+ DAD E + + Q QQ L+ AN +N+LSL
Sbjct: 447 SAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506

Query: 402 R 402
R
Sbjct: 507 R 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10920FLAGELLIN1152e-30 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 115 bits (289), Expect = 2e-30
Identities = 61/386 (15%), Positives = 118/386 (30%), Gaps = 11/386 (2%)

Query: 4 ILTNIAAMSALQTLRSIGQDMEATQGRVSSGQRVGTASDNAAYWSIATTMRSDNMALSAV 63
I TN ++ L + + R+SSG R+ +A D+AA +IA S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYAGM---ESAIEVVKEIKAKLVAATEDGVDKNKVQEEITQLQEQLR 120
G + T + + ++ V+E+ + T D +Q+EI Q E++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 GISEAASFSGENWLQADLSVAGGTVTKDVVGSFVRDANGIVSVKKIDYTLDTDSVLFDTR 180
+S F+G L D + G + + VK + LD +V
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM--KIQVGANDGETITIDLQKIDVKSLG--LDGFNVNGPKE 179

Query: 181 ATGTKTGILDKVYTVAEDGVTLSINTGGVTSEVTVKSFSIDSLIKSGAAFQGNYASVTTA 240
AT K T + + + V + + + +TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 AAGGAGVGDYVKVEGTWVLAANATTAPTQEIAATTTTPAAASWIVATANAPTTTPAVTSL 300
A D K + A A + T + T
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTID----TKTGNDG 295

Query: 301 DGINIIGLSGTQINQMMKAVDAALKDMTSAAADLGSISMRIGLQEDFVSKLTDSIDSGVG 360
+G ++G ++ + + A ++ +A + F +S
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355

Query: 361 RLVDAEMNEESTKLKALQTQQQLAIQ 386
++A + + + A
Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANA 381



Score = 86.6 bits (214), Expect = 1e-20
Identities = 51/348 (14%), Positives = 105/348 (30%), Gaps = 12/348 (3%)

Query: 65 DALGLGAAKVDTAYAGMESAIEVVKEIKAKLVAATEDGVDKNKVQEEITQLQEQLRGISE 124
D LG + + ++ K D + + + +
Sbjct: 163 DVKSLGLDGFNVNGPKEATVGDLKSSFKN---VTGYDTYAVGANKYRVDVNSGAVVTDTT 219

Query: 125 AASFSGENWLQADLSVAGGTVTKDVVGSFVRD----ANGIVSVKKIDYTLDTDSVLFDTR 180
A + + ++ A ++ + G K I +
Sbjct: 220 APTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFD 279

Query: 181 ATGTKTGILDKVYTVAEDGVTLSINTGGVT-SEVTVKSFSIDSLIKSGAAFQGNYASVTT 239
G I K V+ +IN VT + + + + + + + + Y SV
Sbjct: 280 YKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVN 339

Query: 240 AAAGGAGVGDYVKVEGTWVLAANATTAPTQEIAATTTTPAAASWIVATANAPTTTPAVTS 299
+ + + A NA ++ A A+ T T T+
Sbjct: 340 GQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTA 399

Query: 300 LD----GINIIGLSGTQINQMMKAVDAALKDMTSAAADLGSISMRIGLQEDFVSKLTDSI 355
+ + ++D+AL + + + LG+I R + ++
Sbjct: 400 SGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNL 459

Query: 356 DSGVGRLVDAEMNEESTKLKALQTQQQLAIQSLSIANSSSESILSLFR 403
+S R+ DA+ E + + Q QQ L+ AN +++LSL R
Sbjct: 460 NSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10910FLAGELLIN1161e-30 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 116 bits (291), Expect = 1e-30
Identities = 56/379 (14%), Positives = 111/379 (29%), Gaps = 6/379 (1%)

Query: 4 ILTNINAMSALQTLRSISSNMEDTQSRISSGLKVGSASDNAAYWSIATTMRSDNEALGAV 63
I TN ++ L S++ R+SSGL++ SA D+AA +IA S+ + L
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGAAKVDTAYAGM---ESVIDVVKQIKNKLVTAQESSADKTKIQGEITQLQDQLK 120
G + T + + + V+++ + S +D IQ EI Q +++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 GIVESASFSGENWLKADLSAAATTKSVVGSFVREGGTVSVKTIDYLMDASKVLVDTRATG 180
+ F+G L D + G + + +D V AT
Sbjct: 124 RVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQ-KIDVKSLGLDGFNVNGPKEATV 182

Query: 181 TKTGILDKVQDVGVDTVTLTINDGGTLSEHTVQAYSLDTLTTAGAEFQGNFAKTATDNYV 240
K ++ V + + TD+
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 241 KVEGSWVKAV--ASAGTQEIASTTTAAGTITAGTWMVDTTNAGAGTVAASGSVLSMNISS 298
+ ++AGT E + A G ++
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 299 LTGTQLSALVKAVDKSLTELTSAGAQLGSISSRISLQEDFASKLKGSIDKGVGRLVDADM 358
+ G +++ V + + +A Q + F K + ++A+
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANN 362

Query: 359 NEESTRLKALQTQQQLAIQ 377
+ + + A
Sbjct: 363 AVKGESKITVNGAEYTANA 381



Score = 79.3 bits (195), Expect = 3e-18
Identities = 50/334 (14%), Positives = 98/334 (29%), Gaps = 2/334 (0%)

Query: 63 VQDALGLGAAKVDTAYAGMESVIDVVKQIKNKLVTAQESSADKTKIQGEITQLQDQLKGI 122
V + +++ + V + +
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 123 VESASFSGENWLKADLSAAATTKSVVGSFVREGGTVSVKTIDYLMDASKVLVDTRATGTK 182
+ + EN DL + + G + D V
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 183 TGILDKVQDVGVDTVTLTINDGGTLSEHTVQAYSLDTLTTAGAEFQGNFAKTATDNYVKV 242
G + + VTLT+ D + + A + + G F
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 243 EGSWVKAVASAGTQEIASTTTAAGTITAGTWMV--DTTNAGAGTVAASGSVLSMNISSLT 300
+ S ++A + + + A T A V A+ S L ++
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 301 GTQLSALVKAVDKSLTELTSAGAQLGSISSRISLQEDFASKLKGSIDKGVGRLVDADMNE 360
+ + ++D +L+++ + + LG+I +R +++ R+ DAD
Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473

Query: 361 ESTRLKALQTQQQLAIQSLSIANSDSQNILSLFR 394
E + + Q QQ L+ AN QN+LSL R
Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10900FLGBIOSNFLIP299e-105 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 299 bits (766), Expect = e-105
Identities = 105/246 (42%), Positives = 158/246 (64%), Gaps = 5/246 (2%)

Query: 1 MIRFLVTIAVLLALPGLANAQQFPSDLFNTQIDGSVAAWI--IRTFGLLTVLSVAPGILI 58
M R L VLL L Q P + + + G +W ++T +T L+ P IL+
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPG-ITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 59 MVTSFPRFVIAFSILRSGMGLASTPSNMILLSMAMFMTFYVMSPTFDKAWTDGVQPLLQN 118
M+TSF R +I F +LR+ +G S P N +LL +A+F+TF++MSP DK + D QP +
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 119 QINEQQAIQRIAEPFRTFMNANTRDKDLKLFVDIARERGQGVMTDNVVDYRVLVPAFMLS 178
+I+ Q+A+++ A+P R FM TR+ DL LF +A + V R+L+PA++ S
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANT--GPLQGPEAVPMRILLPAYVTS 177

Query: 179 EIRRGFEIGFLIILPFLVIDLIVATITMAMGMMMLPPTSISLPFKILFFVLIDGWNLLVG 238
E++ F+IGF I +PFL+IDL++A++ MA+GMMM+PP +I+LPFK++ FVL+DGW LLVG
Sbjct: 178 ELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVG 237

Query: 239 SLVRSF 244
SL +SF
Sbjct: 238 SLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10890FLGLRINGFLGH2611e-90 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 261 bits (668), Expect = 1e-90
Identities = 58/243 (23%), Positives = 98/243 (40%), Gaps = 30/243 (12%)

Query: 7 PALLLPLALLAGC---------QNNQTLKEIGNAPAMSPIGSGLQFSQTPQMGMYPKQPK 57
L + L GC Q + + + P +P+ +G F Q+ Q Y QP
Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPV---PGPTPVANGSIF-QSAQPINYGYQP- 64

Query: 58 HMASGYSLWSDSQGALFKDLRALNIGDILTVNIQINDKADFDNETERNRTNSSGLNWKAK 117
LF+D R NIGD LT+ +Q N A + +R + +
Sbjct: 65 ---------------LFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTV 109

Query: 118 AQIL-GWTPDADSNIKYGSDTDTQAKGKTKRSEKLTLLVAAVVTGILENGNLIISGSQEV 176
+ L G +A ++++ KG S + + V +L NGNL + G +++
Sbjct: 110 PRYLQGLFGNARADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQI 169

Query: 177 RVNHEIRILNVGGIVRPQDVDAQNMISYERIAEARISYGGRGRLTEVQQPPVGQQVVDLF 236
+N + G+V P+ + N + ++A+ARI Y G G + E Q Q+
Sbjct: 170 AINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNL 229

Query: 237 SPL 239
SP+
Sbjct: 230 SPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10880FLGPRINGFLGI5060.0 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 506 bits (1304), Expect = 0.0
Identities = 363/373 (97%), Positives = 366/373 (98%)

Query: 1 MRLLRIIAAAILFSAQPFLSVSAAHADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS 60
MR+LRIIAAA++FSA PFLS A ADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS
Sbjct: 1 MRVLRIIAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDS 60

Query: 61 LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD 120
LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD
Sbjct: 61 LRSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGD 120

Query: 121 ATSLRGGTLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE 180
ATSLRGG LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE
Sbjct: 121 ATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIE 180

Query: 181 RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRT 240
RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR
Sbjct: 181 RELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV 240

Query: 241 ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV 300
ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV
Sbjct: 241 ADLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQV 300

Query: 301 IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI 360
IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI
Sbjct: 301 IQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGI 360

Query: 361 KSAGALQAELVLQ 373
KSAGALQAELVLQ
Sbjct: 361 KSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10870FLGHOOKAP1413e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 3e-06
Identities = 19/80 (23%), Positives = 31/80 (38%), Gaps = 15/80 (18%)

Query: 4 LAIAATGMDAQQTNLEVIANNIANINTTGYKRARAEFTDLLYQTERMQGVPNRANQAIVP 63
+ A +G++A Q L +NNI++ N GY R T + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT---TIMAQANSTLGA----------- 49

Query: 64 EGANIGLGVQTSAVRNIHTQ 83
G +G GV S V+ +
Sbjct: 50 -GGWVGNGVYVSGVQREYDA 68



Score = 39.2 bits (91), Expect = 1e-05
Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 214 IKQSYLEGSNVDAVKEITDLITAQRAYEMNSKVITTADEM 253
+ S V+ +E +L Q+ Y N++V+ TA+ +
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10865FLGHOOKFLIE336e-05 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 33.1 bits (75), Expect = 6e-05
Identities = 19/99 (19%), Positives = 42/99 (42%), Gaps = 6/99 (6%)

Query: 19 GISSLTESVFGSEQTTPAQQTGASFASVLGNMSVDAMNSLKKAEVAS------FEGIQGK 72
GI + + + + AQ++ A++ + + A+ F +
Sbjct: 5 GIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPG 64

Query: 73 ANTREVVDAVLSAEQSLQTAIALRDKIVSAYLDITKMQI 111
+V+ + A S+Q I +R+K+V+AY ++ MQ+
Sbjct: 65 VALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10860FLGHOOKAP1270.023 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 27.2 bits (60), Expect = 0.023
Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 97 NVNILIEMADMREANRSYDANLQVIRQTRDLVASTIDL 134
VN+ E +++ + Y AN QV++ + + I++
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 26.5 bits (58), Expect = 0.038
Identities = 18/72 (25%), Positives = 23/72 (31%), Gaps = 19/72 (26%)

Query: 5 SAASKIAGSGLEVQSTRLRIVSENIANARSTGDTPGADPYRRKTVTFGSELDR------- 57
S+ A SGL L S NI++ G Y R+T
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAG-------YTRQTTIMAQANSTLGAGGWV 53

Query: 58 -----VSGVERV 64
VSGV+R
Sbjct: 54 GNGVYVSGVQRE 65


31B0909_10820B0909_10725N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_10820111-2.578092flagellar motor switch protein FliM
B0909_10815110-2.891493flagellar motor switch protein FliN
B0909_10810111-2.829269flagellar motor switch protein FliG
B0909_10805111-2.121864flagellar biosynthesis protein FlhB
B0909_10800-114-0.592300hypothetical protein
B0909_10795-2150.077797hypothetical protein
B0909_10790-2140.027137flagellar protein FlaD
B0909_10785-1150.514848hypothetical protein
B0909_10780-313-0.065760MotB family protein
B0909_10775-115-0.945154chemotaxis protein
B0909_10770114-2.012079flagellar hook-length control protein FliK
B0909_10765215-3.159623lytic transglycosylase domain-containing
B0909_10760215-3.289454DNA-binding response regulator
B0909_10755315-3.383462flagellar hook protein FlgE
B0909_10750212-3.455604flagellar hook-associated protein FlgK
B0909_10745213-2.679866flagellar hook-associated family protein
B0909_10740114-2.302700flagellar biosynthesis regulator FlaF
B0909_10735114-2.071600flagellar biosynthesis repressor FlbT
B0909_10730010-1.975356flagellar hook assembly protein FlgD
B0909_10725-111-2.364212flagellar biosynthetic protein FliQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10820FLGMOTORFLIM573e-11 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 56.8 bits (137), Expect = 3e-11
Identities = 36/224 (16%), Positives = 81/224 (36%), Gaps = 18/224 (8%)

Query: 95 MMAVGNGFVIALMERMLGAAPDTIGEPDERSLSHIELDLAAMVLGRIAGVLRSGVNAPGG 154
++ V ++++R+ G +R L+ IE + V+ RI +R
Sbjct: 116 VLEVDPSITFSIIDRLFGGTGQAAK--VQRDLTDIENSVMEGVIVRILANVRESWTQVID 173

Query: 155 FEATID-----PPFNANGRSAFDEMIAGVYGVTIRMKIDIGRVSSEFALIVPQ---RPLL 206
+ P F + EM+ V + ++ +G +P P++
Sbjct: 174 LRPRLGQIETNPQFAQIVPPS--EMV-----VLVTLETKVGEEEGMMNFCIPYITIEPII 226

Query: 207 KTSIVAPKASAQALKKQEEWMEMISQQVKRSQVTLEARIKLETLTLRTISRLVAGDVIPF 266
S+ ++M ++ ++ + + A + L++R I L GD+I
Sbjct: 227 SKLSSQFWFSSVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRL 286

Query: 267 QDLKQDDIGVEVSANGSKLYNCEFGKSGERYMVRVKNNVSTDDE 310
D D +S K + C+ G G++ ++ + + +
Sbjct: 287 HDTHVGD-PFVLSIGNRKKFLCQPGVVGKKIAAQILERIESTSQ 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10815FLGMOTORFLIN1009e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 100 bits (250), Expect = 9e-30
Identities = 32/83 (38%), Positives = 60/83 (72%), Gaps = 3/83 (3%)

Query: 93 SGLSENMELIMDIPIDVQIVLGTSRMLVSGLMGLEEGATIALDRKIGEPVEIMVNGRRIA 152
SG ++++LIMDIP+ + + LG +RM + L+ L +G+ +ALD GEP++I++NG IA
Sbjct: 48 SGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIA 107

Query: 153 RGEITVLEDDDTRFGVKLIEVMS 175
+GE+ V+ D ++GV++ ++++
Sbjct: 108 QGEVVVVAD---KYGVRITDIIT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10810FLGMOTORFLIG295e-101 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 295 bits (758), Expect = e-101
Identities = 73/332 (21%), Positives = 163/332 (49%), Gaps = 6/332 (1%)

Query: 14 KPLSQADKAAAVLLAMGKGVAGKLLKFFTQHELQMIISSAQTLRVIPPDELAQIVAEFED 73
L+ KAA +L+++G ++ K+ K+ +Q E++ + L I + ++ EF++
Sbjct: 13 SALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKE 72

Query: 74 LFTEGTGLMDNAK-AIESILEEGLTPEEVDSLLGRRTAFQAYEASIWDRLQEAEPEFVGK 132
L + +LE+ L ++ ++ + A ++ ++ ++ A+P +
Sbjct: 73 LMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGS--ALQSRPFEFVRRADPANILN 130

Query: 133 FLLREHPQTIAYILSMLPSSFGAKVLLTIPEEQRADIMNRTVNMKEVSPTAAQIIEKRVV 192
F+ +EHPQTIA ILS L + +L ++P E + ++ R M SP + +E+ +
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 193 NLINEIEAE--RNAGGSTKVADLMNELEKPQVDTLLSSLETLSKEAANKVKPKIFLFDDL 250
+ + +E +AGG V +++N ++ ++ SLE E A ++K K+F+F+D+
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 251 MFMPQRSRVMLLNDVSADVLTMALRGATMEIKECVLSSISPRQRRMIESDLAVPQASINT 310
+ + RS +L ++ L AL+ + ++E + ++S R M++ D+
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEF-LGPTRR 309

Query: 311 REVAIARRAVAQEAIRLANSGQIQLKEAGADE 342
++V +++ + +L G+I + G ++
Sbjct: 310 KDVEESQQKIVSLIRKLEEQGEIVISRGGEED 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10805TYPE3IMSPROT368e-129 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 368 bits (947), Expect = e-129
Identities = 99/349 (28%), Positives = 183/349 (52%), Gaps = 9/349 (2%)

Query: 8 DSKTEAPTEKKLRDAAEKGNLPFSREVPIFASSLAFYCYLVF---FLPDGAGRIGETLKD 64
KTE PT KK+RDA +KG + S+EV A +A L+ + + ++ +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 65 LFGQPEQWNLGTRPDALSLLYFLGTSMAYLLMPAMIMFIVFGLASSFFQNLPSPVLERVR 124
P L D + L +F L P + + + +AS Q E ++
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFY------LCFPLLTVAALMAIASHVVQYGFLISGEAIK 116

Query: 125 PQWSRVSPAKGFTRIYSKQGFVEFGKSLFKILIVSTIMFFSLRGDFYSLIDLMFSDPQVI 184
P +++P +G RI+S + VEF KS+ K++++S +++ ++G+ +L+ L + I
Sbjct: 117 PDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECI 176

Query: 185 FVRVVEIVKKMMVVILLSTALLAAVDLLWTRHHWFTQLKMTKHEVKEEYKQSQGDPVVKS 244
+ +I++++MV+ + +++ D + + + +LKM+K E+K EYK+ +G P +KS
Sbjct: 177 TPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKS 236

Query: 245 RQRSIARDRARRRMINNVPRATLVIANPTHFAVALRYVREESDAPIVVAKGQDLIALKIR 304
++R ++ R M NV R+++V+ANPTH A+ + Y R E+ P+V K D +R
Sbjct: 237 KRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVR 296

Query: 305 EIAEENNIPVFEDPPLARSMFAQVSIDSVIPPAFYKAVAELVHRVYAMK 353
+IAEE +P+ + PLAR+++ +D IP +A AE++ +
Sbjct: 297 KIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10790FLAGELLIN1524e-43 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 152 bits (384), Expect = 4e-43
Identities = 75/504 (14%), Positives = 145/504 (28%), Gaps = 69/504 (13%)

Query: 4 IMTNAAAMAALQTLRMIDKSLETTQARVSSGYRVETAADNAAYWSISTTMRSDNTALSAV 63
I TN+ ++ L SL + R+SSG R+ +A D+AA +I+ S+ L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLG---AAKVDTAFEAIESAIETVESLKAKLVAAYGVGSNRSKIQEEIKQLQDQLK 120
G A + A I + ++ V L + S+ IQ+EI+Q +++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 121 SISESASFSGENWLQAKIG----------------------DGKTPAAEEPTIKKIVASF 158
+S F+G L K
Sbjct: 124 RVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVG 183

Query: 159 TRTAAGAVGVTTVDYSLDSSTVLFDLSGGKFGILDTEARFLRKNETIVTMRTTDTPALPA 218
++ Y++ ++ D++ G T K T
Sbjct: 184 DLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAEN 243

Query: 219 VPTVTDKDYVVTTLKDSEVAALSGFTVKAT------------------------GIYTNA 254
V +T +E A++G + T
Sbjct: 244 NTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTI 303

Query: 255 ANTEGYLKISDDVWVKLTSTDPADAAAPTTDSTTPVVTTTSPNTNWYYDVSSAIDPDDRK 314
+ L ++D ++ ++ T + + +
Sbjct: 304 NGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNA 363

Query: 315 LGISVSTLDINKLTDLAQKMGTMTGEQY--------------------TEADVLDAMMSF 354
+ +T ++
Sbjct: 364 VKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS 423

Query: 355 VDGQLEAMTSAASSLGSLQSRIDMQENFVSSLMDVIDKGIGRLVDADMNEESTRLKALQT 414
+D L + + SSLG++Q+R D + + + ++ R+ DAD E + + Q
Sbjct: 424 IDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQI 483

Query: 415 QQQLGIQSLSIANANAENILQLFK 438
QQ G L+ AN +N+L L +
Sbjct: 484 LQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10780IGASERPTASE363e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 3e-04
Identities = 24/114 (21%), Positives = 35/114 (30%), Gaps = 6/114 (5%)

Query: 187 TQQVKITRADQQQEPVEPQQAAESKQEDAAENAQAIALKSSETPVDKAEDGKSMEVAAVV 246
T + Q P P E + D A SET AE+ K
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 247 PQQRP-----DASEQAALAQPKPDAQRQASDLQQKASDADREKADTLREEIEKQ 295
+Q E A A+ A Q +++ Q S+ +E T +E
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET-KETQTTETKETATV 1106



Score = 33.5 bits (76), Expect = 0.002
Identities = 18/105 (17%), Positives = 33/105 (31%), Gaps = 5/105 (4%)

Query: 188 QQVKITRADQQQEPVEPQQAAESKQEDAAENAQAIALKSSETPVD----KAEDGKSMEVA 243
Q ++ + + Q ++ + Q K + T K E K+ EV
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 244 AVVPQQRP-DASEQAALAQPKPDAQRQASDLQQKASDADREKADT 287
V Q P + Q +P + + ++ ADT
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10760HTHFIS343e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 3e-04
Identities = 24/123 (19%), Positives = 46/123 (37%), Gaps = 6/123 (4%)

Query: 2 IVVVDERELVKDGYTSLFGREGIPSTGFDPAEFGEWVSTAAESDLAAVEAFLIGQGDRSY 61
I+V D+ ++ R G A A + DL + ++ + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTD--VVMPDENAF 63

Query: 62 SLPKAIRDR-TTAPVIAVSDQPSLESTLALFDSGVDDVVRKPVHPREILA---RAAAIRR 117
L I+ PV+ +S Q + + + + G D + KP E++ RA A +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 118 RLQ 120
R
Sbjct: 124 RRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10755FLGHOOKAP1449e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 9e-07
Identities = 14/48 (29%), Positives = 26/48 (54%)

Query: 378 IQKGALEGSNVDIASELTDMIESQRIYTANSKVFQTGSDLMDVLINLK 425
+ S V++ E ++ Q+ Y AN++V QT + + D LIN++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 2e-05
Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 1/77 (1%)

Query: 9 TGVSGMNAQANKLGTVGDNIANASTTGYKRASTSFS-SLVLPSSSGSYASGGVQSNVRYS 67
+SG+NA L T +NI++ + GY R +T + + + G +G S V+
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQRE 65

Query: 68 ISEQGNLSYTTSSTDLA 84
+ T +
Sbjct: 66 YDAFITNQLRAAQTQSS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10750FLGHOOKAP1722e-15 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 71.5 bits (175), Expect = 2e-15
Identities = 57/313 (18%), Positives = 124/313 (39%), Gaps = 15/313 (4%)

Query: 4 TSAMNTAQSIFNNTGKQTDVTAKNIANVGNANYVKRTAILGT---TMAGATIVTNG---- 56
+S +N A S N + + NI++ A Y ++T I+ T+ V NG
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 57 ---RAQNESLLRQTISSASLATGQNTVLTGLEEVRSIFGSNNYESAPSTYMEELLKSLSS 113
R + + Q ++ + ++G + ++ ++ ++ S+ +T M++ SL +
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTS--TSSLATQMQDFFTSLQT 118

Query: 114 YAAKPSNAALAATAVTSASDVASSLNKASAELQAMRLRADKEMSLQVDKLNGLLAKFEEA 173
+ + A + + + + L+ + + + VD++N +
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 174 NNEVKAQTAIGG--DPSDALDQRETLLKDISSIIGINVVNRPNNDVALYTTEGATLFEIV 231
N+++ T +G P++ LDQR+ L+ +++ I+G+ V + + G +L +
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGS 238

Query: 232 PRKVTFKAQPGYDATTTGNAIYVDGVALKAGSGSNTTAEGSLQGLMQIRDDLAPTMQSQL 291
+ A P + YVDG A GSL G++ R ++ L
Sbjct: 239 TAR-QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTL 297

Query: 292 DEIARGLISMFAE 304
++A F
Sbjct: 298 GQLALAFAEAFNT 310



Score = 34.9 bits (80), Expect = 7e-04
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 432 RSGATSANEAKSALYERSATSYSNNTAVSLDEELSLLMDIEQSYKAATKLVTTVDEMLKS 491
S AT N + L + S S V+LDEE L +Q Y A +++ T + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQ-SISG---VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDA 541

Query: 492 LMDM 495
L+++
Sbjct: 542 LINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_10725TYPE3IMQPROT521e-12 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 51.7 bits (124), Expect = 1e-12
Identities = 20/77 (25%), Positives = 40/77 (51%)

Query: 5 DALDIMQAAVWTVLVAAGPAVLAAMIVGVAIAFIQALTQVQEMTLTFVPKIVTIMIVLGV 64
D + A++ VL+ +G + A I+G+ + Q +TQ+QE TL F K++ + + L +
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 AAPFVGAQIALFSNLVF 81
+ + G + + V
Sbjct: 63 LSGWYGEVLLSYGRQVI 79


32B0909_08530B0909_08480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_08530-2112.180506c-type cytochrome biogenesis protein CcmI
B0909_08525-2131.586818cytochrome c maturation protein CcmE
B0909_08520-3111.633559heme lyase CcmF/NrfE family subunit
B0909_08515-3131.536584cytochrome c-type biogenesis protein CcmH
B0909_08510-2132.046951Do family serine endopeptidase
B0909_085050132.008091DNA-binding response regulator
B0909_085000132.238195HAMP domain-containing protein
B0909_08495-1111.795129thiol-disulfide oxidoreductase DCC family
B0909_08490-1121.616424MipA/OmpV family protein
B0909_08485-1111.450947bifunctional [glutamine synthetase]
B0909_08480-2100.851622PAS domain-containing sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08530SYCDCHAPRONE310.006 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 30.7 bits (69), Expect = 0.006
Identities = 10/41 (24%), Positives = 18/41 (43%)

Query: 208 EAARDAFAEAEKLKPGNPRARFYLALSLEQAGKADEARAAF 248
+ A +++ + PR F+ A L Q G+ EA +
Sbjct: 87 DLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08510V8PROTEASE651e-13 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 65.0 bits (158), Expect = 1e-13
Identities = 32/157 (20%), Positives = 55/157 (35%), Gaps = 25/157 (15%)

Query: 137 GSGFFISEDGYIVTNNHVVD----DGSAYTIVM--------NDGTELDAKLVGRDPRTDL 184
SG + ++TN HVVD D A +G ++ DL
Sbjct: 104 ASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDL 162

Query: 185 ALLKV-------DVNRKFTYVQFADDSKIRVGDWVVAVGNPFGLGGTVTSGIISARGRDI 237
A++K + +++++ +V + G P ++G+
Sbjct: 163 AIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESKGKIT 219

Query: 238 GSGPYDDYLQIDAAVNRGNSGGPAFNLNGEVVGINTA 274
+Q D + GNSG P FN EV+GI+
Sbjct: 220 YLKGEA--MQYDLSTTGGNSGSPVFNEKNEVIGIHWG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08505HTHFIS811e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 1e-19
Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 1/124 (0%)

Query: 29 KILVIEDDLEAAAYMTKAFREAGIVADHASDGESGLFMGCENAYDVMVIDRMLPRRDGLS 88
ILV +DD + +A AG S+ + D++V D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 89 VISELRRRGIETPVLILSALGQVDDRVTGLRAGGDDYLPKPYAFSELLARIE-VLGRRKG 147
++ +++ + PVL++SA + G DYLPKP+ +EL+ I L K
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 148 KPEQ 151
+P +
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08500PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 15/72 (20%), Positives = 26/72 (36%), Gaps = 17/72 (23%)

Query: 362 LIDNAMKYACE-SEGDKKLAVRLVKEPEAIVLSVADNGPGIPADKRGEVLKRFVRLDESR 420
L++N +K+ K+ ++ K+ + L V + G L
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS----------------LALKN 306

Query: 421 SKPGTGLGLSLV 432
+K TG GL V
Sbjct: 307 TKESTGTGLQNV 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08480PF06580340.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.003
Identities = 20/105 (19%), Positives = 39/105 (37%), Gaps = 26/105 (24%)

Query: 671 LLSNAVKF----TGDGGRIALRTHVHEAAMILTIADTGIGIPTQALEKIGQPFEQVQSQY 726
L+ N +K GG+I L+ + L + +TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG------------------SLAL 304

Query: 727 AKSQGGSGLGLA-ISRSLVKLHGG--TMKIRSCEGRGTVVTIIIP 768
++ +G GL + L L+G +K+ +G+ + ++IP
Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


33B0909_08205B0909_08180N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_08205-1121.194269GTPase Era
B0909_08200-1120.902423Crp/Fnr family transcriptional regulator
B0909_08195-1141.240921response regulator
B0909_081900151.912643DNA repair protein RecO
B0909_081800112.049569MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08200TCRTETOQM310.005 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.4 bits (71), Expect = 0.005
Identities = 36/128 (28%), Positives = 56/128 (43%), Gaps = 25/128 (19%)

Query: 30 NAGKSTLVNRLV---GAKVSIVSHKVQTTRAV------MRGIAIH--------DNAQIVF 72
+AGK+TL L+ GA + S TTR RGI I +N ++
Sbjct: 13 DAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNI 72

Query: 73 MDTPGIFKPRRRLDRAMVTSAWGGAKDADLILLLIDSERGLKGDAETILEGLKDVPQKKI 132
+DTPG + R++ S GA +LLI ++ G++ + L+ + I
Sbjct: 73 IDTPGHMDFLAEVYRSL--SVLDGA------ILLISAKDGVQAQTRILFHALRKMGIPTI 124

Query: 133 LCLNKIDQ 140
+NKIDQ
Sbjct: 125 FFINKIDQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08190HTHFIS511e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.0 bits (122), Expect = 1e-10
Identities = 20/114 (17%), Positives = 40/114 (35%), Gaps = 5/114 (4%)

Query: 1 MVPQRVIIVEDEYLVALDVEAVLQSMGVETITIATTLAQAREAAGQDVADCVLLDVSLSD 60
M +++ +D+ + + L G + + A D V+ DV + D
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 GKSYDFARELRAA--GIPFGFVSGYGDTTGFPEDLMHAPL--LGKPFGENEIVG 110
++D ++ A +P +S + L KPF E++G
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08185BONTOXILYSIN353e-04 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 34.9 bits (80), Expect = 3e-04
Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 218 VYDPRGLSENAARDGFVQAALKALQRRASPPA 249
+YD LS+++ R+ F+QA + L+R + +
Sbjct: 63 IYDSNFLSQDSERENFLQAIIILLKRINNTIS 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_08180TCRTETA513e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.4 bits (123), Expect = 3e-09
Identities = 62/341 (18%), Positives = 122/341 (35%), Gaps = 11/341 (3%)

Query: 47 STFGLAMALQNLFWGLGQPFFGAIADKYGTGRVLVLSGFLYAAGLICMSFGTSPFWLHFG 106
+ +G+ +AL L P GA++D++G VL++S A M+ W+ +
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-LWVLYI 101

Query: 107 GGVLVGLGIAAGSFSVILSAFARHVTPQQRSLAFGIGTAAGSAGMFLFAPISQGLISAYG 166
G ++ G I + +V + A +R+ FG +A GM P+ GL+ +
Sbjct: 102 GRIVAG--ITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA-GPVLGGLMGGFS 158

Query: 167 WSDSLVWLAVMMMLVPLLA-FPMRGNSSSGSQSQTQFQQTAGEALREALGHKSYLLLTTG 225
A + L L F + + + + + R A G L
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAV 218

Query: 226 FFVCGFQVAFITAHFPAYLGD-IGIEPRYAVIAMALIGFFNIIG-SLAAGVIAQRYSKPY 283
FF+ A + + D + I++A G + + ++ G +A R +
Sbjct: 219 FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERR 278

Query: 284 MLAYIYIARSIAVTAFLLLPQSPLSVILFAAVMGILWLSTVPPTNALVAIMFGTRHLGML 343
L IA + ++ + V+ +P A+++ G L
Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIM--VLLASGGIGMPALQAMLSRQVDEERQGQL 336

Query: 344 GGVVFLSHQIGSFLGVWLGGFLYD-RLGSYD-LVWWLGVGM 382
G + + S +G L +Y + +++ W G +
Sbjct: 337 QGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAAL 377


34B0909_07965B0909_07925N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_07965-2111.499915alanine racemase
B0909_07960-2131.226252AraC family transcriptional regulator
B0909_07955-1110.531769branched-chain amino acid ABC transporter
B0909_07950-1120.578965AzlD family protein
B0909_07945-1110.322930ABC transporter ATP-binding protein
B0909_079400161.036393replicative DNA helicase
B0909_07935-1110.824631MarR family transcriptional regulator
B0909_079300110.891308MFS transporter
B0909_079250191.875940HlyD family secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07965ALARACEMASE2322e-75 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 232 bits (593), Expect = 2e-75
Identities = 112/374 (29%), Positives = 173/374 (46%), Gaps = 28/374 (7%)

Query: 21 PLRLTVDLGALADNWRDMKKRSGKARTAAVVKADAYGLGIEDCGATLYHAGARDFFVATV 80
P++ ++DL AL N +++ + AR +VVKA+AYG GIE + + F + +
Sbjct: 4 PIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLNL 61

Query: 81 AEGATLRSYAPEARIFVLSGIWQGQE-RQVFDNDLVPVLASEEQLSFWMATVAERGDHPC 139
E TLR + I +L G + Q+ + L + S QL A R P
Sbjct: 62 EEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLK---ALQNARLKAPL 118

Query: 140 ALH--VDTGFNRLGLPLDDALFLADDVTRPASFDPVLVLSHLACADTPSSPMNRAQLESF 197
++ V++G NRLG D L + + A+ + ++SH A A+ P +
Sbjct: 119 DIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISG--AMARI 176

Query: 198 RKVSAAFEGIESSLSASAGIFLGPDYHFDLTRPGIALYGGEAVNDVANP----MRPVAKA 253
+ + E SLS SA P+ HFD RPGI LYG + +RPV
Sbjct: 177 EQAAEGLEC-RRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTL 235

Query: 254 EARIIQIREAGEGQTVSYGGSFLLKRASRLAIASVGYADGYQRSLSGSGIPLREMGHGGA 313
+ II ++ G+ V YGG + + R+ I + GYADGY R +G P+
Sbjct: 236 SSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAP-TGTPV-------- 286

Query: 314 YGVVNDHRVPVAGRVTMDLTIFDVTDVPANAIRNGDYIELFGPNVPVDETARAAGTIGYE 373
+V+ R G V+MD+ D+T P I G +EL+G + +D+ A AAGT+GYE
Sbjct: 287 --LVDGVRTMTVGTVSMDMLAVDLTPCPQAGI--GTPVELWGKEIKIDDVAAAAGTVGYE 342

Query: 374 MLTGLGLRYERQYL 387
++ L LR +
Sbjct: 343 LMCALALRVPVVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07960PF05860310.002 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 30.9 bits (70), Expect = 0.002
Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 48 GSQISTIQGTTEQTGPGHLYLINPDEI 74
G +S I G +L+LINP+ I
Sbjct: 66 GGSVSNIDGLIRANATANLFLINPNGI 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07950ACRIFLAVINRP260.031 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.0 bits (57), Expect = 0.031
Identities = 19/50 (38%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 48 AVLMAVVTPTALATGPAETI--ACAITAVAALRLSLLPAATVGVAAVALL 95
VL AV P A G I +IT V+A+ LS+L A + A A L
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07930TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.2 bits (94), Expect = 2e-05
Identities = 35/198 (17%), Positives = 73/198 (36%), Gaps = 14/198 (7%)

Query: 60 EFSATVAEVAWLSAAYMAPYATFSIALFKIRAQYGLRPFAELSIIAFVVASCLNLFVTDL 119
+F+ A W++ A+M ++ + K+ Q G++ II S + FV
Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG-FVGHS 101

Query: 120 HSAIVI--RFVSGMAAAPISSLGFLYILEAFPPARKFSLGFSIALTGTL--LSAPIARIV 175
+++I RF+ G AA +L + + P + G + L G++ + + +
Sbjct: 102 FFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR---GKAFGLIGSIVAMGEGVGPAI 158

Query: 176 SPSLLEIDGWNALYSMEVGFALISFAMIYVLKVTPPPRAKVIERMDILSYLLFAGGLGCL 235
+ W+ L +I+ + L ++ DI +L + G+
Sbjct: 159 GGMIAHYIHWSYLLL----IPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFF 214

Query: 236 AVMLTLGRFYWWFETWWL 253
ML + F +
Sbjct: 215 --MLFTTSYSISFLIVSV 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_07925RTXTOXIND1031e-26 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 103 bits (259), Expect = 1e-26
Identities = 59/408 (14%), Positives = 122/408 (29%), Gaps = 84/408 (20%)

Query: 7 TPVSILVLLGGLAGVALVLYAWRLPPFLSTVEMTDNAFVRGYVTTMSPQVSGYVVDVPVK 66
P + + G +A +L G + P + V ++ VK
Sbjct: 56 RPRLVAYFIMGFLVIAFILSVLG--QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 67 DYQEVKQGTLLAKIDDRIYRQKQAQAAATLDTQ--------------------------- 99
+ + V++G +L K+ + ++L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 100 ------------------KAALDNSRQQENAANANILSSQAAVDSAEASL--KQAQLASD 139
K + Q+ N+ +A + A + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 140 RQ-----DNLIKSG-VGTSSLQE-------------EAHAALEKARASLSQAKAALEVSR 180
+ +L+ + ++ E + LE+ + + AK ++
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 181 QDLQT-IIVNRSSLQAAVANAEAAVELTKIDLANTEIHAPVDGRLGEVGVRT-GQYVTAG 238
Q + I+ + + + + I APV ++ ++ V T G VT
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 239 TQLMAVVPHD--VWVIANFKETQLAGMQVGQPVTISVDALHRRK---LTGHVERFSPATG 293
LM +VP D + V A + + + VGQ I V+A + L G V+ +
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA- 412

Query: 294 SEFAVIKPDNATGNFVKIAQRLGVRIAIDADQPLAADLSPGMSVVVHV 341
D G + + ++ + LS GM+V +
Sbjct: 413 ------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSGMAVTAEI 452


35B0909_06800B0909_06770N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_06800014-1.500101GNAT family N-acetyltransferase
B0909_06795013-0.982722hypothetical protein
B0909_06790-213-0.309547GNAT family N-acetyltransferase
B0909_06785-115-0.711141DUF1294 domain-containing protein
B0909_06780-215-0.482847Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase
B0909_06775-311-0.388394GNAT family N-acetyltransferase
B0909_06770-2110.499902GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06800SACTRNSFRASE401e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.5 bits (92), Expect = 1e-06
Identities = 14/57 (24%), Positives = 27/57 (47%)

Query: 73 ISDLWVDPNWQGKGIGKALILHFLDRMRAQGLPFATIDTHAGNQTAIGLYERCGFQI 129
I D+ V +++ KG+G AL+ ++ + ++T N +A Y + F I
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06790SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 6e-06
Identities = 17/55 (30%), Positives = 26/55 (47%)

Query: 77 ISDFWVDPDFQRRRVGTLLLADMERLIRDKGFDAIRLETHAQNEPAVAFFRHHDY 131
I D V D++++ VGT LL ++ F + LET N A F+ H +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06775SACTRNSFRASE310.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 0.002
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 85 ALLHQLYVAPEFQRQGVGRDLFAELETCFP---DAEIMRLEVEPKNTVAIAFYEGVGF 139
AL+ + VA +++++GVG L + + LE + N A FY F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALL-HKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_06770SACTRNSFRASE392e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.2 bits (91), Expect = 2e-06
Identities = 16/86 (18%), Positives = 33/86 (38%), Gaps = 6/86 (6%)

Query: 60 LYVAELDGEVVGTFQTAILTKLVGRGAKSMVIEAVQTRADMRGRGIGAVMINHCLEEARR 119
++ L+ +G K+ +IE + D R +G+G +++ +E A+
Sbjct: 67 AFLYYLENNCIGRI------KIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKE 120

Query: 120 QGLNAAQLTSNMARLDAHRFYERLGF 145
L + + A FY + F
Sbjct: 121 NHFCGLMLETQDINISACHFYAKHHF 146


36B0909_24670B0909_04340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_24670-119-5.022944divalent metal cation transporter
B0909_04365-1151.328729VOC family protein
B0909_04360-1151.183927hypothetical protein
B0909_04355-2150.995386ABC transporter ATP-binding protein
B0909_04350-2122.010938AI-2E family transporter
B0909_04345-2121.915372N-acetyltransferase
B0909_04340-2122.453006hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_04360TYPE3IMSPROT290.040 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.0 bits (65), Expect = 0.040
Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 29/193 (15%)

Query: 258 FATLDSTIALMFALLINASIL-----ILAAATFNKTGQTNVAELGEAHNLLAPLLGLAIA 312
+ +AL L+ + L ++ L + +
Sbjct: 29 VVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCF 88

Query: 313 PTLFGVALLCCGINSTV---TATLAGQIVMEGFLKMRLAPWLRRLITRAIAIVPAAAVTV 369
P L AL+ I S V ++G+ + K+ +R+
Sbjct: 89 PLLTVAALM--AIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI--------------- 131

Query: 370 FYGDSGTAELL--ILTQVVLSLQLSFAVFPLVMFTADKAKMGALKA-PLWLSAFAWLIAV 426
+ E L IL V+LS+ + + ++ G PL L+ +
Sbjct: 132 -FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVI 190

Query: 427 VIAVLNVKLLLDF 439
V + D+
Sbjct: 191 CTVGFVVISIADY 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_04340BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 8/76 (10%)

Query: 23 RIALIPPISAARWLLVLVALAGIYFFHGFLVPVLAALVIGF-ASWPVYTRLLRQVGGNTT 81
+ A+I P +L +VA+A + +VP + I + P+ TR+L +G +
Sbjct: 170 QQAMIYPC-----VLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVL--MGMSDA 222

Query: 82 LGATIAIILILAFLVV 97
+ +L+
Sbjct: 223 VRTFGPWMLLALLAGF 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_04335SACTRNSFRASE467e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 45.7 bits (108), Expect = 7e-09
Identities = 18/91 (19%), Positives = 36/91 (39%)

Query: 39 DIGALEQDNVSFFVARHDGRVVGCGALVEAGDGTAEVKRMFVDPDARGLRIGKLIMDTLV 98
D+ +E++ + F+ + +G + +G A ++ + V D R +G ++ +
Sbjct: 56 DVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAI 115

Query: 99 ARGAELGLSAIRLETGISQPEAIGLYRKAGF 129
E + LET A Y K F
Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_04330BACYPHPHTASE260.047 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 26.3 bits (57), Expect = 0.047
Identities = 13/57 (22%), Positives = 25/57 (43%)

Query: 62 PGEQAPQTAGAQPPATAPATTPQATAAVTPKEEESLTKQAISRVSRILPSTEGIKHL 118
P + P T+G A AT P + P+ L+ + + + + P+T ++L
Sbjct: 162 PPRERPHTSGHHGAGEARATAPSTVSPYGPEARAELSSRLTTLRNTLAPATNDPRYL 218


37B0909_03530B0909_03495N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_035300121.690911sugar ABC transporter substrate-binding protein
B0909_035250141.627732ABC transporter permease
B0909_03520-1121.940625transketolase
B0909_03515-2131.222748transketolase family protein
B0909_03510-2130.943293glycerol kinase
B0909_03505-1100.751511sugar-binding transcriptional regulator
B0909_03500-211-0.683527SDR family oxidoreductase
B0909_03495014-1.172863ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_03525HTHFIS280.049 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.049
Identities = 11/50 (22%), Positives = 20/50 (40%), Gaps = 3/50 (6%)

Query: 210 VSIVFAQADGLALGAAQAIKVANPSQK-IVVGGFDGDTAALEALKNGVFD 258
+V + L IK A P +V+ + A++A + G +D
Sbjct: 53 TDVVMPDENAFDL--LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_03500ARGREPRESSOR290.009 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 29.4 bits (66), Expect = 0.009
Identities = 23/117 (19%), Positives = 39/117 (33%), Gaps = 9/117 (7%)

Query: 20 AQHLNLSQATVSRMLKRAEAEGIVRTSIIPPPGTYSDLEAQLRERFDLPEAIVVDCSEDR 79
N++QATVSR +K +V+ YS Q + ++D
Sbjct: 31 KDGYNVTQATVSRDIKEL---HLVKVPTNNGSYKYSLPADQRFNPLSKLKRSLMDAFVKI 87

Query: 80 DGA---IMARIGEAAAHFLEVTLSQN---EIIGVSSWSQTIFKMVENIHPLKGAKAR 130
D A I+ + A + + EI+G TI + K + +
Sbjct: 88 DSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGTICGDDTILIICRTHDDTKVVQKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_03495DHBDHDRGNASE1285e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (322), Expect = 5e-38
Identities = 73/254 (28%), Positives = 112/254 (44%), Gaps = 5/254 (1%)

Query: 5 EGQSVFVTGGNKGIGYGIARRFAEEGAKVALASVDRDTHDAARKLADETGAVTHGVILDV 64
EG+ F+TG +GIG +AR A +GA +A + + + DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 65 RDAAAVREAYGAAEEAIGALSISVQNAGVITISKIEELTQEQWDFNLDVNTKGAFLCCQE 124
RD+AA+ E E +G + I V AGV+ I L+ E+W+ VN+ G F +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 125 AIRRFRESGTKGRLVNTASGQARQGFIYTPHYAASKFGVIGLTQSLAKELAPEGITVNAI 184
+ + G +V S A YA+SK + T+ L ELA I N +
Sbjct: 127 VSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 185 CPGIIHTEMWDYNDRVWGQMLGDYKPGELMAEWVR-NIPMRRAGTPAEVAALVAFLASED 243
PG T+M +W G + + E + IP+++ P+++A V FL S
Sbjct: 186 SPGSTETDM---QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 244 AAYITGQTINVDGG 257
A +IT + VDGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_03490ABC2TRNSPORT762e-18 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 75.7 bits (186), Expect = 2e-18
Identities = 51/226 (22%), Positives = 97/226 (42%), Gaps = 3/226 (1%)

Query: 18 RRTLLQSVVSPVISTSLYFIVFGTAIGSRIQEVGGVSYGAFITPGLIMLTLLTQCIGNGS 77
++ L S++ + +Y G +G + VGGVSY AF+ G++ + +T
Sbjct: 28 KKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTAATFETI 87

Query: 78 FGIYFPKF-TGTVYEILSAPVAMTEILAGYVGAAATKGLMIGTIILITASFFVDITIAHP 136
+ + T +L + + +I+ G + AATK + G I + A+
Sbjct: 88 YAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSL 147

Query: 137 FMMILFFVLTAVSFSLFGFIIGIWATNFEQLNLIPMLVVPPLTFLGGSFYSIDMLPPFWQ 196
+ LT ++F+ G ++ A +++ LV+ P+ FL G+ + +D LP +Q
Sbjct: 148 LYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 197 TVSHFNPVLYLISGFRWSFYE--IADVNPVISLAMITLFLAFCLGV 240
T + F P+ + I R + DV + I + + F L
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLST 253


38B0909_02670B0909_02620N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_02670-114-0.078299GNAT family N-acetyltransferase
B0909_02665-2120.694470aminoglycoside phosphotransferase
B0909_02660-1110.653826NAD-dependent DNA ligase LigA
B0909_02655-1121.418552DNA repair protein RecN
B0909_02650-1130.862218outer membrane protein assembly factor BamD
B0909_026450141.081348UDP-3-O-acyl-N-acetylglucosamine deacetylase
B0909_02640-1110.961429cell division protein FtsZ
B0909_02635-2110.123983cell division protein FtsA
B0909_02630-211-0.173841cell division protein FtsQ/DivIB
B0909_02625-2110.107340D-alanine--D-alanine ligase
B0909_02620-2120.352182MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_02670SACTRNSFRASE414e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.5 bits (97), Expect = 4e-07
Identities = 20/90 (22%), Positives = 32/90 (35%), Gaps = 9/90 (10%)

Query: 70 ALVAILDGKIVGDIGLTRHTNRRAHAGSIGMGVHDAYTGRGIGSAMIGEILAIADRWL-- 127
A + L+ +G I + + N A I V Y +G+G+A++ A W
Sbjct: 67 AFLYYLENNCIGRIKIRSNWNGYALIEDIA--VAKDYRKKGVGTALLH----KAIEWAKE 120

Query: 128 -DLKRIELTVITDNEPALALYRKFGFEVEG 156
+ L N A Y K F +
Sbjct: 121 NHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_02655GPOSANCHOR300.021 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.021
Identities = 39/251 (15%), Positives = 79/251 (31%), Gaps = 15/251 (5%)

Query: 139 TDAHRTLLDAFAGLSDEARAVQGFYRTWKDAERALKTHRAKVEAAAREADYLRSSVEELE 198
L A G + + A +T + + AL+ +A++E A A ++
Sbjct: 154 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 213

Query: 199 TLSPRDGE--EDELAERRAVMQKSERIAGDIAEASEFLNGNASPVPVIASMMRRLERKSH 256
+ A+ ++ + + + + L + + + LE+
Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA---ALEARQAELEKALE 270

Query: 257 EAPGLLEDTVQLLDAALDSLSNAQMEVEAALRRTEFDPRELERVEERLFALRAAGRKYNV 316
A + + + E +++ + + L A R A ++
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 317 AVPDLPAL---AEKMVADL-ADLDAGEEKLGKLEANLGVVKADF---DRAAQSLS---DK 366
L +E L DLDA E +LEA ++ + + QSL D
Sbjct: 331 EHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 390

Query: 367 RRHAADALSGA 377
R A + A
Sbjct: 391 SREAKKQVEKA 401


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_02640IGASERPTASE411e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.2 bits (96), Expect = 1e-05
Identities = 33/204 (16%), Positives = 63/204 (30%), Gaps = 10/204 (4%)

Query: 331 ADMRAAAAVAKPLIRPSAAVASAPAAVQPAPAVSQAQKAVDPIAQTIRSAEAEMERELG- 389
AD+ + + + + R A PA P+ + ++T+ E +
Sbjct: 1005 ADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ 1064

Query: 390 ---FAAHQQPAQDFRPQSKLFASSPAEA----PAALRPAQPVQQVAPAPVAPAPVYHAPE 442
A + Q+ A S +E + V++ A V P+
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124

Query: 443 QVAAPAPRMQQPQAPIYQEPAPVARQPEPVRMPKVEDFPPVVKAEMEHRTHAAPAAQEER 502
+ +P+ +Q + Q A AR+ +P K A+ E + E+
Sbjct: 1125 VTSQVSPKQEQSET--VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQP 1182

Query: 503 GPMGLLKRITNSLGRREEEEVPSD 526
NS+ E P+
Sbjct: 1183 VTESTTVNTGNSVVENPENTTPAT 1206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_02635SHAPEPROTEIN462e-07 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 45.5 bits (108), Expect = 2e-07
Identities = 49/222 (22%), Positives = 86/222 (38%), Gaps = 26/222 (11%)

Query: 178 LLGVDMHVVTVERTALKNLELCVNRAHLSVEGMVATPYASGLAALVDDEVELGCAAIDMG 237
L+ V + VER A++ A ++ P A+ + A + G +D+G
Sbjct: 111 LVCVPVGATQVERRAIRESAQ---GAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIG 167

Query: 238 GGTTTISVFAEGRLIHTDAIGLGG----HHVTTDLAR--GLSTRIEDAERLKVVHGSALL 291
GGTT ++V + ++++ ++ +GG + + R G AER+K GSA
Sbjct: 168 GGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYP 227

Query: 292 NGADERDMISIPPIGEDDRDQPSQVSRALV--TRIVRARIEETLELIRDRIQKS------ 343
DE I + R+ V R + + ++E L I + +
Sbjct: 228 --GDEVREIEVR-----GRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPP 280

Query: 344 -GFSPIVGKRVVLTGGASQLTGLPETARRILARNVRIG-RPM 383
S I + +VLTGG + L L V + P+
Sbjct: 281 ELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPL 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_02620TCRTETA300.015 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.015
Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 16/99 (16%)

Query: 302 IFIPIAGKLAEKFGRREILILTTVLIGLFSFLLPTLMTGSEGSIFVFAALAMMLMGMTYG 361
P+ G L+++FGRR +L L+ L + + + ++V + ++ G+T
Sbjct: 58 ACAPVLGALSDRFGRRPVL-----LVSLAGAAVDYAIMATAPFLWVL-YIGRIVAGITGA 111

Query: 362 LIGTALAA-----PFPTRVRYTGSSITFNMAGIFGASLA 395
A A R R+ G M+ FG +
Sbjct: 112 TGAVAGAYIADITDGDERARHFGF-----MSACFGFGMV 145


39B0909_01820B0909_01780N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_01820-1143.371416MoxR family ATPase
B0909_01815-2132.816231DUF58 domain-containing protein
B0909_01810-1102.559099DUF4159 domain-containing protein
B0909_01805-1101.623844hypothetical protein
B0909_018000110.632029GNAT family N-acetyltransferase
B0909_017950110.788032LysR family transcriptional regulator
B0909_017902110.482341MBL fold metallo-hydrolase
B0909_017853120.960665MFS transporter
B0909_01780013-0.339730N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01820HTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 4e-04
Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 121 GQLLMADEINRASPRTQSALLQAMQEYHITMAGQTYELPKPFHVLATQN-PLEQE 174
G L DEI Q+ LL+ +Q+ T G + ++A N L+Q
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01805TONBPROTEIN386e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.0 bits (88), Expect = 6e-05
Identities = 32/180 (17%), Positives = 66/180 (36%), Gaps = 7/180 (3%)

Query: 49 AIANPLLTQEEREPLSTIVPVIVDRSQSQDVQDRPQMTDTALETLKDRLSRFPRIEPRIV 108
A+ P E EP P+ ++ V ++P+ ++ P+ + + V
Sbjct: 60 AVQPPPEPVVEPEP--EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117

Query: 109 EVRDDGESDSPSTQLFSALSSAVADVSPSRVGGAIFLSDGQIHDIPNALPNAEQALGFRA 168
E R ++ + + L+S+ A + S+ ++ + P QAL
Sbjct: 118 ESRPASPFENTAP---ARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEG 174

Query: 169 PVHGLITGKADEFDRRIEIVRAPRFGIVNEEQQLTLRV--FDDGRPAGGGSAEVTVKMNG 226
V D ++I+ A + E + +R ++ G+P G + K+NG
Sbjct: 175 QVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKING 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01800SACTRNSFRASE382e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 2e-06
Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 3/65 (4%)

Query: 59 GWLYVQLLFVPETMRGKGTAAKLLAMAEEEARKRGCTGAYIDT--MNPDALRTYERYGFT 116
G+ ++ + V + R KG LL A E A++ G ++T +N A Y ++ F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 117 RIGSL 121
IG++
Sbjct: 148 -IGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01785TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 11/158 (6%)

Query: 44 AASIAATFGLTVPQALQTGTAFFFGMLLGAAGFGRLADRYGRRRVLIVTVACDALFGVLS 103
+ + A +G+ + + A G L+DR+GRR VL+V++A A+ +
Sbjct: 38 SNDVTAHYGILL-------ALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM 90

Query: 104 IFSPDFTILLILRFLTGAAVGGTLPVDYAMMAEFLPAKNRGRWLVFLEGFWAVGTLIVAL 163
+P +L I R + G G T V A +A+ R R F+ G +VA
Sbjct: 91 ATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSA--CFGFGMVAG 147

Query: 164 AAWGASLAGVADAWRYIFAVTAFPAVLGLGLRFLVPES 201
G + G + + A A + L FL+PES
Sbjct: 148 PVLGGLMGGFSPHAPFFAA-AALNGLNFLTGCFLLPES 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01780SACTRNSFRASE310.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 0.002
Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 81 LAVRPSHKNKGIGRELVRIAIAAARRKGSEAVIL 114
+AV ++ KG+G L+ AI A+ ++L
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLML 128


40B0909_01660B0909_01610N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_01660-116-0.810258DUF2333 family protein
B0909_01655-114-0.334628LuxR family transcriptional regulator
B0909_01650013-0.415283TIGR00645 family protein
B0909_01645-113-0.590494TonB-dependent
B0909_01640-113-0.829637extensin
B0909_01635-211-0.354449hypothetical protein
B0909_01630-211-0.110198anthranilate synthase
B0909_01625010-0.379194GNAT family N-acetyltransferase
B0909_01620-110-0.338012HD-GYP domain-containing protein
B0909_01615-1120.058334adenine deaminase
B0909_01610-2100.215491GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01650TRNSINTIMINR300.011 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 30.5 bits (68), Expect = 0.011
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 88 VPPGQELPAATPAAAAGATGAAANTSAPVCQR---SAIVDAAADLTD 131
+PP LP+ T AA G TG +++ + R S + ++ AD D
Sbjct: 16 IPPAPPLPSQTDGAARGGTGHLISSTGALGSRSLFSPLRNSMADSVD 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01630IGASERPTASE416e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 6e-06
Identities = 30/187 (16%), Positives = 53/187 (28%), Gaps = 20/187 (10%)

Query: 24 VPVPQPKPGEA-QSSTPSEKPREEKLQNPAEAPKPEPKPQVPEASKPDGEKTAADKEPGD 82
V Q+ PS E++ EAP P P P P + + + +
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 83 AKKEGEGKGSEGQQKPADGVKKAEEKPREPVKPEDPAELEACLGALKEIGAQFK-KLEPI 141
K E + + Q + E K + E + Q E
Sbjct: 1052 EKNEQDATETTAQNREVA----KEAKSN---VKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 142 RDEEQGCGIEAPIELSVVLPGIKLEPSGTMRCETALALSRWTKEMMLPAAALALPEKKVT 201
E++ +A +E + P T + ++ + E + P A A
Sbjct: 1105 TVEKEE---KAKVETE----KTQEVPKVT----SQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 202 AIANAST 208
I +
Sbjct: 1154 NIKEPQS 1160



Score = 29.6 bits (66), Expect = 0.017
Identities = 17/96 (17%), Positives = 27/96 (28%), Gaps = 6/96 (6%)

Query: 33 EAQSSTPSEKPREEKLQN--PAEAPKPEPKPQVPEASKPDGEKTAADKEPGDAKKEGEGK 90
E T P++E+ + P P E P V T AD E + +
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 91 GSEGQQKPADGVKKAEEKPRE----PVKPEDPAELE 122
+ + E P +P +E
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01620HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 12/52 (23%), Positives = 17/52 (32%), Gaps = 1/52 (1%)

Query: 527 TGVKILLVDHEDSFVHTLANYFRQTGATVSTVRSPVAA-DVFDRFQPDLVVL 577
TG IL+ D + + L + G V + DLVV
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVT 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01615SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.1 bits (96), Expect = 5e-07
Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 13/98 (13%)

Query: 49 VHDMIENGHPQFVAIVDDEVIGWCDIR-----RVSRETRAHCGTLGMGILPAYRDKGLGA 103
V + E G F+ +++ IG IR E A YR KG+G
Sbjct: 57 VSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVA--------KDYRKKGVGT 108

Query: 104 RLMRRTLDAARQCGLHRIELSVHGDNARAIALYEKIGF 141
L+ + ++ A++ + L N A Y K F
Sbjct: 109 ALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01605UREASE300.034 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 29.7 bits (67), Expect = 0.034
Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 12/89 (13%)

Query: 13 TGRQPADMVLKGGRFFDLVTGELVASDIAICGDTIVGTCENYEGRREIDITGKIVVPGFI 72
G AD+ LK GR + G+ D+ IVG G I GKIV G +
Sbjct: 81 WGIVKADIGLKDGRIAAI--GKAGNPDMQPGVTIIVGP-----GTEVIAGEGKIVTAGGM 133

Query: 73 DTHLHIESSLVTPHEFDRCVLPYGVTTAI 101
D+H+H + P + + L G+T +
Sbjct: 134 DSHIH----FICPQQIEE-ALMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_01600SACTRNSFRASE300.003 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.9 bits (67), Expect = 0.003
Identities = 14/57 (24%), Positives = 26/57 (45%)

Query: 78 AFLEGVYVEEAFRRQGVAAALVAEVTRWAVGQGVSELASDADIANVDSHRMHAALGF 134
A +E + V + +R++GV AL+ + WA L + N+ + +A F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


41B0909_00900B0909_00870N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_00900116-0.505577urea ABC transporter permease subunit UrtB
B0909_00895016-0.002487urea ABC transporter substrate-binding protein
B0909_00890-1140.614794DUF2735 domain-containing protein
B0909_00885-2161.003299glutamine synthetase
B0909_00880-1171.586185SDR family oxidoreductase
B0909_00875-1151.144268hybrid sensor histidine kinase/response
B0909_00870-1141.373638DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00890SECFTRNLCASE290.040 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 29.0 bits (65), Expect = 0.040
Identities = 24/148 (16%), Positives = 61/148 (41%), Gaps = 9/148 (6%)

Query: 146 LKAPSAENLELLEAAISKETDKEVR---TRMEEARAVSLLSSDRPLDQKKDAIATIKSLG 202
++ +A ++ + AA+ +V R R ++ R Q+ A +
Sbjct: 57 TESTTAIDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQ 116

Query: 203 GRDAIGILMAASSSVDQSLKPDIDEAISSIESSLAFWDYVQNVWYGLSLGSVLLLAAIGL 262
G++ + + A ++VD +LK E++ S + V + L +V+++ I +
Sbjct: 117 GQELVNKVETALTAVDPALKITSFESVGPKVSG----ELVWTAVWSLLAATVVIMFYIWV 172

Query: 263 AI--TFGVMGIINMAHGEMVMIGAYSTF 288
F + ++ + H ++ +G ++
Sbjct: 173 RFEWQFALGAVVALVHDVLLTVGLFAVL 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00870DHBDHDRGNASE1249e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (313), Expect = 9e-37
Identities = 72/254 (28%), Positives = 114/254 (44%), Gaps = 8/254 (3%)

Query: 9 LENRTAIITGSGRGLGFEIASAFAEAGAHVWLTGRNAEMLEQAVETLRKAGGKADYAAFD 68
+E + A ITG+ +G+G +A A GAH+ N E LE+ V +L+ A+ D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 IADTAAGSALVRRIMAEFGHLDILVNNVGARDRRPLAEFTDEDVLELIRTDLTSSISLSR 128
+ D+AA + RI E G +DILVN G + +DE+ + T + SR
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 DAAEAMNTNGYGRIITITSILGHIVRPGDAIYPVAKQGLTGLMRAIAVEYGARGITSNAI 188
++ M G I+T+ S + R A Y +K + + +E I N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 189 APGMFATETNAAL-----AENPDMVAFA---KLRVPLERWGRPDEIAGAALFLASDAASF 240
+PG T+ +L + K +PL++ +P +IA A LFL S A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 241 VNGHVLTVDGGMSV 254
+ H L VDGG ++
Sbjct: 246 ITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00865HTHFIS741e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 1e-15
Identities = 28/118 (23%), Positives = 52/118 (44%), Gaps = 6/118 (5%)

Query: 920 PRRTIVVVDDNEDHRELMRQVLSPLDFVVLTAQSGPECLTLIEGVKPDLFLIDISMPGMS 979
TI+V DD+ R ++ Q LS + V + I DL + D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 980 GWQLVTKLREAGQTAPLIMLSANIGDGTVAGAGEDNHNDA---IAKPVDIRHLCDRLA 1034
+ L+ ++++A P++++SA T A + + A + KP D+ L +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQ---NTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00860HTHFIS903e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-22
Identities = 33/132 (25%), Positives = 55/132 (41%), Gaps = 2/132 (1%)

Query: 9 PRDIVLLVDDSPEALGFLTDALEQSGFSALIATSGQAALNIAERITPDIILLDAVMPTMD 68
+L+ DD L AL ++G+ I ++ D+++ D VMP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 69 GFETCRRLKANAAVAQVPVIFMTGLTETEHVVRALESGGVDYLTKPINIDELRARIRVHL 128
F+ R+K A +PV+ M+ ++A E G DYL KP ++ EL I L
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 129 SNARSAQSARVA 140
+ + S
Sbjct: 120 AEPKRRPSKLED 131


42B0909_00285B0909_00235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_00285-113-0.120648flavin reductase
B0909_00280-115-0.948090MarR family transcriptional regulator
B0909_00275012-0.328882hypothetical protein
B0909_002700130.060438porin family protein
B0909_002650140.018757sensor histidine kinase
B0909_00260-1150.284336DNA-binding response regulator
B0909_00255-2140.416634TetR/AcrR family transcriptional regulator
B0909_00250-2150.584620FAD-binding protein
B0909_00245-2180.944643two-component sensor histidine kinase
B0909_00240-2170.287337DNA-binding response regulator
B0909_00235-2170.485156protein-disulfide reductase DsbD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00285SECA320.001 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.8 bits (72), Expect = 0.001
Identities = 12/56 (21%), Positives = 20/56 (35%), Gaps = 3/56 (5%)

Query: 56 SMEPPSLLVCLNNRTLLHELLLCRPDFIVNVLTQDQAALSDAFSGKVSPEERFRDG 111
S L+ + H L D+IV + + D +G+ R+ DG
Sbjct: 300 SPANIMLMHHVTAALRAHALFTRDVDYIV---KDGEVIIVDEHTGRTMQGRRWSDG 352


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00270OMPADOMAIN369e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 35.7 bits (82), Expect = 9e-05
Identities = 48/210 (22%), Positives = 73/210 (34%), Gaps = 35/210 (16%)

Query: 12 TAAVMAATSAYSADLANYNAQQFDYANPPAFSWGGAYIGAHGGVASPRFNPLAGGRG--- 68
TA +A A A +A A P +W Y GA G + G
Sbjct: 4 TAIAIAVALAGFATVAQ--------AAPKDNTW---YTGAKLGWSQYHDTGFINNNGPTH 52

Query: 69 ---LTGGVQAGYNFQFGSGVVGAELEGSYLGNDARVPNGRLRERFRGAAK-----LKAGV 120
L G GY VG E+ +LG R+P E A+ K G
Sbjct: 53 ENQLGAGAFGGYQVNPY---VGFEMGYDWLG---RMPYKGSVENGAYKAQGVQLTAKLGY 106

Query: 121 ALDRTL-LYGTAGLTTTKFKDGNGVTG-PDDWKRGYLVGAGVEQSFGGGLSAKFEYNYVN 178
+ L +Y G + + V G D + GVE + ++ + EY + N
Sbjct: 107 PITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTN 166

Query: 179 TGNVSTTTSSGRSKTDVSDHVIKAGLNYRF 208
N+ + G T + ++ G++YRF
Sbjct: 167 --NIGDAHTIG---TRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00260HTHFIS891e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.7 bits (220), Expect = 1e-22
Identities = 32/118 (27%), Positives = 56/118 (47%), Gaps = 1/118 (0%)

Query: 2 RILLLEDEPEMARALLEALRRRDVLADHVSTISDADALARDGSYDVLVLDRRLPDGEGLS 61
IL+ +D+ + L +AL R S + G D++V D +PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LVSSLRRRKHSVPILVLTALGSVDHRVDGLDAGADDYLAKPFAIEELLARL-RALHRR 118
L+ +++ + +P+LV++A + + + GA DYL KPF + EL+ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00255HTHTETR501e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 49.6 bits (118), Expect = 1e-09
Identities = 20/126 (15%), Positives = 45/126 (35%), Gaps = 1/126 (0%)

Query: 14 LEAAYEALLESGVDSVKILPLAKKLNLSRTSFYWFFKDREELLAALLSRWREKNTGSILR 73
L+ A + GV S + +AK ++R + YW FKD+ +L + + L
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 74 QSEAYAETLAEAMLNVFDCWLDSSLFDSKFEFAVRSWALQSDDILAELRKADQMRMEALS 133
+ + + L+S++ + + + + + E+ Q +
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII-FHKCEFVGEMAVVQQAQRNLCL 135

Query: 134 RMFMRF 139
+ R
Sbjct: 136 ESYDRI 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00240HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 2e-24
Identities = 36/128 (28%), Positives = 57/128 (44%)

Query: 2 RLLVVEDDDVLLDGLRVGLQLAGFTVDAVMTLGDAKIALENCRFDAVVLDVMLPDGSGLD 61
+LV +DD + L L AG+ V + D VV DV++PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLRHTRNERNRVPILLLTAKDATTDKICGLDAGADDYLGKPFDLDEVAARLRAIIRRGEG 121
LL + R +P+L+++A++ I + GA DYL KPFDL E+ + + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RSEGTLSA 129
R
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00235PF03544290.037 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.037
Identities = 18/94 (19%), Positives = 26/94 (27%), Gaps = 7/94 (7%)

Query: 134 AEAPFGIRPQEPPAFQSFALIAPSAEPEK----AEQSVPSPVPPAEPGMVEGFLARGGTA 189
EAP I +P + +P++ E SP P A T+
Sbjct: 88 KEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATS 147

Query: 190 LLLASFMGLGVLLAFTPCVFPMYPILAATLGRQG 223
+ P YP A L +G
Sbjct: 148 ---KPVTSVASGPRALSRNQPQYPARAQALRIEG 178


43B0909_00010B0909_24770N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_00010121-3.592187sn-glycerol-3-phosphate ABC transporter
B0909_00005-123-3.384969***phosphoribosylaminoimidazolesuccinocarboxamide
B0909_24755020-2.622671TetR/AcrR family transcriptional regulator
B0909_24760116-2.046166efflux RND transporter periplasmic adaptor
B0909_24770115-1.135809hydrophobe/amphiphile efflux-1 family RND
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_00005PF05272354e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 4e-04
Identities = 15/56 (26%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 32 VVLVGPSGCGKSTLLRMIAGLETVTSGDISISNRVVNEIEPKDRDIAMVFQNYALY 87
VVL G G GKSTL+ + GL+ + I +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24780HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 40/207 (19%), Positives = 83/207 (40%), Gaps = 19/207 (9%)

Query: 21 RRPRRSAEETRRDILAKAEELFRERGFNAVAIADIAAALGMSPANVFKNFSSKNALVDAI 80
R+ ++ A+ETR+ IL A LF ++G ++ ++ +IA A G++ ++ +F K+ L I
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 81 A---FEQIGAFERDI---RPLDKNHAPLSRLRHLARTLMEQHHQDL------NDNPYIFE 128
IG E + P D L H+ + + + + L + ++ E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 129 MILMTAKQDMKCGDYYKAVIASLLADIISDGIDAG-IYAPADIPPLAETVLHALTSVIHP 187
M ++ Q C + Y + + I+A + A A + ++ ++
Sbjct: 123 MAVVQQAQRNLCLESY-----DRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177

Query: 188 VLIAREDIGNLATRCDQLVDLIDAGLR 214
L A + +L V ++
Sbjct: 178 WLFAPQSF-DLKKEARDYVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24785RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 2e-08
Identities = 29/132 (21%), Positives = 58/132 (43%), Gaps = 20/132 (15%)

Query: 69 EIRPRVTGIIREIPFKEGSEVKQGDILYQIEDNTYLAEVAQAKANVAKAEASIPSAQANL 128
EI+P I++EI KEG V++GD+L ++ A+A+ K ++S+ A+
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQ 150

Query: 129 ARYERLVNS----GATQIEYENAKVTLLQAEADVAQTKAAL---------ETAEINLDLT 175
RY+ L S +++ + +E +V + + + + + L+L
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 176 KVRAPFDGITSA 187
K RA + +
Sbjct: 211 KKRAERLTVLAR 222



Score = 43.3 bits (102), Expect = 1e-06
Identities = 20/98 (20%), Positives = 33/98 (33%), Gaps = 9/98 (9%)

Query: 105 AEVAQAKANVAKAEASIPSAQANLARYERLVNSGATQIEYENAKVTLLQAEADVAQTKAA 164
E+ K+ + + E+ I SA+ TQ+ L Q ++
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQL--------VTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 165 LETAEINLDLTKVRAPFDGITSA-TAFSIGNVVTANQT 201
L E + +RAP + G VVT +T
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24790ACRIFLAVINRP11180.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1118 bits (2893), Expect = 0.0
Identities = 582/1031 (56%), Positives = 742/1031 (71%), Gaps = 5/1031 (0%)

Query: 1 MAHFFIRRPVFAWVIAIVIMLGGALAIATLSISQYPDIAPTTVRVSATYNGASAETVEKS 60
MA+FFIRRP+FAWV+AI++M+ GALAI L ++QYP IAP V VSA Y GA A+TV+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTTIIEDGMTGLDDLTYMTSSS-STGSAEVTLTFGNSIMPDIAQVQVQNKLQLVQSQLPD 119
VT +IE M G+D+L YM+S+S S GS +TLTF + PDIAQVQVQNKLQL LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 TVQQQGIQVSRSTSSILMVGALISTDGKRNSADLGDVFSSRVEDQIKRLEGVGSINVFGS 179
VQQQGI V +S+SS LMV +S + D+ D +S V+D + RL GVG + +FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWLDPFKLNKYQLTTADVTSAIESQNTQVSVGSLGAVPAVKGQQLNVTVTAQSQL 239
+YAMRIWLD LNKY+LT DV + ++ QN Q++ G LG PA+ GQQLN ++ AQ++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 TTVADFESVILKVEKDGATVRLSDVARIEIGQETYGGDSRSNGRPSAGFAVNLATGANAL 299
+F V L+V DG+ VRL DVAR+E+G E Y +R NG+P+AG + LATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 DTAARVKAALANVEGSLPEGVAIEYPYDTTPFVKLSIEKVVHTLIEAIILVFVVLLVFLQ 359
DTA +KA LA ++ P+G+ + YPYDTTPFV+LSI +VV TL EAI+LVF+V+ +FLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NLRATFIPMIAVPVVLLGTFGILALTGYSINTLTMFAMVLAIGLLVDDAIVVVENVERIM 419
N+RAT IP IAVPVVLLGTF ILA GYSINTLTMF MVLAIGLLVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 SEEGLSPVEATEKSMGEITGAIIGIALVLTAVFIPMAFFGGSTGIIYRQFSVTIVSAMLL 479
E+ L P EATEKSM +I GA++GIA+VL+AVFIPMAFFGGSTG IYRQFS+TIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAVVAIVLTPALCATMLKPI--DHHKKQRGPGAWFNRGFGKTTDGYVSSIGYLLKRPLRV 537
S +VA++LTPALCAT+LKP+ +HH+ + G WFN F + + Y +S+G +L R
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 538 MIIFAIVIGGCAWFFSKLPSSFLPQEDQGVLLTIIQTPTGSNIERTNEVVKQVESYFREK 597
++I+A+++ G F +LPSSFLP+EDQGV LT+IQ P G+ ERT +V+ QV Y+ +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 EAANVESVFGVLGFSFSGSGQNNAIVFTKLKDFSERTAPDQHAGAIVQRAMGTFFGFRDA 657
E ANVESVF V GFSFSG QN + F LK + ER + A A++ RA RD
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 658 QVFPLLPPAIQGMGTSSGFSMYLVDSGRNGTDALTASSKELIALATGNP-KISSLRSDSQ 716
V P PAI +GT++GF L+D G DALT + +L+ +A +P + S+R +
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 717 DNETQMKIILDQEKMGAMGVDLSSVNLMLSTIFAGRDVNDFTLNGELKPVYVQGDAPYRM 776
++ Q K+ +DQEK A+GV LS +N +ST G VNDF G +K +YVQ DA +RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 777 QPDDLKFWYARNTNGEMVPFSSFSEVKWINAPPSLARFNGTGAISLEGTAGAGVASGEAM 836
P+D+ Y R+ NGEMVPFS+F+ W+ P L R+NG ++ ++G A G +SG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 837 DEMERLTASLPGGYTVAWQGISYQERLSGSQAPMLYALSVLIVFLCLAALYESWSIPFSV 896
ME L + LP G W G+SYQERLSG+QAP L A+S ++VFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 897 ILAVPVGVLGALTAAHFFGQTNDVYFKVGLLTTIGLAAKNAILIVEFAKERQEH-GLSLV 955
+L VP+G++G L AA F Q NDVYF VGLLTTIGL+AKNAILIVEFAK+ E G +V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 956 EAALEAAKLRLRPIIMTSLAFILGVVPLAIATGAGSAAQNAIGIGVLGGMLSATLLGIFF 1015
EA L A ++RLRPI+MTSLAFILGV+PLAI+ GAGS AQNA+GIGV+GGM+SATLL IFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1016 VPSFFVIIRRL 1026
VP FFV+IRR
Sbjct: 1021 VPVFFVVIRRC 1031


44B0909_24980B0909_25015N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_249800150.001047MFS transporter
B0909_249850170.250087M81 family peptidase
B0909_24990018-0.164363beta-N-acetylhexosaminidase
B0909_24995-117-0.701168SDR family oxidoreductase
B0909_25000-215-0.751295MurR/RpiR family transcriptional regulator
B0909_25005-212-0.965938hypothetical protein
B0909_25010-111-1.918711sn-glycerol-3-phosphate ABC transporter
B0909_2501509-3.165985ABC transporter substrate-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_24995TCRTETA348e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.0 bits (78), Expect = 8e-04
Identities = 67/340 (19%), Positives = 130/340 (38%), Gaps = 23/340 (6%)

Query: 40 IREIGLSNTV---YSLLMLFAAIINVSASVLMGIVADRLGEYRKPMLFIALFGVS-GYML 95
+R++ SN V Y +L+ A++ + + ++G ++DR G R+P+L ++L G + Y +
Sbjct: 32 LRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG--RRPVLLVSLAGAAVDYAI 89

Query: 96 VYLAGNATAFVSAKLILLPIFGAMNSLIFAHVRADARNLSTGDMIAVN-SIMRATISLSW 154
+ ATA L + I + A A +++ GD A + M A
Sbjct: 90 M-----ATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGM 144

Query: 155 VLVPGLVGIFLISSGNMLTAFLFSGLCALVCFLLVAFCLPRAATPAVAN-TEARFGLRAS 213
V P L G+ S + F + + FL F LP + AS
Sbjct: 145 VAGPVLGGLMGGFSPHA--PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS 202

Query: 214 LAEIGSRRVLLRIIAIALICSML-HLNDSIRSLIITGQAKGTVADIGIVAGIVAALEIVF 272
V+ ++A+ I ++ + ++ + + IGI L +
Sbjct: 203 FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLA 262

Query: 273 -ILLWGWIEKKVPQTLTLAAGAVLYAVYLILQGLATAPWHIYAQTVISALGAAAIISIPI 331
++ G + ++ + L G + IL AT W + V+ A G + ++
Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ- 321

Query: 332 TYLQELIADRP-----GLGSSLIAVNIFLGAGLGALIFAL 366
L + + G ++L ++ +G L I+A
Sbjct: 322 AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25010DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 75/252 (29%), Positives = 118/252 (46%), Gaps = 12/252 (4%)

Query: 7 QLAVVTGAAGDIGRAIAAALSESHAKVVLVDIDADALSNALSQLSGPGFVAKT--CDVTD 64
++A +TGAA IG A+A L+ A + VD + + L +S L A+ DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 PQDLARLAAEV-SELGDVATLVNNAGAARAVSLHDTTADIWRRDNALNLEAPFLCFRAFE 123
+ + A + E+G + LVN AG R +H + + W ++N F R+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 124 DALKRTR-GSVVNITS-VNGMAVFGHPAYSAAKAGLIHLTKLIAVEYGKFGIRSNAVAPG 181
+ R GS+V + S G+ AY+++KA + TK + +E ++ IR N V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 TVRT----QAWEARAASNPQV---FEEAKHWYPLQRIVRPEDVASAVAFLAGPQAQAISG 234
+ T W + + E K PL+++ +P D+A AV FL QA I+
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 235 VCLPVDCGLTAG 246
L VD G T G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25025PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 32 VVFVGPSGCGKSTLLRMIAGLE 53
VV G G GKSTL+ + GL+
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25030MALTOSEBP544e-10 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 54.0 bits (129), Expect = 4e-10
Identities = 87/357 (24%), Positives = 135/357 (37%), Gaps = 45/357 (12%)

Query: 70 EQCQDKATTLAAAGTPVAMAYVGSRTLKQFAQNDLIVPVPMTEDEKKSYYPNIVDTVTFE 129
++ ++K +AA G + + +AQ+ L+ + + + YP D V +
Sbjct: 67 DKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYN 126

Query: 130 DTQWGVPVAFSTKALYWNKDLFKQAGLDPEVPPKTWAEEIAFAKQIKEKTGIAGYGLPAK 189
P+A +L +NKDL PPKTW E A K++K K G A
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLPN-------PPKTWEEIPALDKELKAK------GKSAL 173

Query: 190 TFDNTMHQFMHWVYT----------NNGKVIDGDKITVDSPQVVAALTAYKD-ITPYSVE 238
F N + W NGK D + VD+ A LT D I +
Sbjct: 174 MF-NLQEPYFTWPLIAADGGYAFKYENGKY-DIKDVGVDNAGAKAGLTFLVDLIKNKHMN 231

Query: 239 GPTAYEQNEIRAIFLDGKVGMIQAGSGAATRLQETKINWGIATLPL---GPEAKGPGTLL 295
T Y E A F G+ M G A + + +K+N+G+ LP P G L
Sbjct: 232 ADTDYSIAE--AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVL- 288

Query: 296 ITDSLAIFKGTGVEEKATEFAK--FITSPGPQGEYELQGGAGLTPLRPSA--KVDEFVAK 351
S I + +E A EF + +T G L+ PL A +E +AK
Sbjct: 289 ---SAGINAASPNKELAKEFLENYLLTDEG------LEAVNKDKPLGAVALKSYEEELAK 339

Query: 352 DPFWKPLIDGIAYGGPEPLFTDYKGFQDVMIEMVQSVVTGKATPEDAAKKASSALEQ 408
DP ++ G P F + V + +G+ T ++A K A + + +
Sbjct: 340 DPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396


45B0909_25650B0909_25690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_25650-2110.697516OmpA family protein
B0909_25655-1120.111153flavodoxin-dependent
B0909_25660-290.054017MFS transporter
B0909_25665-29-0.111573DeoR/GlpR transcriptional regulator
B0909_256700100.162132pyruvate carboxylase
B0909_256750100.205626LuxR family transcriptional regulator
B0909_256800110.269663glucan ABC transporter ATP-binding protein/
B0909_25685-1110.605350hypothetical protein
B0909_25690-1100.702763protein ndvB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25655OMPADOMAIN1091e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 109 bits (273), Expect = 1e-30
Identities = 45/173 (26%), Positives = 75/173 (43%), Gaps = 22/173 (12%)

Query: 58 RRNAALIGAGIGALAGGAIGNYMDGQEAELRAQLQGTGVSVSRRGDSIVLNMPSNITFAT 117
R + ++ G+ G + ++Q + + S++ F
Sbjct: 177 RPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQ-----------TKHFTLKSDVLFNF 225

Query: 118 DQDQVIPPFYQTLDSVGIVLNKFNRTL--IDINGHTDSTGSPGYNQGLSERRAASVANYL 175
++ + P LD + L+ + + + G+TD GS YNQGLSERRA SV +YL
Sbjct: 226 NKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYL 285

Query: 176 GARGVDQRRISTLGFGASQPIASN---------ATPDGRAQNRRVEVLISPLK 219
++G+ +IS G G S P+ N A D A +RRVE+ + +K
Sbjct: 286 ISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIK 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25665TCRTETA354e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 4e-04
Identities = 56/293 (19%), Positives = 99/293 (33%), Gaps = 22/293 (7%)

Query: 40 PKIPEFAARLSLSESA---LGLIILVFGIGSLVFMPIAGSQIARFGSRTVSLVTAAIFLP 96
P +P L S G+++ ++ + P+ G+ RFG R V LV+ A
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV 85

Query: 97 TLLFISWAGTIWAGVIAVFLFG--GLTGAMDVAMNANAVAVERDMRR-AIMSSCHAFWSL 153
++ A +W I + G G TGA+ A A+ + R MS+C +
Sbjct: 86 DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSAC---FGF 142

Query: 154 GGLIGAGLGGYLITAIGVQG---HAVALTVIALVLLVIAWPRVLADRPHPEAERPKGGL- 209
G + G LGG + A AL + + P P L
Sbjct: 143 GMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLA 201

Query: 210 ------PLTPLPWIIGLMALFSMIPEGAILDWGALYLRNELGASVSQSGFAFAAFSMTMA 263
+T + ++ + + ++ + W ++ + + G + AAF + +
Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHS 260

Query: 264 -AMRFAGDLVRDRFGAVNTLRFCSVMSITGLLIAGLAGNSTFAIIGFAIAGIG 315
A V R G L + TG ++ A A + G
Sbjct: 261 LAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25670ARGREPRESSOR280.026 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 27.9 bits (62), Expect = 0.026
Identities = 10/48 (20%), Positives = 20/48 (41%), Gaps = 7/48 (14%)

Query: 8 ERQALILAQLRQSGRVLAQD------LAQNFGVSEDTVRRDLREMAAR 49
+R I + + + QD + V++ TV RD++E+
Sbjct: 5 QRHIKIREIITAN-EIETQDELVDILKKDGYNVTQATVSRDIKELHLV 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25675RTXTOXIND374e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 37.1 bits (86), Expect = 4e-04
Identities = 12/48 (25%), Positives = 22/48 (45%)

Query: 1081 GNASHIGAPMPGVISRVFVNQGQEVKAGDVLLSIEAMKMETALHAERD 1128
G + I ++ + V +G+ V+ GDVLL + A+ E +
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_25695PF07520330.018 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 33.4 bits (76), Expect = 0.018
Identities = 19/109 (17%), Positives = 35/109 (32%), Gaps = 2/109 (1%)

Query: 1757 RGRSLREAAAFDTGATLSSTDGFTLDPILS-LRRTVRVPAGKKVSVIFWTIAAPSREEVD 1815
G + R A DT + + P + + + W + +E
Sbjct: 142 TGHTHRVQIALDTALSDQDQSAHYVAPERADSEKPREFRLVSDPGAMSWFLQRLEADEDG 201

Query: 1816 KAIDRYRHPDAFAHELVHAWTRTQVQMRHVGVTSQQAAAFQHLGRYLTY 1864
A+D + E+ + R + R + F+H RYL+Y
Sbjct: 202 NAVDLQLWVSDWLKEMFLDFKRAERPGRSISE-ENLPHMFEHWARYLSY 249


46B0909_12435B0909_12395N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_12435-211-1.336529tetratricopeptide repeat protein
B0909_12430-113-1.447937type II secretion system F family protein
B0909_12425-214-2.372481pilus assembly protein
B0909_12420-115-2.229346CpaF family protein
B0909_12415-216-2.385322CtpF protein
B0909_12410-117-2.913612pilus assembly protein CpaD
B0909_12405224-4.665045type II and III secretion system protein family
B0909_12400326-5.761477Flp pilus assembly protein CpaB
B0909_12395334-6.835647peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12430SYCDCHAPRONE382e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 37.6 bits (87), Expect = 2e-05
Identities = 19/110 (17%), Positives = 36/110 (32%)

Query: 98 AVMQQIAIANPADRDVLAAYGKAQAAAGQLEQALSTIQRAQTPDRPDWRLYSAEGAVLDQ 157
+ + + + L + Q +G+ E A Q D D R + GA
Sbjct: 23 GTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82

Query: 158 LGRSNDARSRYRQALDLKPNDPSVLSNLGMSYVLSSDPRTAETYLQSAIA 207
+G+ + A Y + +P + + + AE+ L A
Sbjct: 83 MGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12420cloacin300.014 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.014
Identities = 14/50 (28%), Positives = 24/50 (48%)

Query: 39 KRVQAAETDHANIKAARDRVQELSKRRKSMQDNLREMEKKQSEKAKKAKN 88
+ Q A+TD N +AA D + + + E KK+ +K + A+N
Sbjct: 395 LKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKRSAEN 444


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12410HTHFIS372e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.7 bits (85), Expect = 2e-04
Identities = 19/101 (18%), Positives = 36/101 (35%), Gaps = 1/101 (0%)

Query: 79 ASPTPNLIILETASAPGSLLDDLAPLAEVCDPSTRVIIVGRHNDITLYRDLIRNGISEYL 138
A+ +L++ + + D L + + P V+++ N G +YL
Sbjct: 44 AAGDGDLVVTDVVMPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYL 102

Query: 139 VAPVSMADLLGSIATIFVDPEAEPLGRNIAFIGAKGGVGSS 179
P + +L+G I +P+ P VG S
Sbjct: 103 PKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12400BCTERIALGSPD1238e-32 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 123 bits (309), Expect = 8e-32
Identities = 58/279 (20%), Positives = 106/279 (37%), Gaps = 29/279 (10%)

Query: 212 RQVSQIVNMLTIEGEDQVTLKVTVAEVSRQVLKQLGFN---------------GSISSST 256
+ +++ L I QV ++ +AEV LG IS++
Sbjct: 331 NDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAI 389

Query: 257 SNNGFEFANPSNLGNAISGASRIASGAIGSGSLNFATYLNAMEQAGVVRTLAEPSLTAIS 316
+ + + + S S A G N+A L A+ + LA PS+ +
Sbjct: 390 AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLD 449

Query: 317 GEQAKFYVGGDFRLPAEQEVTIDKDTGQQTITRTTDTVDYGITLNFRPVVLSPGRISLKI 376
+A F VG + + T D T+ R GI L +P + + L+I
Sbjct: 450 NMEATFNVGQEVPVLTGS-QTTSGDNIFNTVER----KTVGIKLKVKPQINEGDSVLLEI 504

Query: 377 ETNVSEPTYEGNVVTGNAGRNIPGSTYMSIRKRETSTTVELPSGGSIVIAGLVQDNIRQA 436
E VS + + + G + R + V + SG ++V+ GL+ ++
Sbjct: 505 EQEVSSVADAASSTSSDLG--------ATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDT 556

Query: 437 MSGLPGISKVPIFGTLFRSKDFIRNETELVIIATPYLVR 475
+P + +P+ G LFRS ++ L++ P ++R
Sbjct: 557 ADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_12390PREPILNPTASE449e-08 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 43.6 bits (103), Expect = 9e-08
Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 27/164 (16%)

Query: 4 AAIFLTLPLCVAFAALNDLFSMTIPNRIPLILLLSFVVVAPLTGMDWQTFAMSIAAATAV 63
L L + DL M +P+++ L LL ++ L G + + ++ A A
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG--FVSLGDAVIGAMAG 191

Query: 64 FLVCF-------ALFAANAMGGGDAKLLTAATVWYGFNISLVEFLLAVTFLGGVLTIGIL 116
+LV + L MG GD KLL A W G+ + LL+ +G + IG++
Sbjct: 192 YLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS-LVGAFMGIGLI 250

Query: 117 LLRSRSQEIMAAGIPIPDSLLVAKKIPYGIGIAVAGL--LTYGD 158
LLR+ Q +K IP+G +A+AG L +GD
Sbjct: 251 LLRNHHQ---------------SKPIPFGPYLAIAGWIALLWGD 279


47B0909_11950B0909_11920N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
B0909_11950-313-1.494618extracellular solute-binding protein
B0909_11945-215-1.650218ABC transporter permease subunit
B0909_11940-217-1.992274sn-glycerol-3-phosphate ABC transporter permease
B0909_11935-111-0.776672sn-glycerol-3-phosphate import ATP-binding
B0909_11930-210-0.050754cupin domain-containing protein
B0909_11925-1100.778210glutathione synthase
B0909_119200100.770032ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11940MALTOSEBP300.017 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 30.1 bits (67), Expect = 0.017
Identities = 84/366 (22%), Positives = 142/366 (38%), Gaps = 52/366 (14%)

Query: 5 ISMAAVAVSISATSSMAATN----ITWWHGMGGRNGEVINEVSQKFNEAQKECALTPVSK 60
++++A+ + + S++A + W +G G NG + EV +KF E +T
Sbjct: 10 LALSALTTMMFSASALAKIEEGKLVIWINGDKGYNG--LAEVGKKF-EKDTGIKVTVEHP 66

Query: 61 GSYEEALASGIAAFRSGEQPNIL-QVFDAGAATIINAKGAVIPAEDLINKAGYKFDREAF 119
EE +AA +G+ P+I+ D + A I + Y F +A
Sbjct: 67 DKLEEKFPQ-VAA--TGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDA- 122

Query: 120 INGVRYFYAAADGKFVGMPFNSSAPIMYINDEALKKAGVEAPKTWEEFEQVAPKLKEAGY 179
VRY +GK + P A + N + L PKTWEE + +LK G
Sbjct: 123 ---VRY-----NGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAKG- 169

Query: 180 IPLVQSQLTWQFTENFFS------RNNIQFATNNNGYDSVSDTKLKV--TDPNLVMMFDK 231
+S L + E +F+ F N YD + D + L + D
Sbjct: 170 ----KSALMFNLQEPYFTWPLIAADGGYAFKYENGKYD-IKDVGVDNAGAKAGLTFLVDL 224

Query: 232 LKDWKDKGLFAYYGAGWNDNQKPFEEGKVALWIGSSGSFGGLQKTATMPFSATFLPYWGS 291
+K+ Y A + F +G+ A+ I ++ + + + + T LP +
Sbjct: 225 IKNKHMNADTDYSIA-----EAAFNKGETAMTINGPWAWSNIDTS-KVNYGVTVLP---T 275

Query: 292 IKGAGTNSFIG--GAALFAMSGKSEAENKCVADFFQFLTSPEIQ-VFYHKATGYVAITKA 348
KG + F+G A + A S E + + ++ LT ++ V K G VA+
Sbjct: 276 FKGQPSKPFVGVLSAGINAASPNKELAKEFLENY--LLTDEGLEAVNKDKPLGAVALKSY 333

Query: 349 AYEKAK 354
E AK
Sbjct: 334 EEELAK 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11935PF06580280.050 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.9 bits (62), Expect = 0.050
Identities = 22/145 (15%), Positives = 55/145 (37%), Gaps = 11/145 (7%)

Query: 38 SFYLEDPFGFGASFVGLANYSDAIFNPEYLNIAKFTVVFTVLVTFFSLALGLLLAVKADA 97
++ G+G + ++ +P+ + ++F + ++ +GL+L +
Sbjct: 11 YYWYCQGIGWGVYTLTGFGFASLYGSPKLHS-----MIFNIAISL----MGLVLTHAYRS 61

Query: 98 VIRGSSAYKTLLISVYAIAPPVAGLIGMMFFDQHIGPFVKMAAFLGWDMKVGLNYFDTAF 157
I+ K + + P +IGM++F + + +A + L +
Sbjct: 62 FIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSII 121

Query: 158 AMVV--VAVWKQIPYNFIFFLSGLQ 180
VV +W + + + FF + Q
Sbjct: 122 FNVVVVTFMWSLLYFGWHFFKNYKQ 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11925PF05272355e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 5e-04
Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 33 IVLVGPSGCGKSTLLRMVAGLEDISEGTVKIGD 65
+VL G G GKSTL+ + GL+ S+ IG
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
B0909_11910HTHFIS395e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.7 bits (90), Expect = 5e-05
Identities = 46/225 (20%), Positives = 71/225 (31%), Gaps = 49/225 (21%)

Query: 139 AESGPEAAWAGAGIDILAPRSLIALANHFRGTQLLSRPTPAIRANPVNLPDLADIKGQES 198
+ +A+ GA + P L L + L+ P + D + G+ +
Sbjct: 87 FMTAIKASEKGAYDYLPKPFDLTELIGIIG--RALAEPKRRPSKLEDDSQDGMPLVGRSA 144

Query: 199 A----RRALEVAAAGGHNLLMVGPPGSGKSMLAARLPSILPPLEAAELLEVSMVHSIAGQ 254
A R L L++ G G+GK ++A L H
Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL------------------H----- 181

Query: 255 LSGGKLSDRR--PFRTPHHSATMAALIG------------GGLRAKPGEASLAHHGVLFL 300
RR PF + +A LI G G A G LFL
Sbjct: 182 ----DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 301 DEFPEFSPQVLDALRQPLETGECIIARANHRVSYPAEIQLVAAMN 345
DE + L + L+ GE R +++++VAA N
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRIVAATN 280



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.