PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome454.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_010515 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Bcenmc03_3253Bcenmc03_3267Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_32532151.447963cupin 2 domain-containing protein
Bcenmc03_32541141.774836porin
Bcenmc03_3255-193.296837squalene/phytoene synthase
Bcenmc03_3256-1103.405162glycerophosphoryl diester phosphodiesterase
Bcenmc03_3257-193.816413rhodanese domain-containing protein
Bcenmc03_3258-1103.017506cysteine dioxygenase type I
Bcenmc03_3259-1113.229912LysR family transcriptional regulator
Bcenmc03_3260-2113.309124acyl-CoA dehydrogenase domain-containing
Bcenmc03_3261-1113.535369alkanesulfonate monooxygenase
Bcenmc03_3262-1123.797034hypothetical protein
Bcenmc03_3263-2123.525247NMT1/THI5-like domain-containing protein
Bcenmc03_3264-2114.134743binding-protein-dependent transport systems
Bcenmc03_3265-1113.801537ABC transporter-like protein
Bcenmc03_3266-2103.495052cytochrome c class I
Bcenmc03_3267-1113.068535aliphatic sulfonate ABC transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3254ECOLNEIPORIN927e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 91.8 bits (228), Expect = 7e-23
Identities = 75/283 (26%), Positives = 115/283 (40%), Gaps = 36/283 (12%)

Query: 1 MKKRVAFAMTAVGLAAATAAHAQSSVTLYGIVDNGIAWQNNSSAVGATSGGHSKVQMSTG 60
MKK + A+ LAA A A + VTLYG + G+ + + GA + + V+ TG
Sbjct: 1 MKKSL----IALTLAALPVA-AMADVTLYGTIKAGVETSRSVAHNGAQA---ASVETGTG 52

Query: 61 VW-AGSRFGLKGSEDLGGGTKAIFQLEAGVNTANGSSQWTNGIFTRQAWVGLTNATYGTL 119
+ GS+ G KG EDLG G KAI+Q+E + A S W RQ+++GL +G L
Sbjct: 53 IVDLGSKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGW----GNRQSFIGL-KGGFGKL 107

Query: 120 TAGRQYTAYY--TLLSPYSPT-TWLTGYYGAHPGDIDSLDTSYRANNSLVYMSPKFYGFT 176
GR + ++P+ +L A P S+ Y SP+F G +
Sbjct: 108 RVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPEARLI---------SVRYDSPEFAGLS 158

Query: 177 VGGSYSFGGVAGATNRGSTWSAAIQYLNGPAGIAVGYQKVNNSTLGGGVWGANSTVQ--- 233
Y+ AG N ++ A Y NG + G + + V +
Sbjct: 159 GSVQYALNDNAGRHN-SESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLV 217

Query: 234 -----NGLTAAGDGNQPAVSSINNGYATAQSQQRIAVTAGYQF 271
+ L A+ Q + Y+ Q +A T Y+F
Sbjct: 218 SGYDNDALYASVAVQQQDAKLVEENYSHNS-QTEVAATLAYRF 259


2Bcenmc03_3298Bcenmc03_3315Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3298291.144942phosphotransferase domain-containing protein
Bcenmc03_32992100.763480TraR/DksA family transcriptional regulator
Bcenmc03_33002100.759127undecaprenyl-phosphate glucose
Bcenmc03_33012110.092318CRP/FNR family transcriptional regulator
Bcenmc03_3302-1121.976583cyclic nucleotide-binding protein
Bcenmc03_3303-191.747341acyl carrier protein
Bcenmc03_3304-192.054315acyl-CoA dehydrogenase type 2
Bcenmc03_3305091.702495hypothetical protein
Bcenmc03_3306-270.633879hypothetical protein
Bcenmc03_3307-210-0.446292Beta-mannosidase
Bcenmc03_3308-214-2.717244sigma-54 dependent trancsriptional regulator
Bcenmc03_3309-318-3.217942acyltransferase-like protein
Bcenmc03_3310-319-3.564639mannose-1-phosphate
Bcenmc03_3311-223-4.697447group 1 glycosyl transferase
Bcenmc03_3312-321-4.146722group 1 glycosyl transferase
Bcenmc03_3313-321-3.083617lipopolysaccharide biosynthesis protein
Bcenmc03_3314-221-2.953791hypothetical protein
Bcenmc03_3315021-3.700413major facilitator superfamily permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3308HTHFIS392e-135 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 392 bits (1008), Expect = e-135
Identities = 148/469 (31%), Positives = 223/469 (47%), Gaps = 42/469 (8%)

Query: 33 LRKELSRRDWKVSVVAHANELRD--TSGEITCGILDLSGGHADAIGSIASTCASMRDVVW 90
L + LSR + V + ++A L +G+ + D+ +A +
Sbjct: 19 LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPR--IKKARPDL 76

Query: 91 VALVDVGQTASPNVRALLRDYCFDYVTLPASHQRIADTVGHAYGMECLFARDREQLESEE 150
LV Q +DY+ P + +G A E +
Sbjct: 77 PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDG 136

Query: 151 KGIVGTCSAMLRLFDTVRRFARTDAPVFVFGETGTGKELTAVAIHRHSERRNGPFVAVNC 210
+VG +AM ++ + R +TD + + GE+GTGKEL A A+H + +RRNGPFVA+N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 211 GAIPPHLLQSELFGYERGAFTGANARKIGYVEAANGGTLLLDEIGDLPHESQASLLRFLQ 270
AIP L++SELFG+E+GAFTGA R G E A GGTL LDEIGD+P ++Q LLR LQ
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 271 ERSIHRLGGSDPVPVDVRIVSATHVDLREAMEEGRFRADLFHRLCVMRIDQPPLRARGKD 330
+ +GG P+ DVRIV+AT+ DL++++ +G FR DL++RL V+ + PPLR R +D
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 331 IELLAHHMLERFRGDARHRVRGFSTDAITALYKHDWPGNVRELINRVRRAVVMTEGRLIT 390
I L H +++ + V+ F +A+ + H WPGNVREL N VRR + +IT
Sbjct: 317 IPDLVRHFVQQAEKEGL-DVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVIT 375

Query: 391 AQDLELEYCLDAASPSVA-------------------------------------DIRKS 413
+ +E E + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 414 IEREAIEIALLRTRGRVAASARELGVSRATLYRWMEAYGIERPRGTGSS 462
+E I AL TRG +A LG++R TL + + G+ R + S+
Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRSA 484


3Bcenmc03_3360Bcenmc03_3375Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3360-2133.222575choline/carnitine/betaine transporter
Bcenmc03_3361-1133.811226hypothetical protein
Bcenmc03_3362-1133.590902type 11 methyltransferase
Bcenmc03_3363-2133.030948metallophosphoesterase
Bcenmc03_33640133.529783di-heme cytochrome c peroxidase
Bcenmc03_33652131.910303hypothetical protein
Bcenmc03_33671131.699973hypothetical protein
Bcenmc03_33680101.580239hypothetical protein
Bcenmc03_3369091.249093ArsR family transcriptional regulator
Bcenmc03_3370-191.481158major facilitator transporter
Bcenmc03_3371-292.065674salicylate hydroxylase
Bcenmc03_3372-3102.757891maleylacetoacetate isomerase
Bcenmc03_3373-2103.554668fumarylacetoacetate (FAA) hydrolase
Bcenmc03_33740113.868175gentisate 1,2-dioxygenase
Bcenmc03_3375-2103.620656LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3362PF05211352e-04 Neuraminyllactose-binding hemagglutinin
		>PF05211#Neuraminyllactose-binding hemagglutinin

Length = 260

Score = 35.4 bits (81), Expect = 2e-04
Identities = 17/92 (18%), Positives = 34/92 (36%), Gaps = 19/92 (20%)

Query: 188 FAFGDRQRIASVLSDSGWADVAIEPVDRMCV--------LPESALDDYIGRLGPVGFALL 239
F+F ++ ++ +G + + P + + L + LD G L P GF +
Sbjct: 109 FSFAQKKEGYLAVAMNGE--IVLRPDPKRTIQKKSEPGLLFSTGLDKMEGVLIPAGFVKV 166

Query: 240 EVDEATRRRVIDT---------IRAAFERYVH 262
+ E +D+ I+ F + H
Sbjct: 167 TILEPMSGESLDSFTMDLSELDIQEKFLKTTH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3370TCRTETB454e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.3 bits (107), Expect = 4e-07
Identities = 35/159 (22%), Positives = 60/159 (37%), Gaps = 2/159 (1%)

Query: 47 APVIRAEWGLSPAQLAPVFGAGLAGLMAGALVFGPFGDRFGRKRLLLVCVACFGIAS-AA 105
P I ++ PA V A + G V+G D+ G KRLLL + S
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 106 SASAGGLTELIVWRFVTGLGLGGAMPNAITLTSEYCPARRRSLLVTTMFCGFTIGSALGG 165
+ LI+ RF+ G G + + + Y P R + +G +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 166 LAAASLIEHHGWRAVLVVGGVAPLLLLPLLAWRLPESVR 204
+ + W +L++ + ++ +P L L + VR
Sbjct: 157 AIGGMIAHYIHWSYLLLI-PMITIITVPFLMKLLKKEVR 194


4Bcenmc03_3476Bcenmc03_3484Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3476-1103.045735LysR family transcriptional regulator
Bcenmc03_34771113.174663major facilitator transporter
Bcenmc03_3478092.894512methylmalonate-semialdehyde dehydrogenase
Bcenmc03_3479283.2557933-hydroxyisobutyrate dehydrogenase
Bcenmc03_3480293.329719hypothetical protein
Bcenmc03_3481183.168320gluconate 2-dehydrogenase (acceptor)
Bcenmc03_34821141.7391802Fe-2S iron-sulfur cluster binding
Bcenmc03_34833122.505677monooxygenase FAD-binding
Bcenmc03_34843122.596471Asp/Glu/hydantoin racemase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3477TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 15/101 (14%)

Query: 84 MGGIVFGHFGDRIGRKSMLMITLLLMGVPSMIIGLIPSYDSIGYWAAALLIAMRFLQGMA 143
+G V+G D++G K +L+ +++ S+I +G+ +LLI RF+QG
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI-------GFVGHSFFSLLIMARFIQG-- 114

Query: 144 VGGEWGGAVLMAV------EHAPKGRKGLFGSLPQTGVGLG 178
G A++M V + GL GS+ G G+G
Sbjct: 115 AGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155


5Bcenmc03_3509Bcenmc03_3533Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_35092140.639496hypothetical protein
Bcenmc03_3510-1140.281668aliphatic sulfonate ABC transporter periplasmic
Bcenmc03_35110120.532707hypothetical protein
Bcenmc03_3512-1120.809795LysR family transcriptional regulator
Bcenmc03_35130120.720369putrescine transporter
Bcenmc03_35140100.887536response regulator receiver protein
Bcenmc03_3515-2101.695081ornithine decarboxylase
Bcenmc03_35162112.526025major facilitator transporter
Bcenmc03_35170101.492814alcohol dehydrogenase
Bcenmc03_35180120.436127antibiotic biosynthesis monooxygenase
Bcenmc03_35190130.461797short-chain dehydrogenase/reductase SDR
Bcenmc03_3520014-0.090823LysR family transcriptional regulator
Bcenmc03_3521115-2.039536xanthine permease
Bcenmc03_3522431-4.818571NUDIX hydrolase
Bcenmc03_3523123-4.376474hypothetical protein
Bcenmc03_3524018-3.133618hypothetical protein
Bcenmc03_3525-115-3.500146short chain dehydrogenase
Bcenmc03_3526-114-3.952730hypothetical protein
Bcenmc03_3527-217-4.061964HxlR family transcriptional regulator
Bcenmc03_3528-317-3.872792amino acid permease-associated protein
Bcenmc03_3529-321-3.943381Nitrilase/cyanide hydratase and apolipoprotein
Bcenmc03_3530-225-4.457751LuxR family transcriptional regulator
Bcenmc03_3532028-4.344996LacI family transcription regulator
Bcenmc03_3533026-3.572647AraC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3516TCRTETB401e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.2 bits (94), Expect = 1e-05
Identities = 31/154 (20%), Positives = 59/154 (38%), Gaps = 3/154 (1%)

Query: 26 LPAISAGLHVSIAAAGQLTTIFSAVFALAALVAASFVARVERRTALLAALGAFAAANLCA 85
LP I+ + A+ + T F F++ V ++ + LL + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 86 AASPGYAS-LFAARVLMAASCATLILVATRFAAELAPVSQRGRAIGIVFMGISASLVLGV 144
+ S L AR + A A + A P RG+A G++ ++ +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 145 PIGMRIAEWAGWRAVFV--SIAVAALPLGIWLAR 176
IG IA + W + + I + +P + L +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3519DHBDHDRGNASE402e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 40.4 bits (94), Expect = 2e-06
Identities = 32/188 (17%), Positives = 65/188 (34%), Gaps = 7/188 (3%)

Query: 14 ILLVAASRGLGLAMAEAFLNKGWHVTGTVREGSGRTKLHDLADRFDGRLEIGTLDICEPA 73
+ A++G+G A+A ++G H+ K+ E D+ + A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 74 QLAALRARLSGR--RFDMLFVNAGTTN-DPNETIGEVTIDEFVRVMITNALAPMRVIETL 130
+ + AR+ D+L AG ++ + + V T R
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR--SVS 128

Query: 131 QDLVTDDGLIGAMSSGQGSVANNVTGMREVYRGSKAALNQFMRSFAARQADTRRALALMA 190
+ ++ G++ + + A Y SKAA F + A+ +++
Sbjct: 129 KYMMDRRS--GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 191 PGWVRTEL 198
PG T++
Sbjct: 187 PGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3525DHBDHDRGNASE923e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 91.7 bits (227), Expect = 3e-24
Identities = 68/263 (25%), Positives = 117/263 (44%), Gaps = 25/263 (9%)

Query: 7 LQGKRVLVTGGTMGVGKAVVGLFRELGAKVLTTARTPPADTPADIFVAA----------N 56
++GK +TG G+G+AV GA + P + A +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 57 LATVEGCEAVAEAVMANFGGVDVIVHVVGGSRSPAGGFAALSEDAWQDELNLNLLPAVRL 116
+ + + + G +D++V+V G R G +LS++ W+ ++N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLR--PGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 117 DRALLPGMLAQRAGVVIHVTSIQHALPLPESTTAYAAAKAALSTYSKSLSKEVSPKGIRV 176
R++ M+ +R+G ++ V S +P S AYA++KAA ++K L E++ IR
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVP-RTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 177 IRVAPGWIETEAAVAFAERLAAQAGTDYEGGKQIIMDSLG----GIPLGRPSTPGEVANL 232
V+PG ET+ + D G +Q+I SL GIPL + + P ++A+
Sbjct: 183 NIVSPGSTETD--------MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADA 234

Query: 233 IAFLASPRAASITGVEYVIDGGT 255
+ FL S +A IT +DGG
Sbjct: 235 VLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3529BCTERIALGSPD310.007 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 30.7 bits (69), Expect = 0.007
Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 9 SLVDGDVAHNTRKVIDTIERVDVAGGTKLIVFP 41
L+ A ++++ +ERVD AG ++ P
Sbjct: 166 VLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVP 198


6Bcenmc03_3589Bcenmc03_3600Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3589126-4.660997type II citrate synthase
Bcenmc03_3590118-3.380011hypothetical protein
Bcenmc03_3591222-3.055372succinate dehydrogenase iron-sulfur subunit
Bcenmc03_3592019-1.450864succinate dehydrogenase flavoprotein subunit
Bcenmc03_3593117-1.040065succinate dehydrogenase, hydrophobic membrane
Bcenmc03_3594014-0.440569succinate dehydrogenase, cytochrome b556
Bcenmc03_3595220-0.825099GntR family transcriptional regulator
Bcenmc03_3596221-0.960676malate dehydrogenase
Bcenmc03_3597220-0.712787HpcH/HpaI aldolase
Bcenmc03_3598319-1.552346hypothetical protein
Bcenmc03_3599216-1.3275802-methylcitrate dehydratase
Bcenmc03_3600216-0.510482aconitate hydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3597PHPHTRNFRASE391e-05 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 39.4 bits (92), Expect = 1e-05
Identities = 30/143 (20%), Positives = 53/143 (37%), Gaps = 13/143 (9%)

Query: 84 GVRIHDFSHPH-WRDDVRIVLRASR-APAYITLPKIANAADAAEMTAFIEGTRREL---- 137
+R+ +R +R +LRAS + P IA + + A ++ + +L
Sbjct: 360 AIRL-CLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG 418

Query: 138 -GIAQPIPVDVLIETHGALAQAAALAALPTVGTLSFGLMDFVSAHHGAIPDSAMRSPGQF 196
++ I V +++E A A V S G D + A + S +
Sbjct: 419 VDVSDSIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSY-LY 475

Query: 197 D--HPLVRRAKLEIAAACHAHGK 217
HP + R + A H+ GK
Sbjct: 476 QPYHPAILRLVDMVIKAAHSEGK 498


7Bcenmc03_3665Bcenmc03_3677Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3665093.299021PHB depolymerase family esterase
Bcenmc03_36662124.259982PA-phosphatase-like protein
Bcenmc03_36671123.452499molybdenum ABC transporter periplasmic
Bcenmc03_36681134.110000molybdate ABC transporter inner membrane
Bcenmc03_36690134.227976ABC transporter-like protein
Bcenmc03_36700134.546024ModE family transcriptional regulator
Bcenmc03_36710123.508420CHAD domain-containing protein
Bcenmc03_3672-1123.2455324-oxalocrotonate tautomerase
Bcenmc03_36730103.905387LysR family transcriptional regulator
Bcenmc03_36740103.452200hypothetical protein
Bcenmc03_3675-1103.072974histidine kinase
Bcenmc03_3676-1102.429924hypothetical protein
Bcenmc03_36770113.619053thiamine monophosphate synthase
8Bcenmc03_3728Bcenmc03_3737Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3728281.214516GCN5-related N-acetyltransferase
Bcenmc03_3729392.271805riboflavin synthase subunit alpha
Bcenmc03_37302113.587017hypothetical protein
Bcenmc03_37313134.167540chloride channel core protein
Bcenmc03_37324143.526223chemotaxis-specific methylesterase
Bcenmc03_37334143.136859CheA signal transduction histidine kinase
Bcenmc03_37342162.843297CheW protein
Bcenmc03_37352162.343039TPR repeat-containing CheR-type MCP
Bcenmc03_37362180.720719CheW protein
Bcenmc03_37372170.236470methyl-accepting chemotaxis sensory transducer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3732HTHFIS492e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 2e-08
Identities = 37/174 (21%), Positives = 58/174 (33%), Gaps = 20/174 (11%)

Query: 5 IVNDLPLAVEALRRVIALRADHRVLWVATDGDEAVDFCVAHPPDVVLMDLVMPKVDGVAA 64
+ +D L + RA + V + ++ + A D+V+ D+VMP +
Sbjct: 8 VADDDAAIRTVLNQ-ALSRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 65 TRRIMARAP-CAILIVTASVSANTSSVYEAMGAGALDAVDTPTLALGLSTDASPQALLAK 123
RI P +L+++A + T+ +A GA D P L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYD------------YLPKPFDLTEL 111

Query: 124 IDQIGRLLESRTTALVPPGPAPTRGQPTLVAIGASAGGPTALTALLRALPADFP 177
I IGR L G P +G SA L R + D
Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPL---VGRSAAMQEIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3733HTHFIS753e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 3e-16
Identities = 33/119 (27%), Positives = 54/119 (45%), Gaps = 2/119 (1%)

Query: 645 RRRVLVVDDSLTVRELERKLLEKRGYDVTVAVDGMEGWNAVRSDAFDLVVTDVDMPRMDG 704
+LV DD +R + + L + GYDV + + W + + DLVVTDV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 705 IELVTLIKGDPMLKRVPVMIVSYKDRDEDRRRGLDAGADYYLAKSSFHDEALLDAVHDL 763
+L+ IK +PV+++S ++ + + GA YL K E + L
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


9Bcenmc03_3768Bcenmc03_3791Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3768116-3.850939secretion protein HlyD family protein
Bcenmc03_3769625-7.002287EmrB/QacA family drug resistance transporter
Bcenmc03_3770834-7.998620RND efflux system outer membrane lipoprotein
Bcenmc03_3772933-8.786613transposase mutator type
Bcenmc03_37731039-9.615720hypothetical protein
Bcenmc03_37741036-9.261444hypothetical protein
Bcenmc03_3775727-7.495082hypothetical protein
Bcenmc03_3776317-4.606153hypothetical protein
Bcenmc03_3777219-4.509788type VI secretion system Vgr family protein
Bcenmc03_3778019-3.097557hypothetical protein
Bcenmc03_3779538-8.467996hypothetical protein
Bcenmc03_3780437-7.718995LysR family transcriptional regulator
Bcenmc03_3781442-8.279951major facilitator transporter
Bcenmc03_3782547-8.785238amidohydrolase
Bcenmc03_3783338-8.300965transposase, mutator type
Bcenmc03_3784135-7.940451hypothetical protein
Bcenmc03_3785-218-2.493609methyl-accepting chemotaxis sensory transducer
Bcenmc03_3786-212-1.309290OmpA/MotB domain-containing protein
Bcenmc03_3787-211-0.845953MotA/TolQ/ExbB proton channel
Bcenmc03_3788-28-1.046519cyclic nucleotide-binding protein
Bcenmc03_3789-211-1.383930glutathione S-transferase domain-containing
Bcenmc03_3790012-1.658096polyhydroxyalkanoate depolymerase,
Bcenmc03_3791214-2.458758hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3768RTXTOXIND815e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 81.4 bits (201), Expect = 5e-19
Identities = 50/373 (13%), Positives = 104/373 (27%), Gaps = 84/373 (22%)

Query: 62 VGGDVTVLAPKVNGFVDKILVTDNQRVKAGDV------------LVQLDARDYDAKLAQA 109
G + P N V +I+V + + V+ GDV ++ + A+L Q
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 110 S-----------------------------AEVDSARSAVTELEAKQQLQFAVIGQHAAD 140
EV S + E + Q Q +
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211

Query: 141 KNASSAELTRAASDRVR-----------YRELVKSDAVSNQIVERADADYSKAGAAVE-- 187
K A + + + L+ A++ V + Y +A +
Sbjct: 212 KRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY 271

Query: 188 --------------RSDAALLASKRQLDVLGAQLADARARVNTALAAQRVAALNVEYTTI 233
+ + L+ + ++L +L + + + I
Sbjct: 272 KSQLEQIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIGLLTLELAKNEERQQASVI 330

Query: 234 RSPVDGYVGN-RTGRVGMLANVGVPLLTVVPAS-GLWIDANFKEDQLRKMRAGDRVDVAL 291
R+PV V + G + L+ +VP L + A + + + G + +
Sbjct: 331 RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390

Query: 292 DA----SSTRLHGRVDSLAPATGATFSVLPPENATGNFTKIVQRVPVRVHLDPQPGIKHV 347
+A L G+V ++ + G ++ + I
Sbjct: 391 EAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP-- 441

Query: 348 LRPGLSAVVTVHT 360
L G++ + T
Sbjct: 442 LSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3769TCRTETB955e-23 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 95.3 bits (237), Expect = 5e-23
Identities = 90/411 (21%), Positives = 153/411 (37%), Gaps = 27/411 (6%)

Query: 24 MCVGFFMATLDIQIVASSLRDIGGGLSASQDELSWVQTAYLIAEIIVIPMSGWLSKVMST 83
+C+ F + L+ ++ SL DI + +WV TA+++ I + G LS +
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 84 RWLFAASAVGFTITSMLCGLAWDINSMILF-RGLQGALGAAMIPTVFTTAFVLFPGRQRL 142
+ L + S++ + S+++ R +QGA AA V P R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 143 IASTTIGALASLAPAIGPVIGGWITDQWSWHWLFYLNLVPGIAVAALVPRYVHIDAPDLS 202
A IG++ ++ +GP IGG I W +L L+P I + VP + + ++
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL---LIPMITIIT-VPFLMKLLKKEVR 194

Query: 203 LIKRGDYLGIALMSGFLGCLEYVLEEGPRKNWFGDDAIVICAWISGICGFLFIVHALTAK 262
+ D GI LMS + F + +S + +F+ H
Sbjct: 195 IKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVT 244

Query: 263 DPIVDLRALAVRNFGIGSLLSFVTGIGIFVTVFLTPLFLAQVRAFSSLQIG-----IALL 317
DP VD F IG L + + V + P + V S+ +IG +
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 318 SVGAFQLIA-LCAYAFAARYVSMRALLVFGLICFGLGCYLYTPITHDWGWQELLLPQALR 376
SV F I + YV + + + T W + +++ L
Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS---FLLETTSW-FMTIIIVFVLG 360

Query: 377 GIGQQFSVPPIVTMALGSLPQSRLKSASGLFNLMRNLGGAIGIAVSATMLN 427
G+ F+ I T+ SL Q + L N L GIA+ +L+
Sbjct: 361 GLS--FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3775RTXTOXINA320.010 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.2 bits (73), Expect = 0.010
Identities = 35/190 (18%), Positives = 63/190 (33%), Gaps = 51/190 (26%)

Query: 658 MTKGEPGFAGIGAIFSAINLRFAREELKKANRFNQTEAGIKFDNAV-GGLVAGVAQYGSS 716
+ G + I SAI+ F A+ + AG++ V G + G++QY
Sbjct: 235 LDNIGAGLDTVSGILSAISASFILSN-ADADTRTKAAAGVELTTKVLGNVGKGISQY--- 290

Query: 717 ALEQMEKAGVKLSESTAKVGRFLGIVGRFGGAIVGLVSAAI---------DAYHAYDELE 767
+ Q G L G I V+ AI D + +++E
Sbjct: 291 IIAQRAAQG-------------LSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIE 337

Query: 768 -----------HGNILMFALYTTSALIGGAL-------------VVAAFLGSVLTLPLLL 803
G+ L+ A + + I +L + AA S++ P+
Sbjct: 338 EYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSA 397

Query: 804 LAALIGVLIN 813
L + +I+
Sbjct: 398 LVGAVTGIIS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3780PF08280290.021 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 29.4 bits (66), Expect = 0.021
Identities = 16/68 (23%), Positives = 22/68 (32%), Gaps = 15/68 (22%)

Query: 9 LQCLVAF-EAAVRHASFTKAAAELHLTQSAISRQIQQLEEFLGRSLFVREHRSLRL---- 63
LQ L + T A L+ S+ R + L L R+ L
Sbjct: 122 LQLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIPLL---------RNFELKLSK 172

Query: 64 -TIAGEQY 70
I GE+Y
Sbjct: 173 NKIVGEEY 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3781TCRTETA514e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.0 bits (122), Expect = 4e-09
Identities = 48/286 (16%), Positives = 99/286 (34%), Gaps = 30/286 (10%)

Query: 82 VLGVYADKVGRKAALSLTILLMAAGTALIGIAPTYEQAGIAAPLMIVVARLLQGFSAGGE 141
VLG +D+ GR+ L +++ A A++ AP ++ + R++ G + G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW--------VLYIGRIVAGIT-GAT 112

Query: 142 MSGATAFLTEYAPPEKRAYYSSWIQSSIGFAVLLGAATGTFVTTSLDTQALHSWGWRLPF 201
+ A A++ + ++RA + ++ + GF ++ G G + + PF
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---------MGGFSPHAPF 163

Query: 202 ----LLGIIVGPVGYFI--RSHIDETPAFSAVESQAKESSPLKEVLHTYPRETFASFSMV 255
L + G F+ SH E S + F M
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223

Query: 256 ILWTVCTYVLLFYMPTYSVRTLHL-PQSTGFTAGMVGGLMIMCCSPIVGRLADAWGRRVF 314
++ V + + + H + G + G L + + I G +A G R
Sbjct: 224 LVGQVPAALWVI----FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA 279

Query: 315 LSGSALAILVLAWPMFSWINHAPGFASLIVFQAVFGVLIATYTGPI 360
L +A + + ++ ++V A G+ + +
Sbjct: 280 LMLGMIAD-GTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAML 324



Score = 29.0 bits (65), Expect = 0.038
Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 4/45 (8%)

Query: 293 LMIMCCSPIVGRLADAWGRRVF----LSGSALAILVLAWPMFSWI 333
LM C+P++G L+D +GRR L+G+A+ ++A F W+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWV 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3786OMPADOMAIN644e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 63.8 bits (155), Expect = 4e-14
Identities = 45/166 (27%), Positives = 66/166 (39%), Gaps = 34/166 (20%)

Query: 67 GGAATAAVDTAKADPAPPWLALLDSLKSNGRISLVKAPHGVEIGIDAKILFNVGDARLLP 126
G A V PAP V+ H + + +LFN A L P
Sbjct: 192 GQGEAAPVVAPAPAPAPE----------------VQTKHFT---LKSDVLFNFNKATLKP 232

Query: 127 DSSPVLNQIAQALSAH--TTGDILVEGHTDSVPIANAKYESNWELSSARAGSVVRYLSER 184
+ L+Q+ LS G ++V G+TD I + Y N LS RA SVV YL +
Sbjct: 233 EGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQGLSERRAQSVVDYLISK 288

Query: 185 GVAPHRLAAIGRADTQPLVAGGDAGSRAR---------NRRVTIFV 221
G+ +++A G ++ P+ + R +RRV I V
Sbjct: 289 GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3788SECA320.005 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.2 bits (73), Expect = 0.005
Identities = 32/206 (15%), Positives = 69/206 (33%), Gaps = 34/206 (16%)

Query: 231 RVIDRAQQVADHDLDVSINSELELIELADLESLIGVDKTLLLNKLSNGEIAEEVFAFRKS 290
+ + V +D I + L E+ D+ L K L E ++ +
Sbjct: 675 TINSIREDVFKATIDAYIPPQ-SLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEE 733

Query: 291 KVLNYVEQKGV-AYFSRRNGRRGGDVRALE--------DLVWIDQRTLFDVISAFDTYDL 341
+ + + + Y + +R E D +W + +
Sbjct: 734 TLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEH-----------LAAM 782

Query: 342 AKLIANLDDRAVADK------------LFSVMTEA-RRNEVSWVMRRELKLDPVEIDEIE 388
L + R A K +F+ M E+ + +S + + ++++ + +
Sbjct: 783 DYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELEQ 842

Query: 389 QRVLEAVRVAKAPAAGATAVASASAS 414
QR +EA R+A+ SA+A+
Sbjct: 843 QRRMEAERLAQMQQLSHQDDDSAAAA 868


10Bcenmc03_3877Bcenmc03_3893Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3877219-1.838457RNA-directed DNA polymerase
Bcenmc03_3878016-1.288037hypothetical protein
Bcenmc03_38791120.188087GCN5-related N-acetyltransferase
Bcenmc03_38800130.648704AraC family transcriptional regulator
Bcenmc03_38810100.804506nuclear transport factor 2
Bcenmc03_3882-2111.962131ThiJ/PfpI domain-containing protein
Bcenmc03_3883-1122.905210short-chain dehydrogenase/reductase SDR
Bcenmc03_3884-1123.2522806-aminohexanoate-dimer hydrolase
Bcenmc03_3885-2112.612235transport-associated
Bcenmc03_3886-1123.120099porin
Bcenmc03_3887-1123.474440major facilitator transporter
Bcenmc03_3888-1113.402906L-carnitine dehydratase/bile acid-inducible
Bcenmc03_38890123.226679acyl-CoA dehydrogenase domain-containing
Bcenmc03_38900103.795437AMP-dependent synthetase and ligase
Bcenmc03_38910114.420900LysR family transcriptional regulator
Bcenmc03_38920123.765218major facilitator transporter
Bcenmc03_38931113.3306633-hydroxybutyryl-CoA dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3879SACTRNSFRASE413e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.5 bits (97), Expect = 3e-07
Identities = 16/52 (30%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 107 QGTGAGRALFDAALEYLAETRPGPVWLGVWSGNAKAIAFYEKAGFKRVGTYD 158
+ G G AL A+E+ E + L N A FY K F +G D
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-IIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3883DHBDHDRGNASE1072e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (269), Expect = 2e-30
Identities = 66/253 (26%), Positives = 113/253 (44%), Gaps = 9/253 (3%)

Query: 3 LKSKSALITGGTSGIGLATAKRFIAEGARVAVTGRDESVFERVKAEL---GEHAVVLKGD 59
++ K A ITG GIG A A+ ++GA +A + E+V + L HA D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 60 VRSLDDMRVIAGEVNERFGGLDVVFANAGWAFPSAVNDIDDTLYNDIMDVNVKGVVFTLQ 119
VR + I + G +D++ AG P ++ + D + VN GV +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 120 AALPYL--REGSSVILNTSFVAQTGKHGISLTAAAKAAVRSLARSWSYEFLDRKIRFNAI 177
+ Y+ R S++ S A + ++ A++KAA + E + IR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 178 APGAIDTPL-LSKWGKPD---EWVRDRKAEFAETIPVGRMGHADDIAYAALYLASDESSF 233
+PG+ +T + S W + + ++ F IP+ ++ DIA A L+L S ++
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 234 VVGTELVVDGGAS 246
+ L VDGGA+
Sbjct: 246 ITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3886ECOLNEIPORIN694e-15 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 68.7 bits (168), Expect = 4e-15
Identities = 73/350 (20%), Positives = 123/350 (35%), Gaps = 38/350 (10%)

Query: 1 MKKMIAATLVGALCASAHAQSSVTLFGRIGGGVRWVNGLPG------GSQIGFNNIIAGN 54
MKK + A + AL +A A VTL+G I GV + + G + G+
Sbjct: 1 MKKSLIALTLAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGS 58

Query: 55 EFGIRGREDLGNGLKALFVLDGAFNSGTGALKTSGTLFSQAAYVGLAGDFGRLTLGRQFN 114
+ G +G+EDLGNGLKA++ ++ T ++ +++GL G FG+L +GR +
Sbjct: 59 KIGFKGQEDLGNGLKAIWQVE----QKASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNS 114

Query: 115 VAEDLGIALDPLGGRGQSLAVEPGVLFDGNFFTLDSRFNNTIKYLG-QVGGLRFGANYSP 173
V +D G ++P + N +++Y + GL Y+
Sbjct: 115 VLKDTG-DINPWDSK--------SDYLGVNKIAEPEARLISVRYDSPEFAGLSGSVQYAL 165

Query: 174 GGVAGRSHAGTSYSTAAMYTYSDLMGGVSYAKTYSPDGSQSA--QTIQA-GGTLQLGRAR 230
AGR + SY Y A ++ + Q
Sbjct: 166 NDNAGRHN-SESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDA 224

Query: 231 FYLSYASLNVRANKAGAPSRRDDIPSFGVVAQPLPSV-QLSAAAFYDRARNLGNVSGADG 289
Y S A + K + + V A ++ Y G+ +
Sbjct: 225 LYASVAV-QQQDAKLVEENYSHN-SQTEVAATLAYRFGNVTPRVSYAHGFK-GSFDATNY 281

Query: 290 HKLTTYAI--AEYFLSKRTELYVE---IDRNGLTGAYMRDPATIAALNLR 334
+ + AEY SKRT V + ++ +T + LR
Sbjct: 282 NNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFV---STAGGVGLR 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3892TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 58/351 (16%), Positives = 119/351 (33%), Gaps = 42/351 (11%)

Query: 85 VFGHVGDRYGRKASLVWTLLIMGASTFAIGLLPTYAQVGLWAPAALVVLRLLQGIASGGE 144
V G + DR+GR+ L+ +L + P LW L + R++ GI +G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGI-TGAT 112

Query: 145 WGGGVLMISENAPPEQRGYYAAWSQLGVGGGFVLSSAA--FLAAQALPDDAFRTWGWRLP 202
I++ ++R + + G G V + + P
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG----------FSPHAP 162

Query: 203 FLASVAI----FAIGIYIRR--HLPESRDFEQAGKRGAHAHLPIVECIRRHPKEILLAMG 256
F A+ A+ F G ++ H E R + A + + +
Sbjct: 163 FFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL-NPLASFRWARGMTVVAALMAVFFI 221

Query: 257 LRVAENGGAYIFLAFSLVYGKYLGIPNGTMLTGVMIAMIVEMGAMLAWGRLSDRIGRKPV 316
+++ A +++ F + G L I + + G ++ R+G +
Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQA--MITGPVAARLGERRA 279

Query: 317 YLIGALSLVACAFPFFWLLDTRVTPLVWLALTVGTAVSHGAMIGTLPALVGELFSTEV-- 374
++G ++A + L + + + + ++ G + +PAL + S +V
Sbjct: 280 LMLG---MIADGTGYILLAFATRGWMAFPIMVL---LASGGIG--MPALQA-MLSRQVDE 330

Query: 375 RYSGVALGHEVASIFAGGM-SPLIATALLARYHASWPVSLFLIALGLVTVA 424
G G A + PL+ TA+ A +W ++ L +
Sbjct: 331 ERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381


11Bcenmc03_3903Bcenmc03_3923Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_39031113.5812124-oxalocrotonate tautomerase
Bcenmc03_39041113.294268PAS/PAC sensor signal transduction histidine
Bcenmc03_39051123.168658acyl-CoA dehydrogenase type 2
Bcenmc03_39061132.439594hypothetical protein
Bcenmc03_3907-1113.299775hypothetical protein
Bcenmc03_39081113.423075amidase
Bcenmc03_39091112.603580short-chain dehydrogenase/reductase SDR
Bcenmc03_39100132.442329two component LuxR family transcriptional
Bcenmc03_39111122.1602734-hydroxyphenylacetate 3-monooxygenase,
Bcenmc03_39122122.134554major facilitator transporter
Bcenmc03_39130111.226030hypothetical protein
Bcenmc03_3914-391.308880AsnC family transcriptional regulator
Bcenmc03_3915-262.014173hypothetical protein
Bcenmc03_3916-362.005618hypothetical protein
Bcenmc03_3917-362.344174GCN5-related N-acetyltransferase
Bcenmc03_3918183.153271AraC family transcriptional regulator
Bcenmc03_39192103.194776glucose-methanol-choline oxidoreductase
Bcenmc03_39202103.246074alpha/beta hydrolase domain-containing protein
Bcenmc03_3921093.577486addiction module killer protein
Bcenmc03_3922192.953648putative transcriptional regulator
Bcenmc03_39232102.679609hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3909DHBDHDRGNASE886e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 88.2 bits (218), Expect = 6e-23
Identities = 64/240 (26%), Positives = 100/240 (41%), Gaps = 18/240 (7%)

Query: 25 RVVLVTGAARGLGAVIAERFHAAGYRVALADIAADAIHAHARDLDPSGERAIALPLDVTS 84
++ +TGAA+G+G +A + G +A D + + L A A P DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 85 KRDFEAARDALVARW----GTIDVLVNNAGASKVVPAMEITAEQFDQVIDVNLRSVLFGC 140
AA D + AR G ID+LVN AG + ++ E+++ VN V
Sbjct: 69 S----AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 141 QVFGQYFAQRGAGRIVNIASLAGQNGGSATGAHYAAAKGGTLTLTKVFARDLAAQGVTVN 200
+ +Y R +G IV + S ++ A YA++K + TK +LA + N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAA-YASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 201 AISPGPLDLPIVY---------ESVAPEKLRQVLASLPGGKLGSAGFVADAAVLLASGDA 251
+SPG + + + E V L +P KL +ADA + L SG A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3910HTHFIS1043e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 104 bits (260), Expect = 3e-28
Identities = 36/150 (24%), Positives = 69/150 (46%)

Query: 4 PAPIVYIVDDDSGMRTSLAWLLESVGIASEGFANAADFLARFDVNLPACLVLDVRMPEKS 63
+ + DDD+ +RT L L G +NAA +V DV MP+++
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 64 GFDVQAELNARGATLPVIFVSGHGDIPMSVRALQNGAIDFVEKPYNSQQMLERVQRALRL 123
FD+ + LPV+ +S +++A + GA D++ KP++ +++ + RAL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 124 AQQRHAVSQRHRELRQRLDALTAREKEVLR 153
++R + + + L +A +E+ R
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3917SACTRNSFRASE319e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 9e-04
Identities = 12/55 (21%), Positives = 21/55 (38%), Gaps = 1/55 (1%)

Query: 84 VSEAARGRGVARLMCEHSQQVARERGFLAMQFNSVVATNEVAVALWQKLGFEIVG 138
V++ R +GV + + + A+E F + N A + K F I
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLML-ETQDINISACHFYAKHHFIIGA 150


12Bcenmc03_3934Bcenmc03_3945Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_39340133.078388ABC transporter-like protein
Bcenmc03_39351113.467455ABC transporter-like protein
Bcenmc03_39360112.966019inner-membrane translocator
Bcenmc03_39371102.505386extracellular ligand-binding receptor
Bcenmc03_39382102.673604LysR family transcriptional regulator
Bcenmc03_39392101.403292monooxygenase FAD-binding
Bcenmc03_3940281.846144hypothetical protein
Bcenmc03_39410102.069713peroxidase-like protein
Bcenmc03_3942-192.498621AsnC family transcriptional regulator
Bcenmc03_3943-192.674931ornithine cyclodeaminase
Bcenmc03_3944-192.840792amidinotransferase
Bcenmc03_3945-193.212249molecular chaperone-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3942PHPHTRNFRASE270.030 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.1 bits (60), Expect = 0.030
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 9/65 (13%)

Query: 2 ITLDDVD---RQLIALLRDNAR------LPVVALAKELRVARATVQNRLTRLEKNGVIVG 52
+ L+ D QL ALLR + P++A +ELR A+A +Q +L GV V
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 53 YTVRL 57
++ +
Sbjct: 423 DSIEV 427


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3945SHAPEPROTEIN435e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 42.8 bits (101), Expect = 5e-06
Identities = 59/279 (21%), Positives = 99/279 (35%), Gaps = 88/279 (31%)

Query: 1 MKRYTVGIDLGTSNTVVAYVEAGSDAIRVFDVEQLVGPGAMAAQPLLPSVRYHPAAGELP 60
M + IDLGT+NT++ YV+ + PSV
Sbjct: 8 MFSNDLSIDLGTANTLI-YVKGQGIVLNE------------------PSV---------- 38

Query: 61 PDALRLPWAADPKVNTKANAADAPAAV--IGRYARMLGAQVPGRLVSSAKSWLSHAAVDR 118
V + + A +P +V +G A+ + + PG +
Sbjct: 39 -------------VAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIA-------------- 71

Query: 119 LAAILPWGAADGVDKVSPVDASASYLAHV--RDAWNARFPDAPLAQQDVILTVPASFDDG 176
AI P DGV ++ + L H + N+ +P V++ VP
Sbjct: 72 --AIRP--MKDGV--IADFFVTEKMLQHFIKQVHSNSFMRPSP----RVLVCVPVGATQV 121

Query: 177 ARALTVEAARRAKLPALRLLEEPQAAFYDWLYGQRDMLRDTFTAARRVLICDVGGGTTDL 236
R E+A+ A + L+EEP AA G + A ++ D+GGGTT++
Sbjct: 122 ERRAIRESAQGAGAREVFLIEEPMAAAIG--AGLP------VSEATGSMVVDIGGGTTEV 173

Query: 237 TLVDVAPGDDGEPTFTRVGVGNHLMLGGDNMDLALARLV 275
++ + V + + +GGD D A+ V
Sbjct: 174 AVI----------SLNGVVYSSSVRIGGDRFDEAIINYV 202


13Bcenmc03_4020Bcenmc03_4039Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4020-122-3.337146GntR family transcriptional regulator
Bcenmc03_4021-119-4.120685alpha/beta hydrolase fold protein
Bcenmc03_4022-116-3.644423hypothetical protein
Bcenmc03_4023-115-3.938533LysR family transcriptional regulator
Bcenmc03_4024015-3.452414GCN5-related N-acetyltransferase
Bcenmc03_4025012-3.132301GCN5-related N-acetyltransferase
Bcenmc03_4026111-3.144888malic enzyme
Bcenmc03_4027113-4.044610LysR family transcriptional regulator
Bcenmc03_4028315-4.690028hypothetical protein
Bcenmc03_4029415-3.603934GntR family transcriptional regulator
Bcenmc03_4030016-2.297499extracellular ligand-binding receptor
Bcenmc03_4031117-1.559726hypothetical protein
Bcenmc03_40323190.123740chlorinating enzyme
Bcenmc03_40335191.747505hypothetical protein
Bcenmc03_40344180.988280thioesterase
Bcenmc03_40352170.229464amino acid adenylation domain-containing
Bcenmc03_4036316-0.998675YbaK/prolyl-tRNA synthetase associated region
Bcenmc03_4037119-1.819788acyl-CoA dehydrogenase domain-containing
Bcenmc03_4038121-3.298542sodium/hydrogen exchanger
Bcenmc03_4039025-4.652632glutathione S-transferase domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4020BCTLIPOCALIN300.008 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 30.4 bits (68), Expect = 0.008
Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 29/91 (31%)

Query: 287 DGALERHLPRITAAYRRKCDAMCDALRDGFGDAIEFHRPEGGMFVWARLGAVSTDVLLQQ 346
D + ER L ++TA YR + D L G+ E G + ++
Sbjct: 45 DHSFERGLSQVTAEYRVRNDGGISVLNRGYS-------EEKGEW--------------KE 83

Query: 347 AIANKIVFVPGKAFFADNVDAASLRLSFAAP 377
A GKA+F + L++SF P
Sbjct: 84 A--------EGKAYFVNGSTDGYLKVSFFGP 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4024SACTRNSFRASE316e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 6e-04
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 9/59 (15%)

Query: 67 VVDIAVLPIHQKKGVGDLIMRALMDYIHENAP-----PTAYVSLMADHGTPKFYERYGF 120
+ DIAV ++KKGVG ++ +++ EN T +++ A H FY ++ F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACH----FYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4025SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 8e-07
Identities = 11/63 (17%), Positives = 23/63 (36%), Gaps = 3/63 (4%)

Query: 78 VDAIFVRPSHMGRGIGRTMLRFLEALAAEHGVVEMRLDATLNAAP---FYRSCGWTGDSI 134
++ I V + +G+G +L A E+ + L+ FY + ++
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151

Query: 135 STY 137
T
Sbjct: 152 DTM 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4027PF05043310.005 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.5 bits (71), Expect = 0.005
Identities = 15/55 (27%), Positives = 31/55 (56%)

Query: 22 KIRHLVLLLQIQQHGSLTRVAEHMASSQPAVTNALSELESMFGTPLFERSSRGMR 76
++ L LL + ++ + +AE + ++ AV + LS ++S F +F S+ G+R
Sbjct: 12 QLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIR 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4035PF07212310.011 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 30.8 bits (69), Expect = 0.011
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 5/39 (12%)

Query: 146 ALPVSIDALREQGRREAPHGRATAASIAYINFTSGSTGQ 184
A +SID +++Q G+ TAA YIN TSG+TG+
Sbjct: 238 AAALSIDIVKKQK-----GGKGTAAQGIYINSTSGTTGK 271


14Bcenmc03_4070Bcenmc03_4081Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_40702101.541078major facilitator transporter
Bcenmc03_40711110.030406AraC family transcriptional regulator
Bcenmc03_4072017-3.605973methylated-DNA--protein-cysteine
Bcenmc03_4073-224-4.221991AraC family transcriptional regulator
Bcenmc03_4074-228-5.2796373-methyl-2-oxobutanoate
Bcenmc03_4075-131-5.673046AsnC family transcriptional regulator
Bcenmc03_4076031-5.918571glucosamine--fructose-6-phosphate
Bcenmc03_4077331-6.337396hypothetical protein
Bcenmc03_4078327-4.239440hypothetical protein
Bcenmc03_4079223-3.729835histidine kinase
Bcenmc03_4080317-1.699390two component transcriptional regulator
Bcenmc03_4081212-1.221873MltA-interacting MipA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4070TCRTETB432e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 42.6 bits (100), Expect = 2e-06
Identities = 37/163 (22%), Positives = 67/163 (41%), Gaps = 11/163 (6%)

Query: 20 VCALLFFATVINYMDRQILGLLAPMLQHDIGWSQVQYGRIVMAFSAFYALGLLGFGRIVD 79
+C L FF+ ++ +L + P + +D + AF +++G +G++ D
Sbjct: 19 LCILSFFSV----LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 80 WLGTRVSYAVAMLVW---SIAAMLHAAVGSVTGFAFVRALLGIGEGGNFPAAIK-TTAEW 135
LG + +++ S+ + + S+ A R + G G FPA + A +
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA--RFIQGAG-AAAFPALVMVVVARY 131

Query: 136 FPRRERALATGIFNSGANIGAVFAPAIIPAIAVAYGWRAAFVI 178
P+ R A G+ S +G PAI IA W +I
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4080HTHFIS755e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 5e-18
Identities = 36/164 (21%), Positives = 69/164 (42%), Gaps = 7/164 (4%)

Query: 2 PRVAIVEDHERLAGLLSQALAAAGIESDRFGNAREAAYGVDRADYALLIIDRGLPDGDGL 61
+ + +D + +L+QAL+ AG + NA + D L++ D +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 AFLRTLRAAGRMMPCLMLTARDALHDRIDGLESGADDYVTKPFEMSELVARVRTLM---- 117
L ++ A +P L+++A++ I E GA DY+ KPF+++EL+ + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 118 -RRPALLTTLVASFADVTVDPPQRAMRCGDRTVLLAPAELQIML 160
R L V + + L +L +M+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLMI 165


15Bcenmc03_4176Bcenmc03_4204Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4176-223-3.418060ABC transporter-like protein
Bcenmc03_4177137-5.558676hypothetical protein
Bcenmc03_4178033-5.934236helix-turn-helix domain-containing protein
Bcenmc03_4179-123-3.543974hypothetical protein
Bcenmc03_4180-118-3.333248hypothetical protein
Bcenmc03_4181-111-1.217748type 11 methyltransferase
Bcenmc03_4182-212-1.444856PilT domain-containing protein
Bcenmc03_4183119-2.334656prevent-host-death family protein
Bcenmc03_4184222-2.369639hypothetical protein
Bcenmc03_4185232-3.802240hypothetical protein
Bcenmc03_4186237-3.540315redoxin domain-containing protein
Bcenmc03_4187343-5.654220hypothetical protein
Bcenmc03_4188345-5.834364putative DNA relaxase/nickase, TraS/VirD2-like
Bcenmc03_4189343-6.091614plasmid-like protein
Bcenmc03_4190243-6.141458hypothetical protein
Bcenmc03_4191242-6.536730primase 2
Bcenmc03_4192243-7.608861TRAG family protein
Bcenmc03_4193242-7.562962hypothetical protein
Bcenmc03_4194343-7.464177P-type DNA transfer ATPase VirB11
Bcenmc03_4195444-8.118796conjugation TrbI family protein
Bcenmc03_4196444-8.513083conjugal transfer protein TrbG/VirB9/CagX
Bcenmc03_4197546-9.125000VirB8 family protein
Bcenmc03_4198547-9.033764P-type DNA transfer protein VirB5
Bcenmc03_4199548-8.781059TrbL/VirB6 plasmid conjugal transfer protein
Bcenmc03_4200550-9.488776type IV secretion/conjugal transfer ATPase
Bcenmc03_4201344-7.746951type IV secretory pathway VirB3 family protein
Bcenmc03_4202236-6.064812conjugal transfer protein TrbC
Bcenmc03_4203028-3.790636lytic transglycosylase catalytic
Bcenmc03_4204-120-3.040024transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4196TYPE4SSCAGX422e-06 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 41.7 bits (97), Expect = 2e-06
Identities = 45/192 (23%), Positives = 68/192 (35%), Gaps = 55/192 (28%)

Query: 83 DHDVYLKPKLAAHDTNLIVRTDRRSYSFDLLV----------LPLKERFGNAHEMYRV-- 130
D+ + L P +A TNL+VRT++ Y F L + L +K + HE+ V
Sbjct: 265 DNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKDNFASAYLTVKLEYPQRHEVSSVIE 324

Query: 131 ------------------------SFVYPDTAASDVSI-------------------AAR 147
+++ AS+ I A
Sbjct: 325 EELKKREEAKRQRELIKQENLNTTAYINRVMMASNEQIINKEKIREEKQKIILDQAKALE 384

Query: 148 LACLQKRLSQPSVVRNAAYSMQVMPHAEDIAPSAVWDDGRFTYIRIPNNRRIPAIFQVED 207
+ L + V RN Y ++ I PS ++DDG FTY N PAIF V+
Sbjct: 385 TQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIFDDGTFTYFGFKNITLQPAIFVVQP 444

Query: 208 DDTERVVDKHMD 219
D + D +D
Sbjct: 445 DGKLSMTDAAID 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4197PF043351932e-64 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 193 bits (493), Expect = 2e-64
Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 9/220 (4%)

Query: 4 QSDYRRALDFEASLTALQACSERRAWQVAFAAVIVAIGSAAALAVMTPFYRVVPLPIEVN 63
++ + A +E A S++ AW VA A +A A+A +TP V P I V+
Sbjct: 11 KAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVD 70

Query: 64 KLTGEAQLIEVLDA-KHVPLREIEDKHWVEVYVRARERYDWGLLQMDYDRVLEMSDESVA 122
+ TGEA + L + E K+++ YVR RE + + +D V+ MS
Sbjct: 71 RNTGEASIAAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQ 130

Query: 123 RAYRQIYSG--PNALDQQLGASVQYRTRIVSTTLVPDEPGHAVVHLERTVRKNGIDTGEP 180
+ + Y P + L I + + A V+ + +
Sbjct: 131 DRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNV--AQVYFTKESVT---GSNST 185

Query: 181 AKRFVITLAFTYRPTVLVRERSAIENPFGFKVTAYSRDAE 220
V T+ + T +E +NP G++V +Y D E
Sbjct: 186 KTDAVATIKYKVDGTPS-KEVDRFKNPLGYQVESYRADVE 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4203ACRIFLAVINRP290.018 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.018
Identities = 3/29 (10%), Positives = 11/29 (37%)

Query: 75 INRENWSRYGLTVGTVFDVCRNLSAGAAI 103
+++E G+++ + G +
Sbjct: 730 VDQEKAQALGVSLSDINQTISTALGGTYV 758


16Bcenmc03_4217Bcenmc03_4227Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4217-193.527865hypothetical protein
Bcenmc03_4218-2142.411725porin opacity type
Bcenmc03_4219-1132.426095two component LuxR family transcriptional
Bcenmc03_4220-1131.977619histidine kinase
Bcenmc03_4221-2151.498035two component transcriptional regulator
Bcenmc03_42223130.071432Hpt sensor hybrid histidine kinase
Bcenmc03_4223314-0.303177YadA domain-containing protein
Bcenmc03_4224313-0.851879two component LuxR family transcriptional
Bcenmc03_4225314-0.799114two component LuxR family transcriptional
Bcenmc03_4226313-0.452013OmpA/MotB domain-containing protein
Bcenmc03_4227313-0.705939YadA domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4218OUTRMMBRANEA374e-05 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 36.8 bits (85), Expect = 4e-05
Identities = 28/142 (19%), Positives = 44/142 (30%), Gaps = 20/142 (14%)

Query: 68 GGPDTGSNVTGSLGLGYQFGNGWRAEGEYV-FKRTNNFTSYWAPFDANANEFHVSAQRLM 126
GP + + GYQ E Y R P+ + AQ +
Sbjct: 48 NGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGR--------MPYKGSVENGAYKAQGVQ 99

Query: 127 LNGYKDFDLGRGFSVYGTLGIGVAIVSADGWQTNDTRRFASKTQTNLAYS--AGAGVSYA 184
L + + +Y LG V DT+ + S GV YA
Sbjct: 100 LTAKLGYPITDDLDIYTRLGGMVW--------RADTKSNVYGKNHDTGVSPVFAGGVEYA 151

Query: 185 INKRFSIDLGYRYV-DMGNVET 205
I + L Y++ ++G+ T
Sbjct: 152 ITPEIATRLEYQWTNNIGDAHT 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4219HTHFIS1042e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 104 bits (261), Expect = 2e-28
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 1/154 (0%)

Query: 10 DRPIVAIVDDDEPVRDGLALLLRTVGLPTRCYADAQAFLADADDRALGCVLLDLRMPGMS 69
+ + DDD +R L L G R ++A V+ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 70 GLDALDRLSARRA-LPVIVLTGHGNVDACRRAFKRGALDFLRKPVDDDELIDTVQQALRR 128
D L R+ R LPV+V++ +A ++GA D+L KP D ELI + +AL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 129 QAAQRGQDDAGQTRAARVATLSAREREVLEGIVR 162
+ + + + SA +E+ + R
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4220PF06580310.013 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.013
Identities = 19/102 (18%), Positives = 38/102 (37%), Gaps = 16/102 (15%)

Query: 374 ILHNLIRNA-RDALAGMP-LGEIRISGGRAGRHYRFSVVDNGPGVPDDALPRLFEPFFTT 431
++ L+ N + +A +P G+I + G + V + G + T
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN----------TK 308

Query: 432 RENGLGLGLPLCDTLAQR--QDGSLMIRNRPSGGVEATLLLP 471
G GL + + L + + + + G V A +L+P
Sbjct: 309 ESTGTGLQN-VRERLQMLYGTEAQIKLSEKQ-GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4222HTHFIS885e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 5e-20
Identities = 37/137 (27%), Positives = 58/137 (42%), Gaps = 2/137 (1%)

Query: 771 AARVLVVDDHPVNRTLQQSQLVTLGYAADAADDGASALRRCADTRYDLVMTDLNMPGMDG 830
A +LV DD RT+ L GY + A+ R A DLV+TD+ MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 831 YTLARVLRARYPDLPVIAITAHASAAEHARCAEAGIVAVLVKPVLLDTIDRTVRRFAKIS 890
+ L ++ PDLPV+ ++A + + +E G L KP L + + R ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR--ALA 120

Query: 891 ATSRPARNTLVDLAEGP 907
R D +G
Sbjct: 121 EPKRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4223OMADHESIN697e-14 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 68.8 bits (167), Expect = 7e-14
Identities = 63/205 (30%), Positives = 102/205 (49%), Gaps = 14/205 (6%)

Query: 509 GSDSVATGPQSTAIGTSATANSTGSVALGNGATSSGANATAIGRLATAGAANATVLGGSA 568
G ++ A G S AIG +A A +VA+G G+ ++G N+ AIG L+ A +A G ++
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 569 TA------------TGQSAIAIGQSAHANGLSAIGIGFLSDAQADN--SVALGARSVANR 614
TA T + +A+G ++ A+ +++ IG S A++ S+A+G RS +R
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 615 DNTVSVGSASQQRQIVNVAAGTQGSDAVNVSQLAPVVTALGGGATIDSATGAVTGPTYTL 674
+N+VS+G S RQ+ ++AAGT+ +DAVNV+QL + SA Y
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241

Query: 675 TNGGAQTTVGGALGALDSALTTANG 699
+ + SA T N
Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENA 266



Score = 63.0 bits (152), Expect = 5e-12
Identities = 67/217 (30%), Positives = 99/217 (45%), Gaps = 3/217 (1%)

Query: 106 NAALAADSATAIGANASAAGQSSVAIGGNTMAT-VNGVAIGTLSQATGANSTAVGVGAAA 164
NA+ + AIGA A AA ++VA+G ++AT VN VAIG LS+A G ++ G + A
Sbjct: 64 NASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTA 123

Query: 165 TASAASALGRGAAASGTNTSAFGNGARASGDSASALGRGAVASEANSVALGANSTADRAN 224
+ R + + F + A A A A+ S+A+G S DR N
Sbjct: 124 QKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDREN 183

Query: 225 AVSVGTTSAQRQIINVADGTAGTDAVNLNQLNAAIAASDQYFMVNSTSANLANATGANAV 284
+VS+G S RQ+ ++A GT TDAVN+ QL I + + N SA L A A
Sbjct: 184 SVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQEN--TNKRSAELLANANAYAD 241

Query: 285 AIGQAVTSSGSSSVAIGSGTSADGLHSMALGASSRVV 321
+V ++ S + + A S V+
Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVL 278



Score = 43.0 bits (100), Expect = 8e-06
Identities = 54/159 (33%), Positives = 75/159 (47%), Gaps = 8/159 (5%)

Query: 1391 GSLVVGDGSAASGENSSAIGQGSAASGDGSTAVGQGSNASGGNSSAIGQGSNASGSNSS- 1449
++ VG GS A+G NS AIG S A GD + G S A + G+ AS S++
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTA---QKDGVAIGARASTSDTGV 141

Query: 1450 AIGQGAVASGDSSTAIGQGS--SATGSGSVAIGAGSVATEANTVSFGDGTAEGNRRLVNI 1507
A+G + A +S AIG S +A S+AIG S N+VS G + NR+L ++
Sbjct: 142 AVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESL--NRQLTHL 199

Query: 1508 ADGVNASDAATKGQLDRAINGMQGQINDVAKNAYAGVAA 1546
A G +DA QL + I Q N + A A
Sbjct: 200 AAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 42.2 bits (98), Expect = 1e-05
Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 6/100 (6%)

Query: 276 ANATGANAVAIGQAVTSSGSSSVAIGSGTSADGLHSMALGASSRVVGDSNLAVGDGATVT 335
A+A G +++AIG ++ ++VA+G+G+ A G++S+A+G S+ +GDS + G
Sbjct: 65 ASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA---- 120

Query: 336 SLTTAPTNAIAIGKGANVSDAGVAGATGSIAIGTNAAAGG 375
+TA + +AIG A+ SD GVA S A N+ A G
Sbjct: 121 --STAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIG 158



Score = 40.3 bits (93), Expect = 7e-05
Identities = 36/128 (28%), Positives = 63/128 (49%)

Query: 411 GINSAAAGNASVAMGPNATAAGAGSIVIGNQAATTAANSVALGNAATGSAVNTTAIGSNA 470
G+N++A G S+A+G A AA ++ +G + T NSVA+G + + G+ +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 471 KATGNNSTVLGQSSSASGLSAVGIGFRANASGQEAISLGSDSVATGPQSTAIGTSATANS 530
A + + ++S++ AVG +A+A AI S A S AIG + +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 531 TGSVALGN 538
SV++G+
Sbjct: 182 ENSVSIGH 189



Score = 37.2 bits (85), Expect = 5e-04
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1410 GQGSAASGDGSTAVGQGSNASGGNSSAIGQGSNASGSNSSAIGQGAVASGDSSTAIGQGS 1469
G ++A G S A+G + A+ G + A+G GS A+G NS AIG + A GDS+ G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1470 SATGSGSVAIGAGSVATE 1487
+A G VAIGA + ++
Sbjct: 122 TAQKDG-VAIGARASTSD 138



Score = 35.3 bits (80), Expect = 0.002
Identities = 45/162 (27%), Positives = 76/162 (46%), Gaps = 8/162 (4%)

Query: 157 AVGVGAAATASAASALGRGAAASGTNTSAFGNGARASGDSASALGRGAVASEANSVALGA 216
A+G+ A G A+A G ++ A G A A+ +A A+G G++A+ NSVA+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 217 NSTADRANAVSVGTTSAQRQIINVADGTAGTDAVNLNQLNAAIAASDQYFMVNSTSANLA 276
S A +AV+ G S ++ DG A + + A+ + + NS + +
Sbjct: 106 LSKALGDSAVTYGAASTAQK-----DGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHS 160

Query: 277 NATGAN---AVAIGQAVTSSGSSSVAIGSGTSADGLHSMALG 315
+ AN ++AIG + +SV+IG + L +A G
Sbjct: 161 SHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202



Score = 32.2 bits (72), Expect = 0.019
Identities = 25/60 (41%), Positives = 36/60 (60%)

Query: 1438 GQGSNASGSNSSAIGQGAVASGDSSTAIGQGSSATGSGSVAIGAGSVATEANTVSFGDGT 1497
G ++A G +S AIG A A+ ++ A+G GS ATG SVAIG S A + V++G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4224HTHFIS464e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.0 bits (109), Expect = 4e-08
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 7 IRIVIADDHPAVVIGARYELSATNTLAVVASANNSTELMETLANHPCDVLVSDYAMPGTE 66
I++ADD A I + + V +N+ L +A D++V+D MP
Sbjct: 4 ATILVADDDAA--IRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD-- 59

Query: 67 YGDGLAMFSAILKRYPNLRIVVMT 90
+ + I K P+L ++VM+
Sbjct: 60 -ENAFDLLPRIKKARPDLPVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4225HTHFIS383e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 3e-05
Identities = 25/121 (20%), Positives = 49/121 (40%), Gaps = 7/121 (5%)

Query: 6 IRVVLADDHPATLGGVQHGLSSVP-TIRLTGSAGNSTELIALLDAGVCDVLVSDYAMPGG 64
+++ADD A + LS +R+T +A L + AG D++V+D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNA---ATLWRWIAAGDGDLVVTDVVMPD- 59

Query: 65 AYGDGIALFSYLQRNYPAVKLVVLTMLDNPAVIKGLLGLGISCIVSKSDAVDHLIPAVHA 124
+ L +++ P + ++V++ + G + K + LI +
Sbjct: 60 --ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 125 A 125
A
Sbjct: 118 A 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4226OMPADOMAIN1062e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (266), Expect = 2e-29
Identities = 54/149 (36%), Positives = 79/149 (53%), Gaps = 16/149 (10%)

Query: 94 FMCGKPQPVVQPAPAPAPQPAPQPVPQRQVLLQGDANFATDSAALTSQARNDLDRFIA-- 151
F G+ PVV PAPAPAP V + L+ D F + A L + + LD+ +
Sbjct: 191 FGQGEAAPVVAPAPAPAP-----EVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQL 245

Query: 152 ANRGVEFARVAITGFTDSTGSAAHNRTLSEARARTVVNYLRSNGLQARSFSAEGLGAADP 211
+N + V + G+TD GS A+N+ LSE RA++VV+YL S G+ A SA G+G ++P
Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305

Query: 212 VASNATADGR---------AQNRRVEIRL 231
V N + + A +RRVEI +
Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4227OMADHESIN821e-17 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 81.9 bits (201), Expect = 1e-17
Identities = 73/197 (37%), Positives = 114/197 (57%), Gaps = 7/197 (3%)

Query: 1205 GAQSIATGSGAVALGAGASAAGTGAVALGHG-VATGTNALALGNGTVASGNNAIAEGFNA 1263
G + A G ++A+GA A AA AVA+G G +ATG N++A+G + A G++A+ G A
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG-AA 120

Query: 1264 RAAGVNGLALGNTARANAADS-LAFGTTAQVDPLATNSIAIGRQANVTSTALNSVALGAS 1322
A +G+A+G ARA+ +D+ +A G ++ D A NS+AIG ++V + S+A+G
Sbjct: 121 STAQKDGVAIG--ARASTSDTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIAIGDR 176

Query: 1323 SVADRLNSVSVGSTGQQRQIIYVARGTANTDAVNVSQLKEAVAAFGGNASVDANGAIVNP 1382
S DR NSVS+G RQ+ ++A GT +TDAVNV+QLK+ + N + + + N
Sbjct: 177 SKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANA 236

Query: 1383 TYTIGGTTYRNVGDALN 1399
+ +G A N
Sbjct: 237 NAYADNKSSSVLGIANN 253



Score = 71.5 bits (174), Expect = 3e-14
Identities = 59/182 (32%), Positives = 86/182 (47%), Gaps = 28/182 (15%)

Query: 635 ATAVGANASATGRSAVALGGNTVASAQNAVALGTLSRATGLESTAVGVGAAAT------G 688
+ A+GA A A +AVA+G ++A+ N+VA+G LS+A G + G + A G
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 689 ANTSAFGRGASAG----------------------AGNSVALGTFSVADRANSVSVGSAS 726
A S G + G G S+A+G S DR NSVS+G S
Sbjct: 132 ARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191

Query: 727 ARRQITNVAAGTQDNDAVNVAQLNEAIANVDGNSPYFKANGAGDGSDAASATGVGSVAVG 786
RQ+T++AAGT+D DAVNVAQL + I N+ A + + A + +
Sbjct: 192 LNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIA 251

Query: 787 SN 788
+N
Sbjct: 252 NN 253



Score = 68.8 bits (167), Expect = 2e-13
Identities = 88/309 (28%), Positives = 129/309 (41%), Gaps = 66/309 (21%)

Query: 347 FSTGTYAANAAATVGNNSATAIGPNASATGISAVALGGNTVASAQNAVALGTLARATGLE 406
FS+ A+ + N +A I PNA ALG A G A A G+
Sbjct: 18 FSSPYAFADDYDGIPNLTAVQISPNADP------ALGLEYPVRPPVPGAGGLNASAKGIH 71

Query: 407 TTAVGVGAAATGASATALGRGAAATGTNSTALGTFASAIGTGNLAVGGGAAVAQTGRFVP 466
+ A+G A A +A A+G G+ ATG NS A+G + A+G + G + + G
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG---- 127

Query: 467 TNMIAIGTGARAIDGDVTGANGSIAIGSSATAAGTGSIAIGTNARHSGSSATVLGGEASA 526
+AIG+ A+ + TG
Sbjct: 128 -----------------------VAIGARASTSDTG------------------------ 140

Query: 527 MGGGGGVAVGYGAMASGSSGVSLGTNSTASATS--SVALGTNSVANRANAVSVGSAAQQR 584
VAVG+ + A + V++G +S +A S+A+G S +R N+VS+G + R
Sbjct: 141 ------VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR 194

Query: 585 QITNVAAGTAGTDAVNVNQLNAAIAATDNKYVSISTGLYAADVAATAAESATAVG-ANAS 643
Q+T++AAGT TDAVNV QL I T S L A A +S++ +G AN
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNY 254

Query: 644 ATGRSAVAL 652
+SA L
Sbjct: 255 TDSKSAETL 263



Score = 63.4 bits (153), Expect = 8e-12
Identities = 64/207 (30%), Positives = 97/207 (46%), Gaps = 30/207 (14%)

Query: 763 FKANGAGDGSDAASATGVGSVAVGSNAKALVAGGVAIGGSATASMANSIAIGNDVIAAQD 822
+ G G ASA G+ S+A+G+ A+A VA+G + A+ NS
Sbjct: 53 VRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNS------------ 100

Query: 823 GVAIGHGARAQNSNAISIGTQSAAGPNGVSLGNNALSVSDGIALGTNASAAGANSVALGS 882
VAIG ++A +A++ G S A +GV++G A + G+A+G N+ A NSVA+G
Sbjct: 101 -VAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGH 159

Query: 883 GSIAAS-----------------NTVSVGSVGNERKITNVAAGTDRQDAVNLGQLQDTGL 925
S A+ N+VS+G R++T++AAGT DAVN+ QL+
Sbjct: 160 SSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIE 219

Query: 926 VAPVDPTNPGAGLTSLAVTYGTNADGS 952
+ A L + A Y N S
Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSS 246



Score = 37.6 bits (86), Expect = 8e-04
Identities = 50/183 (27%), Positives = 79/183 (43%), Gaps = 23/183 (12%)

Query: 3118 GLELRPGTPGDGNGGGTGTNPYFGATDLTAGGSSAANPGTGTGNVAAG-SGASIG----- 3171
+RP PG G + + A TA + A G G++A G + +IG
Sbjct: 50 EYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 3172 ------------TGVNNGTVIGSGSTVGSSGGTAIGAGSAANGENATAIGQGSN--ATGS 3217
T +G IG+ ++ S G A+G S A+ +N+ AIG S+ A
Sbjct: 110 LGDSAVTYGAASTAQKDGVAIGARAST-SDTGVAVGFNSKADAKNSVAIGHSSHVAANHG 168

Query: 3218 GSVAIGSGSVANEANTVSFGNGTDTGNRRIVNIADGVGANDAATKGQLDRAVGGLGSQIN 3277
S+AIG S + N+VS G+ ++ NR++ ++A G DA QL + + N
Sbjct: 169 YSIAIGDRSKTDRENSVSIGH--ESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTN 226

Query: 3278 DLS 3280
S
Sbjct: 227 KRS 229



Score = 34.5 bits (78), Expect = 0.009
Identities = 38/142 (26%), Positives = 55/142 (38%), Gaps = 2/142 (1%)

Query: 2913 QVKNVAAGTDDTDAVNVAQLKSAGLVAPVDPTNPGSGLTSLAVTYSTNDDESANFDEVKL 2972
Q+ ++AAGT DTDAVNVAQLK + + L + A Y+ N S
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNY 254

Query: 2973 KGTNGTTITNVAAGAVNSTSTDAINGSQLHGTAQSVADTIGGGTTVGADGTLGNTAIEVN 3032
+ A + S D +N ++ H + SVA T A+ T
Sbjct: 255 TDSKSAETLENARKEAFAQSKDVLNMAKAH--SNSVARTTLETAEEHANSVARTTLETAE 312

Query: 3033 GQKYSTVAEAVQAAAAYGATDS 3054
AEA+ +A Y + S
Sbjct: 313 EHANKKSAEALASANVYADSKS 334



Score = 32.9 bits (74), Expect = 0.023
Identities = 53/207 (25%), Positives = 81/207 (39%), Gaps = 21/207 (10%)

Query: 1297 ATNSIAIGRQANVTSTALNSVALGASSVADRLNSVSVGSTGQ---QRQIIYVARGTANTD 1353
+SIAIG A A +VA+GA S+A +NSV++G + + Y A TA D
Sbjct: 69 GIHSIAIGATAEAAKGA--AVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126

Query: 1354 AVNVSQLKEAVAAFGGNASVDANGAIVNPTYTIGGTTYRNVGDALNALSNLGGGGTDPLA 1413
V A G AS G V +G + + +N G +
Sbjct: 127 GV----------AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHG------YS 170

Query: 1414 VTYGTNEDGTPNFAVVTLKGTDGTTLSNVKAGVADTDAVNVSQLKDSGLIGDDGKAIAAV 1473
+ G +V + L+++ AG DTDAVNV+QLK + +
Sbjct: 171 IAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSA 230

Query: 1474 TYDRNADGTPNLGSVTLAGGTDGTTLS 1500
NA+ + S ++ G + T S
Sbjct: 231 ELLANANAYADNKSSSVLGIANNYTDS 257


17Bcenmc03_4239Bcenmc03_4303Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_42390113.093279amino acid permease-associated protein
Bcenmc03_42400134.430021undecaprenyl-phosphate glucose
Bcenmc03_42410145.485373polysaccharide export protein
Bcenmc03_42420135.067310protein tyrosine phosphatase
Bcenmc03_42431104.801962exopolysaccharide transport protein family
Bcenmc03_42442114.437916group 1 glycosyl transferase
Bcenmc03_42451112.773392hypothetical protein
Bcenmc03_4246-181.336497polysaccharide biosynthesis protein
Bcenmc03_4247-28-0.072754hypothetical protein
Bcenmc03_4248-29-1.019416hypothetical protein
Bcenmc03_4249-120-4.163381hypothetical protein
Bcenmc03_4250127-5.4522352-isopropylmalate synthase
Bcenmc03_4251642-6.240389integrase catalytic subunit
Bcenmc03_4252745-7.046462transposase IS3/IS911 family protein
Bcenmc03_4253645-6.405122transposase IS3/IS911 family protein
Bcenmc03_4255644-5.425080integrase catalytic subunit
Bcenmc03_4256539-4.403708prevent-host-death family protein
Bcenmc03_4257540-5.208394PilT domain-containing protein
Bcenmc03_4258542-6.157463porin
Bcenmc03_4259543-6.000163hypothetical protein
Bcenmc03_4260543-6.492791amidase
Bcenmc03_4261643-7.375972branched chain amino acid ABC transporter
Bcenmc03_4262843-7.706052ABC transporter-like protein
Bcenmc03_4263842-7.416972ABC transporter-like protein
Bcenmc03_4264739-6.320097inner-membrane translocator
Bcenmc03_4265638-6.724256inner-membrane translocator
Bcenmc03_4266641-6.273795XRE family transcriptional regulator
Bcenmc03_4267542-6.387835XRE family transcriptional regulator
Bcenmc03_4268441-5.919520GntR domain-containing protein
Bcenmc03_4269443-5.875076acyl-CoA dehydrogenase domain-containing
Bcenmc03_4270545-6.266791phosphopantetheine-binding
Bcenmc03_4271545-6.1022513-oxoacyl-(ACP) synthase III
Bcenmc03_4272444-5.9207223-oxoacyl-(ACP) synthase III
Bcenmc03_4273542-5.725705acyl-CoA ligase
Bcenmc03_4274544-6.147723pyridoxal-dependent decarboxylase, exosortase
Bcenmc03_4275545-6.131616acyl-CoA dehydrogenase domain-containing
Bcenmc03_4276546-6.440708hypothetical protein
Bcenmc03_4277547-6.1660293-oxoacyl-(ACP) synthase III
Bcenmc03_4278340-5.8381373-oxoacyl-(ACP) synthase III
Bcenmc03_4279124-3.797329LuxR family transcriptional regulator
Bcenmc03_4280013-1.146464autoinducer synthesis protein
Bcenmc03_4281-291.507479hypothetical protein
Bcenmc03_4282-182.573374histone family protein nucleoid-structuring
Bcenmc03_4283092.925834Ion transport protein
Bcenmc03_4286-1103.364910EmrB/QacA family drug resistance transporter
Bcenmc03_42870114.677622secretion protein HlyD family protein
Bcenmc03_42881114.099558RND efflux system outer membrane lipoprotein
Bcenmc03_42892123.198527AraC family transcriptional regulator
Bcenmc03_42902122.722028LysR family transcriptional regulator
Bcenmc03_42912132.713352hypothetical protein
Bcenmc03_42922132.834247amino acid adenylation domain-containing
Bcenmc03_42932131.552540hypothetical protein
Bcenmc03_42940111.657778cupin 2 domain-containing protein
Bcenmc03_4295-1100.922665hypothetical protein
Bcenmc03_4296-1100.815204condensation domain-containing protein
Bcenmc03_4297-190.004060hypothetical protein
Bcenmc03_429818-0.690507AraC family transcriptional regulator
Bcenmc03_4299310-4.389038LuxR family transcriptional regulator
Bcenmc03_4300212-4.7673052-isopropylmalate synthase
Bcenmc03_4301318-5.450741fucose-binding lectin II
Bcenmc03_4302312-3.267576fucose-binding lectin II
Bcenmc03_4303212-2.215537fucose-binding lectin II
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4243SSPAMPROTEIN320.003 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M

signature.
Length = 147

Score = 32.0 bits (72), Expect = 0.003
Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 16/87 (18%)

Query: 270 RYVEQNVERRSAEAA-----HSLAFLRDQLPTVKRQLEEAEQRYATLRSSGHSIDLAEEG 324
RY +++ + E A L L D L RQL E+ YA LR
Sbjct: 27 RYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAENRQLSR-EEIYALLRKQ---------- 75

Query: 325 KLALQQGADLQTRVLELQQKRDELSRR 351
+ +Q DL+ +++++Q+KR EL ++
Sbjct: 76 SIVRRQIKDLELQIIQIQEKRSELEKK 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4258ECOLNEIPORIN739e-17 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 73.3 bits (180), Expect = 9e-17
Identities = 84/339 (24%), Positives = 126/339 (37%), Gaps = 41/339 (12%)

Query: 24 VVHAQSSVSLYGLIDAFVGETHAPGAAGSAWQVGSGGMTT-----SYWGVSGSEDLGHGM 78
V A + V+LYG I A V ET A A T S G G EDLG+G+
Sbjct: 14 PVAAMADVTLYGTIKAGV-ETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGL 72

Query: 79 KAVFTLESFFRVNAGQLGSFNGQSFFGRNAFVGLSGRFGEVTVGRNTAPLFVSTLLFNPF 138
KA++ +E + G N R +F+GL G FG++ VGR + L T NP+
Sbjct: 73 KAIWQVEQKASIAGTDSGWGN------RQSFIGLKGGFGKLRVGRLNSVLK-DTGDINPW 125

Query: 139 GNSFAFSPMISHGYLGSAMGAASVQSDTAIDSSVLYQTPEIGGLSGSLLYSNAGVAGHTG 198
+ +G + A SV Y +PE GLSGS+ Y+ AG
Sbjct: 126 DSKS------------DYLGVNKIAEPEARLISVRYDSPEFAGLSGSVQYALNDNAGRHN 173

Query: 199 QANYSANVLY-FAG-AFSATAAVQSLHTASLFLNGATAQTAWLVGGAY---RIHA----- 248
+Y A Y G A + H +N Q + Y ++A
Sbjct: 174 SESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQ-IHRLVSGYDNDALYASVAVQ 232

Query: 249 TKLMFQYERVSDNSNVTDDTVQVGASARFGMGSMLLSWAGTRRRPVTGSGI--RWSTFAL 306
+ E + S+ + V + RFG + +S+A + + + +
Sbjct: 233 QQDAKLVEE--NYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVV 290

Query: 307 GYDYPLSKRTDIYAAWRY-DTITHASSGNGIGAGVRMRF 344
G +Y SKRT + + S GV +R
Sbjct: 291 GAEYDFSKRTSALVSAGWLQEGKGESKFVSTAGGVGLRH 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4262PF05272280.028 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.028
Identities = 12/49 (24%), Positives = 21/49 (42%), Gaps = 5/49 (10%)

Query: 34 GCNGAGKTTLVKAIMGFLPHVSGEVRFGEH-----VINGMRAHEIARLG 77
G G GK+TL+ ++G G I G+ A+E++ +
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMT 651


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4270ISCHRISMTASE260.032 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 25.7 bits (56), Expect = 0.032
Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 29 LSAELIEGDTNLFDLGVDSMNMTELLLQ 56
+ E I +L D G+DS+ + L+ Q
Sbjct: 245 ETPEDITDQEDLLDRGLDSVRIMTLVEQ 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4280AUTOINDCRSYN1097e-32 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 109 bits (273), Expect = 7e-32
Identities = 34/180 (18%), Positives = 65/180 (36%), Gaps = 10/180 (5%)

Query: 1 MSSIFAGSFDDMPTMMHRRLGMFRYDVFVGRLGWQLPGADATSLTEWDQFDRGRTIHVVS 60
M IF + + L R + F RL W + D E+DQ+D T ++
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGM---EFDQYDNNNTTYLFG 57

Query: 61 IDQAQHICGCARLIPTTQPYLLQTLCAPSAA-LDLPCAPTVWELSRFAARSAANPTM--- 116
I + R I T P ++ P +++P E SRF + +
Sbjct: 58 IKD-NTVICSLRFIETKYPNMITGTFFPYFKEINIPE-GNYLESSRFFVDKSRAKDILGN 115

Query: 117 RASTGMQLFPSILAIAASLGATCVIGAMTRAVARLYQRCGLSLQLLSTAETA-GRPAYLI 175
LF S++ + G + ++ + + +R G ++++ + YL+
Sbjct: 116 EYPISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLV 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4286TCRTETB1013e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 101 bits (253), Expect = 3e-25
Identities = 80/400 (20%), Positives = 158/400 (39%), Gaps = 17/400 (4%)

Query: 30 FMAGMNVHVTNASLPDIRGSLGASFEEGSWITTAYLVAEIVVIPLTGWLVQVFSARRVLL 89
F + +N V N SLPDI +W+ TA+++ + + G L +R+LL
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 90 VGASGFLAFSLACSVAPS-ISTMIIARALQGAFGGVLIPLSFQLIVTELPPSKHPLGMAL 148
G S+ V S S +I+AR +QGA L ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 149 FAIANNVAQAAGPSVGGWLTDMYSWRWIFYLQIPPAIALVAAIGWAIRPLPVQLGMLRRA 208
+ + GP++GG + W ++ + + + + + ++ L ++ +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI----PMITIITVPFLMKLLKKEVRIKGHF 199

Query: 209 DWFGIATMAVGLSALQIVLEEGGRKDWFASDLIVELSIVAALGLAAFVAIELRRKEPFIN 268
D GI M+VG+ + F + + IV+ L FV + +PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 269 LRLLGQYNFGIASLMQFLFGAVVFGVVFLVPNYFAELHGYSARDIG-LAMIPYGLVQFAM 327
L F I L + V G V +VP ++H S +IG + + P +
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 328 SFLTPPLMRRTSPRTTIVLGFVLVAAGCLMNIHLDADAASNVIVPSLIVRGIGQSFVVIA 387
++ L+ R P + +G ++ L L + S + ++ G SF
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 388 LAVMAVDGIEKAQLGSASGVFNMVRNVGGAIGIAVMSQIV 427
++ + +++ + G+ + N + GIA++ ++
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4287RTXTOXIND1357e-38 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 135 bits (342), Expect = 7e-38
Identities = 65/413 (15%), Positives = 130/413 (31%), Gaps = 68/413 (16%)

Query: 14 RFSRRQLIAAGVVLAVIALAVFGWHWWT-VGRFIESTDDAYVRADVVTVSSRVSGYVTQV 72
SRR + A ++ + +A F V + + + V ++
Sbjct: 52 PVSRRPRLVAYFIMGFLVIA-FILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 73 AVDDNQPVKRGDVLVRLDDRDYRAKVDDAQAAVAAADAT--------------------- 111
V + + V++GDVL++L A Q+++ A
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 112 ------------------------LQAEQAAAATLDAQIGQQRSQIAQADADAAAARAEA 147
Q + + ++R++ A +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 148 ARRDADATRYKQLLAESAASGQRWEQAHADALKARAELTRAGAAV--------RVQTDQQ 199
+ + LL + A + + ++A EL + + + + Q
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 200 TVLQRRREQSTAAIAQARARLAAAQAKLALAQLDLDHTVIRATRDGSVGQRAVRA-GQYV 258
V Q + + + Q + +LA + +VIRA V Q V G V
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 259 EVGMPLLAVVPLSDVYVV-ANFKETQLGAMHDGQPVQIDVDTYSGHTLHGRVIGLAPGSG 317
L+ +VP D V A + +G ++ GQ I V+ + +T +G ++G
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP-YTRYGYLVG------ 403

Query: 318 AQFALLPPDNATGNFTKIVQRIPVKIRVDAPPA---GVVLRPGMSVIARVDTR 367
+ + D +V + + I + + L GM+V A + T
Sbjct: 404 -KVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4292ISCHRISMTASE330.006 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 33.1 bits (75), Expect = 0.006
Identities = 26/100 (26%), Positives = 40/100 (40%), Gaps = 6/100 (6%)

Query: 674 IDRAALARLPVSRAGSDTRDAPRGALEMRLAEMWATLLELAPDEIGRDASFFELGGHSLL 733
+D+ A V + ++T E + + A LL+ P++I + G S+
Sbjct: 207 LDQLQNAPADVQKTSANTGKKNVFTCE-NIRKQIAELLQETPEDITDQEDLLDRGLDSV- 264

Query: 734 VSRLMLAVK--RELGGNAALARFMERPTIAALAALLTDES 771
R+M V+ R G ERPTI LLT S
Sbjct: 265 --RIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4301PF074721485e-48 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 148 bits (375), Expect = 5e-48
Identities = 50/128 (39%), Positives = 70/128 (54%), Gaps = 10/128 (7%)

Query: 4 SQTSSNRAGEFSIPPNTDFRAIFFANAAEQQHIKLFIGDSQEPAA-YHKLTTRDGPREA- 61
+ R G F++PPN F N++ QQ I++++ D+ +PAA + T+D
Sbjct: 126 TTGGGERDGIFNLPPNIAFGVTALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDANLNTQ 185

Query: 62 TLNSGNGKIRFEVSVNGKPSATDARLAPINGKKSDGSPFTVNFGIVVSEDGHDSDYNDGI 121
+NSG GK+R V+ NGKPS +R I K FG+V SEDG D DYNDGI
Sbjct: 186 IVNSGKGKVRVVVTANGKPSKIGSRQVDIFKK--------TYFGLVGSEDGTDGDYNDGI 237

Query: 122 VVLQWPIG 129
+L WP+G
Sbjct: 238 AILNWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4302PF074722032e-67 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 203 bits (518), Expect = 2e-67
Identities = 89/159 (55%), Positives = 114/159 (71%), Gaps = 16/159 (10%)

Query: 113 GSAMHIDSYASLSAIGETAAPSSSQGGGNQGAETGGAGAGNIGGERDGTFNLPPHIKFGV 172
G ++ ++ + E P ++ GGG ERDG FNLPP+I FGV
Sbjct: 103 GVGAVVNYFSKATPQPEPTQPGTTTGGG----------------ERDGIFNLPPNIAFGV 146

Query: 173 TALTNAANDQTIDIYIDDDPKPAATFKGAGAQDQNLGTKVLDSGNGRVRVIVMANGKPSR 232
TAL N++ QTI++Y+DD+PKPAATF+GAG QD NL T++++SG G+VRV+V ANGKPS+
Sbjct: 147 TALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDANLNTQIVNSGKGKVRVVVTANGKPSK 206

Query: 233 LGSRQVDIFKKSYFGIVGSEDGADDDYNDGIVFLNWPLG 271
+GSRQVDIFKK+YFG+VGSEDG D DYNDGI LNWPLG
Sbjct: 207 IGSRQVDIFKKTYFGLVGSEDGTDGDYNDGIAILNWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4303PF07472407e-148 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 407 bits (1046), Expect = e-148
Identities = 230/245 (93%), Positives = 235/245 (95%), Gaps = 1/245 (0%)

Query: 1 MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMRRFMTAQLVEKLPQYDVFVDIATI 60
MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMR+FMT QLVEKLPQYDVFVDIATI
Sbjct: 1 MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMRQFMTTQLVEKLPQYDVFVDIATI 60

Query: 61 PYSFDVGSWQNKVKADAAGEVVACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPVP 120
PYSFDVGSWQNKVKADAAG+V+ACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQP P
Sbjct: 61 PYSFDVGSWQNKVKADAAGQVIACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPEP 120

Query: 121 -PAPAPTGGGERDGVFNLPPNIAFGVTALVNSSAPQTIEVFVDDNPKPAATFQGAGTQDA 179
TGGGERDG+FNLPPNIAFGVTALVNSSA QTIEV+VDDNPKPAATFQGAGTQDA
Sbjct: 121 TQPGTTTGGGERDGIFNLPPNIAFGVTALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDA 180

Query: 180 NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDGGDGDYNDGIAIL 239
NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDG DGDYNDGIAIL
Sbjct: 181 NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDGTDGDYNDGIAIL 240

Query: 240 NWPLG 244
NWPLG
Sbjct: 241 NWPLG 245


18Bcenmc03_4393Bcenmc03_4468Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4393-213-3.904809sodium:dicarboxylate symporter
Bcenmc03_4394020-4.223333TetR family transcriptional regulator
Bcenmc03_4395023-4.678750group 1 glycosyl transferase
Bcenmc03_4396124-5.106831hypothetical protein
Bcenmc03_4397126-4.965630polysaccharide deacetylase
Bcenmc03_4398127-5.097381hypothetical protein
Bcenmc03_4399123-3.156972hypothetical protein
Bcenmc03_4400123-3.209591OmpW family protein
Bcenmc03_4401121-2.839607AMP-binding domain-containing protein
Bcenmc03_4402123-3.413148fumarylacetoacetate (FAA) hydrolase
Bcenmc03_4403122-3.358138glyoxalase/bleomycin resistance
Bcenmc03_4404122-3.769250monooxygenase FAD-binding
Bcenmc03_4405225-5.663444TetR family transcriptional regulator
Bcenmc03_4406125-6.387965major facilitator transporter
Bcenmc03_4407443-10.939145hypothetical protein
Bcenmc03_4408738-9.736872translation initiation factor IF-1
Bcenmc03_4409736-9.686812cold-shock DNA-binding domain-containing
Bcenmc03_4410638-9.842843hypothetical protein
Bcenmc03_4411638-8.639326hypothetical protein
Bcenmc03_4412842-8.831486hypothetical protein
Bcenmc03_4413437-6.779019hypothetical protein
Bcenmc03_4414536-7.021887hypothetical protein
Bcenmc03_4415335-7.445960hypothetical protein
Bcenmc03_4416534-6.719103hypothetical protein
Bcenmc03_4418535-7.109782hypothetical protein
Bcenmc03_4419533-7.033340fatty acid desaturase
Bcenmc03_4420636-7.537903hypothetical protein
Bcenmc03_4421744-8.249944Crp/FNR family transcriptional regulator
Bcenmc03_4422642-7.415470hypothetical protein
Bcenmc03_4423333-5.705535hypothetical protein
Bcenmc03_4424129-4.000855transposase IS3/IS911 family protein
Bcenmc03_4425029-4.073538integrase, catalytic region
Bcenmc03_4426029-4.570761integrase catalytic subunit
Bcenmc03_4427-120-3.370831hypothetical protein
Bcenmc03_4428021-3.862914major facilitator transporter
Bcenmc03_4429228-5.433803mandelate racemase/muconate lactonizing protein
Bcenmc03_4430334-8.4173565-carboxymethyl-2-hydroxymuconate
Bcenmc03_4431534-8.332740short-chain dehydrogenase/reductase SDR
Bcenmc03_4432533-8.340763OsmC family protein
Bcenmc03_4433636-8.967056TetR family transcriptional regulator
Bcenmc03_4434533-8.496555alkylhydroperoxidase
Bcenmc03_4435531-8.331977DSBA oxidoreductase
Bcenmc03_4436731-8.695259NADH:flavin oxidoreductase/NADH oxidase
Bcenmc03_4437734-9.177458class I and II aminotransferase
Bcenmc03_4438936-9.0993934-oxalocrotonate tautomerase
Bcenmc03_4440737-8.274366transposase IS3/IS911 family protein
Bcenmc03_4441634-7.556368integrase catalytic subunit
Bcenmc03_4442536-8.240303integrase catalytic subunit
Bcenmc03_4443440-8.200335hypothetical protein
Bcenmc03_4444441-9.572092putative lipoprotein
Bcenmc03_4445441-9.095912LysR family transcriptional regulator
Bcenmc03_4446437-8.775935hypothetical protein
Bcenmc03_4447440-9.311122NmrA family protein
Bcenmc03_4448341-8.832967hypothetical protein
Bcenmc03_4449340-8.255627AraC family transcriptional regulator
Bcenmc03_4450031-5.403334LysR family transcriptional regulator
Bcenmc03_4451-130-4.738693transposase mutator type
Bcenmc03_4452-132-4.470086hypothetical protein
Bcenmc03_4453-128-4.388889hypothetical protein
Bcenmc03_4454-122-3.028998hypothetical protein
Bcenmc03_4455023-3.439463hypothetical protein
Bcenmc03_4456-225-4.645049polysaccharide deacetylase
Bcenmc03_4457-228-5.169209group 1 glycosyl transferase
Bcenmc03_4458-128-5.174106AraC family transcriptional regulator
Bcenmc03_4459133-5.815447hypothetical protein
Bcenmc03_4460339-6.941800hypothetical protein
Bcenmc03_4461442-7.557466hypothetical protein
Bcenmc03_4462437-6.857062hypothetical protein
Bcenmc03_4465434-6.905663hypothetical protein
Bcenmc03_4466127-5.901518peptidase S8/S53 subtilisin kexin sedolisin
Bcenmc03_4468122-5.026904integrase catalytic subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4394HTHTETR762e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 76.2 bits (187), Expect = 2e-19
Identities = 30/175 (17%), Positives = 58/175 (33%), Gaps = 8/175 (4%)

Query: 1 MARTRNENLHQQRREQILTAAARVFKAKGFHGARTEDICAAADMSAGAVFRYFADKREMI 60
MAR + Q+ R+ IL A R+F +G +I AA ++ GA++ +F DK ++
Sbjct: 1 MARKTKQEA-QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 DAIIAVEVERYTQDFDRILSKDGLRWLANITADE---LTGMLAQGDDGLGVDSWLELARD 117
I + + +K L+ + L + + L ++
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 118 AERRPDIVGLDR----KMRADLAGLLASGQAEGWIRPSLDPTGTTNIVFAMFNGL 168
+ R + + L + L I+ +GL
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4405HTHTETR663e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 3e-15
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 1/109 (0%)

Query: 33 EPRGARRKRETRARLLDAAFVLMAQKGMEGVAINEITEAADVGFGSFYNHFESKEAIHAA 92
+ + +ETR +LD A L +Q+G+ ++ EI +AA V G+ Y HF+ K + +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 93 VLEIVFEEFADTLDRIAGSLT-DPAEIISVSLRHTLLRARSEPVWGQFL 140
+ E+ + DP ++ L H L +E +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLM 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4428TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 1e-05
Identities = 73/385 (18%), Positives = 125/385 (32%), Gaps = 32/385 (8%)

Query: 30 VAPIIKRELGIDD---AQMGILFSSFFIGYCVFCFIGGWAADRFGPRRVFACAAGVWSLF 86
V P + R+L + A GIL + + + + G +DRFG R V + ++
Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86

Query: 87 CGATALAGSFAHLLVVRVAFGIGEGPMGTTTNKAISNWFPRREAGRAVGWTNAGQPLGAA 146
A A L + R+ GI I++ E R G+ +A G
Sbjct: 87 YAIMATAPFLWVLYIGRIVAGITGATGAVAGA-YIADITDGDERARHFGFMSACFGFGM- 144

Query: 147 IAAPIVGLVALQFGWRVSFIVIAALGFVWLAAWWTLFRDDPASHPRVSPDEAREIASDRM 206
+A P++G + F F AAL L F + P +
Sbjct: 145 VAGPVLGGLMGGFSPHAPFFAAAALNG--LNFLTGCFLLPESHKGERRPLRREAL----- 197

Query: 207 IDVSPDVHAIDRAARPLLRDLLSRPVLGVALAFFSFNYVLYFFLSWLPSYLTDYQHLDIK 266
V + FF V + + D H D
Sbjct: 198 -----------NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 267 QMSVVGILPWLGATVGFVAGGTVSDRIYRRTGDVLFARKIVIVVGLAVAAVCVLLASRVS 326
+GI + +A ++ + R G+ R+ +++ +A +LLA
Sbjct: 247 ---TIGISLAAFGILHSLAQAMITGPVAARLGE----RRALMLGMIADGTGYILLAFATR 299

Query: 327 SLGAAVTLIAIASLFAFMAPQACWSLLQEIVPRERVGSAGGFVHLLANLAGILSPSLTGW 386
A ++ +AS P A ++L V ER G G + L +L I+ P L
Sbjct: 300 GWMAFPIMVLLAS-GGIGMP-ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 387 LVEYGGGYASAFVLAGASALAGAVI 411
+ + + +AL +
Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4431DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 68/251 (27%), Positives = 113/251 (45%), Gaps = 8/251 (3%)

Query: 6 GKKLLVVGGTSGIGLATAKQVLKSGGSVVLTGNRKDKAEAVRAELSGLGPVS-VIAANLM 64
GK + G GIG A A+ + G + +K E V + L + A++
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 65 TEEGMNAIRAEINANHKDISLLVNSAGIFVPKAFIDHEESDYDMYLSLNRATFFITQDVV 124
++ I A I I +LVN AG+ P + +++ S+N F V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 125 RNMLAAKRQGAIVNVGSIGAQAALGDSAASAYSMAKAGLHALTRNLAIELADAGIRVNAV 184
+ +R G+IV VGS A ++ +AY+ +KA T+ L +ELA+ IR N V
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVP--RTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 185 SPAIVQTSIYEGFMAKED-----IAGAMKALDSFHPLGRVGTPEDVANTIVFLLSDKTSW 239
SP +T + A E+ I G+++ + PL ++ P D+A+ ++FL+S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 240 VTGAIWNVDAG 250
+T VD G
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4433HTHTETR651e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.4 bits (159), Expect = 1e-15
Identities = 29/173 (16%), Positives = 56/173 (32%), Gaps = 6/173 (3%)

Query: 2 AVGTRDALVQAGEGLMRSMGYAAFSYADLAETVGIRKASIHHHFPTKEDLGVAIVEAYVA 61
A TR ++ L G ++ S ++A+ G+ + +I+ HF K DL I E +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 62 RVVEAF-ERIDRENEDFWGRL-NGFFDTFRASSDGSLLPL---CGALAAEMAALPPELQK 116
+ E E + D L ++ L E +Q+
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 117 LTHRFFELQLCWLTKVIDKGIGDGEIPAGVGSYQKAHQVLSVLEGASFVEWAM 169
+ + + I +PA + + + A + + G W
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL-MENWLF 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4466SUBTILISIN1361e-37 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 136 bits (344), Expect = 1e-37
Identities = 89/440 (20%), Positives = 132/440 (30%), Gaps = 138/440 (31%)

Query: 63 PSLVQYQWHLQNTGQSAFAKAAGTPGFDLDVASLFAQGETGTGVRVLVLDDGLDIHHPDL 122
P V Q N G ++ A G GV+V VLD G D HPDL
Sbjct: 9 PYQVIKQEQQVNEIP---------RGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDL 59

Query: 123 KDRIDSSMLYNFEANANSGDPTPLNNDAHGTTIGGIIGAT--GIGVRGVAPRVTLGGARY 180
K RI NF + + + HGT + G I AT GV GVAP L +
Sbjct: 60 KARIIGG--RNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKV 117

Query: 181 L-CKACDTTKNKLDAFGAASFSANADIINASFGIDSTTRVEQFNPDDSTNQNVLAARLLE 239
L + + A DII+ S G P+D + + +
Sbjct: 118 LNKQGSGQYDWIIQGIYYA-IEQKVDIISMSLG----------GPEDVPELHEAVKKAVA 166

Query: 240 KGRQGKGVVLVKAAGNDYIGIEGSTEGQCTAGVSCGNANYDPQ-NTMPQTIVVAAVNALG 298
++++ AAGN+ G + + I V A+N
Sbjct: 167 -----SQILVMCAAGNE--------------GDGDDRTDELGYPGCYNEVISVGAINFDR 207

Query: 299 KKSSYSSASSAVLVSGFGGEWGNQRAADWNGVFDPGPAILTTDLAGCGRGDVRAGNADAP 358
S +S++++ V + PG IL+T G
Sbjct: 208 HASEFSNSNNEVDLVA------------------PGEDILSTVPGGK------------- 236

Query: 359 LPSNLDGYNPFDDPGSSVARSLNPSCNYTAKMNGTSAATPTVAGVVALMLHANPNLTWRD 418
A +GTS ATP VAG +AL+ RD
Sbjct: 237 ----------------------------YATFSGTSMATPHVAGALALIKQLANASFERD 268

Query: 419 -----VRAILMKTARRIDSTRQASVMPLPDGESYTPEPTWTQNHAGFWFDNWYGFGLVDA 473
+ A L+K + ++ G GL+
Sbjct: 269 LTEPELYAQLIKRTIPLGNS-----------------------------PKMEGNGLLYL 299

Query: 474 AAAVSMARNYTTYLTGPMKS 493
A ++R + T + S
Sbjct: 300 TAVEELSRIFDTQRVAGILS 319


19Bcenmc03_4503Bcenmc03_4519Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_45030123.175492aldehyde dehydrogenase
Bcenmc03_45041141.348731hypothetical protein
Bcenmc03_45050160.082366hypothetical protein
Bcenmc03_4506019-0.420503hypothetical protein
Bcenmc03_4507221-0.489197serine/threonine protein kinase
Bcenmc03_4508333-4.405211FHA domain-containing protein
Bcenmc03_4509337-6.318905fatty acid desaturase
Bcenmc03_4510133-4.869859hypothetical protein
Bcenmc03_4511032-4.575276hypothetical protein
Bcenmc03_4512130-4.420173hypothetical protein
Bcenmc03_4513130-5.473304metal-dependent hydrolase
Bcenmc03_4514232-6.381277fatty acid desaturase
Bcenmc03_4515232-6.117255GH3 auxin-responsive promoter
Bcenmc03_4516229-6.028248Rieske (2Fe-2S) domain-containing protein
Bcenmc03_4517226-5.133493PAAR repeat-containing protein
Bcenmc03_4518226-5.014524hypothetical protein
Bcenmc03_4519223-3.695857hypothetical protein
20Bcenmc03_4532Bcenmc03_4542Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4532210-0.312786tartrate dehydrogenase
Bcenmc03_4533212-0.651904hypothetical protein
Bcenmc03_4534312-1.188540TetR family transcriptional regulator
Bcenmc03_4535215-1.869392L-threonine 3-dehydrogenase
Bcenmc03_4536313-1.8150992-amino-3-ketobutyrate coenzyme A ligase
Bcenmc03_4537314-2.572929XRE family transcriptional regulator
Bcenmc03_4538417-3.701437hypothetical protein
Bcenmc03_4539416-3.834928integrase family protein
Bcenmc03_4540415-3.742139hypothetical protein
Bcenmc03_4541311-3.952466hypothetical protein
Bcenmc03_4542112-4.178935parB-like partition protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4534HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 1e-16
Identities = 28/183 (15%), Positives = 56/183 (30%), Gaps = 27/183 (14%)

Query: 14 RDRLLDAAEALIYSGGIHATGVDAIVKRSGAARKSFYSHFESKEALVVAALERRDERWMR 73
R +LD A L G+ +T + I K +G R + Y HF+ K L E +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 74 WFVDGTLARGKAPRAQLLGMFDVLRDWFGQPDFHGCAFLNASGEIPDADDPVRVVARMHK 133
++ P + L + + + + +
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEE---------------------------R 105

Query: 134 ARLLAFVRERFDAYADETGIERRGLARLARQWLVLIDGAIGVALVSGDANAARDARATAE 193
RLL + + E + ++ L + I+ + + + A R A
Sbjct: 106 RRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAI 165

Query: 194 LLL 196
++
Sbjct: 166 IMR 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4540RTXTOXIND422e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 2e-06
Identities = 20/162 (12%), Positives = 50/162 (30%), Gaps = 14/162 (8%)

Query: 92 AGALLGALYEEALKAARDSLDADREQVRANMADAEQRLRDATIRQETLEGALARGEARNE 151
+ EE + + + E L + T+ + R E +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR 231

Query: 152 QLQARVTELEVQLASQTTHGSASEATLLT---TVARLEKELAAAAGRIDAEQAQNAALRD 208
++R+ + L + + ++ +L EL ++ +
Sbjct: 232 VEKSRLDDFS-SLLHK---QAIAKHAVLEQENKYVEAVNELRVYKSQL-------EQIES 280

Query: 209 RIDALQAELQQRTEHYAQQIKDAVAEAERRVKPMLVELDSLR 250
I + + E Q T+ + +I D + + + + +EL
Sbjct: 281 EILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322


21Bcenmc03_4626Bcenmc03_4707Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4626026-5.365348cupin 2 domain-containing protein
Bcenmc03_4627239-8.684937alkylhydroperoxidase
Bcenmc03_4628343-9.023782ECF subfamily RNA polymerase sigma-24 factor
Bcenmc03_4629534-5.713080diaminopimelate epimerase
Bcenmc03_4630638-6.419519hypothetical protein
Bcenmc03_4631637-6.071384hypothetical protein
Bcenmc03_4632538-6.599368hypothetical protein
Bcenmc03_4633535-5.710450histone family protein nucleoid-structuring
Bcenmc03_4636535-5.998405YadA domain-containing protein
Bcenmc03_4637545-9.900121hypothetical protein
Bcenmc03_4638542-9.386307hypothetical protein
Bcenmc03_4640642-9.674971PAS/PAC sensor hybrid histidine kinase
Bcenmc03_4641736-8.732165two component LuxR family transcriptional
Bcenmc03_4643734-8.634874hypothetical protein
Bcenmc03_4644733-7.867634type III secretion FHIPEP protein
Bcenmc03_4645836-8.104576flagellar biosynthesis protein FlhB
Bcenmc03_4646734-7.901587flagellar biosynthetic protein FliR
Bcenmc03_4647633-7.873698flagellar biosynthetic protein FliQ
Bcenmc03_4648431-6.800126flagellar biosynthesis protein FliP
Bcenmc03_4649433-6.767784flagellar motor switch protein FliN
Bcenmc03_4650532-6.292038surface presentation of antigens (SPOA) protein
Bcenmc03_4651430-6.106193flagellin domain-containing protein
Bcenmc03_4652432-5.372056two component transcriptional regulator
Bcenmc03_4653431-4.594121histidine kinase
Bcenmc03_4654432-4.712082flagellar hook-basal body complex subunit FliE
Bcenmc03_4655530-4.967244flagellar MS-ring protein
Bcenmc03_4656330-5.292538flagellar motor switch protein G
Bcenmc03_4657333-5.840998flagellar assembly protein FliH
Bcenmc03_4658234-5.728248FliI/YscN family ATPase
Bcenmc03_4659335-6.565883hypothetical protein
Bcenmc03_4660633-7.074113flagellar hook-associated 2 domain-containing
Bcenmc03_4661634-6.467412flagellar protein FliS
Bcenmc03_4662736-6.329059hypothetical protein
Bcenmc03_4663735-5.771585flagellar hook-length control protein
Bcenmc03_4664832-5.967681RNA polymerase sigma-28 subunit FliA/WhiG
Bcenmc03_4665835-5.202861MotA/TolQ/ExbB proton channel
Bcenmc03_4666636-4.539415OmpA/MotB domain-containing protein
Bcenmc03_4667735-4.614608hypothetical protein
Bcenmc03_4668732-4.836734hypothetical protein
Bcenmc03_4669527-5.359192mannosyl-glycoprotein
Bcenmc03_4670527-5.785672flagella basal body P-ring formation protein
Bcenmc03_4671727-6.422977flagellar basal-body rod protein FlgB
Bcenmc03_4672726-5.615016flagellar basal-body rod protein FlgC
Bcenmc03_4673727-5.595163flagellar basal body rod modification protein
Bcenmc03_4674727-5.687277flagellar basal body FlaE domain-containing
Bcenmc03_4675828-5.361608flagellar basal-body rod protein FlgF
Bcenmc03_4676729-5.871846flagellar basal body rod protein FlgG
Bcenmc03_4677629-4.996515flagellar basal body L-ring protein
Bcenmc03_4678728-5.427399flagellar P-ring protein
Bcenmc03_4679726-5.496866hypothetical protein
Bcenmc03_4680830-6.495526flagellar hook-associated protein FlgK
Bcenmc03_4681732-6.848329flagellar hook-associated protein 3
Bcenmc03_46821037-6.952389hypothetical protein
Bcenmc03_4683932-7.396316hypothetical protein
Bcenmc03_4684836-6.954068hypothetical protein
Bcenmc03_4685836-7.090372hypothetical protein
Bcenmc03_4686732-6.109131hypothetical protein
Bcenmc03_4687634-6.545532hypothetical protein
Bcenmc03_4688638-7.624806hypothetical protein
Bcenmc03_4689643-8.562506hypothetical protein
Bcenmc03_4690542-8.553725hypothetical protein
Bcenmc03_4691546-9.427187hypothetical protein
Bcenmc03_4692344-8.892909two component LuxR family transcriptional
Bcenmc03_4693444-8.760299histidine kinase
Bcenmc03_4696445-9.8182134-oxalocrotonate tautomerase
Bcenmc03_4698444-8.859962hypothetical protein
Bcenmc03_4699447-10.102885two component LuxR family transcriptional
Bcenmc03_4701342-8.391121multi-sensor hybrid histidine kinase
Bcenmc03_4702545-9.353708two component LuxR family transcriptional
Bcenmc03_4703546-9.367388hypothetical protein
Bcenmc03_4704341-7.902060YadA domain-containing protein
Bcenmc03_4705125-6.190588HvnC; halovibrin
Bcenmc03_4706-118-4.328495integrase family protein
Bcenmc03_4707013-5.084528hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4636OMADHESIN685e-13 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 67.6 bits (164), Expect = 5e-13
Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 11/158 (6%)

Query: 309 GKDAYATG-NNIAVGMGAKADTGGA---GTGVIAIGEGAAAGRPGYS--GAIAIGYGARA 362
G +A A G ++IA+G A+A G A G G IA G + A P G A+ YGA +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 363 VDGTQGGYLRPVAIGAGAQAQGVSVAIGPDAQSSPGNGVALGHQASVGADATWAVALGTG 422
G VAIGA A VA+G ++++ N VA+GH + V A+ +++A+G
Sbjct: 122 TAQKDG-----VAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDR 176

Query: 423 SSADRADTVSVGNAVSQRQIVNVAAGTQGTDAVNVAQL 460
S DR ++VS+G+ RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 177 SKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 63.8 bits (154), Expect = 8e-12
Identities = 58/178 (32%), Positives = 97/178 (54%), Gaps = 9/178 (5%)

Query: 1088 AIGVGTIASGANTVAIGVRSYANSDGAVAIGNMAQTGASQPNSVAIGSNVTTNGASALAV 1147
A+G+ A G+ + A ++AIG A+ A++ +VA+G+ G +++A+
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAE--AAKGAAVAVGAGSIATGVNSVAI 103

Query: 1148 GSQAKANGDNAIALGNNNVMAVGEGSIAIGNKAVSAAGTTNGIALGAGANVARSVADSMA 1207
G +KA GD+A+ G + + +AIG +A +T+ + G N +S+A
Sbjct: 104 GPLSKALGDSAVTYGAASTAQ--KDGVAIGARA-----STSDTGVAVGFNSKADAKNSVA 156

Query: 1208 LGAKSSVEKGANGAVALGTGSKATRANTVSVGNTGTERQIVNVAAGTQGTDAVNVAQL 1265
+G S V ++A+G SK R N+VS+G+ RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 157 IGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 61.1 bits (147), Expect = 5e-11
Identities = 56/162 (34%), Positives = 91/162 (56%), Gaps = 9/162 (5%)

Query: 2044 GLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWATTNGAHALAIGSHAKANGENAVAVGS 2103
GLN A S+AIG A+ + +VA+G+ + G +++AIG +KA G++AV G+
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 2104 NGIKAIGASSIAIGNAAEASVGATNGIALGTGASVEPNVTDAMALGANTIVDDKANGAVA 2163
+AIG A S G+A+G + + +++A+G ++ V ++A
Sbjct: 120 ASTAQ--KDGVAIGARASTS---DTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIA 172

Query: 2164 LGAGSKATRANTISVGSAGSERQIVNIAAGTQSTDAVNVAQL 2205
+G SK R N++S+G RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 60.7 bits (146), Expect = 7e-11
Identities = 56/162 (34%), Positives = 90/162 (55%), Gaps = 9/162 (5%)

Query: 2975 GLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWVTTNGAHALAIGSHAKANGENAVAVGS 3034
GLN A S+AIG A+ + +VA+G+ G +++AIG +KA G++AV G+
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 3035 NGIKAIGASSIAIGNAAEASVGATNGIALGTGASVEPNVTDAMALGANTIVDDKANGAVA 3094
+AIG A S G+A+G + + +++A+G ++ V ++A
Sbjct: 120 ASTAQ--KDGVAIGARASTS---DTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIA 172

Query: 3095 LGASSKATRANTISVGSAGSERQIVNIAAGTQSTDAVNVAQL 3136
+G SK R N++S+G RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 57.6 bits (138), Expect = 6e-10
Identities = 56/149 (37%), Positives = 78/149 (52%), Gaps = 20/149 (13%)

Query: 75 AVRSVAIGLNAVAGRFSQVVIGDGASASEDYAVAIGVNANGAGQYGVAVGEDASAHEAAV 134
+ S+AIG A A + + V +G G+ A+ +VAIG + G V G ++A + V
Sbjct: 69 GIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGV 128

Query: 135 AIGAGAVAQDEGVAIGVRATA-VSGSVAIGHD----------------AKADRVDTVSVG 177
AIGA A D GVA+G + A SVAIGH +K DR ++VS+G
Sbjct: 129 AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIG 188

Query: 178 GGKWGPPQRQIVNVAAGTQDTDVVNVGQL 206
RQ+ ++AAGT+DTD VNV QL
Sbjct: 189 HESL---NRQLTHLAAGTKDTDAVNVAQL 214



Score = 48.4 bits (114), Expect = 4e-07
Identities = 62/168 (36%), Positives = 89/168 (52%), Gaps = 19/168 (11%)

Query: 891 AAGVAETDAVNVGQLSDATKSIRDSLSDGSLSMRYIKVKATGQAANPMGTNTVAIGAGAN 950
A A+ AV VG S AT +S++ G LS T AA+ + VAIGA A+
Sbjct: 78 TAEAAKGAAVAVGAGSIATGV--NSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAS 135

Query: 951 ATGNGSLALGTGSRANGLNSVAIGFNS-VATD---------------ANQVSVGDIGNER 994
+ G +A+G S+A+ NSVAIG +S VA + N VS+G R
Sbjct: 136 TSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR 194

Query: 995 RISNVADGTEDTDAVNVNQLTEAIEKMSARTDKLSSELKSRHSSLMAN 1042
+++++A GT+DTDAVNV QL + IEK T+K S+EL + ++ N
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADN 242



Score = 38.7 bits (89), Expect = 4e-04
Identities = 71/296 (23%), Positives = 125/296 (42%), Gaps = 15/296 (5%)

Query: 2754 GSVTLGGAGAAAPVALKNVAAGVDDTDAVNVGQLNTGLSDMKRELAEGNIDLKYIKVRAD 2813
G+ GAA V ++A GV+ +V +G L+ L D + K
Sbjct: 76 GATAEAAKGAAVAVGAGSIATGVN---SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGA 132

Query: 2814 GAPATATGAQSVAIGSKALAGGPNSLALGAGARALGNG--SVALGSNSIATEPMTVSVGD 2871
A + TG VA+G + A NS+A+G + N S+A+G S +VS+G
Sbjct: 133 RASTSDTG---VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGH 189

Query: 2872 DGTERKIIHVKAGDVTAKSTDAINGSQLFDALGQLKSHVAAEQSHLLSRVNALADSGEPN 2931
+ R++ H+ AG K TDA+N +QL + + + + + LL+ NA AD+ +
Sbjct: 190 ESLNRQLTHLAAG---TKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSS 246

Query: 2932 SLVVVEGMGGTNTA-SLSGGDPESTTAAAIGVEAHAAGANAIALGLNTGALSNESVAIGN 2990
L + + +A +L E+ + + A +N++A A + +
Sbjct: 247 VLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVART 306

Query: 2991 MAQTGSDQPFSVAIGSWVTTNGAHALAIGSHA--KANGENAVAVGSNGIKAIGASS 3044
+T + + + + N +A + SH AN V V ++ KAI S+
Sbjct: 307 TLETAEEHANKKSAEALASAN-VYADSKSSHTLKTANSYTDVTVSNSTKKAIRESN 361



Score = 38.7 bits (89), Expect = 4e-04
Identities = 43/147 (29%), Positives = 61/147 (41%), Gaps = 14/147 (9%)

Query: 3509 AGQTVMAANAGNGSNNVAIGSSSTISDDAGNATAVGANSTVRAAG------------GTA 3556
A V A + G N+VAIG S D+ A GA ST + G G A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDS--AVTYGAASTAQKDGVAIGARASTSDTGVA 142

Query: 3557 IGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGSDQGKRRIVNIADG 3616
+G + A A NS AIG S ++ +I G + D S G + R++ ++A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 3617 VNASDAATKGQLDRAVGGLQGQINGVS 3643
+DA QL + + Q N S
Sbjct: 203 TKDTDAVNVAQLKKEIEKTQENTNKRS 229



Score = 37.6 bits (86), Expect = 0.001
Identities = 24/56 (42%), Positives = 33/56 (58%)

Query: 3539 NATAVGANSTVRAAGGTAIGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANED 3594
++ A+GA + A+GAG+ A NS AIG SKA GD++V GA S A +D
Sbjct: 71 HSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126



Score = 36.0 bits (82), Expect = 0.003
Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 3544 GANSTVRAAGGTAIGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGS 3603
G N++ + AIGA A+A + A+G GS A G NSVAIG S A D+ V++G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 3604 DQGK 3607
K
Sbjct: 122 TAQK 125



Score = 35.6 bits (81), Expect = 0.004
Identities = 75/385 (19%), Positives = 136/385 (35%), Gaps = 24/385 (6%)

Query: 3327 ASVTLGDAGTAVGLHNVATGAVSATSTDAVNGSQLHGMATSVANAIGGDTTVDENGQVAV 3386
A+V +G A G+++VA G +S D+ A AIG + + G
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVG 144

Query: 3387 NSIEVGGHKYATVSQAVQAAAAYGATDSLAVRYDVDSHGNPNYGSVTLGGPAAAPVTLTN 3446
+ + + + AA +G + ++ R D + + G +L LT+
Sbjct: 145 FNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR------QLTH 198

Query: 3447 VADGKSQYDAVNYGQLSSLQSDFENRLGSMDDRVSKIETTGGGSGGEARRTVSNDLISGS 3506
+A G DAVN QL ++ A N S
Sbjct: 199 LAAGTKDTDAVNVAQLKK----------EIEKTQENTNKRSAELLANANAYADNKSSSVL 248

Query: 3507 GDAGQTVMAANAGNGSNNVAIGSSSTISDDAGNATAVGANSTVRAAGGTAI-GAGADAHA 3565
G A + +A N A + S D N +NS R TA A + A
Sbjct: 249 GIANNYTDSKSAETLEN--ARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVART 306

Query: 3566 ANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGSDQGKRRIVNIADGVNASDAATK 3625
TA +K + + + + +T+ + V +++ + +
Sbjct: 307 TLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTD-----VTVSNSTKKAIRESN 361

Query: 3626 GQLDRAVGGLQGQINGVSRNAYSGIAAATALTMIPGVDPGKTLSFGIGSASYKGYQAVAF 3685
D L +++ + G+A++ AL + ++F G Y+ QA+A
Sbjct: 362 QYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAI 421

Query: 3686 GGEARINKNLKMKAGVGLSSGGNTV 3710
G R+N+N+ +KAGV + + +
Sbjct: 422 GSGYRVNENVALKAGVAYAGSSDVM 446



Score = 34.1 bits (77), Expect = 0.012
Identities = 26/59 (44%), Positives = 36/59 (61%)

Query: 930 ATGQAANPMGTNTVAIGAGANATGNGSLALGTGSRANGLNSVAIGFNSVATDANQVSVG 988
A G A+ G +++AIGA A A ++A+G GS A G+NSVAIG S A + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 33.7 bits (76), Expect = 0.015
Identities = 32/133 (24%), Positives = 65/133 (48%), Gaps = 4/133 (3%)

Query: 1983 AAAASGSGGGGKGPSFVTIDGMGSDGSRFNTASITTGDPESTTAAAIGVDAHAAGAN-AI 2041
AA A G+G G + V I G +++T G + + + A A+ ++ +
Sbjct: 85 AAVAVGAGSIATGVNSVAI---GPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGV 141

Query: 2042 ALGLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWATTNGAHALAIGSHAKANGENAVAV 2101
A+G N+ A + SVAIG+ + ++ +S+AIG + T+ ++++IG + +A
Sbjct: 142 AVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAA 201

Query: 2102 GSNGIKAIGASSI 2114
G+ A+ + +
Sbjct: 202 GTKDTDAVNVAQL 214



Score = 33.7 bits (76), Expect = 0.015
Identities = 27/66 (40%), Positives = 37/66 (56%)

Query: 2812 ADGAPATATGAQSVAIGSKALAGGPNSLALGAGARALGNGSVALGSNSIATEPMTVSVGD 2871
A G A+A G S+AIG+ A A ++A+GAG+ A G SVA+G S A V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 2872 DGTERK 2877
T +K
Sbjct: 120 ASTAQK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4638BACINVASINB354e-04 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 34.7 bits (79), Expect = 4e-04
Identities = 37/129 (28%), Positives = 60/129 (46%), Gaps = 13/129 (10%)

Query: 3 QLNVTEGVLASSLQVLTTFDLIWIVAALVLGGMAKGITGIGVPL-VAMPIVS-----QFM 56
+ N G + L L T ++ +VAA+ GG + + +G+ + VA IV F+
Sbjct: 309 ETNRIMGCIGKVLGALLT--IVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFI 366

Query: 57 -----PIRDAVLLLSMPIILGNIPQALEGGQVLATARKIAAPIAGTVFGNIAGVAILLSL 111
PI + VL M +I I +ALEG V ++A I G + IA VA+++ +
Sbjct: 367 QQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVV 426

Query: 112 NSGHAQAAS 120
AA+
Sbjct: 427 AVVGKGAAA 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4640HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 3/119 (2%)

Query: 953 SGLQILVVDDHPVNRLVTKAQLERLGYTAVAVSNGMDALRVLDNSDFALILTDCAMPEMD 1012
+G ILV DD R V L R GY SN R + D L++TD MP+ +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 1013 GYELTKRIRSREHRSRDTPIVALTANALPDEAIRCAEAGMDGLLIKPTTLAVLRDQLAH 1071
++L RI+ D P++ ++A AI+ +E G L KP L L +
Sbjct: 62 AFDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4641HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 25/119 (21%), Positives = 49/119 (41%), Gaps = 4/119 (3%)

Query: 4 RVVLADDHPIMLLGCRILIEQGGLEVVGEARDSRELMSILARVACDVVITDFSMPNTGRV 63
+++ADD + + + G +V ++ L +A D+V+TD MP+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE--- 60

Query: 64 DGLPMLSMIRREHVALPVIVLTNMANAGLLRAMLNEGVLGIVEKGAERNELFAAVRAAL 122
+ +L I++ LPV+V++ +G + K + EL + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4644PF04647290.042 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 29.4 bits (66), Expect = 0.042
Identities = 16/119 (13%), Positives = 36/119 (30%), Gaps = 20/119 (16%)

Query: 12 FAAPLALFTILAMVILPLPPAALDVMFTFNIVLSIVVVMV---AVTVKRP---------L 59
L +F +LA + + PA ++ + S++ ++ + L
Sbjct: 81 TLTSLLVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDNPRNLISNTEQRKTL 140

Query: 60 DFSAF-PTVILAATLMRLTLNVASTRVVLLNGYTGASAAGQVIESFANVVIGSNFVVGL 117
++L + A G + ++F +G F+VG
Sbjct: 141 KLKTSMVLMVLFGGSIGAYRLYTHQ-------IALAILLGVLWQTFTLTALGHKFIVGW 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4645TYPE3IMSPROT2925e-99 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 292 bits (749), Expect = 5e-99
Identities = 97/349 (27%), Positives = 178/349 (51%), Gaps = 7/349 (2%)

Query: 6 TGDKTEKATPQKLRKARMEGQVARSRDIGTCVGILVALKLIVVLTPAWLVELKHIFALSF 65
+G+KTE+ TP+K+R AR +GQVA+S+++ + I+ +++ L+ + + +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 66 ADLSGDDRLGNAVSMLFPAAVLLMCKMLAPL---AAVPAGIIVASLIPGGWIISHKNIMP 122
A+S + +L + PL AA+ A I + ++ G++IS + I P
Sbjct: 62 E--QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA--IASHVVQYGFLISGEAIKP 117

Query: 123 KLNRLNPLSGLKRLVSGKHYMQFGTTVLKALVLMATLFIVCRSNLSGFIRLQGAPLADAL 182
+ ++NP+ G KR+ S K ++F ++LK ++L ++I+ + NL ++L +
Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECIT 177

Query: 183 TGGANLFLSSAVTLSCIITVFALVDIPVQQIIFKRGQRMSKRDIKEEMKQSEGRPEVKSR 242
+ V + V ++ D + + + +MSK +IK E K+ EG PE+KS+
Sbjct: 178 PLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSK 237

Query: 243 IRQIQRQLARQGIRKTVPTADLVVMNPTHYAVALKYDVTRAQAPYVVAKGVDEVALFIRD 302
RQ +++ + +R+ V + +VV NPTH A+ + Y P V K D +R
Sbjct: 238 RRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRK 297

Query: 303 VARGHNVEVLELPPLARAIYHTSQVNQQIPAELYRAVAQVLSYVLQIKA 351
+A V +L+ PLARA+Y + V+ IPAE A A+VL ++ +
Sbjct: 298 IAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4646TYPE3IMRPROT1182e-34 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 118 bits (298), Expect = 2e-34
Identities = 81/248 (32%), Positives = 127/248 (51%), Gaps = 1/248 (0%)

Query: 7 QLLPLANTIFWPFCRIAAALAASPILGDVMVPVRLRLLIALFLALAIQPGIPTMPVIDLL 66
Q L N FWP R+ A ++ +PIL + VP R++L +A+ + AI P +P V +
Sbjct: 8 QWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVP-VF 66

Query: 67 QLEGVAAMAEQVLIGGLLGFVFHLVLCALQIFGTIASSQLGLSMAQINDPMNGQMADVLT 126
+ +Q+LIG LGF A++ G I Q+GLS A DP + VL
Sbjct: 67 SFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLA 126

Query: 127 SVMYVVFILLFFAVDGHLILTSVIARSFAVWPVGRFAFDLDALKHLAFAVGWIFSAAVAL 186
+M ++ +LLF +GHL L S++ +F P+G + +A L A IF + L
Sbjct: 127 RIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLML 186

Query: 187 ALPVMFATLVVQVGLGLLNRVAPALNIFALGFSITTMFGLLLLTLLLPSLPDHYGRMVEH 246
ALP++ L + + LGLLNR+AP L+IF +GF +T G+ L+ L+P + +
Sbjct: 187 ALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSE 246

Query: 247 VLELYDRL 254
+ L +
Sbjct: 247 IFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4647TYPE3IMQPROT535e-13 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 52.8 bits (127), Expect = 5e-13
Identities = 23/79 (29%), Positives = 36/79 (45%)

Query: 9 GFAVDALRLVLVIILVLITPGLITGVLVAIFQAATQINEQTMSFLPRLITTLVALALAGP 68
AL LVL++ I G+LV +FQ TQ+ EQT+ F +L+ + L L
Sbjct: 6 FAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSG 65

Query: 69 WMTGRVMHYTVEIFSRAAQ 87
W ++ Y ++ A
Sbjct: 66 WYGEVLLSYGRQVIFLALA 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4648FLGBIOSNFLIP2052e-68 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 205 bits (523), Expect = 2e-68
Identities = 108/242 (44%), Positives = 151/242 (62%), Gaps = 3/242 (1%)

Query: 6 LLRYLGLALAAFAMPALV---QAETLTLASDGVGGQGFTVKTQILVLMTLLGLLPALLMT 62
+ R L +A + + Q +T GGQ +++ Q LV +T L +PA+L+
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 63 MTSFLRYVIVLSLVKQALGLQQGLPGRIVTGVALVLTMLTMRPVGEEIWQKAFVPYDQGK 122
MTSF R +IV L++ ALG P +++ G+AL LT M PV ++I+ A+ P+ + K
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 123 ISMQTALATSEQPLGRYMLAQTNKATLAQMAKLSGTEKVMDPEKQPFLVKLSAFVLSELK 182
ISMQ AL QPL +ML QT +A L A+L+ T + PE P + L A+V SELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 183 TAFQMGAMLFIPFLIVDIIVASVLMAMGMMMLSPLVISLPLKLLLFVLVDGWSLTVNTLV 242
TAFQ+G +FIPFLI+D+++ASVLMA+GMMM+ P I+LP KL+LFVLVDGW L V +L
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 243 TS 244
S
Sbjct: 241 QS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4649FLGMOTORFLIN743e-20 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 73.8 bits (181), Expect = 3e-20
Identities = 31/72 (43%), Positives = 47/72 (65%)

Query: 34 MRMLRRIPVRLTLEVGEATVPLADLLSYETGSTVELNRLAGEPLVIKVNGTPVGLGEVVV 93
+ ++ IPV+LT+E+G + + +LL GS V L+ LAGEPL I +NG + GEVVV
Sbjct: 54 IDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVV 113

Query: 94 SGEHYGLRIIEL 105
+ YG+RI ++
Sbjct: 114 VADKYGVRITDI 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4650FLGMOTORFLIN363e-05 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 36.4 bits (84), Expect = 3e-05
Identities = 26/91 (28%), Positives = 40/91 (43%), Gaps = 19/91 (20%)

Query: 194 ALDDMWLDHLFARLDAQHLRPAPDSAQANVS----------------IPVTISVHVLSKN 237
ALDD+W D L A + A D+ + IPV ++V +
Sbjct: 14 ALDDLWADAL-NEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTR 72

Query: 238 MRLDELLRMRPGDVLPV-RLP-ETVDVLVNN 266
M + ELLR+ G V+ + L E +D+L+N
Sbjct: 73 MTIKELLRLTQGSVVALDGLAGEPLDILING 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4651FLAGELLIN1102e-29 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 110 bits (276), Expect = 2e-29
Identities = 59/290 (20%), Positives = 116/290 (40%), Gaps = 25/290 (8%)

Query: 6 TNAAAMNIKKAIGSTSNSLNTTMTRLGTGLRINSAKDDAAGLQIAVRLQAQTRGMGMAMQ 65
TN+ ++ + + + +SL++ + RL +GLRINSAKDDAAG IA R + +G+ A +
Sbjct: 6 TNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASR 65

Query: 66 NTQNASSMLQTADGAMKEVTNILYRMKDLATQAADGSSSANEKTAMQAEYDALGKELSNI 125
N + S+ QT +GA+ E+ N L R+++L+ QA +G++S ++ ++Q E +E+ +
Sbjct: 66 NANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRV 125

Query: 126 MKNTTYGGAKLLQKEVKNKDDGTVVSEGGRLTKEITFQIGATKDETMAADFSKHVANAHD 185
T + G K+L + ++ Q+GA ET+ D K +
Sbjct: 126 SNQTQFNGVKVLSQ-----------------DNQMKIQVGANDGETITIDLQKIDVKS-- 166

Query: 186 KFEGLSASYTGPEAGKTEEPGKELTDNANATIDLINNVLDDVGALRSAIGAAENRLAHTH 245
G +E ++ + + R + + T
Sbjct: 167 ------LGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTA 220

Query: 246 NNLANMSTNTADAEGRIMDADMASESAKMSSQQVLLQASMSMLKQTSSMN 295
+ + A D + + + + ++
Sbjct: 221 PTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIK 270



Score = 75.1 bits (184), Expect = 3e-17
Identities = 58/306 (18%), Positives = 103/306 (33%), Gaps = 8/306 (2%)

Query: 6 TNAAAMNIKKAIGSTSNSLNTTMTRLGTGLRINSAKDDAAGLQIAVRLQAQTRGMGMAMQ 65
N +++ T + T ++ D A AV L T+ +
Sbjct: 202 ANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAE 261

Query: 66 NTQNASSMLQTADGAMKEVTNILYRMKDLATQAADG--SSSANEKTAMQAEYDALGKELS 123
A ++ +G + + + + +G S++ N + D +
Sbjct: 262 AKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAAN 321

Query: 124 NIMKNTTYGGAKLLQKEVKNKDDGTVVSEGGRLTKEITFQIGATKDETMAADFSKHVANA 183
++ + + + +++ ANA
Sbjct: 322 VDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANA 381

Query: 184 HDKFEGLSASYTGPEAGKTEEPGKELTDN------ANATIDLINNVLDDVGALRSAIGAA 237
L+ + + D + I++ L V A+RS++GA
Sbjct: 382 AGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAI 441

Query: 238 ENRLAHTHNNLANMSTNTADAEGRIMDADMASESAKMSSQQVLLQASMSMLKQTSSMNQM 297
+NR NL N TN A RI DAD A+E + MS Q+L QA S+L Q + + Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQN 501

Query: 298 VLSLLQ 303
VLSLL+
Sbjct: 502 VLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4652HTHFIS616e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 6e-13
Identities = 26/123 (21%), Positives = 52/123 (42%), Gaps = 4/123 (3%)

Query: 11 KPRIALLEDNVAHARTVRHWLEAAGYDAIVEYDGRRFIDRIGREKVDMLLLDWDVPGMTG 70
I + +D+ A + L AGYD + + I D+++ D +P
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 71 IDVLIDMRKRVDYLIPIVLLTQHDDERDILHGLSCGADDYLVKPISE---RMLIARVIAQ 127
D+L ++K +P+++++ + + GA DYL KP +I R +A+
Sbjct: 63 FDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 128 LRK 130
++
Sbjct: 122 PKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4654FLGHOOKFLIE489e-11 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 48.1 bits (114), Expect = 9e-11
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 27 QAPSADIAGGFADLLKQAVRRTDAQQHHADDLVTAVETGASD-DLVGAMLASQQASLSFS 85
Q FA L A+ R Q A G L M Q+AS+S
Sbjct: 23 QESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQ 82

Query: 86 TMIQVRNKVMSAFDDIIKMQV 106
IQVRNK+++A+ +++ MQV
Sbjct: 83 MGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4655FLGMRINGFLIF2982e-96 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 298 bits (764), Expect = 2e-96
Identities = 154/548 (28%), Positives = 255/548 (46%), Gaps = 44/548 (8%)

Query: 30 ALSKLAPIVILAISLAALTMMLMHRQDSRYKPLFGSQEAVVAADMMAALDAEGIPYRIHP 89
A ++ IV + ++A + M++ + Y+ LF + ++A L IPYR
Sbjct: 21 ANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFAN 80

Query: 90 DSGQVLVPEQKLGAARMMLAAKGVVGKLPEGLEQVDKSDPLGVSQFVQDVRFRRGLEGEL 149
SG + VP K+ R+ LA +G+ G E +D+ G+SQF + V ++R LEGEL
Sbjct: 81 GSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQE-KFGISQFSEQVNYQRALEGEL 139

Query: 150 TQSIMALEPVSSARVHLSIAKSASFILADGDKSSASVVLTLKPNRKLNKEQIAAIVALVA 209
++I L PV SARVHL++ K + F+ + SASV +TL+P R L++ QI+A+V LV+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFV-REQKSPSASVTVTLEPGRALDEGQISAVVHLVS 198

Query: 210 GSVANLDPARVTVIDQSGNHLSAQIDLVLGNSTLDSELG--AQMREQVLRNIRELLTPVL 267
+VA L P VT++DQSG+ L+ G D++L + ++ R I +L+P++
Sbjct: 199 SAVAGLPPGNVTLVDQSGHLLTQSNT--SGRDLNDAQLKFANDVESRIQRRIEAILSPIV 256

Query: 268 GDGNFRASVAVELDHDRVEETREQYGEAPKVTQEAIR------DEKDIGQAALGVPGSLS 321
G+GN A V +LD E+T E Y ++ +R E+ GVPG+LS
Sbjct: 257 GNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALS 316

Query: 322 NRPAPPSTASMPEAPHSAKNAQ-----------------------TRQYAYDRNVVQIKR 358
N+PAPP+ A + P + +NAQ T Y DR + K
Sbjct: 317 NQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKM 376

Query: 359 SPVRVKRLNVAVVLNNAAAPGG-GKAWAPAQLAQVDTILRDGLGIDADRDDALTVSSLDF 417
+ ++RL+VAVV+N G Q+ Q++ + R+ +G R D L V + F
Sbjct: 377 NVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPF 436

Query: 418 RGT-PVTESQPWWKQPDNLVTIGTWAAWALGALLGFVFIFRPLLKVLRIWANGGRDPLSQ 476
P+W+Q + + W L ++ ++ + + L + Q
Sbjct: 437 SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQ 496

Query: 477 GANAVSGGPEAPALAAAADADTQPLLLADANLPPIGSGADVLIAHLKHLAAQDPERVAEV 536
+ + Q A+ L GA+V+ ++ ++ DP VA V
Sbjct: 497 AQVRQETEEAVEVRLSKDEQLQQ--RRANQRL-----GAEVMSQRIREMSDNDPRVVALV 549

Query: 537 IKPWIRDD 544
I+ W+ +D
Sbjct: 550 IRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4656FLGMOTORFLIG1731e-53 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 173 bits (441), Expect = 1e-53
Identities = 85/331 (25%), Positives = 167/331 (50%), Gaps = 6/331 (1%)

Query: 23 SLAAVERAAIILLSIGEEAAAGVLRCLSREELLDVTLAMSRMQGVKVDAVQNTIERFFTN 82
+L ++AAI+L+SIG E ++ V + LS+EE+ +T +++++ + + N + F
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 83 FREQSGVRGASRSFLQRSLEMALGGVVANSVLNKIYGDAIGPKMARLQWAQPQWLADRLR 142
Q ++ + + LE +LG A ++N + ++ A P + + ++
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 143 DEHVRMQAMFLVFLPPEQASRVIQALPEARREQVLLDIARLTEIDHDLLRDLEDVVDSCV 202
EH + A+ L +L P++AS ++ +LP + V IA + +++R++E V++ +
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 203 ANLGT-QSTAVEGVRQAADIINRMPGDRTQ---MVEILRARDPELVAAVEDLIYDFAVIA 258
A+L + T+ GV +IIN DR ++E L DPEL ++ ++ F I
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMA--DRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIV 251

Query: 259 NQDDEVISVILEHVDTALWGVALKGADPAVRDALLRSMPRRAVQAFEEMLRRTEPALPSK 318
DD I +L +D ALK D V++ + ++M +RA +E + P
Sbjct: 252 LLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKD 311

Query: 319 VESARREIMDIIRGLADDGDIELRLVAEEEL 349
VE ++++I+ +IR L + G+I + EE++
Sbjct: 312 VEESQQKIVSLIRKLEEQGEIVISRGGEEDV 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4657FLGFLIH552e-11 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 55.2 bits (132), Expect = 2e-11
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 5/191 (2%)

Query: 15 IRAASEQLGEFAAPD---EVGLLSEQLHAPPTGELLDEARQAGYADGFAAGERVGAEGAR 71
I E + E A P ++ L Q H + E RQ G+ G+ G G E
Sbjct: 25 IVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGL 84

Query: 72 RDVRSGFDALVAPVDALVRGFQRVQQAYRAKVRSEVAKLVGDVARQVVRAELETRPERIL 131
+ +S + A + LV FQ A + + S + ++ + ARQV+ ++
Sbjct: 85 AEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALI 144

Query: 132 AFVDEAVGTLTKPPESVSVRLNPSDYARLAQ--AAPDRVHGWQLVPDDRLEPGECRVRAD 189
+ + + +R++P D R+ A +HGW+L D L PG C+V AD
Sbjct: 145 KQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSAD 204

Query: 190 DIEMDAGCGQR 200
+ ++DA R
Sbjct: 205 EGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4663FLGHOOKFLIK290.050 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 28.6 bits (63), Expect = 0.050
Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 237 ASVPETGAPTAPQQRLIDALGERLSVQMAQGTRQAVIRLEPGSNGSIHIELRQNANGMAV 296
+ P AP + +L + +S+ QG + A +RL P G + I L+ + N +
Sbjct: 226 VAAPVLSAPLGSHE-WQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQI 284

Query: 297 HLSATHPEVVFQLQAIGESLRQDL 320
+ + H V L+A LR L
Sbjct: 285 QMVSPHQHVRAALEAALPVLRTQL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4666OMPADOMAIN349e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 33.8 bits (77), Expect = 9e-04
Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 10/80 (12%)

Query: 216 VMGHTDSVPYRNSDSGSRS-NWDLSVDRAMSARSWVLQGGVNSEQVMQVIGMADRAPLVN 274
V+G+TD + GS + N LS RA S +++ G+ ++++ GM + P+
Sbjct: 257 VLGYTDRI-------GSDAYNQGLSERRAQSVVDYLISKGIPADKI-SARGMGESNPVTG 308

Query: 275 NPRAAINRRIEFLV-LTPER 293
N + +R + L P+R
Sbjct: 309 NTCDNVKQRAALIDCLAPDR 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4669FLGFLGJ1344e-40 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 134 bits (339), Expect = 4e-40
Identities = 75/200 (37%), Positives = 98/200 (49%), Gaps = 4/200 (2%)

Query: 55 ATEPHLSSQAATWTNMMRARAEALAEPQQGVLGNGAPATGPNFADS---DQQAFLAEIMP 111
E L ++ M + Q + A N+ DS D +AFLA++
Sbjct: 99 TPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSL 158

Query: 112 HARRAGAMIGAAPELIAAHAALESGWGSKPLKNVRGETTHNLFGIKSAGGWAGESAAAVT 171
A+ A G LI A AALESGWG + ++ GE ++NLFG+K++G W G T
Sbjct: 159 PAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITT 218

Query: 172 TEYVNGSAVKMVDHFRAYRSYSGAFHDYAKLLRDSRRYAGVRNVGDDASAFASALKRGGY 231
TEY NG A K+ FR Y SY A DY LL + RYA V A A AL+ GY
Sbjct: 219 TEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAV-TTAASAEQGAQALQDAGY 277

Query: 232 ATDPAYATKLVEMVGLVKRM 251
ATDP YA KL M+ +K +
Sbjct: 278 ATDPHYARKLTNMIQQMKSI 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4672FLGHOOKAP1300.002 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.002
Identities = 8/39 (20%), Positives = 17/39 (43%)

Query: 91 SNVSAVEEMADMMAASRAFSTNVEVLTRIKGMQQDLLRM 129
S V+ EE ++ + + N +VL + L+ +
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4674FLGHOOKAP1363e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.7 bits (82), Expect = 3e-04
Identities = 15/43 (34%), Positives = 23/43 (53%)

Query: 345 RAVEQSNVDMTAELVSLMGAQQNYQANSKVLSTENEMMRALMQ 387
+ S V++ E +L QQ Y AN++VL T N + AL+
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 29.9 bits (67), Expect = 0.021
Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 2 SFNIALAGINAINGQLNQISNNIANSGTLGFKSGR 36
N A++G+NA LN SNNI++ G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4675FLGHOOKAP1330.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 0.001
Identities = 10/46 (21%), Positives = 20/46 (43%)

Query: 5 LYTAMTGAEHSLRALNVRANNLSNAQTSGFRADLASVTSQAARGYG 50
+ AM+G + ALN +NN+S+ +G+ + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGA 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4676FLGHOOKAP1383e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 38.0 bits (88), Expect = 3e-05
Identities = 12/49 (24%), Positives = 19/49 (38%)

Query: 208 NGIGTIKQGALEGSNVLAVEEMVEMIAAQRTYEMNTKVLSAADNMMQYL 256
N + + S V EE + Q+ Y N +VL A+ + L
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDAL 542



Score = 35.7 bits (82), Expect = 2e-04
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 13/74 (17%)

Query: 7 ISKTGIQAQDAKLQAIANNLANVNTVGFKRDRAVFEDMFYRAERQPGAQVSDNATGPGVQ 66
+ +G+ A A L +NN+++ N G+ R + +++ G G
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI-------------MAQANSTLGAGGW 52

Query: 67 LGNGTRIAGTQKVF 80
+GNG ++G Q+ +
Sbjct: 53 VGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4677FLGLRINGFLGH1401e-43 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 140 bits (354), Expect = 1e-43
Identities = 61/162 (37%), Positives = 83/162 (51%), Gaps = 6/162 (3%)

Query: 59 LTSDVRAFRAGDVLTVDLEESTQASKKSGTQVGKDS----SLSAKKPSLFGKALPVEAEL 114
L D R GD LT+ L+E+ ASK S +D L G A++
Sbjct: 65 LFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADV 124

Query: 115 GTKSG--FNGAGSSSQQNTLRGSVTVVVQRVMPNGLLQVRGEKRLVLNQGEENVRLAGYV 172
G FNG G ++ NT G++TV V +V+ NG L V GEK++ +NQG E +R +G V
Sbjct: 125 EASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVV 184

Query: 173 RAADIDSNNRVSSQRVANARITYAGRGSLADASQPGMLTRFF 214
I +N V S +VA+ARI Y G G + +A G L RFF
Sbjct: 185 NPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4678FLGPRINGFLGI309e-105 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 309 bits (792), Expect = e-105
Identities = 136/372 (36%), Positives = 199/372 (53%), Gaps = 10/372 (2%)

Query: 7 TIRAAALAIVIGIALSSTIEAHAQTVGNLVDVEGVRENALVGYGIVVGLAGSGDGT-QAK 65
I AA + + + +A + ++ ++ R+N L+GYG+VVGL G+GD +
Sbjct: 6 IIAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSP 65

Query: 66 YTTQSLTNMLKQFGTRLPENINLRSRNAAAVIVSATFPPGYRRGQKVDVTVSSLGDAKSL 125
+T QS+ ML+ G ++N AAV+V+A PP G +VDVTVSSLGDA SL
Sbjct: 66 FTEQSMRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSL 124

Query: 126 RGGTLLMTPLRAADGDVYALAQGNLVIPGLNVQGRSGTSVTINTPTTGRIPKGATIEREI 185
RGG L+MT L ADG +YA+AQG L++ G + QG ++T T+ R+P GA IERE+
Sbjct: 125 RGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIEREL 183

Query: 186 ATDFADTPTVRLNLKRPDFQTASSIADVINN----ALGSEVASTVDATSVDVVAPQVPSQ 241
+ F D+ + L L+ PDF TA +ADV+N G +A D+ + V P+ +
Sbjct: 184 PSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VAD 242

Query: 242 RVAFVARLNALKVSKGAEVPRVVFNSRTGTVVISQGVTVKPAVVSHGSLKVTIAEGTMVS 301
+A + L V +VV N RTGT+VI V + VS+G+L V + E V
Sbjct: 243 LTRLMAEIENLTVETDT-PAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVI 301

Query: 302 QPNAFANGSTVTAPVSEIGVTQAGGNAFQWTSGASLQAIVDTITRTGATPDDLMAILQAL 361
QP F+ G T P ++I Q G G L+ +V + G D ++AILQ +
Sbjct: 302 QPAPFSRGQTAVQPQTDIMAMQEGSKVAI-VEGPDLRTLVAGLNSIGLKADGIIAILQGI 360

Query: 362 SEAGALTGDLVV 373
AGAL +LV+
Sbjct: 361 KSAGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4679FLGFLGJ368e-06 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 36.2 bits (83), Expect = 8e-06
Identities = 23/97 (23%), Positives = 43/97 (44%), Gaps = 4/97 (4%)

Query: 14 ASIETAAVKAPDEAYRARVEDAAVKFEGLFIAQMLSEMKKATDQFKADNGFADRSSEAMI 73
S+ KA ++ A + A + EG+F+ ML M+ A + D F+ +
Sbjct: 16 QSLNELKAKAGEDP-AANIRPVARQVEGMFVQMMLKSMRDALPK---DGLFSSEHTRLYT 71

Query: 74 DYANRAVADAIAKQRGFGIADTLVAQMLPPDATPSKD 110
++ +A + +G G+A+ +V QM P P +
Sbjct: 72 SMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEES 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4680FLGHOOKAP11133e-29 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 113 bits (283), Expect = 3e-29
Identities = 77/312 (24%), Positives = 129/312 (41%), Gaps = 17/312 (5%)

Query: 1 MDRSARNTANQQTVGYTRQGVLRTARAS---------GGVDASSVIRFGDH-ANTQQKWA 50
++ ++ N ++ GYTRQ + S GV S V R D Q + A
Sbjct: 18 LNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYDAFITNQLRAA 77

Query: 51 SHGSVGEHRAVESYFRQLEEVMGLKDGSIKVSMGKFFGALDAASADVANSALRQQVLLAA 110
S G A +++ ++ S+ M FF +L ++ + A RQ ++ +
Sbjct: 78 QTQSSG-LTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNAEDPAARQALIGKS 136

Query: 111 GGMAKSFNSVQQMMRGQLDTLRQQSAATVEQINGLSRTAAELNRLVAEAEANGGA--PSE 168
G+ F + Q +R Q + A+V+QIN ++ A LN ++ G P+
Sbjct: 137 EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGAGASPNN 196

Query: 169 LIDQRDQAIDQLSALVDIRTVRQPDGTVDVSLAGGTPLIAGHQVAKMRVETLSGGTFELK 228
L+DQRDQ + +L+ +V + Q GT ++++A G L+ G ++ S
Sbjct: 197 LLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQLAAVPSSADPSRTT 256

Query: 229 LEL----AGTQYPVDGAKIGGELGGLSSFAKDTLLPQMEAIRSLAAELAGSFNEQVTAGF 284
+ AG + G LGG+ +F L + LA A +FN Q AGF
Sbjct: 257 VAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGF 316

Query: 285 GMGGSPGKALFT 296
G G+ F
Sbjct: 317 DANGDAGEDFFA 328



Score = 63.4 bits (154), Expect = 7e-13
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 2/110 (1%)

Query: 317 SGDPKAPGNSDNLLKLIELRSRRVDLPGFGEASLGDAYVLLVGKLGAQSEQNQSSLAIAV 376
S + ++ N L++L+S G S DAY LV +G ++ ++S A
Sbjct: 436 SEEDAGDSDNRNGQALLDLQSN--SKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQG 493

Query: 377 NVRQRAEEAWQSLSGVSMDEEAVNFSEALQVYSANMKVISVAKELFDATI 426
NV + QS+SGV++DEE N Q Y AN +V+ A +FDA I
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4681FLAGELLIN352e-04 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 35.4 bits (81), Expect = 2e-04
Identities = 30/138 (21%), Positives = 58/138 (42%), Gaps = 6/138 (4%)

Query: 13 LATLRASNSKAADLTSKISTGQRVQRASDDPIAAARLLLIERDTSV---LQRYQKNIDTL 69
L S S + ++S+G R+ A DD AA + R TS L + +N +
Sbjct: 14 QNNLNKSQSSLSSAIERLSSGLRINSAKDD---AAGQAIANRFTSNIKGLTQASRNANDG 70

Query: 70 SVRLQKNEVHLDGMLDTVMAVHDSLLSAADGSRSAADLNALAAPLRMRLNNLKQAANAKD 129
Q E L+ + + + V + + A +G+ S +DL ++ ++ RL + + +N
Sbjct: 71 ISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQ 130

Query: 130 GDGNFLFSGSQTNTAPIA 147
+G + S +
Sbjct: 131 FNGVKVLSQDNQMKIQVG 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4688PF00577764e-16 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 76.0 bits (187), Expect = 4e-16
Identities = 101/683 (14%), Positives = 206/683 (30%), Gaps = 100/683 (14%)

Query: 152 LPASGSTGLIAYNY-LNLTGGQGREIGGRYSFDAIG----SVGNWSITTGLQATQNRGM- 205
L G + NY + Q R G + ++G W + + N
Sbjct: 177 LWDPGINAGLL-NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDS 235

Query: 206 RRASEIEYSVPRLYAQREFEGKFAR--AG--FFTPDLATGIRPPRMPGGGAATTLGVMFG 261
S+ ++ + +R+ +R G + D+ GI G
Sbjct: 236 SSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGIN-----------FRGAQLA 284

Query: 262 TSEAREVEGARPSLYPVYVTANRQGMVEIYRNGALIHSQAVQPGLQVVDTRPLPSGIYEV 321
+ + + R ++ A V I +NG I++ V PG ++ ++
Sbjct: 285 SDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDL 344

Query: 322 EVRLVE-DGQVTSTQHELVYKPAAWSDPTQR---WKYAFFAGQERNLIGEDRSHAFTAGG 377
+V + E DG P + QR +Y+ AG+ R+ + F
Sbjct: 345 QVTIKEADGSTQIFT-----VPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQST 399

Query: 378 AINYLAHPRVVLGASAQHVADANVFGASLDWQFTDRARTYWNVYHSSKHGVGGDVQILMP 437
++ L + Q F + ++ ++
Sbjct: 400 LLHGLP-AGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQS 458

Query: 438 YR----------DGSVSL---SHSQRWQRSDRRHRNGRSSVRSGKVQDSAITWSHRFTPQ 484
R ++ L +S + R + + + QD I +FT
Sbjct: 459 VRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDY 518

Query: 485 TSIMSR-------------ASYASGATTGLGVD----------LSVSHRHSLFGNDLTWR 521
++ ++ +G + D+ W
Sbjct: 519 YNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF--EDINWT 576

Query: 522 VSGFDRPTGTLARTRNRGVDLGLSIALGKEKRSYNVNLGTRSGSDGQRDHYASLGVRQEL 581
+S + + R++ + L ++I RS + + + + H + +
Sbjct: 577 LS-YSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNL- 634

Query: 582 DAGFFKSVGANGTVDRYGLSMGATTQ-FEHSVARGDAFLQRSSKEGGIAGGMNLSS---- 636
AG + ++ + + Y + G +S + G A L G G + S
Sbjct: 635 -AGVYGTLLEDNNLS-YSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQ 692

Query: 637 -TVGTNGKSAAVSGKGNAF---NGESAVIVDLASDFEDVKVHAQDTSGGSIELRPGRNLV 692
G +G G ++ V+V A +D KV ++ +G + R G ++
Sbjct: 693 LYYGVSG-GVLAHANGVTLGQPLNDTVVLVK-APGAKDAKV--ENQTGVRTDWR-GYAVL 747

Query: 693 P-VSAYRAGRLQYFFDRSQAPA-ATLQPAATHYHLNKGGVGYEKVHVMKTMTVIGRLVDH 750
P + YR R+ D + L A + +G + + + ++ L H
Sbjct: 748 PYATEYRENRVA--LDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL-TH 804

Query: 751 SGKPVRAAHLSSPAGRSVTEADG 773
+ KP+ P G VT
Sbjct: 805 NNKPL-------PFGAMVTSESS 820


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4692HTHFIS524e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.1 bits (125), Expect = 4e-10
Identities = 22/122 (18%), Positives = 48/122 (39%), Gaps = 7/122 (5%)

Query: 13 RINVVVADDHPVVSTGVAAILTAEMDINVVGVASTISELLLLLQQQPCDVLICDYSFSGD 72
++VADD + T + L+ +V ++ + L + D+++ D +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRITSNA-ATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 73 QQPDGVALFKRLRRNHPHVAIVVLTAHQDIALLVSRVMNTGVAGFLRKSSQDFARLAAIV 132
+ L R+++ P + ++V++A + G +L K D L I+
Sbjct: 61 ---NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIK-ASEKGAYDYLPKPF-DLTELIGII 115

Query: 133 RR 134
R
Sbjct: 116 GR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4693HTHFIS611e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-11
Identities = 30/121 (24%), Positives = 53/121 (43%), Gaps = 4/121 (3%)

Query: 898 SDASVLVVEDDRVSAQLMCDQLRMLGIGHVEVVCSAEDGIRRCQSRIYDLVVTDSNLPGK 957
+ A++LV +DD ++ L G V + +A R + DLVVTD +P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 958 GGAELLSDLRAAGISWPVVLCTADATLSR-VANVPFDAL--ITKPSTLSDLSHVLQNVLG 1014
+LL ++ A PV++ +A T + A + KP L++L ++ L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1015 P 1015

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4699HTHFIS362e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 2e-04
Identities = 17/110 (15%), Positives = 46/110 (41%), Gaps = 5/110 (4%)

Query: 32 KINIVVADSYPMIVQGLRHVFSSVENMRIVAEAHTLSGMASLLSSCECDVLICDYAFGDD 91
I+VAD I L S + ++ + +++ + D+++ D
Sbjct: 3 GATILVADDDAAIRTVLNQALSR-AGYDVRITSNAATLWR-WIAAGDGDLVVTDVV---M 57

Query: 92 PGPDGMRMLETIRRNHPNVKIILLAELRDGLSVQRVLKKGVSAFVVKSSD 141
P + +L I++ P++ +++++ ++ + +KG ++ K D
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4701HTHFIS847e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 7e-19
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 3/119 (2%)

Query: 995 SGTRILVVDDHPINRLVIEAQLARLGYTAIAVSNGTDALHALDDSDIALVLSDCAMPDMD 1054
+G ILV DD R V+ L+R GY SN + D LV++D MPD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 1055 GYDLARRIRSREPRSRHIPILALTANALPDEAIRCAEAGMDGLIVKPTTLTVLREELAR 1113
+DL RI+ P +P+L ++A AI+ +E G + KP LT L + R
Sbjct: 62 AFDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4702HTHFIS532e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 2e-10
Identities = 24/119 (20%), Positives = 50/119 (42%), Gaps = 4/119 (3%)

Query: 4 RVVLADDHPIMLLGCRLLIEQNGMEVVGEARDSSELMSILAHVACDVVITDFSMPNTGRA 63
+++ADD + + + G +V +++ L +A D+V+TD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP---DE 60

Query: 64 DGLAMLSTLRREHGALPVIVLTNMANAGLLRAMLNEGVLGIVEKGAEKSELFAAVRTAL 122
+ +L +++ LPV+V++ +G + K + +EL + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4704OMADHESIN611e-11 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 61.1 bits (147), Expect = 1e-11
Identities = 50/163 (30%), Positives = 93/163 (57%), Gaps = 9/163 (5%)

Query: 556 GMRTIANSDNSVAIGNMAQTGSEQPYSVAIGSHVTTNGASALAIGSQARANGENAIAVGN 615
G+ A +S+AIG A+ + + +VA+G+ G +++AIG ++A G++A+ G
Sbjct: 62 GLNASAKGIHSIAIGATAE--AAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 616 NNVHAIGESSIAIGNGAEAVVGATNGIALGTGASVARNVTDAMALGAKAFVEKRANGAVA 675
+ + +AIG A + G+A+G + + +++A+G + V ++A
Sbjct: 120 ASTAQ--KDGVAIGARAST---SDTGVAVGFNSKA--DAKNSVAIGHSSHVAANHGYSIA 172

Query: 676 LGAGSQASRANTISVGNAGSERQIVNVAAGTQGTDAVNVAQLQ 718
+G S+ R N++S+G+ RQ+ ++AAGT+ TDAVNVAQL+
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 47.6 bits (112), Expect = 2e-07
Identities = 50/148 (33%), Positives = 73/148 (49%), Gaps = 6/148 (4%)

Query: 347 AAGSADTDAVNVGQLTAATQPIRDELADVSLAMKSIQIKPQGVDAIAAGTNA----VAVG 402
AA + ++ G + A P+ L D ++ + + AI A + VAVG
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVG 144

Query: 403 AGANAMANGSIALGAGSRVTGLN--SVAIGINSVALETNQVSVGDVGRERRISNLAAGTK 460
+ A A S+A+G S V + S+AIG S N VS+G R++++LAAGTK
Sbjct: 145 FNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTK 204

Query: 461 GTDAVNLNQLSDAIGKVSSRTDKLSFDL 488
TDAVN+ QL I K T+K S +L
Sbjct: 205 DTDAVNVAQLKKEIEKTQENTNKRSAEL 232



Score = 45.3 bits (106), Expect = 1e-06
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 4/97 (4%)

Query: 386 PQGVDAIAAGTNAVAVGAGANAMANGSIALGAGSRVTGLNSVAIGINSVALETNQVSVGD 445
G++A A G +++A+GA A A ++A+GAGS TG+NSVAIG S AL + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 446 VGRERR----ISNLAAGTKGTDAVNLNQLSDAIGKVS 478
++ I A+ + AV N +DA V+
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVA 156



Score = 44.9 bits (105), Expect = 1e-06
Identities = 45/128 (35%), Positives = 67/128 (52%), Gaps = 9/128 (7%)

Query: 865 IGTGSGSV----INSDAGDTTAIGANSTQQASSGTAIGAGADARAINSTAIGQAASAHGE 920
I TG SV ++ GD+ ++ G AIGA A + A+G + A +
Sbjct: 94 IATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAST-SDTGVAVGFNSKADAK 152

Query: 921 NSTAVGQGATAWGNN--SIAIGAGSVADADNAVSFGNSATGMTRTLTNVSAGVAPTDAVN 978
NS A+G + N+ SIAIG S D +N+VS G+ + + R LT+++AG TDAVN
Sbjct: 153 NSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES--LNRQLTHLAAGTKDTDAVN 210

Query: 979 VQQLDESV 986
V QL + +
Sbjct: 211 VAQLKKEI 218



Score = 40.7 bits (94), Expect = 3e-05
Identities = 28/62 (45%), Positives = 38/62 (61%)

Query: 898 GAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGSVADADNAVSFGNSA 957
G A A+ I+S AIG A A + AVG G+ A G NS+AIG S A D+AV++G ++
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 958 TG 959
T
Sbjct: 122 TA 123



Score = 36.8 bits (84), Expect = 5e-04
Identities = 84/389 (21%), Positives = 142/389 (36%), Gaps = 39/389 (10%)

Query: 655 TDAMALGAKAFVEKRANGAVALGAGSQASRANTISVG---NAGSERQIVNVAAGTQGTDA 711
++A+GA A K A AVA+GAGS A+ N++++G A + + AA T D
Sbjct: 70 IHSIAIGATAEAAKGA--AVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG 127

Query: 712 VNVAQLQGAAASLASVIGGDTTVDGSGHVAIHSIEVSGHKYATVSAAVQAAAAYGATDSL 771
V + + + + +G ++ D VAI GH + AA +G + ++
Sbjct: 128 VAIGA-RASTSDTGVAVGFNSKADAKNSVAI------GH-------SSHVAANHGYSIAI 173

Query: 772 AVRYDLDSHGNPNYGSVTLGGPSAAPVMLTNVADGKSRYDAVNYGQLSSLQSDFENRMGA 831
R D + + G +L LT++A G DAVN QL
Sbjct: 174 GDRSKTDRENSVSIGHESLNR------QLTHLAAGTKDTDAVNVAQLKK----------- 216

Query: 832 MDDRVSKIETDTGDSRDDSRVMTMANLRRDNNDIGTGSGSVINSDAGDTTAIGANSTQQA 891
+ K + +T + A ++ + + + +S + +T
Sbjct: 217 ---EIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQ 273

Query: 892 SSGTAIGAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGSVADADNAV 951
S A A + ++ T + A + + A N A S ++
Sbjct: 274 SKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSK 333

Query: 952 SFGNSATGMTRTLTNVSAGVAPTDAVNVQQLDESVGGLRSQIEHDRADANGGTASAVAIA 1011
S T + T VS + Q D L ++++ + G AS+ A+
Sbjct: 334 SSHTLKTANSYTDVTVSNSTKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALN 393

Query: 1012 SLPQAPAPGKSVVAVGGGTYAGQSALAVG 1040
SL Q GK G G Y ALA+G
Sbjct: 394 SLFQPYGVGKVNFTAGVGGYRSSQALAIG 422



Score = 36.4 bits (83), Expect = 6e-04
Identities = 26/96 (27%), Positives = 46/96 (47%)

Query: 884 GANSTQQASSGTAIGAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGS 943
G N++ + AIGA A+A + A+G + A G NS A+G + A G++++ GA S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 944 VADADNAVSFGNSATGMTRTLTNVSAGVAPTDAVNV 979
A D ++T T ++ ++V +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAI 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4706MPTASEINHBTR290.024 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 28.8 bits (64), Expect = 0.024
Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 438 DQYLVQRGLPIAPARWNPATPIIASLEADGTGI 470
D ++ L P W+P I + A+GTGI
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGI 95


22Bcenmc03_4721Bcenmc03_4729Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_47210123.586529hopene-associated glycosyltransferase HpnB
Bcenmc03_4722-1123.658964hypothetical protein
Bcenmc03_47232151.997364hopanoid-associated sugar epimerase
Bcenmc03_47242121.033125acylphosphatase
Bcenmc03_4725214-0.164181transporter DMT superfamily protein
Bcenmc03_4726215-0.785572hypothetical protein
Bcenmc03_4727416-1.276579hypothetical protein
Bcenmc03_4728217-1.003103amino acid/peptide transporter
Bcenmc03_4729216-0.503095XRE family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4722IGASERPTASE494e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 48.9 bits (116), Expect = 4e-08
Identities = 33/262 (12%), Positives = 78/262 (29%), Gaps = 39/262 (14%)

Query: 24 AQQAARSAEAFARAAAAEARAAAERHAAAEAEAQAAAQRHTAAAADAEAAQ----RRHAD 79
AQ + EA + A + + E Q + TA E A+ +
Sbjct: 1063 AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEV 1122

Query: 80 ATAAAEAAAQRHTDAATQAEAASQRHAEATALTE---ALAQRHADAEAASQRHAAAIAEA 136
++ + ++ Q +A R + T + + AD E ++ ++ + +
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQP 1182

Query: 137 EAAAQRHHAAATEADAAAQRHAAATAEAEAAAKRHAEATAEAEAAAQRHMAAIAEAEALA 196
+ T + + E T A + + + +
Sbjct: 1183 VTEST-------------------TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 197 QRHTQALADTQAAAQRHAEATAEAEAITQRHAEAIAQAEAAAQRHAEATAEAEAITQRHT 256
+R +++ E + +A + + ++A A Q
Sbjct: 1224 RRSVRSVPHNV-----------EPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQ--F 1270

Query: 257 EAIAQAEAAAQHHAKAIADAEA 278
A+ +A +QH ++ + E
Sbjct: 1271 VALNVGKAVSQHISQLEMNNEG 1292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4723NUCEPIMERASE612e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 60.6 bits (147), Expect = 2e-12
Identities = 57/350 (16%), Positives = 119/350 (34%), Gaps = 54/350 (15%)

Query: 9 VLVTGASGFVGSAVARIAQQKGYAVRVL------VRPTSPRTNVADL---DAEIVTGDMR 59
LVTGA+GF+G V++ + G+ V + + + + L + D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 60 DEASMRAALR--GVRYLLHVAAD--YRLWAPDPDEIERANLEGAVATMRAARAEGVERIV 115
D M + R +P +NL G + + R ++ ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122

Query: 116 Y--TSSVATLKVTSAGDPSDENRPLTAEQAIGVYKRSKVLAERAVERMIADEGLPAVIVN 173
Y +SSV L + P + + + +Y +K E GLPA +
Sbjct: 123 YASSSSVYGL---NRKMPFSTDDS--VDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 174 PSTPIGPR---DVKPTPTGRIIVEAALGKIPAFVDTGLNLVHVDDVAHGHFLALERGRIG 230
T GP D+ + ++E + + + ++DD+A
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEA----------- 226

Query: 231 ERYILGGENLPLQQMLADIAQMTGRKAPTIALPRWPLY--------PLAVGAEAVAKFTK 282
I+ +++ AD P ++ + +Y L +A+
Sbjct: 227 ---IIRLQDVIPH---ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280

Query: 283 KEPFVTVDGLRMSKNKMYFTSA---KAERELGYR-ARPYREGLRDALDWF 328
E + L + + TSA +G+ ++G+++ ++W+
Sbjct: 281 IE--AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


23Bcenmc03_4772Bcenmc03_4809Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4772-1113.482125N-acylglucosamine 2-epimerase
Bcenmc03_4773-1103.654978MgtC/SapB transporter
Bcenmc03_47742112.660185hypothetical protein
Bcenmc03_47752132.433910hypothetical protein
Bcenmc03_47762132.219401GCN5-related N-acetyltransferase
Bcenmc03_47772122.309658heavy metal translocating P-type ATPase
Bcenmc03_47782141.467167hypothetical protein
Bcenmc03_47792141.341997hypothetical protein
Bcenmc03_4780-1141.837722hypothetical protein
Bcenmc03_4781-1132.320096type I phosphodiesterase/nucleotide
Bcenmc03_4782-2123.218329LysR family transcriptional regulator
Bcenmc03_4783-2113.178255binding-protein-dependent transport systems
Bcenmc03_4784-3113.145889ABC transporter-like protein
Bcenmc03_4785-2111.575498extracellular solute-binding protein
Bcenmc03_4786-1120.328488binding-protein-dependent transport systems
Bcenmc03_4787113-0.216885hypothetical protein
Bcenmc03_47882150.301255hypothetical protein
Bcenmc03_47890140.839968hypothetical protein
Bcenmc03_47900140.895399cytochrome d ubiquinol oxidase subunit II
Bcenmc03_4791-1122.061800cytochrome bd ubiquinol oxidase subunit I
Bcenmc03_47921103.637311hypothetical protein
Bcenmc03_47930103.087049hypothetical protein
Bcenmc03_47940111.674187threonine aldolase
Bcenmc03_47952123.174946hypothetical protein
Bcenmc03_47963123.199140chromosome replication initiation inhibitor
Bcenmc03_47973112.555427beta-lactamase domain-containing protein
Bcenmc03_47981112.898035glutathione S-transferase domain-containing
Bcenmc03_4799093.263021PA-phosphatase-like protein
Bcenmc03_4800-1103.812834major facilitator transporter
Bcenmc03_4801-292.574441LysR family transcriptional regulator
Bcenmc03_4802-282.106940DSBA oxidoreductase
Bcenmc03_4803-292.481429IclR family transcriptional regulator
Bcenmc03_4804-392.100875pyruvate carboxyltransferase
Bcenmc03_4805-3101.864941L-carnitine dehydratase/bile acid-inducible
Bcenmc03_4806-2101.775733major facilitator transporter
Bcenmc03_4807-2142.273553amidohydrolase 2
Bcenmc03_4808-1153.345453activator of Hsp90 ATPase 1 family protein
Bcenmc03_4809-1143.157619activator of Hsp90 ATPase 1 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4776SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.4 bits (76), Expect = 2e-04
Identities = 13/60 (21%), Positives = 20/60 (33%)

Query: 86 NRTLFIYDLDIVPSRRRQGWATRALDALDAEAHRYGVTEIGLSVFNHNAAARALYRSCGF 145
N I D+ + R++G T L A + L + N +A Y F
Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4793cloacin270.041 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.4 bits (60), Expect = 0.041
Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 88 GAAFNGGTGAAVGAGAGLLAGSVVGAGAAQGSAYDVQRR-YDYAYLQCMYATGNRVPVPG 146
G N G + G G G VG GA+ GS + + + ++ G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 147 GMSGGSGGGYGGGGYGTAPRAA 168
G +G SGGG G GG +A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4794PRTACTNFAMLY300.014 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.0 bits (67), Expect = 0.014
Identities = 46/198 (23%), Positives = 66/198 (33%), Gaps = 18/198 (9%)

Query: 8 TVTRPGQAMLAAMTAAETGDDVWGDDPTVLRLQAVTAERAGKEAGLFFPSGTQSNLAALM 67
TVT ++A D W DD L + A+ + ++ L G Q
Sbjct: 112 TVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIADSTLQGAGGVQ------- 164

Query: 68 SHCERGDEYIVGQLAHTYKYEGGGAAVLGSIQPQPIENAPDGTLPLAKIAAAIKPLDNHF 127
ERG V + A GG + G++Q E+ P + L P
Sbjct: 165 --IERGANVTVQRSA----IVDGGLHI-GALQSLQPEDLPPSRVVLRDTNVTAVPASGAP 217

Query: 128 ARTRLL-ALENTIGGQVLPEGYVQEAVAFARSRGLSTHLDGARVCNAAVASGRPIAELCA 186
A +L A E T+ G + G A A +G HL A + +G +
Sbjct: 218 AAVSVLGASELTLDGGHITGG---RAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAV 274

Query: 187 PFDTVSICFSKGLGAPVG 204
P V F G PV
Sbjct: 275 PGGAVPGGFGPGGFGPVL 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4800TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.6 bits (108), Expect = 2e-07
Identities = 57/377 (15%), Positives = 122/377 (32%), Gaps = 54/377 (14%)

Query: 57 LAPDLGASARAIGFVPTLTQLGYALGILLLAPLGDRFDRRRVIVTKAAALVVALLLASIA 116
+A D + +V T L +++G + L D+ +R+++ ++ +
Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99

Query: 117 PS-LGLLLAASF--AIGLAATMAQDVVPAAATLAHDAHRGRIVGTVMTGLLLGILLSRVV 173
S LL+ A F G AA A +V A +RG+ G + + + +G + +
Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMV-VVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158

Query: 174 AGFVAETAGWRAMFALAAASVAVIGAVAARGLPRFEPTTRLPYRA-------------LI 220
G +A W + + + +I L + E + + ++
Sbjct: 159 GGMIAHYIHWSYLLLIPMIT--IITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216

Query: 221 GSLGALWR-----------------------------AHSALRRAALAQGLLAVGFSAFW 251
+ + L G++ + F
Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276

Query: 252 STLAVMLHGAPFHLGSAAAGAFGL--AGAAGALAAPVAGRLADHHGPERVTRIGIGIATL 309
S + M+ L +A G+ + + + + G L D GP V IG+ ++
Sbjct: 277 SMVPYMM-KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335

Query: 310 SFALMAAAPLMSPHAQLVLLAVATIGFDLGVQATLIAHQSIVYRIDPASRSRLNAVLFVG 369
SF + + +++ V +G + + + + ++L
Sbjct: 336 SFLTASFLLETTSWFMTIII-VFVLGGLSFTK--TVISTIVSSSLKQQEAGAGMSLLNFT 392

Query: 370 MFIGMAAGAAIGSLLLA 386
F+ G AI LL+
Sbjct: 393 SFLSEGTGIAIVGGLLS 409


24Bcenmc03_4824Bcenmc03_4846Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4824223-3.599386hypothetical protein
Bcenmc03_4825121-2.966404isoprenylcysteine carboxyl methyltransferase
Bcenmc03_4826-114-1.19640017 kDa surface antigen
Bcenmc03_4827-213-1.660002hypothetical protein
Bcenmc03_4828-215-1.420860hypothetical protein
Bcenmc03_4829090.594366LysR family transcriptional regulator
Bcenmc03_48302113.193595aminoglycoside/hydroxyurea antibiotic resistance
Bcenmc03_48313124.3522941A family penicillin-binding protein
Bcenmc03_48322144.473172hypothetical protein
Bcenmc03_4833-1163.649661hypothetical protein
Bcenmc03_4834-1163.827797hemin importer ATP-binding subunit
Bcenmc03_48350173.393565transport system permease
Bcenmc03_48360182.062852periplasmic binding protein
Bcenmc03_48373190.052191hemin-degrading family protein
Bcenmc03_4838319-0.827961TonB-dependent
Bcenmc03_4839020-1.714120hypothetical protein
Bcenmc03_4840219-1.982576hypothetical protein
Bcenmc03_4841116-1.345523hypothetical protein
Bcenmc03_4842016-0.163504porin
Bcenmc03_48430140.453747YaeC family lipoprotein
Bcenmc03_4844-2131.201188succinylglutamate desuccinylase/aspartoacylase
Bcenmc03_4845-1131.951078cationic amino acid ABC transporter periplasmic
Bcenmc03_48462122.766899LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4836FERRIBNDNGPP593e-12 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 59.2 bits (143), Expect = 3e-12
Identities = 76/293 (25%), Positives = 119/293 (40%), Gaps = 26/293 (8%)

Query: 1 MSARPFDPRRRSLLGGAAACALAGALPGGVLAQVAAAAPKRVIVIGGALAETAFAL---- 56
MS P RRR L A AL+ L A AA P R++ + E AL
Sbjct: 1 MSGLPLISRRRLL----TAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVP 56

Query: 57 -GGAETPRYRLVGADTTCTYPDAAKRLPKVGYQRALSAEGLLSLRPDLVLASAEAGP-PT 114
G A+T YRL ++ P + VG + + E L ++P ++ SA GP P
Sbjct: 57 YGVADTINYRLWVSE-----PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPE 111

Query: 115 AIAQVKGAGVTVTTFDERHDVESVRAKITGVAQALDVRDAGTALLQRFDRDWQAARDAVA 174
+A++ G D + + R +T +A L+++ A L +++ ++ +
Sbjct: 112 MLARI-APGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMK---- 166

Query: 175 ARVPGGAQPPRVLFVLNHTGAQALVAGQRTAADAMIRYAGARNAMQGFDHYKPLTT---E 231
R P +L L LV G + ++ G NA QG ++ T +
Sbjct: 167 PRFVKRGARPLLLTTLIDP-RHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSID 225

Query: 232 ALAAAAPDVVLISDEGLAAVGGHAALLATPGFGATPAGRARRVVSLDALFLLG 284
LAA VL D + AL+ATP + A P RA R + A++ G
Sbjct: 226 RLAAYKDVDVLCFDHDNSKD--MDALMATPLWQAMPFVRAGRFQRVPAVWFYG 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4842ECOLNEIPORIN962e-24 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 96.0 bits (239), Expect = 2e-24
Identities = 86/390 (22%), Positives = 134/390 (34%), Gaps = 64/390 (16%)

Query: 1 MNKKLLTIAALAATAGTAHAQSSVTLYGVIDAGISYVNHSKTANGGTGKLFKYDDGVAQG 60
M K L+ AL A A + VTLYG I AG+ + V G
Sbjct: 1 MKKSLI---ALTLAALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 61 SRWGLRGTEDLGGGLKAIFVLENGFNSGNGTIGQGGAIFGRQAYVGLSQSQYGTVTFGRQ 120
S+ G +G EDLG GLKAI+ +E G RQ+++GL +G + GR
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVE----QKASIAGTDSGWGNRQSFIGLK-GGFGKLRVGRL 112

Query: 121 YSFSTDILGSNYSTGGNTVAGNYAYHVNDIDQLTSSRINNAVKFQSANYSGFTFGALYGF 180
S D N + G + +L S V++ S ++G + Y
Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEPE-ARLIS------VRYDSPEFAGLSGSVQYAL 165

Query: 181 SNSTDFAGAPATTTGTTTTAGSSRAYSFGLNYANGPVSVGAAYTDIRYPSQSTPGFSTTI 240
+++ +S +Y G NY NG V R+
Sbjct: 166 NDNAGR--------------HNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQ-------- 203

Query: 241 ANLSTGNVRDLRTYGVGGRYVWGPATAWLLWTRTQFSTVSGAGGTFYNAYEAGAKYAF-- 298
N+ + + + Y A + + Q + + + + E A A+
Sbjct: 204 ---ENVNIEKYQIHRLVSGYDNDALYA-SVAVQQQDAKLVEENYSHNSQTEVAATLAYRF 259

Query: 299 ---TPALSGGLGYTYTNATQNGNSWHWNQVNGIADYALSKRTDVYGLVVYQQASGKGVQA 355
TP +S G+ + N N+ ++QV A+Y SKRT + Q
Sbjct: 260 GNVTPRVSYAHGFKGSFDATNYNN-DYDQVVVGAEYDFSKRTSALVSAGWLQ-------- 310

Query: 356 QIGSSTSYFNTSGTGSKNQIAARIGIRHKF 385
G A +G+RHKF
Sbjct: 311 ---------EGKGESKFVSTAGGVGLRHKF 331


25Bcenmc03_4881Bcenmc03_4891Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4881029-4.392944amidohydrolase 2
Bcenmc03_4882237-6.172209LysR family transcriptional regulator
Bcenmc03_4883242-7.021035short chain dehydrogenase
Bcenmc03_4884337-6.173892hypothetical protein
Bcenmc03_4885337-5.935245LysR family transcriptional regulator
Bcenmc03_4886235-5.612222LysR family transcriptional regulator
Bcenmc03_4887334-4.898281major facilitator transporter
Bcenmc03_4888232-4.2328062Fe-2S iron-sulfur cluster binding
Bcenmc03_4889230-4.341632aldehyde oxidase and xanthine dehydrogenase
Bcenmc03_4890232-5.045844gluconate 2-dehydrogenase (acceptor)
Bcenmc03_4891121-3.733300LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4887TCRTETB478e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.8 bits (111), Expect = 8e-08
Identities = 30/155 (19%), Positives = 61/155 (39%), Gaps = 3/155 (1%)

Query: 40 LTPIAHDLNATEGIAGQAISISGFFAVLASLFVAPLAGRFD-RRHVLMSMTVLMLISIVL 98
L IA+D N + + + L+ + +R +L + + S++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 99 IAVSPNFAVLMIARAFLGLAVGGFWSLSTATVIQLVPAQRVPKALGTIYMGNAIATAFAA 158
F++L++AR G F +L V + +P + KA G I A+
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 159 PIGAYVGGHLGWRFVFAALVPLVLVNLVWQAVSLP 193
IG + ++ W ++ L+P++ + V + L
Sbjct: 157 AIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLL 189


26Bcenmc03_4995Bcenmc03_5002Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4995111-3.551136TetR family transcriptional regulator
Bcenmc03_4996320-5.347640FAD dependent oxidoreductase
Bcenmc03_4997329-8.242354amino acid permease-associated protein
Bcenmc03_4998334-8.958269aldehyde dehydrogenase
Bcenmc03_4999340-9.625880hypothetical protein
Bcenmc03_5000237-8.110724hypothetical protein
Bcenmc03_5001027-5.990322hypothetical protein
Bcenmc03_5002-120-4.599273hemolysin-type calcium-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4995HTHTETR683e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 3e-16
Identities = 31/159 (19%), Positives = 72/159 (45%), Gaps = 8/159 (5%)

Query: 20 RRKYDPEQTKRNILDVATQEFSAMGLAGARVDAIAERTNTTKRMLYYYFESKEGLYEAVL 79
+ K + ++T+++ILDVA + FS G++ + IA+ T+ +Y++F+ K L+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 80 EKVYGDIRALEQELHVGDM-EPREGMRRLVEFTFD--YHDKHRDFVRLV---SIENIHGA 133
E +I LE E +P +R ++ + ++ R + + E +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 134 KYVEQLKSFKNRNVSIIKTLEELVERGAASGVFRKDIDA 172
V+Q + +N + +E+ ++ + + D+
Sbjct: 124 AVVQQAQ--RNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5002RTXTOXINA1288e-32 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 128 bits (323), Expect = 8e-32
Identities = 83/295 (28%), Positives = 117/295 (39%), Gaps = 28/295 (9%)

Query: 711 IMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGND-TVFGGAGNDELHGDAGNDRLLGE 769
+ G G+D+++ G + G G D V + + G+ R+LG
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLGG 672

Query: 770 AGNDRIFGEAGDDVLWGGDGDDV------LVGFTASN-DAKQTLSWGESDNDMLYGGNGN 822
+V G + N L E L G
Sbjct: 673 DVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEE----LIGTTRA 728

Query: 823 DALYGGLGNDYLDGGNDNDFLDGGDGDDRLFGGAGDDELNGGNGHDALSGETGNDKIFGG 882
D +G D G + +D ++G DG+DRL+G G+D L+GGNG D L G GNDK+ G
Sbjct: 729 DKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGV 788

Query: 883 AGNDTIWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAGNDLILGGLGNDLLHGEA 942
AGN+ + GGDGDD S +D +YG G DL+ GG G+DLL G
Sbjct: 789 AGNNYLNGGDGDDEFQVQGNSLAKNVLFGG--KGNDKLYGSEGADLLDGGEGDDLLKGGY 846

Query: 943 GNDE------------IQGGDGHDKLYGGDGN--DRLFGQVGNDILYGGAGDDLL 983
GND G DKL D + D F + GND++ ++L
Sbjct: 847 GNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNVL 901



Score = 127 bits (320), Expect = 2e-31
Identities = 68/191 (35%), Positives = 92/191 (48%), Gaps = 7/191 (3%)

Query: 609 LAGDGNDMMGGSSRNDNLWGGTGNDTLFGYDGDDRLYGEEGDDELNGGAGNDVLDGGIGN 668
+ D GS D G G+D + G DG+DRLYG++G+D L+GG G+D L GG GN
Sbjct: 723 IGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGN 782

Query: 669 DKLFGHVGNDIMNGGDGDDIMLGFTASNDSKQTLAWGETDDDIMYGGAGNDRMWGGVGHD 728
DKL G GN+ +NGGDGDD S G +D +YG G D + GG G D
Sbjct: 783 DKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLF--GGKGNDKLYGSEGADLLDGGEGDD 840

Query: 729 YMDGGDGADFV--SGGDGNDTVF-GGAGNDELH-GDAGNDRLLGE-AGNDRIFGEAGDDV 783
+ GG G D G G+ + G D+L D + + GND I + +V
Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900

Query: 784 LWGGDGDDVLV 794
L G + +
Sbjct: 901 LSIGHKNGITF 911



Score = 125 bits (315), Expect = 6e-31
Identities = 86/288 (29%), Positives = 117/288 (40%), Gaps = 48/288 (16%)

Query: 756 ELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGESDNDM 815
E H G+D++ AG+ I+ G DV++ D + + + G
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEA----GNYTVTR 668

Query: 816 LYGGNGNDALYGGLGNDYLDGGNDNDFLDGGDGDDRLFGGAGDDELNGGNGHDALSGETG 875
+ GG+ L + + G + + G E + + L G T
Sbjct: 669 VLGGDVK-VLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTR 727

Query: 876 NDKIFGGAGNDTIWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAGNDLILGGLGN 935
DK FG D G DGDD LI G GN
Sbjct: 728 ADKFFGSKFTDIFHGADGDD--------------------------------LIEGNDGN 755

Query: 936 DLLHGEAGNDEIQGGDGHDKLYGGDGNDRLFGQVGNDILYGGAGDDLLVGFTGDNEAKRT 995
D L+G+ GND + GG+G D+LYGGDGND+L G GN+ L GG GDD
Sbjct: 756 DRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEF-----------Q 804

Query: 996 LGPGETDDDYLYGGEGNDTLLGGLGDDYLDGGAGADHMEGGEGNDTYI 1043
+ + L+GG+GND L G G D LDGG G D ++GG GND Y
Sbjct: 805 VQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYR 852



Score = 124 bits (313), Expect = 9e-31
Identities = 91/313 (29%), Positives = 133/313 (42%), Gaps = 42/313 (13%)

Query: 639 DGDDRLYGEEGDDELNGGAGNDVLDGGIGNDKLFGHVGNDIMNGGDGDDIMLGFTASNDS 698
DGDD+++ G + G G+DV+ + G++ D + + + D
Sbjct: 618 DGDDKVFLSAGSANIYAGKGHDVVYYDKTD---TGYLTIDGTKATEAGNYTVTRVLGGDV 674

Query: 699 K--------QTLAWGETDDDIMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGNDTVFG 750
K Q ++ G+ + Y + G D + + G D FG
Sbjct: 675 KVLQEVVKEQEVSVGKRTEKTQYRSYEFTHI-NGKNLTETDNLYSVEELIGTTRADKFFG 733

Query: 751 GAGNDELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGE 810
D HG G+D + G GNDR++G+ G+D L GG+GD
Sbjct: 734 SKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGD-------------------- 773

Query: 811 SDNDMLYGGNGNDALYGGLGNDYLDGGNDND---FLDGGDGDDRLFGGAGDDELNGGNGH 867
D LYGG+GND L G GN+YL+GG+ +D + LFGG G+D+L G G
Sbjct: 774 ---DQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGA 830

Query: 868 DALSGETGNDKIFGGAGNDT--IWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAG 925
D L G G+D + GG GND G G I+ D K SLA + D + G
Sbjct: 831 DLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKED-KLSLADIDFRDVA-FKREG 888

Query: 926 NDLILGGLGNDLL 938
NDLI+ ++L
Sbjct: 889 NDLIMYKGEGNVL 901



Score = 108 bits (271), Expect = 9e-26
Identities = 63/215 (29%), Positives = 97/215 (45%), Gaps = 29/215 (13%)

Query: 635 LFGYDGDDRLYGEEGDDELNGGAGNDVLDGGIGNDKLFGHVGNDIMNGGDGDDIMLGFTA 694
L G D+ +G + D +G G+D+++G GND+L+G GND ++GG+G
Sbjct: 722 LIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG--------- 772

Query: 695 SNDSKQTLAWGETDDDIMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGNDTVFGGAGN 754
DD +YGG GND++ G G++Y++GGDG D
Sbjct: 773 --------------DDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG------NSLAK 812

Query: 755 DELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGESDND 814
+ L G GND+L G G D + G GDD+L GG G+D+ + G+ D
Sbjct: 813 NVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKL 872

Query: 815 MLYGGNGNDALYGGLGNDYLDGGNDNDFLDGGDGD 849
L + D + GND + + + L G +
Sbjct: 873 SLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKN 907



Score = 85.0 bits (210), Expect = 1e-18
Identities = 43/136 (31%), Positives = 69/136 (50%), Gaps = 16/136 (11%)

Query: 1377 GNALDNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRGSGCDTIRDDDETLGNSDVIS 1436
G ++ + G+ G ++LDGG G+D+L GG GND YR+ G G I DD G D +S
Sbjct: 817 GGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG---GKEDKLS 873

Query: 1437 IGAGVSADQLWFRHVGNDL-------EISILGTGDTATVRDWYL-----GSRYQIEQIRV 1484
+ A + + F+ GNDL + +G + T R+W+ S ++IEQI
Sbjct: 874 L-ADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEIEQIFD 932

Query: 1485 DDGRTLVNADVEKLVQ 1500
GR + ++K ++
Sbjct: 933 KSGRIITPDSLKKALE 948



Score = 53.0 bits (127), Expect = 9e-09
Identities = 57/180 (31%), Positives = 76/180 (42%), Gaps = 36/180 (20%)

Query: 550 NDKIYWINSRDFIMFGPTQIKVSPSNRSYLIGTDGNDVFDANYYAAYGHWIDSNLLVNFL 609
ND++Y D + G + L G DGND G+ N L
Sbjct: 755 NDRLYGDKGNDTLSGG--------NGDDQLYGGDGNDKL----IGVAGN----NYLN--- 795

Query: 610 AGDGNDMM---GGSSRNDNLWGGTGNDTLFGYDGDDRLYGEEGDDELNGGAGND--VLDG 664
GDG+D G S + L+GG GND L+G +G D L G EGDD L GG GND
Sbjct: 796 GGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLS 855

Query: 665 GIGNDKLFGHVGNDIMNGGDGDDIMLGFTASNDSKQTLAWGETDDD-IMYGGAGNDRMWG 723
G G+ + GG D + L ++ + +A+ +D IMY G GN G
Sbjct: 856 GYGHHIIDDD-------GGKEDKLSL----ADIDFRDVAFKREGNDLIMYKGEGNVLSIG 904



Score = 36.5 bits (84), Expect = 0.001
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 1381 DNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRGSGCDTIRDDDETLGNSDVISIGAG 1440
D+ I GN GN+ L G GND L GG G+D G G +D+ +G + + G
Sbjct: 746 DDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDG--------NDKLIGVAGNNYLNGG 797

Query: 1441 VSADQLWFRHVGNDLEISILGTGD 1464
D+ + + G G+
Sbjct: 798 DGDDEFQVQGNSLAKNVLFGGKGN 821



Score = 31.5 bits (71), Expect = 0.036
Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 1377 GNALDNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRG 1416
G+ + G G+++++G GND L G GNDT G G
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG 772


27Bcenmc03_5043Bcenmc03_5102Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_50431025-2.211403hypothetical protein
Bcenmc03_50441025-1.973973hypothetical protein
Bcenmc03_5045925-2.464263hypothetical protein
Bcenmc03_5046827-3.097387hypothetical protein
Bcenmc03_5047929-3.386989OmpA/MotB domain-containing protein
Bcenmc03_5048931-4.734839hemagluttinin domain-containing protein
Bcenmc03_5049746-9.124219hypothetical protein
Bcenmc03_5050851-9.679212AsnC family transcriptional regulator
Bcenmc03_5051758-11.845276ATPase central domain-containing protein
Bcenmc03_5052762-13.645629XRE family transcriptional regulator
Bcenmc03_5053768-15.122570hypothetical protein
Bcenmc03_5054770-15.828476transposase IS3/IS911 family protein
Bcenmc03_5055773-16.302097integrase catalytic subunit
Bcenmc03_50561072-16.045628integrase catalytic subunit
Bcenmc03_50571167-14.515152hypothetical protein
Bcenmc03_50581064-13.434306hypothetical protein
Bcenmc03_50591052-10.799544hypothetical protein
Bcenmc03_50601145-8.806212histone family protein nucleoid-structuring
Bcenmc03_50611144-8.537418KAP P-loop domain-containing protein
Bcenmc03_50621044-8.182025hypothetical protein
Bcenmc03_5063938-7.469514hypothetical protein
Bcenmc03_5064937-6.767538integrase catalytic subunit
Bcenmc03_5065937-6.502073transposase IS3/IS911 family protein
Bcenmc03_5066836-6.573972hypothetical protein
Bcenmc03_5067733-5.221491TatD-related deoxyribonuclease
Bcenmc03_5068632-4.885021FRG domain-containing protein
Bcenmc03_5069634-5.140353AraC family transcriptional regulator
Bcenmc03_5070438-5.875948alpha/beta hydrolase fold protein
Bcenmc03_5071441-6.324173short-chain dehydrogenase/reductase SDR
Bcenmc03_5072445-6.847428major facilitator transporter
Bcenmc03_5073334-6.710773alkylhydroperoxidase
Bcenmc03_5074231-6.768444hypothetical protein
Bcenmc03_5075333-7.085206AraC family transcriptional regulator
Bcenmc03_5076233-7.531337transposase
Bcenmc03_5077129-6.797156hypothetical protein
Bcenmc03_5078020-4.489004catalase/peroxidase HPI
Bcenmc03_5079-117-3.583029alpha/beta hydrolase fold protein
Bcenmc03_5080-118-3.897847LysR family transcriptional regulator
Bcenmc03_5081-116-3.380859cyclic nucleotide-regulated FAD-dependent
Bcenmc03_5082-113-2.6247982Fe-2S iron-sulfur cluster binding
Bcenmc03_5083-117-3.870415aldehyde oxidase and xanthine dehydrogenase
Bcenmc03_5084137-8.338527gluconate 2-dehydrogenase (acceptor)
Bcenmc03_5085350-11.202556hypothetical protein
Bcenmc03_5086453-11.910534transport-associated
Bcenmc03_5087451-11.371567LysR family transcriptional regulator
Bcenmc03_5088453-12.100227response regulator receiver protein
Bcenmc03_5089448-10.678496PAS/PAC sensor signal transduction histidine
Bcenmc03_5090647-10.414976two component LuxR family transcriptional
Bcenmc03_5091647-10.227333major facilitator transporter
Bcenmc03_5092543-8.911491pyridine nucleotide-disulfide oxidoreductase
Bcenmc03_5093843-9.354241hypothetical protein
Bcenmc03_5094841-9.161889integrase catalytic subunit
Bcenmc03_5095948-11.114459transposase IS3/IS911 family protein
Bcenmc03_5096747-10.844742hypothetical protein
Bcenmc03_5097437-8.147758hypothetical protein
Bcenmc03_5098329-6.237296hypothetical protein
Bcenmc03_5099118-1.802610XRE family transcriptional regulator
Bcenmc03_51001122.807468hypothetical protein
Bcenmc03_51011143.573448hypothetical protein
Bcenmc03_51020123.094716NADPH-dependent FMN reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5047OMPADOMAIN1134e-32 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 113 bits (284), Expect = 4e-32
Identities = 55/145 (37%), Positives = 75/145 (51%), Gaps = 11/145 (7%)

Query: 87 FQCGAAPAEAASAATPAPAAVIERVNLSGDALFATDHATLAPTARESLDRLLSE--RADR 144
F G A A A PAP + L D LF + ATL P + +LD+L S+ D
Sbjct: 191 FGQGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDP 250

Query: 145 TYSQVTVTGFTDSVGSDDYNLALSKRRAESVAAYLKAHGLKTDSITVSGRGKADPVASN- 203
V V G+TD +GSD YN LS+RRA+SV YL + G+ D I+ G G+++PV N
Sbjct: 251 KDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNT 310

Query: 204 --------ATPEGRASNRRVEIRLQ 220
A + A +RRVEI ++
Sbjct: 311 CDNVKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5048OMADHESIN360.001 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 36.4 bits (83), Expect = 0.001
Identities = 40/131 (30%), Positives = 68/131 (51%)

Query: 2331 GQNASATGGKAVSIGSGNTASGDGAVAIGDPNVATGTGAVAMGANNTATGDGAVSLGNQN 2390
G NASA G +++IG+ A+ AVA+G ++ATG +VA+G + A GD AV+ G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 2391 TATGASALALGSANQATADNTIALGSQATASATGAQAYGSAAKATAADALAFGTNAQANV 2450
TA A+ + + S+A A + A + S A ++A G ++ +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 2451 ANSIALGANSV 2461
NS+++G S+
Sbjct: 182 ENSVSIGHESL 192



Score = 35.3 bits (80), Expect = 0.004
Identities = 64/294 (21%), Positives = 109/294 (37%)

Query: 2301 AAGVNASAAGASSVAVGDGSNAQTAGAVAIGQNASATGGKAVSIGSGNTASGDGAVAIGD 2360
A G+NASA G S+A+G + A AVA+G + ATG +V+IG + A GD AV G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 2361 PNVATGTGAVAMGANNTATGDGAVSLGNQNTATGASALALGSANQATADNTIALGSQATA 2420
+ A G +T+ AV ++ A + A+ S A +IA+G ++
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 2421 SATGAQAYGSAAKATAADALAFGTNAQANVANSIALGANSVTAAAVGTSSATIGGVTYPF 2480
+ + G + LA GT V + T SA + +
Sbjct: 180 DRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAY 239

Query: 2481 AGGSPVGVVSVGAPGQERQITNVAAGRISATSTDAINGSQLNATNNAINTLSTSTASNVA 2540
A V+ + + + + + + ++ +T +
Sbjct: 240 ADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEH 299

Query: 2541 SLSTGINSLSTGLSTTNSNVASLSTSTSTAINSLSTGLSTTNNNVNSLSTSTST 2594
+ S +L T N A S + +S S+ T N+ ++ S ST
Sbjct: 300 ANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTDVTVSNST 353



Score = 35.3 bits (80), Expect = 0.004
Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 2/129 (1%)

Query: 244 GATGRNVAIGSSGTTANGATAAGGAVAIGRGQVATGDGAVAIGDPNSATGTGALAIGAND 303
G I S A A G AVA+G G +ATG +VAIG + A G A+ GA
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 304 TSNGSGAIALGNSNSASGTGSVALGNSSTATNSAVAIGSSASATGTNG-AIAIGNAATAN 362
T+ G +A+G S S TG NS ++VAIG S+ +G +IAIG+ + +
Sbjct: 122 TAQKDG-VAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 363 GTGAIALGN 371
++++G+
Sbjct: 181 RENSVSIGH 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5051HTHFIS300.016 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.016
Identities = 19/115 (16%), Positives = 36/115 (31%), Gaps = 11/115 (9%)

Query: 75 SRRPRARRVQIFSPDALEQANAALDGADASQQQCARPLLEKAGSNDGCRKLPDIQKALKR 134
RP + + + + A A + A L + G R L + ++ +
Sbjct: 71 KARPDLPVLVMSAQNTFMTAIKASE-KGAYDYLPKPFDLTELIGIIG-RALAEPKRRPSK 128

Query: 135 LDVARGSFANL---SEPIGKLMVDLVLASAVRSREFRVRPILLMGEPGVGKTHFA 186
L+ L S + ++ L +++ GE G GK A
Sbjct: 129 LEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL------TLMITGESGTGKELVA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5071DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 2e-17
Identities = 43/190 (22%), Positives = 83/190 (43%), Gaps = 5/190 (2%)

Query: 3 MTGNTIFITGGTSGIGRALAEQFHALGNKVIIAGRRKALLDEVTTANPGM----EGVALD 58
+ G FITG GIG A+A + G + L++V ++ E D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 59 ISDAADIDRVAAQLIRDYPSLNVLINNAGIMPFDDPSGRIDDSVSRQILDTNLLGPIRLT 118
+ D+A ID + A++ R+ +++L+N AG++ + D N G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 119 SALIEHLKAQPRATIIHNTSVLAYVPIATNAVYSASKAALHSYALSQRFMLKGTSVSVQE 178
++ +++ + +I+ S A VP + A Y++SKAA + L ++
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 179 IAPPWVDTDL 188
++P +TD+
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5072TCRTETB583e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.0 bits (140), Expect = 3e-11
Identities = 36/151 (23%), Positives = 64/151 (42%), Gaps = 5/151 (3%)

Query: 37 LPVMAKDFGLPVPTVAVLVIVFTLVLALSSPISTVATGRMARKWVLLAAMSLFAIGNVTA 96
LP +A DF P + + F L ++ + + + ++ K +LL + + G+V
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 97 AVSASFA-LLIGARVLMAIAAGLYVPAANGLAGVIVPPSMRGRALAIVSAGQTLAIALGL 155
V SF LLI AR + A + + +P RG+A ++ + + +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 156 PLGGMIGHAFGWRATFLLVGAMSVIAIAGIF 186
+GGMI H W L+ +I I +
Sbjct: 157 AIGGMIAHYIHWSYLLLI----PMITIITVP 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5088HTHFIS762e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 2e-19
Identities = 21/101 (20%), Positives = 43/101 (42%)

Query: 9 IIDDDQSVRRATGSLVRSLGWEVRTYESGEEFLSAERIADVACIISDVQMPGISGLEMYE 68
+ DDD ++R + G++VR + D +++DV MP + ++
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 69 MLLERGVAPPVIFITSFPSEATHRQAMKLGAICVFSKPVDP 109
+ + PV+ +++ + T +A + GA KP D
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5090HTHFIS1001e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 1e-26
Identities = 32/143 (22%), Positives = 62/143 (43%)

Query: 16 VVDDDDSMRSALGMLLRSVGLRVELFSSAQEFLAFDKPDVSSCLILDVRLKGQSGLVLQE 75
V DDD ++R+ L L G V + S+A + ++ DV + ++ L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 76 QIVAGDMGLPIIFITAHGDVAMSVKAMKNGALDFLSKPFRDQEMLDAVEGALLKHEARRR 135
+I LP++ ++A ++KA + GA D+L KPF E++ + AL + + R
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127

Query: 136 TDGRVAEVRRRYESLTPREREVM 158
++ + +E+
Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5091TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 28/156 (17%), Positives = 62/156 (39%), Gaps = 1/156 (0%)

Query: 252 IFSSKIFWIFGIIYFLDVFGIYGYTLWAPTIIKSLGVERNSLIGLLAALPNAVAVIVM-I 310
+ + F I + + + G+ P ++K + + IG + P ++VI+
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 311 IAGRKADSRRERRLLVAALFLMAAAGLTLALVWHGTLWLSIAALCIANAGLLSIPPIFWG 370
I G D R +L + ++ + LT + + T W + GL +
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 371 MPTAVLSPRNAASGIAWISAIGNIGGFFGPYVVGLL 406
+ ++ L + A +G++ ++ + G +VG L
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407



Score = 35.2 bits (81), Expect = 5e-04
Identities = 32/161 (19%), Positives = 60/161 (37%), Gaps = 2/161 (1%)

Query: 246 SINQNNIFSSKIFWIFGIIYFLDVFGIYGYTLWAPTIIKSLGVERNSLIGLLAALPNAVA 305
S +Q+N+ ++I I+ F V + P I S + A +
Sbjct: 4 SYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFS 63

Query: 306 VIVMIIAGRKADSRRERRLLVAALFLMAAAGLTLALVWHGTLWLSIAALCIANAGLLSIP 365
+ + G+ +D +RLL+ + + + + V H L I A I AG + P
Sbjct: 64 IGTAVY-GKLSDQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 366 PIFWGMPTAVLSPRNAASGIAWISAIGNIGGFFGPYVVGLL 406
+ + + N I +I +G GP + G++
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5092GPOSANCHOR330.003 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.003
Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 200 IHDAPRVAMREDEDVSREIQQALEADGIKLELQSRIAN 237
+ +A R ++R D D SRE ++ LEA+ KLE Q++I+
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343


28Bcenmc03_5186Bcenmc03_5197Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_51861143.257809isoprenylcysteine carboxyl methyltransferase
Bcenmc03_51871132.830128glycosyl transferase family protein
Bcenmc03_51880102.477239EmrB/QacA family drug resistance transporter
Bcenmc03_5189-282.571718rhamnosyltransferase
Bcenmc03_5190-371.833198RND efflux system outer membrane lipoprotein
Bcenmc03_5191-290.119701secretion protein HlyD family protein
Bcenmc03_5192011-1.561619formaldehyde dehydrogenase,
Bcenmc03_5193119-3.100115hypothetical protein
Bcenmc03_5194-114-3.576196transcriptional regulator
Bcenmc03_5195012-3.885459coagulation factor 5/8 type domain-containing
Bcenmc03_5196-212-4.780623hypothetical protein
Bcenmc03_5197-211-3.875785hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5188TCRTETB1044e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 104 bits (261), Expect = 4e-26
Identities = 87/421 (20%), Positives = 169/421 (40%), Gaps = 21/421 (4%)

Query: 32 IRLALLTFALSLATFIEVLDSTVTNVAVPAISGSLGVSNSQGTWVISSYSVAAAIAVPLT 91
+R + L + +F VL+ V NV++P I+ + WV +++ + +I +
Sbjct: 10 LRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVY 69

Query: 92 GWLARRVGELRLFVGAVLLFTLTSLLCGLARD-LHVLVICRALQGLCSGPMVPLSQTILL 150
G L+ ++G RL + +++ S++ + +L++ R +QG + L ++
Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA 129

Query: 151 RTFPPDKRTIALALWAMTVLLAPIFGPVVGGWIVDNFSWPWIFLINLPIGLFSFAVCTAM 210
R P + R A L V + GP +GG I W ++ LI + I + + +
Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKL 188

Query: 211 LRPDAQRGAAGPIDVPGIVLLVIGVGSLQAMLDLGHDRGWFDSPLIVTLAVVAALAIVSL 270
L+ + + G D+ GI+L+ +G+ F + ++ +V+ L+ +
Sbjct: 189 LKKEVRI--KGHFDIKGIILMSVGIVFFM----------LFTTSYSISFLIVSVLSFLIF 236

Query: 271 LIWEAGEAHPVVDLSLFRDRTFSFCVLIISLGMMSFSVVGVVFPLWMQAVMGYNAFHAGL 330
+ P VD L ++ F VL + + + + P M+ V + G
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 331 ATASLGVLA-LVFSILVGLHAHRFDARVLATFGFLVFAAVLAWDAQFTLKMTFAQIAAPG 389
G ++ ++F + G+ R + G + F +V A F L+ T +
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG-VTFLSVSFLTASFLLETTSWFMTIII 355

Query: 390 LIQGIGLPCFFIPLTAATLSRIPDDRLAAASSLSNFLRTLSAAFGTA-----MSVTLWDN 444
+ GL ++ S + A SL NF LS G A +S+ L D
Sbjct: 356 VFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415

Query: 445 R 445
R
Sbjct: 416 R 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5191RTXTOXIND735e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 72.6 bits (178), Expect = 5e-16
Identities = 45/273 (16%), Positives = 80/273 (29%), Gaps = 32/273 (11%)

Query: 107 AFAQAKAQLAQAVRQVANARISNTMYVEAVNARRADLSLAQRALAA-RSGASVEIVAPEE 165
F+ + Q Q + R + +N + + L S + +A
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 166 LARARAAVAGAQANLAAAQAQLDAARA--LGSKLPVDESPAVVQAAAQFKLAYR------ 217
+ A L ++QL+ + L +K + + KL
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 218 ----------NLKRTTIVAPVDGTIGQRSVQ-VGQQVGPGVPLMSIVQLN-RLWVEANFK 265
+ + I APV + Q V G V LM IV + L V A +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQ 373

Query: 266 EGQIRHMRVGQPVEVVSDLYGSRIA--YRGRVQGFSAGTGSAFSMLPSQNAAGNWIKVVQ 323
I + VGQ + + + G+V+ + G V+
Sbjct: 374 NKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVII 426

Query: 324 RVPVVIALDPRDVAAHPLRVGLSMRATVDTHDR 356
+ PL G+++ A + T R
Sbjct: 427 SIEENCLS--TGNKNIPLSSGMAVTAEIKTGMR 457



Score = 61.0 bits (148), Expect = 3e-12
Identities = 30/189 (15%), Positives = 67/189 (35%), Gaps = 8/189 (4%)

Query: 12 PAALNDPALDARRATRRKRFTVFFAI-VLLAAIAWIAYWLLSDRYYEDTDDAYVAGSIVQ 70
PA L L +RR R +F + L+ A + + +G +
Sbjct: 43 PAHL---ELIETPVSRRPRLVAYFIMGFLVIAFIL-SVLGQVEIVATANGKLTHSGRSKE 98

Query: 71 VAAQIPGAVTDVVVADTQAVRAGQPLVRLDDTEASVAFAQAKAQLAQAVRQVANARISNT 130
+ V +++V + ++VR G L++L A + ++ L QA + +I +
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL-S 157

Query: 131 MYVEAVNARRADLS--LAQRALAARSGASVEIVAPEELARARAAVAGAQANLAAAQAQLD 188
+E L + ++ + + E+ + + + NL +A+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 189 AARALGSKL 197
A ++
Sbjct: 218 TVLARINRY 226


29Bcenmc03_5257Bcenmc03_5265Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5257121-4.092419diguanylate cyclase
Bcenmc03_5258226-6.043717OmpA/MotB domain-containing protein
Bcenmc03_5259227-6.773138type VI secretion system Vgr family protein
Bcenmc03_5260338-7.592741hypothetical protein
Bcenmc03_5261331-6.893709YD repeat-containing protein
Bcenmc03_5262125-7.695191hypothetical protein
Bcenmc03_5265015-4.758936hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5258OMPADOMAIN854e-22 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 84.6 bits (209), Expect = 4e-22
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 11/116 (9%)

Query: 65 ILFDFDRYNLKPDVRRIVERIGRTLRSAGING--VRVYGYSDDEGTAAYDAELSRRRAEV 122
+LF+F++ LKP+ + ++++ L + V V GY+D G+ AY+ LS RRA+
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 123 VAVELVDVGLDAKRIAIVGKGKLDPVGDN---------RTPAGRAQNRRAAIVVSP 169
V L+ G+ A +I+ G G+ +PV N A +RR I V
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


30Bcenmc03_5279Bcenmc03_5296Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5279-1133.155285AraC family transcriptional regulator
Bcenmc03_52800123.065768AraC family transcriptional regulator
Bcenmc03_52810102.980102short-chain dehydrogenase/reductase SDR
Bcenmc03_5282-1113.331026class III aminotransferase
Bcenmc03_5283-183.018282LysR family transcriptional regulator
Bcenmc03_52840103.766529salicylate biosynthesis isochorismate synthase
Bcenmc03_52851113.561890isochorismate-pyruvate lyase
Bcenmc03_52861113.768721thioesterase
Bcenmc03_52871114.024615AMP-dependent synthetase and ligase
Bcenmc03_52882124.434168AraC family transcriptional regulator
Bcenmc03_52891124.282346amino acid adenylation domain-containing
Bcenmc03_52901124.208538amino acid adenylation domain-containing
Bcenmc03_52912125.005588thiazolinyl imide reductase
Bcenmc03_52921114.671770ABC transporter-like protein
Bcenmc03_52931114.035215ABC transporter-like protein
Bcenmc03_5294-1113.226288TonB-dependent siderophore receptor
Bcenmc03_52950113.563189hypothetical protein
Bcenmc03_52960103.204966PepSY-associated TM helix domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5281DHBDHDRGNASE863e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 86.3 bits (213), Expect = 3e-22
Identities = 47/180 (26%), Positives = 78/180 (43%), Gaps = 10/180 (5%)

Query: 2 KTVLITGCSSGFGLEIARHFLARDWQVVATMRKPN-DDVLPPSERLRV-----LPLDVTN 55
K ITG + G G +AR ++ + A P + + S + P DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 56 ADSIRAAID----AAGPIDVLVNNAGFGAAAPAELMPLDTVRALFDTNTIGTIAVTQAVL 111
+ +I GPID+LVN AG + + A F N+ G +++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 112 PQFRARGAGVVVNVTSSVTLKALPLVSAYRASKAAVNAYTESMAAELEPFGVRAHLVLPG 171
R +G +V V S+ ++AY +SKAA +T+ + EL + +R ++V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5289ISCHRISMTASE565e-10 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 55.8 bits (134), Expect = 5e-10
Identities = 30/63 (47%), Positives = 38/63 (60%), Gaps = 3/63 (4%)

Query: 31 IAELLDESVDEIASLDDDEDLLSCGLDSIRLMYLQTRVNRLGHALTFDALARTPTLGAWT 90
IAELL E+ ++I D EDLL GLDS+R+M L + R G +TF LA PT+ W
Sbjct: 239 IAELLQETPEDI---TDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQ 295

Query: 91 TLL 93
LL
Sbjct: 296 KLL 298


31Bcenmc03_5337Bcenmc03_5354Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5337023-4.947847hypothetical protein
Bcenmc03_5338229-6.855986ribonuclease T2
Bcenmc03_5339338-7.760909isochorismatase hydrolase
Bcenmc03_5340341-8.401832hypothetical protein
Bcenmc03_5341545-9.141293alpha/beta hydrolase
Bcenmc03_5342642-8.701994hypothetical protein
Bcenmc03_5343432-6.795695cupin 2 domain-containing protein
Bcenmc03_5344225-5.734559integrase catalytic subunit
Bcenmc03_5345-219-4.093808transposase IS3/IS911 family protein
Bcenmc03_5346-311-1.992721hypothetical protein
Bcenmc03_5347-39-0.342481short-chain dehydrogenase/reductase SDR
Bcenmc03_5348-3120.740005glutaminase
Bcenmc03_53491112.383264two component transcriptional regulator
Bcenmc03_53501122.795190histidine kinase
Bcenmc03_53511122.987775patatin
Bcenmc03_53522123.179419rod shape-determining protein MreB
Bcenmc03_53533123.102334hypothetical protein
Bcenmc03_53543123.080533outer membrane autotransporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5339ISCHRISMTASE515e-10 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 5e-10
Identities = 36/140 (25%), Positives = 60/140 (42%), Gaps = 9/140 (6%)

Query: 34 IAPSKTALLVMHYQTDILGLFPSVAP---ELLANTRRLCDAARAAGVGVWFANLRFSPG- 89
P++ LL+ Q + F + A EL AN R+L + G+ V + PG
Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTA---QPGS 82

Query: 90 -YPEVSPLNKNGQGIKQLGLFIDDAPCPELAKRPDEPLIVAHRASVFFGTDLQARLIAQG 148
P+ L + G ++ ELA D+ ++ R S F T+L + +G
Sbjct: 83 QNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG 142

Query: 149 VDTLIMVGI-ASTGVMLSSI 167
D LI+ GI A G ++++
Sbjct: 143 RDQLIITGIYAHIGCLVTAC 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5347DHBDHDRGNASE631e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.1 bits (153), Expect = 1e-13
Identities = 51/207 (24%), Positives = 88/207 (42%), Gaps = 19/207 (9%)

Query: 9 GRRIVITGANSGTGKEATRRLVAAGADVIMAVRSESKGDAARRDIRKEFPGTSIEVRTLD 68
G+ ITGA G G+ R L + GA + + K + ++ E E D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPAD 65

Query: 69 LSSLASVRNFGRQLLEEGRPLDVLVNNAGIMMP-PTRVLSSDGFELQLATNFLGHFALTN 127
+ A++ ++ E P+D+LVN AG++ P LS + +E + N G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 128 LLLPLLLEAKSPRVATMTSSAAMGATINFDDLQGERSYKPMTAYAQSKLACLLLANRLA- 186
+ +++ +S + T+ S+ A + M AYA SK A ++ L
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTS------------MAAYASSKAAAVMFTKCLGL 173

Query: 187 EIARERGWPLLSTSAHPGHTRTNLQTS 213
E+A + + PG T T++Q S
Sbjct: 174 ELA---EYNIRCNIVSPGSTETDMQWS 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5349HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 4e-16
Identities = 29/124 (23%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 4 RILLVEDDTRLSTLIAGYLRKNDYEVDTVLHGDAAVPAILSIRPDLVILDVNLPGKDGFE 63
IL+ +DD + T++ L + Y+V + I + DLV+ DV +P ++ F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 ICREARKQYDGV-IIMVTARDEPFDELLGLEFGADDYVHKPVEPRILLARIKAQLRRAPA 122
+ +K + +++++A++ + E GA DY+ KP + L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 RAAE 126
R ++
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5350PF06580310.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.005
Identities = 33/184 (17%), Positives = 70/184 (38%), Gaps = 35/184 (19%)

Query: 201 DSIAQDVTELEELIDMSLTYARLEYSSLQSNLEMTAPVAWFEHQVNDAQLLYPDRAIESR 260
+ +T L EL+ SL Y+ SL L + + + A + + DR ++
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVV------DSYLQLASIQFEDR-LQFE 243

Query: 261 IEIGADLRVKMDRRLMSYAMRNLLRNASKYA------KSRIVVGISLVHGNIGIFVEDDG 314
+I + D ++ ++ L+ N K+ +I++ + +G + + VE+ G
Sbjct: 244 NQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 315 PGVPESERERIFDAFVRLDRRTGGYGLGLSITR---QVLHAHNGRIAVVDPVELGGARFE 371
++ +E G GL R Q+L+ +I + + + G
Sbjct: 301 SLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSE--KQGKVNAM 344

Query: 372 ISWP 375
+ P
Sbjct: 345 VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5352SHAPEPROTEIN354e-124 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 354 bits (910), Expect = e-124
Identities = 166/340 (48%), Positives = 226/340 (66%), Gaps = 2/340 (0%)

Query: 1 MSTPLFGKLFAQPVAIDPGTASTRIYTHERGVVLNQPSVVCFRKGGASDARPTLEAVGEL 60
M G +F+ ++ID GTA+T IY +G+VLN+PSVV R+ A + AVG
Sbjct: 1 MLKKFRG-MFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVA-AVGHD 58

Query: 61 AKALLGREPGHLEAVRPMRHGVIADAHAAEQMIRSFIDMSRTRSRFGRRVEVTLCVPSDA 120
AK +LGR PG++ A+RPM+ GVIAD E+M++ FI + S V +CVP A
Sbjct: 59 AKQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGA 118

Query: 121 TAVERRAIREAAFAAGVSEVELIEESLAAGLGAGLPVTEPVGSMVIDIGGGTTEVAVIAL 180
T VERRAIRE+A AG EV LIEE +AA +GAGLPV+E GSMV+DIGGGTTEVAVI+L
Sbjct: 119 TQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISL 178

Query: 181 GGIVYREAIRVGGSQFDAAIVNHVRNLYGVLLGEQTAEHVKKAIGSATSAVPRTSTRAVG 240
G+VY ++R+GG +FD AI+N+VR YG L+GE TAE +K IGSA G
Sbjct: 179 NGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRG 238

Query: 241 RSIGDGLPRSVELSNHDVADALAAPLKQVIGAVKSVLENAPAELVTDIANRGVVLTGGGA 300
R++ +G+PR L+++++ +AL PL ++ AV LE P EL +DI+ RG+VLTGGGA
Sbjct: 239 RNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGA 298

Query: 301 LLADLERLLYDETGLVARIADEPATCAVRGAGEAMGRLAM 340
LL +L+RLL +ETG+ +A++P TC RG G+A+ + M
Sbjct: 299 LLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDM 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5353OMPADOMAIN310.004 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 30.7 bits (69), Expect = 0.004
Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 103 GLNEATAMRDYLVARGVPADRIAV 126
A ++ DYL+++G+PAD+I+
Sbjct: 274 SERRAQSVVDYLISKGIPADKISA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5354INTIMIN441e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 43.5 bits (102), Expect = 1e-05
Identities = 66/279 (23%), Positives = 96/279 (34%), Gaps = 22/279 (7%)

Query: 1667 STGAVNLAGTGATFDVSGATGTQTVGALSGAAGTNVNLGANALALNGSGSSTFGGTIGGA 1726
T A+ T V+ A + +SG A L AN+ NGSG +T
Sbjct: 574 GTEAITYTATVKKNGVAQANVPVSFNIVSGTA----VLSANSANTNGSGKATVTLKSDKP 629

Query: 1727 GGVTVASGTQ----------VLTGDNTYTGGTTIAAGGTLQLGNGGTSGSVAGNVVDNGA 1776
G V V++ T V+ D T T I A T + NG + + V+
Sbjct: 630 GQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDK 689

Query: 1777 LIVNQSGNVTIASVLSGTGSLTQAGSGRLTLTGTSTLSGPTTVGAGTLAVNGSLGQSTVT 1836
+ NQ T + +G +T TST G + V A V + V
Sbjct: 690 PVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVE 749

Query: 1837 VQNGATLTGTG-TIGGLVVQGGATAAATQPGAALNV--GGNVTFQPGSTFQVAATPQQSG 1893
T+ I G V+G Q G GGN + S A+
Sbjct: 750 FFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVD--- 806

Query: 1894 SLAASGTATLNGGTVQVLANQSGYQPSTTYTILSASSGV 1932
A+SG TL ++ S + TYTI + +S +
Sbjct: 807 --ASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLI 843


32Bcenmc03_5365Bcenmc03_5372Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_53651113.479068RpiR family transcriptional regulator
Bcenmc03_53661133.565091GCN5-related N-acetyltransferase
Bcenmc03_53672133.740203hypothetical protein
Bcenmc03_53682113.689413PEP phosphonomutase
Bcenmc03_53691104.160803hypothetical protein
Bcenmc03_5370-193.861599cytochrome P450-like protein
Bcenmc03_5371-183.276266hypothetical protein
Bcenmc03_53720103.002989rhodanese domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5366SACTRNSFRASE347e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.1 bits (78), Expect = 7e-05
Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 2/59 (3%)

Query: 61 GWLHVDLLVVPESARGQGAGTRIMDLAEREAVARGCHSAWLDTFDFQ--ARPFYEKRGY 117
G+ ++ + V + R +G GT ++ A A L+T D A FY K +
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


33Bcenmc03_5431Bcenmc03_5442Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5431-1143.911450YscD/HrpQ family type III secretion apparatus
Bcenmc03_54322173.223528LigA
Bcenmc03_54333164.350171hypothetical protein
Bcenmc03_54344153.852075YscJ/HrcJ family type III secretion apparatus
Bcenmc03_54354143.465187hypothetical protein
Bcenmc03_54364163.077377HrpE/YscL family type III secretion apparatus
Bcenmc03_5437-1123.479348type III secretion apparatus H+-transporting
Bcenmc03_5438-1113.496834hypothetical protein
Bcenmc03_5439-1113.150478type III secretion protein SpaR/YscT/HrcT
Bcenmc03_5440-1113.598970type III secretion exporter
Bcenmc03_5441-1113.420249asparagine synthase
Bcenmc03_5442-2123.421459hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5434FLGMRINGFLIF843e-20 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 83.9 bits (207), Expect = 3e-20
Identities = 46/184 (25%), Positives = 77/184 (41%), Gaps = 10/184 (5%)

Query: 20 ALLAGLLVLLAGCQKELYSGLSERDANQMVAVLGDAGISASKDNDARDTSDRNAWLVSVA 79
++A +L + L+S LS++D +VA L I N + + V
Sbjct: 37 IVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGA--------IEVP 88

Query: 80 DGDMQAALTVLQANGLPKPSYASLGELFQKQGLVSTPAEERVRYLYGVSQDLSRTLQDIE 139
+ L GLPK EL ++ + E+V Y + +L+RT++ +
Sbjct: 89 ADKVHELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQRALEGELARTIETLG 147

Query: 140 GVVVARVQVVIPENDPLADKIKPSSAAVYIRYRPGVDL-RAMAPMVKDLVAHSIEGLQYD 198
V ARV + +P+ + K SA+V + PG L V LV+ ++ GL
Sbjct: 148 PVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPG 207

Query: 199 NVSL 202
NV+L
Sbjct: 208 NVTL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5439TYPE3IMRPROT1343e-40 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 134 bits (339), Expect = 3e-40
Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 3/248 (1%)

Query: 15 LRPLLYVMPRLLPIMFVVPVFNEQIITGLVRNGIAVVIAAFVAPTIDAAQVAALPFLMWC 74
L + + R+L ++ P+ +E+ + V+ G+A++I +AP++ A V F
Sbjct: 13 LNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFF-AL 71

Query: 75 LLVAKEAMVGMLLAGAFSAVLFAIQGVGYLIDFQTGSGSAAFFDPMGGHEGGPTSGFLNF 134
L ++ ++G+ L A++ G +I Q G A F DP + ++
Sbjct: 72 WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDM 131

Query: 135 VAIALFVTAGGLQVLVQLFAQSYAWWPIGSLGPDFSSMLQTFIVRQTDTIFEWMVKLAAP 194
+A+ LF+T G L+ L ++ PIG +S + + IF + LA P
Sbjct: 132 LALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSLIFLNGLMLALP 189

Query: 195 VTIVLVLVELGIGLVGRAVPQLNIFVFSQPLKSALAVLMMILFLPVVYASLHSLLSPDSG 254
+ +L+ + L +GL+ R PQL+IFV PL + + +M +P++ L S
Sbjct: 190 LITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249

Query: 255 LMALLRAL 262
L+A + +
Sbjct: 250 LLADIISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5440TYPE3IMSPROT2446e-81 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 244 bits (625), Expect = 6e-81
Identities = 93/339 (27%), Positives = 174/339 (51%), Gaps = 3/339 (0%)

Query: 2 AEKDQKPTAKRLREAREKGDVPKSAETVSSAFFVGVCVALAVGIGALFARVQALFRLVFD 61
EK ++PT K++R+AR+KG V KS E VS+A V + L F L + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AVGAADPSARLAALIDGAARDWATLSAQIVAAGLQAGLLAGFVQVGGVMAWSRLVPQLSR 121
S L+ ++D ++ L ++ + + VQ G +++ + P + +
Sbjct: 63 QSYL-PFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 122 LNPAEGMKNLWSLRNLVNLAKMLMKTALLVATLGWLIVESLDPSVQSGFTRPASILALIV 181
+NP EG K ++S+++LV K ++K LL + +I +L +Q I L+
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 182 KLLMLLFGWAALIYIVMALIDIVHQRHEFNQKMKMSIDEVRREHKEDEGDPHIQAKRRQL 241
++L L + ++V+++ D + +++ +++KMS DE++RE+KE EG P I++KRRQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 242 AREAQFASLPDRIGYASVVVYSP-RVAVALYYG-GMGSLPWVLARGEGDAAERIVRLARD 299
+E Q ++ + + +SVVV +P +A+ + Y G LP V + + + ++A +
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 300 ALRPTLANVGLAQALYETTPENGTIQPQHFRAVAQLLKW 338
P L + LA+ALY + I + A A++L+W
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRW 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5442PF03544370.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.9 bits (85), Expect = 0.001
Identities = 13/67 (19%), Positives = 20/67 (29%)

Query: 23 PVVAPPPPPPPPPKKDDPAAGPANPTAAPPIPVTASLATDPSKPTNAEIQSATSLIQSMA 82
PVV P P P PK P+ + + + P +AT+
Sbjct: 91 PVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150

Query: 83 AQYTAPP 89
+ P
Sbjct: 151 TSVASGP 157


34Bcenmc03_5454Bcenmc03_5467Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_54540153.079017hypothetical protein
Bcenmc03_54553141.219039hypothetical protein
Bcenmc03_54562131.883357sigma-54 dependent trancsriptional regulator
Bcenmc03_54572131.410754hypothetical protein
Bcenmc03_54582130.674261hypothetical protein
Bcenmc03_54593120.603959two component LuxR family transcriptional
Bcenmc03_54604120.672790methyl-accepting chemotaxis sensory transducer
Bcenmc03_54614120.5517992-dehydropantoate 2-reductase
Bcenmc03_54623120.407418AraC family transcriptional regulator
Bcenmc03_54634110.648125periplasmic binding protein/LacI transcriptional
Bcenmc03_54644110.739054ABC transporter-like protein
Bcenmc03_54654121.484094monosaccharide-transporting ATPase
Bcenmc03_54663121.021036alcohol dehydrogenase
Bcenmc03_54672130.715235xylulokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5454IGASERPTASE463e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 46.2 bits (109), Expect = 3e-07
Identities = 47/263 (17%), Positives = 73/263 (27%), Gaps = 28/263 (10%)

Query: 264 ASMRAAPVSVPVPVPALAPVAAAPAVATPAVAAAA--------PAVAVPTVAAAVPAAAA 315
R V A P+V + A PA A P+ A +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 316 PAAAAAPAVAAAPAASVVPAAAMAAVPAAA-VIAAPAVADKAAPAPAAPVADTKAAEPVQ 374
+ A A A + V A + A T + +
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK--E 1102

Query: 375 PVADKAPEPAPAVADKTPEPAPAVADKAP-----EPAQPVADKAPEPMPA-------ATD 422
+ E A +KT E + +P E QP A+ A E P +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 423 TTQAAGEPVAEPMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAAAPADMPAAD 482
T A E A+ + + + E PA QP V + ++ P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 483 AK-----APDAVESAGTAAAQAA 500
+ P VE A T++ +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRS 1245



Score = 43.1 bits (101), Expect = 3e-06
Identities = 42/251 (16%), Positives = 67/251 (26%), Gaps = 16/251 (6%)

Query: 305 TVAAAVPAAAAPAAAAAPAVAAAPAASVVPAAAMAAVPAAAVIAAPAVADKAAP---APA 361
TV A P+V + A PA A + +
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 362 APVADTKAAEPVQPVADKAPEPAPAVADKTPEPAPAVADKAPEPAQPVADKAPEPMPAA- 420
+ A E + A E V T A + + Q K +
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 421 -----TDTTQAAGEPVAE--PMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAA 473
T+ TQ + ++ P + P A+ A E P + P A
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTE 1169

Query: 474 APADMPAADAKAPDAVESAGTAAAQAAGMPALTDPAQALPPATVDQQAAP----AAPVAP 529
PA +++ + P + P T PA P + P V
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229

Query: 530 APTVISTSTSS 540
P + +T+S
Sbjct: 1230 VPHNVEPATTS 1240



Score = 32.3 bits (73), Expect = 0.006
Identities = 24/186 (12%), Positives = 48/186 (25%), Gaps = 20/186 (10%)

Query: 313 AAAPAAAAAPAVAAAPAASVVPAAAMAAVPAAAVIAAPAVADKAAPAPAAPVADTKAAEP 372
P + + + +V P A A V + A A ++
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 373 VQPVADKAPEPAPAVADKTPEPAPAVADKAPEPAQPVADKAPEPMPAATDTTQAAGEPVA 432
QPV + + + P QP T ++++ +P
Sbjct: 1180 EQPVTESTTV------NTGNSVVENPENTTPATTQP------------TVNSESSNKP-- 1219

Query: 433 EPMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAAAPADMPAADAKAPDAVESA 492
+ +V + P A + + A A A A + ++
Sbjct: 1220 KNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279

Query: 493 GTAAAQ 498
+Q
Sbjct: 1280 SQHISQ 1285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5456HTHFIS360e-122 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 360 bits (926), Expect = e-122
Identities = 145/466 (31%), Positives = 211/466 (45%), Gaps = 48/466 (10%)

Query: 47 AALVDVLASRGWDVWRAKTVADALNLVKANRPHAGIVDFDSFASPDVASFEAL----LRD 102
L L+ G+DV A + A + D PD +F+ L
Sbjct: 17 TVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDV---VMPDENAFDLLPRIKKAR 73

Query: 103 PRVGWVALADGERLRNITIARLIRHCCFDYVRNAAAYTTIGYLVGHAYGMLKLADGDPAA 162
P + + ++ A +DY+ T + ++G A K
Sbjct: 74 PDLPVLVMSAQNTFMTAIKA--SEKGAYDYLPKPFDLTELIGIIGRALAEPK-RRPSKLE 130

Query: 163 EAPPPGGAMIGACGAMRRLFATIRKVANTEATVFIAGESGTGKELTAAAIHRQSSRADAP 222
+ G ++G AM+ ++ + ++ T+ T+ I GESGTGKEL A A+H R + P
Sbjct: 131 DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGP 190

Query: 223 FVAVNCAAIPTTLLQAELFGHERGAFTGAHQRKIGRIEAAHGGTLFLDEIGDMPFESQAS 282
FVA+N AAIP L+++ELFGHE+GAFTGA R GR E A GGTLFLDEIGDMP ++Q
Sbjct: 191 FVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTR 250

Query: 283 LLRFLQEGKIERLGGHASIPVDVRIVSATHVDLEAAMQAGRFRADLYYRLCVLRIDEPPL 342
LLR LQ+G+ +GG I DVRIV+AT+ DL+ ++ G FR DLYYRL V+ + PPL
Sbjct: 251 LLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPL 310

Query: 343 RMRGRDIMLLADDVLRRYRDDGSYRIRGFTPCAIEAIHNYPWPGNVRELINRIRFAVVMT 402
R R DI L +++ +G ++ F A+E + +PWPGNVREL N +R +
Sbjct: 311 RDRAEDIPDLVRHFVQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALY 369

Query: 403 NGPLISAADLELR-------------------------------------PYTSLRPPTL 425
+I+ +E
Sbjct: 370 PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLY 429

Query: 426 AQARRQAERHAIEETLLRHRHQHADVAAELGISRATLYRLMIAHGL 471
+ + E I L R A LG++R TL + + G+
Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5459HTHFIS290.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.018
Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 3/61 (4%)

Query: 34 VAVYRSAAELVASLGGVDCDIVLVDYAIRGDEQMDGLALFDWLRRTRPNVGIVVLVANEN 93
V + +AA L + D D+V+ D + + L +++ RP++ ++V+ A
Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTD--VV-MPDENAFDLLPRIKKARPDLPVLVMSAQNT 86

Query: 94 P 94

Sbjct: 87 F 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5461NUCEPIMERASE300.015 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.015
Identities = 18/64 (28%), Positives = 26/64 (40%), Gaps = 13/64 (20%)

Query: 1 MRILVVG-AGAVGGYFGGRLAAAGRDVTFL----------VRDGRAAALARDGLLIRSPR 49
M+ LV G AG +G + RL AG V + ++ R LA+ G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFH--K 58

Query: 50 GDLT 53
DL
Sbjct: 59 IDLA 62


35Bcenmc03_5488Bcenmc03_5497Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_54882142.917364endoribonuclease L-PSP
Bcenmc03_54892152.724164major facilitator transporter
Bcenmc03_54901142.357532LysR family transcriptional regulator
Bcenmc03_54910112.332485hypothetical protein
Bcenmc03_54921103.337963LysR family transcriptional regulator
Bcenmc03_5493-1112.919524NAD-dependent epimerase/dehydratase
Bcenmc03_54940143.488087hypothetical protein
Bcenmc03_5495-1133.609302hypothetical protein
Bcenmc03_5496-2123.716671major facilitator transporter
Bcenmc03_54970123.353058hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5489TCRTETB1066e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 106 bits (266), Expect = 6e-27
Identities = 74/416 (17%), Positives = 158/416 (37%), Gaps = 23/416 (5%)

Query: 24 HSWALVVLLVGAILPPLDYFIVNLALPAIRDGIGAHQAELQLVVSAYACANAVVQITGGR 83
H+ L+ L + + L+ ++N++LP I + A V +A+ ++ G+
Sbjct: 12 HNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71

Query: 84 LGDLYGRKRMFMIGMAGFVLASTLCGLADNG-TVLVGGRVLQGLFAAILAPQVLATIRSV 142
L D G KR+ + G+ S + + + ++L+ R +QG AA V+ +
Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY 131

Query: 143 FSPQEQVRVMGFYGFAFGLAAVIGQLGGGALISLHPFGLGWRAIFLVNLPIGILALIGSW 202
+ + + G G + +G GG + H +L+ +P+ + +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMI--AHYIHWS----YLLLIPMITIITVPFL 185

Query: 203 RFIPENRAPRGQRIDVPGTVLMSLFLLMLVYPLTHGREAGWPLWMIACGVGALPMLGALL 262
+ + D+ G +LMS+ ++ + T + + +++
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS------------F 233

Query: 263 AVEARRLARGHDPLLDVRLLRNPVVALGLLLAFL-FYTLSAFFLSYGIYLQGCLNWSPLA 321
+ + + + DP +D L +N +G+L + F T++ F ++ S
Sbjct: 234 LIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE 293

Query: 322 SGFAIL-PLGLGFLASPLLTTRLVARFGGYRVLTLGFAMLAAGVAIAAALARDGAPGPGF 380
G I+ P + + + LV R G VL +G L+ A+ L +
Sbjct: 294 IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMT 352

Query: 381 YAGIAAIGIGQGLVLPSVVRIVLAEVDAARAGVASGMVSAMLQIGAAVGAATIGGV 436
+ +G G + IV + + AG +++ + G A +GG+
Sbjct: 353 IIIVFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5493NUCEPIMERASE446e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.6 bits (103), Expect = 6e-07
Identities = 42/203 (20%), Positives = 75/203 (36%), Gaps = 37/203 (18%)

Query: 13 LVLGASGGIGGEVARQLRDAGWQVRA-----------LKRGLDAEVVERDGIAWVRGDAL 61
LV GA+G IG V+++L +AG QV LK+ E++ + G + + D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQA-RLELLAQPGFQFHKIDLA 62

Query: 62 DRDAVVRAAR--GCSVIVHAVNPPGYR----NWATQVLPMID---NTIAAARAAQ-ATVV 111
DR+ + + + + R N + N + R + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122

Query: 112 LPGTVYNFGADA-FPVLREDAPQHPATRKGAIRVELERRLQDASA-HGVPAIVVRAGDFF 169
+ +G + P +D+ HP + A + E S +G+PA +R +
Sbjct: 123 YASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVY 182

Query: 170 GPQLGNSWFSQGLVKAGRPVAAI 192
GP GRP A+
Sbjct: 183 GP-------------WGRPDMAL 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5496TCRTETA320.003 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.5 bits (74), Expect = 0.003
Identities = 26/82 (31%), Positives = 32/82 (39%), Gaps = 6/82 (7%)

Query: 247 LLIAGGSRIGSDVLYALIVVFTLTYVTTVLHLSRPVALTAVMIGTACNALAVPFFGALSD 306
L A G + VL L+ + H +AL A+M P GALSD
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHS-NDVTAHYGILLALYALM-----QFACAPVLGALSD 68

Query: 307 RFGRRPVYLAGAIAGIVWAFVF 328
RFGRRPV L V +
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIM 90


36Bcenmc03_5541Bcenmc03_5579Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_55412101.278195glutathione S-transferase domain-containing
Bcenmc03_5542419-0.558337major facilitator transporter
Bcenmc03_5543526-1.491233isochorismatase hydrolase
Bcenmc03_5544628-2.485565hypothetical protein
Bcenmc03_5545427-3.613405HxlR family transcriptional regulator
Bcenmc03_5546426-4.131758hypothetical protein
Bcenmc03_5547323-5.940251porin
Bcenmc03_5548022-5.704276GreA/GreB family elongation factor
Bcenmc03_5549125-6.240333hypothetical protein
Bcenmc03_5550326-6.679378signal-transduction protein
Bcenmc03_5551429-6.566823GTP cyclohydrolase II
Bcenmc03_5552639-9.026059integrase family protein
Bcenmc03_5553641-9.761821hypothetical protein
Bcenmc03_5554434-7.609526hypothetical protein
Bcenmc03_5555433-7.011823integrase catalytic subunit
Bcenmc03_5556436-7.345525transposase IS3/IS911 family protein
Bcenmc03_5557436-7.543886hypothetical protein
Bcenmc03_5558327-4.538648hypothetical protein
Bcenmc03_5560529-6.804398transposase IS66
Bcenmc03_5561636-8.017128IS66 Orf2 family protein
Bcenmc03_5562739-8.633485transposase IS3/IS911 family protein
Bcenmc03_5563739-9.416664transposase IS3/IS911 family protein
Bcenmc03_5564437-8.504048integrase catalytic subunit
Bcenmc03_5565438-9.264228hypothetical protein
Bcenmc03_5566338-6.808457integrase catalytic subunit
Bcenmc03_5567131-4.716945histone family protein nucleoid-structuring
Bcenmc03_5569-126-3.360772hypothetical protein
Bcenmc03_55702190.664318GCN5-related N-acetyltransferase
Bcenmc03_55710140.194756glyoxalase/bleomycin resistance
Bcenmc03_55720140.373737hypothetical protein
Bcenmc03_5573-3101.657080hypothetical protein
Bcenmc03_5574-2122.006884autoinducer synthesis protein
Bcenmc03_55750142.439617hypothetical protein
Bcenmc03_55760132.480928LuxR family transcriptional regulator
Bcenmc03_55771132.880911MgtC/SapB transporter
Bcenmc03_55781103.173934outer membrane efflux protein
Bcenmc03_55791113.027964fusaric acid resistance protein region
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5542TCRTETA1198e-32 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 119 bits (299), Expect = 8e-32
Identities = 93/360 (25%), Positives = 154/360 (42%), Gaps = 23/360 (6%)

Query: 42 LLAIALDAMGFGLVYPMMSAIFSDPHAGILPADAGAHARNFYLGLGYGVYPLCMFFGSSL 101
L +ALDA+G GL+ P++ + D D AH G+ +Y L F + +
Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLV---HSNDVTAH-----YGILLALYALMQFACAPV 62

Query: 102 MGELSDRYGRRRVLLLCVLGLAAGYAMMAAGAWHASVALLLAGRGLTGLMAGCQGIAQAA 161
+G LSDR+GRR VLL+ + G A YA+MA + +L GR + G+ +A A
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATA---PFLWVLYIGRIVAGITGATGAVAGAY 119

Query: 162 ITDLSTPDTKAYNMSIMSLAFSAGVIVGPVLGGVTSDRTISPLFDYGTPFMLVAALSLIC 221
I D++ D +A + MS F G++ GPVLGG+ SP PF AAL+ +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSP----HAPFFAAAALNGLN 173

Query: 222 ACWTWVSYRDSAAPRGDT-RIDPLLPLRIIVEAARQRDVAFLSVVFFLMQVGYGLYLQTI 280
+S R + L PL A VA L VFF+MQ+ +
Sbjct: 174 FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW 233

Query: 281 MLLLQAKFGYTSARLGLFSGVIGLCFVFGLLCVVRLMLRVWRVIDIAKTGLLVAGLGQIL 340
++ + +F + + +G+ G+ + + G++ G G IL
Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYIL 293

Query: 341 SALFPHEPVLWALAMVVGCFDMV--AYTTMYTAFSDAVSDDRQGWALGVAGSVMAVAWVV 398
A + + + +++ + A M S V ++RQG G ++ ++ +V
Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGMPALQAM---LSRQVDEERQGQLQGSLAALTSLTSIV 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5543ISCHRISMTASE605e-13 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 59.6 bits (144), Expect = 5e-13
Identities = 46/208 (22%), Positives = 71/208 (34%), Gaps = 25/208 (12%)

Query: 4 PTIRTLAGASAPTSIAAARTALLVIDFQNEYFSGRLP--IPDGPGALGNARRVIAFADRA 61
PT + R LL+ D Q YF N R++ +
Sbjct: 12 PTASDMPQNKVSWVPDPNRAVLLIHDMQ-NYFVDAFTAGASPVTELSANIRKLKNQCVQL 70

Query: 62 GIPVFHVQHVGT---ADSPIFAD----GSDGFRFH----SDLHPAPQHTVVQKTSVSVFP 110
GIPV + G+ D + D G + + ++L P V+ K S F
Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130

Query: 111 TTDLDARLKAAGIDTLIVTGLMTHACVAGAARDAVPLGYAVIVVDDACATRDLDVADGGT 170
T+L ++ G D LI+TG+ H A +A V DA VAD
Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA-------VADFS- 182

Query: 171 VPHRDLHRATLAALSDTFGDVLTTEQVL 198
+ H+ L + + T+ +L
Sbjct: 183 ---LEKHQMALEYAAGRCAFTVMTDSLL 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5547ECOLNEIPORIN2084e-67 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 208 bits (532), Expect = 4e-67
Identities = 97/373 (26%), Positives = 147/373 (39%), Gaps = 53/373 (14%)

Query: 1 MNKTLIVAAAAASFATVAHAQSSVTLYGVLDAGITYQSNVGGKSLWS----MGSGIDQ-- 54
M K+LI AA A + VTLYG + AG+ +V + G+GI
Sbjct: 1 MKKSLIALTLAA---LPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 55 SRFGLRGSEDLGGGLKAIFTLESGFNIGNGRFANGNGGMFNRQAFVGLSSQYGTVTLGKQ 114
S+ G +G EDLG GLKAI+ +E +I A + G NRQ+F+GL +G + +G+
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASI-----AGTDSGWGNRQSFIGLKGGFGKLRVGRL 112

Query: 115 YDATQDY--LAPLTATGSW-GGTYFAHPLNNDRLSTNGDVALNNSIKYTSANYAGLQFGG 171
+D + P + + G A P RL S++Y S +AGL
Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEP--EARLI---------SVRYDSPEFAGLSGSV 161

Query: 172 TYSFSNNTNFGNNRAYSGGLSYQFQGLKLGAAYSQANLGDGTNTNGASTLGGQGRVRTYG 231
Y+ ++N N+ +Y G +Y+ G + + V Y
Sbjct: 162 QYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYD 221

Query: 232 AAAGYAFGPAQVGAA--WTQSRIDNQAAGVPTLRADNYEVNAKYNLTPALGLGAAYTYTN 289
A YA Q A ++ N V A + N ++ A G ++ T
Sbjct: 222 NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFG-NVTPRVSYAHGFKGSFDAT- 279

Query: 290 AKVNNGSSHWNQFGVQADYALSKRTDVYAQAVYQRGAKGNNIVGTGIYNGDNTTASSSSV 349
N ++ ++Q V A+Y SKRT A + + KG S
Sbjct: 280 ----NYNNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKG-----------------ESKF 318

Query: 350 NQTAATVGLRHRF 362
TA VGLRH+F
Sbjct: 319 VSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5574AUTOINDCRSYN1311e-40 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 131 bits (332), Expect = 1e-40
Identities = 30/155 (19%), Positives = 60/155 (38%), Gaps = 10/155 (6%)

Query: 11 LPHELAADLGRYRRRVFVEQLGWALPSANESFERDQFDRDDTVYVFARNADGDMCGCARL 70
L + +L R+ F ++L WA+ + E DQ+D ++T Y+F D + R
Sbjct: 12 LSETKSGELFTLRKETFKDRLNWAVQCTD-GMEFDQYDNNNTTYLFGIK-DNTVICSLRF 69

Query: 71 LPTTRPYLLKSLFADLVAEDMPLPQSAAVWELSRFAATDDEGGPGNAEWAVRP----MLA 126
+ T P ++ F ++ +P+ E SRF D+ + P +
Sbjct: 70 IETKYPNMITGTFFPYFK-EINIPEG-NYLESSRFFV--DKSRAKDILGNEYPISSMLFL 125

Query: 127 AVVECAAQLGARQLIGVTFASMERLFRRIGIHAHR 161
+++ + G + + M + +R G
Sbjct: 126 SMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRV 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5578CHANLCOLICIN320.009 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 31.6 bits (71), Expect = 0.009
Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 402 SAVLMQARNDAESASARLTRTKEEAVRQVVAAQNAVQTSLASHDAAKALVDAAQTSYDAA 461
+ +A +AE + R K E RQ+ A+ + A + AKA V+ AQ AA
Sbjct: 143 AEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKA-VEIAQKKLSAA 201

Query: 462 LTAYRNGVGSVTDATIAQSQLLAARNAEVDSYAGALSAAAA 502
+ G + S + AR+AE+ + AG + A
Sbjct: 202 QSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQ 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5579adhesinmafb300.027 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 0.027
Identities = 27/108 (25%), Positives = 34/108 (31%), Gaps = 13/108 (12%)

Query: 281 AILERGGYPVDVTLALPPADALPPLARIAATDLQDAITHFAEPGATA--------PTVDA 332
IL Y +D A+ LP + A ++ F + A P
Sbjct: 250 DILYGTRYAIDKA-AMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAE 308

Query: 333 TAEASANATPAAAAATPEAPAAAPAPAPHGGFFLPDARTN---PDHIR 377
T EA N AA A A AA P A G F + D R
Sbjct: 309 TVEAVFNVAAAAKVAK-LAKAAKPGKAAVSGDFADSYKKKLALSDSAR 355


37Bcenmc03_5671Bcenmc03_5677Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5671-114-4.014056urea ABC transporter ATP-binding protein UrtD
Bcenmc03_5672013-4.050268urea ABC transporter ATP-binding protein UrtE
Bcenmc03_5673214-4.220834formamidase
Bcenmc03_5674315-3.972062FmdB family regulatory protein
Bcenmc03_5675313-3.553178acylamide amidohydrolase
Bcenmc03_5676114-2.450061porin
Bcenmc03_5677217-0.884801hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5676ECOLNEIPORIN1027e-27 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 102 bits (257), Expect = 7e-27
Identities = 72/343 (20%), Positives = 123/343 (35%), Gaps = 52/343 (15%)

Query: 15 PALLLAGTAHAQQSITLYGLIDEGLNFTSNAGGHRAWQMSSGDT-----FGSRWGLKGSE 69
AL +A A +TLYG I G+ + + + A S GS+ G KG E
Sbjct: 11 AALPVAAMA----DVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQE 66

Query: 70 DLGGGDKAIFQLENGFNVNSGKLGQDSSMFGRQAFVGLSSSRYGTLTLGRQYDTSVDALG 129
DLG G KAI+Q+E ++ G DS RQ+F+GL +G L +GR D
Sbjct: 67 DLGNGLKAIWQVEQKASIA----GTDSGWGNRQSFIGLKGG-FGKLRVGRLNSVLKDTGD 121

Query: 130 FGGITAAGNWAGDIATHPFDNDNTDWDFRVNNAVKYVTPTYRGLTAEAMYGFSNQPGGFS 189
+ ++ G + +V+Y +P + GL+ Y ++ G
Sbjct: 122 INPWDSKSDYLGVNKIAEPEARLI--------SVRYDSPEFAGLSGSVQYALNDN-AGRH 172

Query: 190 NNRVWGATLNYQSGNLTAAASYLKLNNPGLAAGGTVNSGDLFNGSSQQDIGVAASYQFTH 249
N+ + A NY++G + + N Q + + Y
Sbjct: 173 NSESYHAGFNYKNGGFFVQYGGAYKRHH--------QVQENVNIEKYQIHRLVSGY---- 220

Query: 250 VLVGAAWSHVDVYNPDGNAWIDNTALQNGATWNAWKFDNFELNAQYYFTHALWLGASYTF 309
+ V +A ++ + N+ E+ A + + +
Sbjct: 221 ---DNDALYASVAVQQQDA----KLVEENYSHNS----QTEVAATLAYR---FGNVTPRV 266

Query: 310 TIAHLYTSDTKYV---PKWHQIGMMLDYDLSKRTSLYLQGAWQ 349
+ AH + + Q+ + +YD SKRTS + W
Sbjct: 267 SYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWL 309


38Bcenmc03_5780Bcenmc03_5792Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5780-1113.071406hypothetical protein
Bcenmc03_5781-194.033688alcohol dehydrogenase
Bcenmc03_5782084.722032GCN5-related N-acetyltransferase
Bcenmc03_5784-1104.257298hypothetical protein
Bcenmc03_5785-1104.158939GCN5-related N-acetyltransferase
Bcenmc03_5786-1104.369016transcriptional regulator
Bcenmc03_5787-1114.066431peptidase S1 and S6 chymotrypsin/Hap
Bcenmc03_5788-3124.223244Bcr/CflA subfamily drug resistance transporter
Bcenmc03_5789-2133.906886hypothetical protein
Bcenmc03_5790-1154.763956fumarylacetoacetate (FAA) hydrolase
Bcenmc03_57910144.215520AraC family transcriptional regulator
Bcenmc03_5792-2163.278164uracil-DNA glycosylase superfamily protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5785SACTRNSFRASE404e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.3 bits (94), Expect = 4e-06
Identities = 20/63 (31%), Positives = 29/63 (46%), Gaps = 2/63 (3%)

Query: 322 RSCWTEGPYCYLQDLYTAPDARGQGAGGALIEAVYERAREAGASRVYWLTHETNTTARAL 381
RS W Y ++D+ A D R +G G AL+ E A+E + T + N +A
Sbjct: 83 RSNWNG--YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF 140

Query: 382 YDK 384
Y K
Sbjct: 141 YAK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5787V8PROTEASE613e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 60.8 bits (147), Expect = 3e-12
Identities = 31/157 (19%), Positives = 55/157 (35%), Gaps = 26/157 (16%)

Query: 124 SGSGSGFIVSADGLILTSAHVVDEATDVTVRLTDRR-----------EFKAT-VLAVDPQ 171
+ SG +V +LT+ HVVD L F A + +
Sbjct: 101 TFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 172 SDVAVLRVDATK--------LPFVRIGDSSKVRAGEPVMTIGAPDGSGNTVTAGIVSATS 223
D+A+++ + + + ++++ + + + G P G T
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKI 218

Query: 224 RRLPDGSAFPFFETDIAPNPDNSGGPVFNRAGDVIGI 260
L + D++ NSG PVFN +VIGI
Sbjct: 219 TYLKG----EAMQYDLSTTGGNSGSPVFNEKNEVIGI 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5788TCRTETA552e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.8 bits (132), Expect = 2e-10
Identities = 74/318 (23%), Positives = 117/318 (36%), Gaps = 21/318 (6%)

Query: 18 TLIVLCALSVLPLSLFLPSLPAIVRDLHTDYALVA---LSLGGYAAVAASLECVTGPLSD 74
++ AL + + L +P LP ++RDL + A + L YA + + V G LSD
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 75 RFGRRPIVLTSVALFALGSLGCAMATDIRVFLGCRLMQAAITSVYPVSMAAIRDSGGGAR 134
RFGRRP++L S+A A+ A A + V R++ + V+ A I D G
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128

Query: 135 AASRIGYAAMAAAFAPMLGPTLGGALDETVGWRASFWLLAVIGTALLAWCVRDLAETHTH 194
A G+ + F + GP LGG + A F+ A + L E+H
Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 195 RPSSFGQQLRAYPALLRARRFWAYALCMAFSTGAFYAFLAGAPLAATTLFGI-----PPA 249
++ A R R + + + P A +FG
Sbjct: 188 ERRPLRREALNPLASFRWARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 250 EIGFYMGTITAGFVCGSF----LAARVARRHALATTILCGRIVACAGPLIGLALLFGGVT 305
IG ++ A + S + VA R ++ G I G LL
Sbjct: 247 TIGI---SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD----GTGYILLAFATR 299

Query: 306 HALAWFGPCVLVGVGNGL 323
+A+ +L G G+
Sbjct: 300 GWMAFPIMVLLASGGIGM 317


39Bcenmc03_5818Bcenmc03_5847Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5818290.404586LysR family transcriptional regulator
Bcenmc03_5819190.957443lysine exporter protein LysE/YggA
Bcenmc03_5820091.646146hypothetical protein
Bcenmc03_5821082.580576metallophosphoesterase
Bcenmc03_58220143.486418hypothetical protein
Bcenmc03_58231134.022058hypothetical protein
Bcenmc03_58240124.157715RNA polymerase sigma factor
Bcenmc03_5825-1114.232625hypothetical protein
Bcenmc03_5826-2103.962846integral membrane protein
Bcenmc03_5827-1103.669406hypothetical protein
Bcenmc03_5828-1120.727406hypothetical protein
Bcenmc03_5829-114-0.862875hypothetical protein
Bcenmc03_5830-213-1.295606integrase family protein
Bcenmc03_5831018-2.949315hypothetical protein
Bcenmc03_5832-120-3.453897major facilitator transporter
Bcenmc03_5833227-5.936793putative lipoprotein
Bcenmc03_5834119-3.697254hypothetical protein
Bcenmc03_5835014-2.879696acyltransferase 3
Bcenmc03_5836015-3.101196hypothetical protein
Bcenmc03_5837-212-0.278272PilT domain-containing protein
Bcenmc03_5838-112-0.197845cold-shock DNA-binding domain-containing
Bcenmc03_5839-112-0.308806translation initiation factor IF-1
Bcenmc03_5840012-0.129029phosphoesterase
Bcenmc03_5841113-0.589831hypothetical protein
Bcenmc03_5842213-0.492542major facilitator transporter
Bcenmc03_5843420-3.204785hypothetical protein
Bcenmc03_5844421-1.038218hypothetical protein
Bcenmc03_5845319-0.599780hypothetical protein
Bcenmc03_58463180.043539hypothetical protein
Bcenmc03_58473180.133900metal-binding integral membrane protein-like
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5830PF05272290.033 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.033
Identities = 26/145 (17%), Positives = 44/145 (30%), Gaps = 9/145 (6%)

Query: 238 RDANGHERWW-LDVTGKGGRQRLVPATDEMMAE-LTRYRRTHGLPALPLDGEPTPLVLPF 295
D G+ R+W + V G+ L ++ AE L Y P D E
Sbjct: 700 FDITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRP--- 756

Query: 296 GQARKPLTRAALHRIVKQVFRHAAGRLRANGETGEQAARVLEQ----ASAHWLRHSAGSH 351
Q + + R+ + R A + G A S
Sbjct: 757 EQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTIADLVQALGADPGKSSP 816

Query: 352 MADGRVDLRLVRDNLGHVSLTTTSQ 376
M +G+V L + ++ T+ +
Sbjct: 817 MLEGQVRDWLNENGWEYLRETSGQR 841


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5832TCRTETB1428e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 142 bits (360), Expect = 8e-40
Identities = 92/412 (22%), Positives = 173/412 (41%), Gaps = 18/412 (4%)

Query: 7 HSVLLWIVAAAFFMQSLDTTIVNTALPSIAQSLHASPLAMQPVVVVYTLTMAMLTPASGW 66
+ +L+W+ +FF L+ ++N +LP IA + P + V + LT ++ T G
Sbjct: 13 NQILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71

Query: 67 LADRFGTRRVFSVAILVFVLASIGCAASHTLGQ-LVVARAVQGIGGSMLLPIGRLAVLRR 125
L+D+ G +R+ I++ S+ H+ L++AR +QG G + + + V R
Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY 131

Query: 126 VPGEQYVAAIAFVSIAGQLGPIVGPTLGGWLTQAISWHWVFIVNVPVGVVGFIAVQRYLP 185
+P E A + +G VGP +GG + I HW +++ +P+ + + L
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLL 189

Query: 186 HDQATQPPPFDFVGCALLSAAMIALSLAIDPPMSTHRAAWSAALAGLGLASALAYLPHAR 245
+ FD G L+S ++ L + + L ++ H R
Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSF--------LIFVKHIR 241

Query: 246 RRTQPLFRLGLFREPNFGSGLLGNLLCRIGTSSVPFMLPLLMQVQLGYTPLRSG-LMMLP 304
+ T P GL + F G+L + + M+P +M+ + G +++ P
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 305 AAIAGVIAKRWIAPLVKRFG--YAAFLVVNTGIVGCAIAGFALVSARPAPVLEGVLLIVF 362
++ +I LV R G Y + V V A F L + + +++ V
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTI--IIVFVL 359

Query: 363 GAANSMQFAAMNGVTLKGLSHADAGSGNSLFTMMQMLAMGLGVSIGGGLVNL 414
G + + + + L +AG+G SL L+ G G++I GGL+++
Sbjct: 360 GGLSFTKTVI-STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5842TCRTETB1162e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 116 bits (292), Expect = 2e-30
Identities = 78/406 (19%), Positives = 166/406 (40%), Gaps = 13/406 (3%)

Query: 14 RRGVTLLTLCIAVLVAQVDTAVVNLATRAIGAYFHAGVGALQWVVDSYNLTYAVLLLTGG 73
R L+ LCI + ++ V+N++ I F+ + WV ++ LT+++ G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 74 LLADLYGRRRVFMAGTAVFTIASLLCALAPS-VSVLIAARALAGVGAALLLPASLAVVRV 132
L+D G +R+ + G + S++ + S S+LI AR + G GAA PA + VV
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVA 129

Query: 133 VWRDPVERGRALGIWAACNGVAMAIGPTLGGVLIRHFGWRSIFFVVVPLSIAAMLLAIPA 192
+ RG+A G+ + + +GP +GG++ + W + +++P+ + +
Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMK 187

Query: 193 VPESSDPHGRHFDGAAQVTGALALGALAYAAIVFREAPVACAIAGCIAVASFAGFVAIER 252
+ + HFD + ++ + + + + ++V SF FV R
Sbjct: 188 LLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI------VSVLSFLIFVKHIR 241

Query: 253 RHGESALVPLDIFRISAFRGAIVATTGMTFGMYGVLFLLPLTWQSIGRLDSTGAGLALLP 312
+ V + + F ++ + + G + ++P + + +L + G ++
Sbjct: 242 K-VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIF 300

Query: 313 MALVFVVVS-PCSGPLSERIGTRATTAGGVAVIASGLAVISVSASSSSLLGAEIGLALTG 371
+ V++ G L +R G GV ++ S ++S I +
Sbjct: 301 PGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT-IIIVFVL 359

Query: 372 LGMGIATGPLMTVAVGAVDAARSGTASALVNVARMAGATLGIAVLG 417
G+ + T+ ++ +G +L+N GIA++G
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


40Bcenmc03_5914Bcenmc03_5925Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5914281.561223PilS domain-containing protein
Bcenmc03_5915281.129040type II secretion system protein E
Bcenmc03_5916271.907986hypothetical protein
Bcenmc03_5917271.366218type IV prepilin
Bcenmc03_5918271.025029YscC/HrcC family type III secretion outer
Bcenmc03_5919491.997255histidine kinase
Bcenmc03_59204112.165670two component transcriptional regulator
Bcenmc03_59214114.149193type II/III secretion system family protein
Bcenmc03_59224151.798128hypothetical protein
Bcenmc03_59233141.471238hypothetical protein
Bcenmc03_59242122.707452hypothetical protein
Bcenmc03_59252122.927225putative type III secretion apparatus protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5914PilS_PF088051322e-41 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 132 bits (333), Expect = 2e-41
Identities = 41/165 (24%), Positives = 84/165 (50%), Gaps = 8/165 (4%)

Query: 38 RGASLLEAISYLGIAAIVVIGAIALLAGAFSSANTNSITEQVNAIQSGVKKLYMGQSASY 97
+GA+L+E + +G+ ++ A L + S+ +++ V + + +K L +
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQGRYTD 85

Query: 98 ANLSNSVLASAGVFPSTLAPASGSGAITNMWNGTITVAAATNNSNQFTITYTNVPRSVCV 157
+N + L + G+ PS + + + N W G++T+ +++ + F + NVP+ C+
Sbjct: 86 SNYIKT-LYAQGLLPSDMIADTTGASAKNPWGGSVTITTSSDKYS-FNVVEANVPQKNCM 143

Query: 158 NSVTAGGSWISIT-VNETALTLPATPDSAATACASGDTNTVAWTS 201
V A S +I+ +N T + SAAT CAS D+NT+ +++
Sbjct: 144 AMVNALRSSSAISKINNT----STSTVSAATVCAS-DSNTLTFST 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5917BCTERIALGSPH452e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 44.6 bits (105), Expect = 2e-07
Identities = 17/59 (28%), Positives = 28/59 (47%)

Query: 7 RQRGFTIVEMLAALAIASLMIVGVTAMIDTSLADAKGQQAAAWQAQMTQAAAQLITQNQ 65
RQRGFT++EM+ L + + V S D+ Q A ++AQ+ + + Q
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5918TYPE3OMGPROT2642e-82 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 264 bits (675), Expect = 2e-82
Identities = 93/312 (29%), Positives = 153/312 (49%), Gaps = 27/312 (8%)

Query: 277 SQQLSSAVASSGPTAGGQASGGGDEEALPVIEADQRTNSVLIRDTPDRMYQYPALIQRLD 336
+ Q + P A +AS A +EAD N++++RD+P+RM Y LI LD
Sbjct: 223 TIQQVTVDNQRIPQAATRAS------AQARVEADPSLNAIIVRDSPERMPMYQRLIHALD 276

Query: 337 VKPRLIEIEAHIFEVDTSSIRQLGVNWTAHNSHIDLQTGNGLGAQNTYGGTLTQNFGNTT 396
IE+ I +++ + +LGV+W ++TGN G + N
Sbjct: 277 KPSARIEVALSIVDINADQLTELGVDWRV-----GIRTGNNHQVVIKTTGDQSNIASN-- 329

Query: 397 LAGNVTAAAMPVGGVLSAVIGNAGRYLMANVSALEEQNLAKIDASPKVTTLDNIEADMAN 456
G + S V YL+A V+ LE + A++ + P + T +N +A + +
Sbjct: 330 ------------GALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH 377

Query: 457 QTQFFVRVSGYTSADLYSVSTGVSLRVLPMVVDEGGRTQIKLDVAIQDGQL--TSRTVDN 514
++V+V+G A+L ++ G LR+ P V+ +G +++I L++ I+DG S ++
Sbjct: 378 SETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG 437

Query: 515 IPVISSTNINTSAFVNEGEALLIAGYKNDGRIDTTTGVPVLSKIPVIGNLFKYTDRENTR 574
IP IS T ++T A V G++L+I G D + VP+L IP IG LF+ R
Sbjct: 438 IPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRR 497

Query: 575 MERLFLLTPRII 586
RLF++ PRII
Sbjct: 498 TVRLFIIEPRII 509



Score = 177 bits (449), Expect = 3e-50
Identities = 70/244 (28%), Positives = 110/244 (45%), Gaps = 11/244 (4%)

Query: 4 FRFGLLFILVAAC----VAATVHAAPVNWHTRMVDYTADSKDIKDVLRDFAASQGIPADI 59
F F V +++ A ++W Y A + ++D+L DF A+ +
Sbjct: 3 FPLHSFFKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVV 62

Query: 60 SKDVQGSVTGKF-HMPPQRLLDTLASSFGFVWYYDGQVLDIVTPDEMKSTLIKLDHGSTA 118
S + V+G+F H PQ L +AS + VWYYDG VL I E+ S LI+L A
Sbjct: 63 SDKINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAA 122

Query: 119 QLRSTLAAMNVTDPRFRITYDDVQGAAIVNGPPNYVKLVGDVAQRLDTTTRHRA----GT 174
+L+ L + +PRF D V+GPP Y++LV A L+ T+ R+
Sbjct: 123 ELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGAL 182

Query: 175 VVQVYPLHHAWAMDRSVVADGQSMTLLGVATVLNNVYH--PQQGGGGNSGGGGRAPNVQR 232
++++PL +A A DR++ + GVAT+L V Q ++ +A
Sbjct: 183 AIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRAS 242

Query: 233 AQPM 236
AQ
Sbjct: 243 AQAR 246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5920HTHFIS796e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 6e-19
Identities = 29/129 (22%), Positives = 53/129 (41%), Gaps = 1/129 (0%)

Query: 5 TRIMLLEDDRIQQTMLVSWLKAEGYQVEAFDNGIEARNHLSDHWADLMILDWDVPGLSGD 64
I++ +DD +T+L L GY V N ++ DL++ D +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 KLLSWVRGRSRSTVPVIFQTVHSDEEEIVRILDTGADDFLIKPVDRIVFLARIRALLRRF 124
LL ++ R +PV+ + + ++ + GA D+L KP D + I L
Sbjct: 64 DLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 QTAGSERRR 133
+ S+
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5921TYPE3OMGPROT1642e-46 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 164 bits (417), Expect = 2e-46
Identities = 83/273 (30%), Positives = 126/273 (46%), Gaps = 14/273 (5%)

Query: 77 PPWSSAPYRYSTSGASLPDTLRALSAATHVPIAFDAGLPGRVEGRFEL-PPQRFVEMLAH 135
W PY Y G SL D L A + + +V G+FE PQ F++ +A
Sbjct: 29 LDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHIAS 88

Query: 136 GYGLVWYYDGTVLHVDAAGTQTTLIVRLNYARPTDLHALLAQTGIDDVRFVARDDAPARG 195
Y LVWYYDG VL++ + ++RL + +L L ++GI + RF R D +
Sbjct: 89 LYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPD-ASNR 147

Query: 196 LITFRGPPAWIALVGRAAQRLDADARARV----KTAVRIVPLHYGNAADRSAFANGRSNV 251
L+ GPP ++ LV + A L+ + R A+ I PL Y +A+DR+
Sbjct: 148 LVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDEVA 207

Query: 252 VQGVASRAARVLDPHDSLRATITEYEAP--------LPVLGADAGTNAVLVRDRPERLDA 303
GVA+ RVL + T+ P + AD NA++VRD PER+
Sbjct: 208 APGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNAIIVRDSPERMPM 267

Query: 304 DVRAIIALDRPRQHVGLGLLVAEVDTDALGAIG 336
R I ALD+P + + L + +++ D L +G
Sbjct: 268 YQRLIHALDKPSARIEVALSIVDINADQLTELG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5922PF05932290.003 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 29.0 bits (65), Expect = 0.003
Identities = 16/96 (16%), Positives = 29/96 (30%), Gaps = 8/96 (8%)

Query: 37 DVRLD-----HFENDPEAMYVNFHYGTVTAGRTLVIFRLMLEANLLIYAQDQAQLGLDAD 91
++ +D D + G + + + L L L LGLD
Sbjct: 31 NMIIDNTFALTLSCDYARERLLL-IGLLEPHKDIPQQCL-LAGALNPLLNAGPGLGLDEK 88

Query: 92 TGGIILILRLPLTPDVDGAVVADTVSHYTEHGRYWR 127
+G +P + + ++ E R WR
Sbjct: 89 SGLYHAYQSIPRE-KLSVPTLKREMAGLLEWMRGWR 123


41Bcenmc03_3223Bcenmc03_3228N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_32232162.125873cyclic nucleotide-binding protein
Bcenmc03_32241141.561508acriflavin resistance protein
Bcenmc03_3225-1111.995945RND family efflux transporter MFP subunit
Bcenmc03_3226-1111.069407RND efflux system outer membrane lipoprotein
Bcenmc03_3227-110-0.547458two component heavy metal response
Bcenmc03_3228-19-0.667042heavy metal sensor signal transduction histidine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3223FLGPRINGFLGI280.036 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 28.0 bits (62), Expect = 0.036
Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 71 IGVHQGLLKLAIFNVSGRGCTFS-GVPSGGWFGEGSVIKREL 111
V QG L + F+ G T + GV + G++I+REL
Sbjct: 142 YAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIEREL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3224ACRIFLAVINRP6370.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 637 bits (1645), Expect = 0.0
Identities = 247/1056 (23%), Positives = 444/1056 (42%), Gaps = 51/1056 (4%)

Query: 3 IVNLALRRPYTFIVMAIMIVLATPLALMRTPVDVLPAINIPVISVIWNYSGFSATEMTNR 62
+ N +RRP V+AI++++A LA+++ PV P I P +SV NY G A + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITSVHERILTTTVNNIQHVESTSLP-GIAVVKVFLQPGANVQTAIAQTVSSAQAIVRQMP 121
+T V E+ + ++N+ ++ STS G + + Q G + A Q + Q +P
Sbjct: 61 VTQVIEQNMNG-IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLP 119

Query: 122 QGATPPLVITYSASSIPVIQLGLSS--KTLSEQSLADIALNFLRPQLITVPGVQIPFPYG 179
Q + +SS ++ G S ++ ++D + ++ L + GV +G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 180 GRTRVVAIDLDPQALQAKGLTPADIVNAVNAQNLVLPTGT-----AKMGQT-EYRIDTNA 233
+ + I LD L LTP D++N + QN + G A GQ I
Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 234 SADTVADISNLPVQT-INGATTYLREVAAVRDGFAPQTNVVRQNGQRGVLISILKSGDAS 292
+ + ++ +G+ L++VA V G + R NG+ + I + A+
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 TLKVVSDLKALLPKVIPTLPEGLTITPLFDQSVFVNAAVQGVIHEALIAAVLTAMMILLF 352
L +KA L ++ P P+G+ + +D + FV ++ V+ A +L +++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 353 LGNWRSTLIIAISIPLSIFTSLIALSALGETINIMTLGGLALAVGILVDDATVTIENIER 412
L N R+TLI I++P+ + + L+A G +IN +T+ G+ LA+G+LVDDA V +EN+ER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 HLH-LGTNLHDAILEGAGEIAVPALVSTLCICIVFVPMFFLTGVARFLFVPLAEAVVFAM 471
+ +A + +I + + + VF+PM F G ++ + +V AM
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 472 LASYVLSRTLVPTLAMLLFRPQQANTGADHSTSRFARIHHAFNHAFERLRAWYIVLLSIL 531
S +++ L P L L +P A+H ++ FN F+ Y + +
Sbjct: 479 ALSVLVALILTPALCATLLKPV----SAEHHENK-GGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 532 LVRRRFYALCFLGFCVLSTGLVFMLGRDFFPNADSGNLRLHVRAPTGYRIEETARLADQV 591
L Y L + L L F P D G ++ P G E T ++ DQV
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 592 ERVIRETVPPDELGAIVDNLGLPVSGINLSYSNAGTIGTLDGELLIALKPGHRATGH--- 648
L N+ + S+S G ++LKP G
Sbjct: 594 TDYY--------LKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDENS 642

Query: 649 ---YVQTLRTLLPQRFPGVEFFFQPSDIITQILNFGQPAAIDVQVLGNDLASNMTIAS-S 704
+ + L + G F I+ L ++ +T A
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVE--LGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 705 LMKKIRQIPGAV-DVHVLQRNDEPTLLADMDRTRMQQLNLSAQNVAQNMLISLSGSSQTT 763
L+ Q P ++ V D ++D+ + Q L +S ++ Q + +L G+
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 764 PSFWINPRTGVQYPLQIQTPQYNLSSVDDLLGTPISASGRTGTPLQLLGNLVQVRSTVNP 823
G L +Q +D+ + ++ P
Sbjct: 761 -----FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVP---FSAFTTSHWVYGS 812

Query: 824 AVITHYNIRPAIDVYVSVEGRDLGAVAGEIDRIVADARATLPRGTDLTMRGQIETMRTSY 883
+ YN P++++ G +G+ ++ + + LP G G R S
Sbjct: 813 PRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSG 869

Query: 884 IGLGAGVAMAIVLVYLLIVVNFQSWLDPLIIISAMPAALAGIAWMLFITGTHLSVPALTG 943
A VA++ V+V+L + ++SW P+ ++ +P + G+ + V + G
Sbjct: 870 NQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVG 929

Query: 944 AIMTVGVATANSILVVSFARQRLAA-GAPPLTAALEAGATRIRPVLMTALAMIIGMVPMA 1002
+ T+G++ N+IL+V FA+ + G + A L A R+RP+LMT+LA I+G++P+A
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 1003 LGLGEGAEQNAPLGRAVIGGLLFATVSTLLFVPLVF 1038
+ G G+ +G V+GG++ AT+ + FVP+ F
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 126 bits (317), Expect = 2e-31
Identities = 82/517 (15%), Positives = 179/517 (34%), Gaps = 43/517 (8%)

Query: 3 IVNLALRRPYTFIVMAIMIVLATPLALMRTPVDVLPAINIPVISVIWNY-SGFSATEMTN 61
V L ++++ +IV + +R P LP + V + +G +
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQK 588

Query: 62 RITSVHERILTTTVNNIQHVESTSLPGIAVVKVFLQPGANVQTAIAQTV----------- 110
+ V + L N++ V V F G +A
Sbjct: 589 VLDQVTDYYLKNEKANVESV--------FTVNGFSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 111 SSAQAIV-RQMPQGATPPLVITYSASSIPVIQLGLSS----KTLSEQSLADIALNFLRPQ 165
+SA+A++ R + + +++LG ++ + + + L AL R Q
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 166 LITVPGVQIPFPYGGRTRVVA------IDLDPQALQAKGLTPADIVNAVNAQNLVLPTGT 219
L+ + R + +++D + QA G++ +DI ++
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 220 AKMGQTEYRIDTNASAD---TVADISNLPVQTINGATTYLREVAAVRDGFAPQTNVVRQN 276
++ A A D+ L V++ NG + + R N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGS-PRLERYN 819

Query: 277 GQRGVLISILKSGDASTLKVVSDLKALLPKVIPTLPEGLTITPLFDQSVFVNAAVQGVIH 336
G + I + S+ ++ ++ L K LP G+ + Q
Sbjct: 820 GLPSMEIQGEAAPGTSSGDAMALMENLASK----LPAGIGYDWTGMSYQERLSGNQAPA- 874

Query: 337 EALIAAVLTAMMILLFL-GNWRSTLIIAISIPLSIFTSLIALSALGETINIMTLGGLALA 395
+ + + + L L +W + + + +PL I L+A + + ++ + GL
Sbjct: 875 -LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 396 VGILVDDATVTIENI-ERHLHLGTNLHDAILEGAGEIAVPALVSTLCICIVFVPMFFLTG 454
+G+ +A + +E + G + +A L P L+++L + +P+ G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 455 VARFLFVPLAEAVVFAMLASYVLSRTLVPTLAMLLFR 491
+ V+ M+++ +L+ VP +++ R
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3225RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 2e-05
Identities = 21/193 (10%), Positives = 53/193 (27%), Gaps = 22/193 (11%)

Query: 86 ASGYVLRWQADIGAHVKQGQTLAELDTPELNQELAQATAQRQQAQAALALAKTS------ 139
+ V G V++G L +L + + + QA+ +
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 140 ----------FDRAQQLRQRDAVSQQELDDRQGAFSQGSANLAAADANMRRLT-ELKGFQ 188
Q + + + + L + FS + N+ + E
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSL--IKEQFSTWQNQKYQKELNLDKKRAERLTVL 220

Query: 189 RIVAPID---GIVTQRNVDVGDLVNSGNAGRSLFTVVQADRLRLYVQVPQAYAQQVKVGQ 245
+ + + R D L++ + + + ++ +Q ++
Sbjct: 221 ARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIES 280

Query: 246 HVSVAQAELPGRT 258
+ A+ E T
Sbjct: 281 EILSAKEEYQLVT 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3226RTXTOXIND300.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.020
Identities = 14/102 (13%), Positives = 37/102 (36%)

Query: 162 RNVEAAQASTEQSRDDFANARLVLSADLASSYFTLRELDTEIDVVKRSIDLQQKALDYVS 221
V + ++ + N + +L + I+ + +++ LD S
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241

Query: 222 ARHDLGAVSGLDLLQQRAQLDATRTQAQLLIQQRAQVETAIA 263
+ A++ +L+Q + + ++ Q Q+E+ I
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3227HTHFIS854e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 4e-21
Identities = 35/139 (25%), Positives = 66/139 (47%), Gaps = 3/139 (2%)

Query: 2 KVLIVEDEPKVVEYLKSGLTEEGWVVDTALDGEDGAWKAVE-FDYDVVVLDVMLPKLDGF 60
+L+ +D+ + L L+ G+ V + W+ + D D+VV DV++P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAAT-LWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 GVLRALRA-QKQTPVIMLTARDRVDDRVRGLRGGADDYLTKPFSFLELIERLRALTRRAR 119
+L ++ + PV++++A++ ++ GA DYL KPF ELI + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 120 VQESTLISIGDLRVDLIGR 138
+ S L + L+GR
Sbjct: 124 RRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3228PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 2e-04
Identities = 29/133 (21%), Positives = 54/133 (40%), Gaps = 31/133 (23%)

Query: 303 RASDLKDVSLADEVRRMLDFLEIPLDEAQLRAELHGDARAAVDPSLFRRAMTNLLI---- 358
R S+ + VSLADE+ + +L++ A ++ E ++P++ + +L+
Sbjct: 209 RYSNARQVSLADELTVVDSYLQL----ASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV 264

Query: 359 -NAIQH----SAPGATLNVTITRRDTLVEMAVSNPGEPIDPVQRSHVFERFYRLEEARAN 413
N I+H G + + T+ + V + V N G N
Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------------N 306

Query: 414 SKENHGLGLSIVK 426
+KE+ G GL V+
Sbjct: 307 TKESTGTGLQNVR 319


42Bcenmc03_3269Bcenmc03_3276N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3269-1111.363176histidine kinase
Bcenmc03_3270-1130.970454two component transcriptional regulator
Bcenmc03_3271-1130.650985Bcr/CflA subfamily drug resistance transporter
Bcenmc03_3272-1120.591577transcriptional regulator
Bcenmc03_3273-112-0.347856extracellular solute-binding protein
Bcenmc03_3274-3110.779072binding-protein-dependent transport systems
Bcenmc03_3275-3111.217818ABC transporter-like protein
Bcenmc03_32761130.982915porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3269PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 2e-06
Identities = 19/104 (18%), Positives = 40/104 (38%), Gaps = 24/104 (23%)

Query: 367 LIDNAIRYA----GDRAVITVRVSRDGADARLDVIDNGPGIPADERDAVFERFHRGSKTQ 422
L++N I++ I ++ ++D L+V + G + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK-------------- 308

Query: 423 TVEGTGLGLSIVRE-IARVH--QGSVTLADAAGGGLIVTIRLPA 463
E TG GL VRE + ++ + + L++ G + +P
Sbjct: 309 --ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3270HTHFIS972e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.8 bits (241), Expect = 2e-25
Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 1/128 (0%)

Query: 2 RVLLVEDNPNLAQSLNDALSAARFAVDHMADGEAADHVLRTQDYALVILDLGLPKLDGLE 61
+L+ +D+ + LN ALS A + V ++ + D LV+ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLRARRNPVPVLILTAHGSVEDRVKGLDLGADDYLAKPFELTE-LEARARALIRRSL 120
+L R++ R +PVL+++A + +K + GA DYL KPF+LTE + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GHEHSRVE 128
+
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3271TCRTETB635e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 63.4 bits (154), Expect = 5e-13
Identities = 41/161 (25%), Positives = 65/161 (40%), Gaps = 3/161 (1%)

Query: 32 SLPAMADALHGTDAQLQLTLTLYMVGYALSMLVSGPLSDRYGRRPVLLGGLCVYVVASVA 91
SLP +A+ + A T +M+ +++ V G LSD+ G + +LL G+ + SV
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95

Query: 92 CAWSTS-IPALIAARMFQALGGCCGTVIGRVIVRERFPAATQATMLGHISAGMALSPVVA 150
S LI AR Q G + V+V P + G I + +A+ V
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 151 PLAGSAIAQWLGWRGVFGWLAAGGLVATAMVLRYLPETRER 191
P G IA ++ W + + T L L + R
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEVR 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3276ECOLNEIPORIN813e-19 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 80.6 bits (199), Expect = 3e-19
Identities = 78/324 (24%), Positives = 126/324 (38%), Gaps = 35/324 (10%)

Query: 20 AACLAAPAAHAQSSVTMYGIMDAGIEFTNHAAPEGGNSVKLKSGNKNT---SRWGLRGVE 76
A LAA A + VT+YG + AG+E + A G + +++G S+ G +G E
Sbjct: 7 ALTLAALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQE 66

Query: 77 DLGGGLKAVFRLESGIDLANGASDDGPDSIFARRATVGLKGKWGELSLGRNFTVTYDY-- 134
DLG GLKA++++E +A S G R++ +GLKG +G+L +GR +V D
Sbjct: 67 DLGNGLKAIWQVEQKASIAGTDSGWG-----NRQSFIGLKGGFGKLRVGRLNSVLKDTGD 121

Query: 135 MLPFDPMGYAQNYSWATSSMATGGRKDGLFTRSSNAVRYDG-EFSGFKFGALYGFGNVPG 193
+ P+D + +A + +VRYD EF+G Y + G
Sbjct: 122 INPWDSKS----DYLGVNKIA---EPEARLI----SVRYDSPEFAGLSGSVQYALNDNAG 170

Query: 194 SMKTSSKYDFAVGYETGPFAAVVTFDRQNGAADSVTPADTVNYIQGIHAGLSYDFGNL-K 252
S Y Y+ G F + V + Q YD L
Sbjct: 171 R-HNSESYHAGFNYKNGGFFVQYGGAYKR--HHQVQENVNIEKYQIHRLVSGYDNDALYA 227

Query: 253 TMAG-YRNYKRTFHTTAANQLSDMYWLGGSYQF-----TPTFSLIAAVYHQNIKGGTDAD 306
++A ++ K + N ++ + + + TP S + D
Sbjct: 228 SVAVQQQDAKLVEENYSHNSQTE---VAATLAYRFGNVTPRVSYAHGFKGSFDATNYNND 284

Query: 307 PTLVSVRAQYALSKRTVLYAAGAF 330
V V A+Y SKRT + +
Sbjct: 285 YDQVVVGAEYDFSKRTSALVSAGW 308


43Bcenmc03_3604Bcenmc03_3609N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3604-1130.116704peptidase S53 propeptide
Bcenmc03_3605-2120.624810major facilitator transporter
Bcenmc03_36060100.564748binding-protein-dependent transport systems
Bcenmc03_3607-190.422908binding-protein-dependent transport systems
Bcenmc03_3608-280.719682extracellular solute-binding protein
Bcenmc03_3609181.301597spermidine/putrescine ABC transporter ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3604SUBTILISIN441e-06 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 43.7 bits (103), Expect = 1e-06
Identities = 51/344 (14%), Positives = 97/344 (28%), Gaps = 75/344 (21%)

Query: 235 TAAGVTVGIITIGGVSQTLQDLKQFTSSNGYGTVSTQTVKTNGTGGSYTDDQDGQGEWDL 294
GV V ++ G DLK + T+ G +D G
Sbjct: 39 RGRGVKVAVLD-TGCDADHPDLK--------ARIIGGRNFTDDDEGDPEIFKDYNGHGTH 89

Query: 295 DSQSIVGSAGGQVGKLVFYMADLNA---------AGNTGLTQAFNRAVSDNTAKVINVSL 345
+ +I + V ADL + Q A+ +I++SL
Sbjct: 90 VAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQK-VDIISMSL 148

Query: 346 GWCETDANADGTLDAEEQIFTTAAAQGQTFSVSSGDEGVYECNNRGYPDGSNYTVSWPAS 405
G + + A A ++G+EG + + +P
Sbjct: 149 G-------GPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTD--------ELGYPGC 193

Query: 406 SPHVLAIGGTTLYTTSAGAFSNETVWNEGLDSNGKLWATGGGVSTILPAPSWQSGSNRQL 465
V+++G ++ FSN + L A G + + +P
Sbjct: 194 YNEVISVGAINFDRHAS-EFSNSN-------NEVDLVAPGEDILSTVP------------ 233

Query: 466 PDVAFDAAQSTGAYIYNYGQLQQIGGTSLAAPIFTGFWARLLAANGTGLGFPASNFYADI 525
G+ GTS+A P G A + + ++
Sbjct: 234 -----------------GGKYATFSGTSMATPHVAGALALIKQLANASFERDLTE--PEL 274

Query: 526 PSHPSLVRYDVVSGNNGYQGYGY-KAGTGWDLTTGFGSLNIANL 568
+ + R + + +G G +L+ F + +A +
Sbjct: 275 YAQ-LIKRTIPLGNSPKMEGNGLLYLTAVEELSRIFDTQRVAGI 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3605TCRTETB1154e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 115 bits (290), Expect = 4e-30
Identities = 81/398 (20%), Positives = 155/398 (38%), Gaps = 12/398 (3%)

Query: 27 VALATLDTAIANTALPAIAADLHASPAASVWIINAYQLAMVATLLPFASLGDIVGHKRVY 86
+ L+ + N +LP IA D + PA++ W+ A+ L + L D +G KR+
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 87 VAGLAVFTLASL-GCSLASTLPMLTAARIVQGFGASAIMSVNVALIRGLFPAHRLGRGVG 145
+ G+ + S+ G S +L AR +QG GA+A ++ + ++ P G+ G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 146 FNALVVGVSFAVGPTIASLILSVAAWPWLFAVNVPLGVFALAVAIPSLPQTARGKHAFDP 205
+V + VGP I +I W +L + + + + + + L + R K FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 206 VAALFNVITFASLIFALGEFAQRGPLSVVFAAAAVAFSFGWLLIRRQAGHPAPMLPVDLF 265
+ + + + S+ F +V + ++ P + L
Sbjct: 202 KGIILMSVGIVFFMLFTTSY------SISFLIVSVLSFL--IFVKHIRKVTDPFVDPGLG 253

Query: 266 RRPVFTLSALTAVCAFAAQGLAFVSLPFYFETVLHRSAVETGF-LMTPWSAIVALAAPIA 324
+ F + L F +P+ + V S E G ++ P + V + I
Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313

Query: 325 GRLSDRYPPGLLGAIGLALLSAGMVSLAALPVSPGVVDIGWRMMLCGAGFGFFQSPNLKA 384
G L DR P + IG+ LS ++ + L + + ++ G F ++
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF-MTIIIVFVLGGLSFTKTVISTI 372

Query: 385 LMSSAPPERSGGASGIIATARLIGQATGAALVALSFGI 422
+ SS + +G ++ + + TG A+V I
Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3607PF06580290.040 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.040
Identities = 17/109 (15%), Positives = 43/109 (39%), Gaps = 12/109 (11%)

Query: 195 SIYLAIFGRTFVIGIAVTLFALLLGYPLAYWISTLSERRANLVMILVLIPFWTSVLVRVA 254
S+Y + + + IA++L L+L + +I + N+ I++ + + +V
Sbjct: 32 SLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRV--LPACVVIGM 89

Query: 255 AWIV----------LLQSEGLINTALIGSGLISHPLTLLFNRVGVYISM 293
W V + ++ + T + +I + + + F +Y
Sbjct: 90 VWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3609PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 9/31 (29%), Positives = 15/31 (48%)

Query: 37 LTLLGPSGSGKTTCLMMLAGFEFPTGGEIRL 67
+ L G G GK+T + L G +F + +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


44Bcenmc03_3624Bcenmc03_3641N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_36240110.918545MscS mechanosensitive ion channel
Bcenmc03_3625-2100.447609TetR family transcriptional regulator
Bcenmc03_3626-2120.759735secretion protein HlyD family protein
Bcenmc03_3627-212-0.305557EmrB/QacA family drug resistance transporter
Bcenmc03_3628-2120.233712RND efflux system outer membrane lipoprotein
Bcenmc03_3629-113-0.010654catalase
Bcenmc03_3630-1130.626134ankyrin
Bcenmc03_3631-1130.326833RND family efflux transporter MFP subunit
Bcenmc03_3632-1100.128933hydrophobe/amphiphile efflux-1 (HAE1) family
Bcenmc03_3633-1110.436297RND efflux system outer membrane lipoprotein
Bcenmc03_36340110.302512two component transcriptional regulator
Bcenmc03_3635-19-0.118774periplasmic sensor signal transduction histidine
Bcenmc03_3636-38-1.028127pseudomonalisin
Bcenmc03_3637-27-1.218871hypothetical protein
Bcenmc03_363817-1.845044flavin-nucleotide-binding protein
Bcenmc03_363918-1.834870transcriptional regulator
Bcenmc03_364029-1.820065phospholipase D/transphosphatidylase
Bcenmc03_3641510-1.206487*RNA polymerase sigma factor RpoD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3624PYOCINKILLER310.024 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.9 bits (69), Expect = 0.024
Identities = 37/177 (20%), Positives = 60/177 (33%), Gaps = 18/177 (10%)

Query: 9 LLAAVVSAAHAAAPAPAAASAASGAAPALTPQEARQALNVLENPRDRAQVETTLRAIAAV 68
L +S+ AA A+ AA A +E A + + E R AA+
Sbjct: 192 LFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAA-------EAKRKAEEQARQQAAI 244

Query: 69 GALSAPAVPASAAPATSGASAAAAPAALTSNGLASML---VRQGSRWATQIGNALQESLR 125
A + A+PA+ + + A A + LA + + R + +
Sbjct: 245 RAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFA 304

Query: 126 SLLDIGSVGSWWHDKLVSADQRADLTRTLGILVAVLLPALIVEWLAKRLLRRALATV 182
SL W D+ + + A LG+ A L V A + +A TV
Sbjct: 305 SLTYSSRTAEQWQDQTPDSVRYA-----LGMDAAKLGLPPSVNLNA---VAKASGTV 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3625HTHTETR635e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 5e-14
Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 2/93 (2%)

Query: 16 ARGDETRQRIIEAAIELFGERGFAGASTREIAAMAGVNAPALQYYFENKEGVYRACVETI 75
ETRQ I++ A+ LF ++G + S EIA AGV A+ ++F++K ++ E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 76 AEHGWQVFAPAVGHARAMLDGHASVDALIDAFI 108
+ ++ + + ++ +
Sbjct: 67 ESNIGELELEYQAKFPGDP--LSVLREILIHVL 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3626RTXTOXIND1291e-35 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 129 bits (326), Expect = 1e-35
Identities = 56/370 (15%), Positives = 114/370 (30%), Gaps = 81/370 (21%)

Query: 69 SMTAAPKVAGYVTDVYVRDNQPVKAGDPLVRLD-------VRQYQVALAQAQATV----- 116
S P V ++ V++ + V+ GD L++L + Q +L QA+
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 117 ----------------------------------------DARRADIARAEADISQQRAN 136
+ + E ++ ++RA
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215

Query: 137 LEQADAQAKVSRINAQHASDEYTRYAPLAATGAETHERVADLKSTRDQAAATLAANNASI 196
A+ ++ ++ L A V + ++ +A L + +
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQL 275

Query: 197 AAARTQIASFTA---------------QLQQARAQLEAAQASAAQAQLDLDNTIVRSTLA 241
++I S +L+Q + A+ + +++R+ ++
Sbjct: 276 EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335

Query: 242 GRVGDRTVR-VGQYVQPGTRLLTVVPVDSIYLV-ANFKETQIGNMRIGQPVELHVDALPD 299
+V V G V L+ +VP D V A + IG + +GQ + V+A P
Sbjct: 336 VKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPY 395

Query: 300 ---GPLSGVVDSFAPGTGAQFALLPPENATGNFTKIVQRVPVRIRLAANARAQRMLLPGL 356
G L G V + + G ++ + N L G+
Sbjct: 396 TRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGM 446

Query: 357 SVTVDVDTRS 366
+VT ++ T
Sbjct: 447 AVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3627TCRTETB872e-20 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 86.9 bits (215), Expect = 2e-20
Identities = 66/400 (16%), Positives = 158/400 (39%), Gaps = 15/400 (3%)

Query: 32 ALMATLDISITNSALPQIQGEIGATGTEGTWISTGYLMSEIVMIPLAAWLTRVFGLRNFL 91
+ + L+ + N +LP I + W++T ++++ + + L+ G++ L
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 92 LTNSALFIAFSMMCGWSHS-LPMMIAGRIGQGFTGGALIPTAQTIIRTRLPLSQLPVGMT 150
L + S++ HS ++I R QG A ++ +P
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 151 LFGLIVLLGPLFGPVLGGWLAENVNWSWCFFLNLPVCLLLMALLVFGLPSDRPQWSAFFN 210
L G IV +G GP +GG +A ++WS + L +P+ ++ + L + + +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLL--KKEVRIKGH 198

Query: 211 ADWLGILGLAIGLSSLTVVLEEGQRERWFESQMIVTLSIVSFIGMVLIALSQRFAKRPIM 270
D GI+ +++G+ + +L F + ++ IVS + ++ R P +
Sbjct: 199 FDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFV 248

Query: 271 RLSLMRNPRYASVIVIVSAVGAGLYGVSYLLPQFLAIVAGYNAEQAGAIMLLSGLPAFLV 330
L +N + ++ + + G ++P + V + + G++++ G + ++
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 331 MPILPRLLGKVDFRILVITGLLLFCLSCMLDISLTAQSVGHDFVWSQLIRGLAQMLAMMP 390
+ +L + V+ + F L S ++ +
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 391 LNQASMAAVAREDSGDAAGLYNMARNLGGSIGLAIIGTVI 430
++ +++ ++++G L N L G+AI+G ++
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3631RTXTOXIND509e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 9e-09
Identities = 33/213 (15%), Positives = 74/213 (34%), Gaps = 32/213 (15%)

Query: 43 AATRVDVTEDLPGRVAAV-RVAEIRPQVSGIVQRRLFEQGTEVRAGQPLFQINPAPFKAE 101
+V++ G++ R EI+P + IV+ + ++G VR G L ++ +A+
Sbjct: 76 VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135

Query: 102 MDTAAASLQRAQAALERAKVQ----------------TARFKPLVEADAISRQVYDDAVS 145
+SL +A+ R ++ F+ + E + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL--IKE 193

Query: 146 QRDQAAADVAQARATLARRQLDLKFATVEAPIAGRIDQALVTEGALVSSSDSQPMA---- 201
Q Q L +++ + TV A I + + V + L D +
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAER--LTVLARINRYENLSRVEKSRL---DDFSSLLHKQA 248

Query: 202 ----RIQQIDQVYVDVRQPAASLEALRDALASQ 230
+ + + YV+ ++ + + S+
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESE 281



Score = 29.0 bits (65), Expect = 0.034
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 61 RVAEIRPQVSGIVQR-RLFEQGTEVRAGQPLFQINP 95
+ + IR VS VQ+ ++ +G V + L I P
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3632ACRIFLAVINRP10900.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1090 bits (2821), Expect = 0.0
Identities = 532/1028 (51%), Positives = 718/1028 (69%), Gaps = 7/1028 (0%)

Query: 1 MAEFFIRRPVFAWVIALFIILTGLIAIPQLPVARYPSVAPPSVTITASYPGATPQTMNDG 60
MA FFIRRP+FAWV+A+ +++ G +AI QLPVA+YP++APP+V+++A+YPGA QT+ D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VLSLIERELSGVKNLLYFESSADTSGQAQITVTFKPGTNPEMAQVDVQNKIKSVEPRLPA 120
V +IE+ ++G+ NL+Y S++D++G IT+TF+ GT+P++AQV VQNK++ P LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVRQNGLIVESASSGFLMIVSLRSDNGRFDEGALADYMARSVSEELRRIDGVGRVLQFGS 180
V+Q G+ VE +SS +LM+ SDN + ++DY+A +V + L R++GVG V FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 ERAMRIWVDPQKLINFGLSMSDLTTAIGQQNVQIAPGSLGALPALPGQRVTVPLTAQGQL 240
+ AMRIW+D L + L+ D+ + QN QIA G LG PALPGQ++ + AQ +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TTPEAFAKVVLRANADGSKVVLGDVARVELGSQNYTFVSRENNKPATLAGVQLAPGANAV 300
PE F KV LR N+DGS V L DVARVELG +NY ++R N KPA G++LA GANA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 KTADAIRARMAELSKSMPSGMSYSIPLDTSPFVKISIEKVLHTLLEAMVLVFLVMYLFLQ 360
TA AI+A++AEL P GM P DT+PFV++SI +V+ TL EA++LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NVRYTLIPAIVAPVAMLGTFTVMLLTGFSINVLTMFGMVLAIGIIVDDAIVVVENVERLM 420
N+R TLIP I PV +LGTF ++ G+SIN LTMFGMVLAIG++VDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPKDATSKAMKEITGAIIGVTLVLTAVFLPMAMASGSVGVIYKQFTLSMAVSILF 480
E+ L PK+AT K+M +I GA++G+ +VL+AVF+PMA GS G IY+QF++++ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SALLALTLTPALCATMLKPIAAGHHE-KRGFFGWFNRRFDRLTKWYETRVGRLVGRTGRV 539
S L+AL LTPALCAT+LKP++A HHE K GFFGWFN FD Y VG+++G TGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 MLVFVAISGALVLGFRSLPSSFLPDEDQGYFITSFLLPADATAERTHDVVKTLEKHL--A 597
+L++ I +V+ F LPSSFLP+EDQG F+T LPA AT ERT V+ + +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 SRPAIQSSISVIGYGFSGQGSNAAINWSVMKDWKNRGGASTIEEGML--AQQAMAGVTEG 655
+ ++S +V G+ FSGQ NA + + +K W+ R G E ++ A+ + + +G
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 656 TVMSLLPPAIDELGNSSGFSMRLEDRANQGAAALKAAEVKLLELAAQSKV-VTGVYPDSL 714
V+ PAI ELG ++GF L D+A G AL A +LL +AAQ + V P+ L
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 715 PAGTSIRLEIDRAKAQALGVSFTTLSDTLSTAMGSTYVNDFPNAGRMQQVIIQADAPARM 774
+LE+D+ KAQALGVS + ++ T+STA+G TYVNDF + GR++++ +QADA RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 775 QIDNVMKLYVRNAAGGMVPLSEVVRPVWTDTPLQMVRFKGYPSARIAGNAAPGQSSGAAM 834
++V KLYVR+A G MVP S W ++ R+ G PS I G AAPG SSG AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 835 AEMERLAAQLPPGFAVEWTGQSLQERQSASQAPMLMVLSMIVVFLVLAALYESWSIPLSV 894
A ME LA++LP G +WTG S QER S +QAP L+ +S +VVFL LAALYESWSIP+SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 895 MLVVPLGLIGAIGAVLLRGMPNDVFFKVGMITVIGLSAKNAILIVEFAKQLRE-EGKGLI 953
MLVVPLG++G + A L NDV+F VG++T IGLSAKNAILIVEFAK L E EGKG++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 954 EAAVQASKLRLRPILMTSLAFGLGVVPLMIATGASAETQHAIGTGVFGGMVTATVLAIFF 1013
EA + A ++RLRPILMTSLAF LGV+PL I+ GA + Q+A+G GV GGMV+AT+LAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1014 VPVFFVFV 1021
VPVFFV +
Sbjct: 1021 VPVFFVVI 1028



Score = 80.7 bits (199), Expect = 2e-17
Identities = 57/323 (17%), Positives = 124/323 (38%), Gaps = 17/323 (5%)

Query: 719 SIRLEIDRAKAQALGVSFTTLSDTLSTA---MGSTYVNDFPNA-GRMQQVIIQADAPARM 774
++R+ +D ++ + + L + + + P G+ I A R
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT--RF 240

Query: 775 Q-IDNVMKLYVR-NAAGGMVPLSEVVRPVWTDTPLQ--MVRFKGYPSARIAGNAAPGQS- 829
+ + K+ +R N+ G +V L +V R V + R G P+A + A G +
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVAR-VELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 830 ---SGAAMAEMERLAAQLPPGFAVEWT-GQSLQERQSASQAPMLMVLSMIVVFLVLAALY 885
+ A A++ L P G V + + + S + + ++++VFLV+
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 886 ESWSIPLSVMLVVPLGLIGAIGAVLLRGMPNDVFFKVGMITVIGLSAKNAILIVE-FAKQ 944
++ L + VP+ L+G + G + GM+ IGL +AI++VE +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 945 LREEGKGLIEAAVQASKLRLRPILMTSLAFGLGVVPLMIATGASAETQHAIGTGVFGGMV 1004
+ E+ EA ++ ++ ++ +P+ G++ + M
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 1005 TATVLAIFFVPVFFVFVMSIQER 1027
+ ++A+ P ++
Sbjct: 480 LSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3634HTHFIS876e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 6e-22
Identities = 43/182 (23%), Positives = 85/182 (46%), Gaps = 16/182 (8%)

Query: 1 MTNSLILIAEDEPDISDILDAYLKHDGFRTYRVADGQAVLDMQPHLKPDLILLDVKMPRK 60
MT + IL+A+D+ I +L+ L G+ ++ + DL++ DV MP +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGWDVLAELRRRD-NTPVVVLTAFDRDLDRLQALHAGADDYIVKPFNPAEVVARL-RAIL 118
N +D+L +++ + PV+V++A + + ++A GA DY+ KPF+ E++ + RA+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 119 RRSAAPPTLRMLRVGDLEIDTDSYLARVRTAGTEVPITLTLTEFRLLAHMARSPSRVFTR 178
P L + + S A ++ +R+LA + ++ +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRS--AAMQEI------------YRVLARLMQTDLTLMIT 166

Query: 179 GE 180
GE
Sbjct: 167 GE 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3636SUBTILISIN456e-07 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 44.8 bits (106), Expect = 6e-07
Identities = 51/243 (20%), Positives = 75/243 (30%), Gaps = 40/243 (16%)

Query: 302 TDTDGTVEWNLDSQSIVGAAGG----SVKQVVFYVAPSMTLTAITAAYNKAVTDNVAKVI 357
T GT+ + +VG A +K V S I A+ V +I
Sbjct: 88 THVAGTIAATENENGVVGVAPEADLLIIK--VLNKQGSGQYDWIIQGIYYAIEQKV-DII 144

Query: 358 NVSLGVCESSANSTGSQATDDTIFKQAVAQGQTFSVSAGDHGAYECASGTPSRSTYTVSE 417
++SLG K+AVA +AG+ G + T +
Sbjct: 145 SMSLG-------GPEDVPELHEAVKKAVASQILVMCAAGNEGDGDD-------RTDELGY 190

Query: 418 PATSPYVIAVGGTTLFTNTSTNAYNSEIVWNDPSWQPGT-VWST--GGGYSKYE----AA 470
P VI+VG + S PG + ST GG Y+ + A
Sbjct: 191 PGCYNEVISVGAINF---DRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMAT 247

Query: 471 P------TWQSSTLTGSTKRALPDVGFDADLRTGAILVVNGQTSDTLWGSGYLNNEGGTS 524
P S +R L + A L I + N S + G+G L
Sbjct: 248 PHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGN---SPKMEGNGLLYLTAVEE 304

Query: 525 LAA 527
L+
Sbjct: 305 LSR 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3637IGASERPTASE542e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.9 bits (129), Expect = 2e-09
Identities = 30/179 (16%), Positives = 54/179 (30%), Gaps = 21/179 (11%)

Query: 419 APEAFAQHRANAPHAEVPVPHPPGATHDTRQQAMQREHAAQETRAAAQQQQQQRAEMQRH 478
P + + A E PVP P AT + + + +Q Q
Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066

Query: 479 DAAQQ-----REALQQRNAAQQQEHAQAQQRDEAQQQQRVEAQ--QRDEARQQQRS---- 527
+ A++ + Q AQ + Q E ++ VE + + E + Q
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 528 --EAAQQQQRKEMQPHPEAPP--------REHAAPQPRQAAPEHAHAPHPAESHPPHES 576
+ +Q+Q + +QP E +E + A E + P
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185



Score = 42.0 bits (98), Expect = 7e-06
Identities = 27/188 (14%), Positives = 50/188 (26%), Gaps = 38/188 (20%)

Query: 419 APEAFAQHRANAPHAEVPVPHPPGATHDTRQQAMQREHAAQETRAAAQQ----------- 467
A E AQ+R A A+ V + + +E ET+ A
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE 1117

Query: 468 QQQQRAEMQRHDAAQQREALQQRNAAQ-QQEHAQAQQRDEAQQQQRVEAQQRDEARQQQR 526
+ Q+ ++ + +Q ++ + A+ +E+ E Q Q A A++
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS 1177

Query: 527 --------------------------SEAAQQQQRKEMQPHPEAPPREHAAPQPRQAAPE 560
Q E P+ R P P
Sbjct: 1178 NVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA 1237

Query: 561 HAHAPHPA 568
+ +
Sbjct: 1238 TTSSNDRS 1245



Score = 36.2 bits (83), Expect = 4e-04
Identities = 20/158 (12%), Positives = 41/158 (25%), Gaps = 8/158 (5%)

Query: 423 FAQHRANAPHAEVPVPHPPGATHDTRQQAMQREHAAQETRAAAQQQQQQRAEMQRHDAAQ 482
+ + V T + Q + + E A +
Sbjct: 978 YDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT 1037

Query: 483 QREALQQRNAAQQQEHAQAQQRDEAQQQQRV--EAQQRDEARQQQRSEAAQQQQRKEMQP 540
+ A + ++ E + + Q + V EA+ +A Q +E AQ +
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT-NEVAQSGSETK--- 1093

Query: 541 HPEAPPREHAAPQPRQAAPEHAHAPHPAESHPPHESRE 578
E E + + + P S+
Sbjct: 1094 --ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3641ADHESNFAMILY320.006 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 32.1 bits (73), Expect = 0.006
Identities = 32/170 (18%), Positives = 53/170 (31%), Gaps = 16/170 (9%)

Query: 242 KEAEAIENEDEEAEEEEEEEEEEEDDGAAQASANAAQLEALKRASLEKFSQISE---WFD 298
A+ I + + +E E +L+ L + S +KF++I
Sbjct: 149 IFAKNIAKQLSAKDPNNKEFYE------KNLKEYTDKLDKLDKESKDKFNKIPAEKKLIV 202

Query: 299 KMRRAFEKEGYKSKAYLKAQETIQSELMTIRFTARTVERLCDTLRAQVDEVRQVERQILH 358
AF Y SKAY I T ++ L + LR VE +
Sbjct: 203 TSEGAF---KYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDD 259

Query: 359 IVVDKCGMPRSEFIARFPGSETDLDWAEKITAEGHPYSAVLSRNVPAIRE 408
+ + AE+ EG Y +++ N+ I E
Sbjct: 260 RPMKTV---SQDTNIPIYAQIFTDSIAEQGK-EGDSYYSMMKYNLDKIAE 305


45Bcenmc03_3687Bcenmc03_3694N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_36870102.472518sigma-54 dependent trancsriptional regulator
Bcenmc03_36880102.796372metal-dependent hydrolase-like protein
Bcenmc03_36892123.974804FKBP-type peptidylprolyl isomerase
Bcenmc03_36902114.079278AraC family transcriptional regulator
Bcenmc03_36910123.287338GCN5-related N-acetyltransferase
Bcenmc03_3692-1112.741309major facilitator transporter
Bcenmc03_3693-1112.130834HxlR family transcriptional regulator
Bcenmc03_36940112.476150major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3687HTHFIS384e-131 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 384 bits (988), Expect = e-131
Identities = 139/383 (36%), Positives = 202/383 (52%), Gaps = 40/383 (10%)

Query: 122 FDYVTLPLPYEWISHVLGHARGMAALDRVDGAAYAASIGEHGMIGNCEAMQQLFSTIRKV 181
+DY+ P + ++G A +A R S ++G AMQ+++ + ++
Sbjct: 99 YDYLPKPFDLTELIGIIGRA--LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156

Query: 182 AKTDASVFISGESGTGKELTALAIHERSGRGKGPFVAINCGAIPHHLLQSELFGYERGAF 241
+TD ++ I+GESGTGKEL A A+H+ R GPFVAIN AIP L++SELFG+E+GAF
Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 242 TGANQRRAGRIESANGGTLFLDEIGDMPVESQASLLRFLQEGKIERLGGQESIVVDVRII 301
TGA R GR E A GGTLFLDEIGDMP+++Q LLR LQ+G+ +GG+ I DVRI+
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276

Query: 302 SATHVDLDGAVEAGRFRADLYHRLCVLRIHEPPLRARGKDIDILAHYVLQKFKADSGRKI 361
+AT+ DL ++ G FR DLY+RL V+ + PPLR R +DI L + +Q+ + + G +
Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDV 335

Query: 362 SGFTSAALDAMRRYEWPGNVRELINRVRRAIVMAESRLLTPHDLGLDTPGET-------- 413
F AL+ M+ + WPGNVREL N VRR + ++T + + E
Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395

Query: 414 -----------------------------EPVTLEQARALAERTAIENALLRNDHRINKA 444
++ A E I AL KA
Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455

Query: 445 AAELGISRVTLYRMMIEHGLNDH 467
A LG++R TL + + E G++ +
Sbjct: 456 ADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3689INFPOTNTIATR931e-26 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 92.7 bits (230), Expect = 1e-26
Identities = 46/111 (41%), Positives = 63/111 (56%), Gaps = 2/111 (1%)

Query: 3 VITTESGLKYEDLTEGTGAEAQAGKTVSVHYTGWLTDGQKFDSSKDRNDPFAFVLGGGMV 62
++ SGL+Y+ + GTGA+ TV+V YTG L DG FDS++ P F + V
Sbjct: 121 IVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVS--QV 178

Query: 63 IKGWDEGVQGMKVGGVRRLTIPPQLGYGPRGAGGVIPPNATLVFEVELLDV 113
I GW E +Q M G + +P L YGPR GG I PN TL+F++ L+ V
Sbjct: 179 IPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3691SACTRNSFRASE452e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.6 bits (105), Expect = 2e-08
Identities = 17/55 (30%), Positives = 27/55 (49%)

Query: 80 ITSLVVDESCRGQGVGGALIAAAHSWFESVGCVKLEVTSGDHRLDAHRFYARYGF 134
I + V + R +GVG AL+ A W + L + + D + A FYA++ F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3692TCRTETB1324e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 132 bits (334), Expect = 4e-36
Identities = 82/399 (20%), Positives = 156/399 (39%), Gaps = 21/399 (5%)

Query: 30 LDVTIVNIALAHLAADLHLPVAGLQWVVDAYTLAFAVLMLSAGALGDRFGTRRLYVAGLL 89
L+ ++N++L +A D + P A WV A+ L F++ G L D+ G +RL + G++
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 90 LFAFASLACGAAVAPA-MLIAARALQGVGAAAMLPNSLALLNDACRHDPRLRARAVSGWT 148
+ F S+ + +LI AR +QG GAAA + ++ R +A
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI--PKENRGKAFGLIG 145

Query: 149 AAGSIAIAAGPVVGGLLIAAWGWRGIFLVNLPLCAAGLAATFAWVPARREQAAPARSTRS 208
+ ++ GP +GG++ W + L+ + R
Sbjct: 146 SIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE-------VRIKGH 198

Query: 209 LDPRGQFIAIAMLTVLTGAVIEWRPLGFTHPVVAGGFVLAALAALAFVAVESRTATPMLP 268
D +G + + + FT +++ L+ L FV + P +
Sbjct: 199 FDIKGIILMSVGIVFF---------MLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 269 LSLFRHPAFSTAVLFGICVNLTYYGTVFVLALYLQRARGESALQAGLAFL-PLTGGFLLS 327
L ++ F VL G + T G V ++ ++ S + G + P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 328 NLASGRVVARHGPRVPMVAGALVAALGYGSLHFVDAATPLGVLLVPFLLIPSGMGFAVPA 387
G +V R GP + G ++ + + F+ T + + + + G+ F
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTIIIVFVLGGLSFTKTV 368

Query: 388 MTTAVLASVAPERAGIASAVLNTARQAGGAIGVAAFGAL 426
++T V +S+ + AG ++LN G+A G L
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3694TCRTETB356e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 6e-04
Identities = 34/158 (21%), Positives = 59/158 (37%), Gaps = 1/158 (0%)

Query: 53 LDSIARDFGVSQAAVGGVITATQLGCALALLFVVPLGDLLNRKRLIAVQLVLLSAACIGV 112
L IA DF A+ V TA L ++ L D L KRL+ +++ +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 113 ATAPTRGALLAGMVAIGLLGTAMTQGLIACSAA-LVGAGERGRVVGAAQGGVVVGLLAAR 171
+ +LL I G A L+ A + RG+ G V +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 172 SLAGVVTDIAGWRAVYLVSGALAIAMLVVLSRLLPDMR 209
++ G++ W + L+ I + ++ L ++R
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194


46Bcenmc03_3838Bcenmc03_3844N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3838-3120.475174porin
Bcenmc03_3839-3131.130194hypothetical protein
Bcenmc03_3840-2121.611617hypothetical protein
Bcenmc03_3841-2111.854280heavy metal sensor signal transduction histidine
Bcenmc03_3842-2122.130766two component heavy metal response
Bcenmc03_3843-2111.990917CzcA family heavy metal efflux protein
Bcenmc03_3844-292.083356RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3838ECOLNEIPORIN889e-22 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 88.3 bits (219), Expect = 9e-22
Identities = 91/381 (23%), Positives = 132/381 (34%), Gaps = 66/381 (17%)

Query: 14 LLTASHAAHATEVTLYG----LFDTSLTVVWNADAQGRNLVGLGNGNLLGNRFGVKGAED 69
L A A +VTLYG +TS +V N G G +L G++ G KG ED
Sbjct: 9 TLAALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDL-GSKIGFKGQED 67

Query: 70 LGGGLKAIFTLENGFNPNTGALGQGNRMFGRQAFVGLESARWGTLTLGRQYDALADV--- 126
LG GLKAI+ +E G + RQ+F+GL +G L +GR L D
Sbjct: 68 LGNGLKAIWQVEQ----KASIAGTDSGWGNRQSFIGL-KGGFGKLRVGRLNSVLKDTGDI 122

Query: 127 -AWPITGDFYFGSVYATPGDVDNYDTSSRTDNAVKYTSPVVGGFQFVGMYALGGVAGKSG 185
W D+ + A P + S V+Y SP G YAL AG+
Sbjct: 123 NPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQYALNDNAGRHN 173

Query: 186 AGQTWSAGLSYNHGPFDAAGGYYHAANRASLANGVRTGWNGTSDGTFDGSLVNGGYISAK 245
+++ AG +Y +G F G + + N + + + GY
Sbjct: 174 -SESYHAGFNYKNGGFFVQYGG-------AYKRHHQVQENVNIE-KYQIHRLVSGY-DND 223

Query: 246 SIGIARGALRYTFAPFTIGIDYSNAQYKADAMSAFRSTQKYDTARGFFNYQATASLLVGV 305
++ + + N+Q + A A+R F N S G
Sbjct: 224 ALYASVAVQQQDAKLVEENYS-HNSQTEVAATLAYR----------FGNVTPRVSYAHGF 272

Query: 306 GYSYTKARGDTSATYHQVSAGADYVLSKRTDLYAVGAWQRANGEQRTLDGGTQAAQASIG 365
S+ + Y QV GA+Y SKRT W +
Sbjct: 273 KGSFDATNYNN--DYDQVVVGAEYDFSKRTSALVSAGWLQEG------------------ 312

Query: 366 SYGYGGTRTQGIVNLGLRHRF 386
+GLRH+F
Sbjct: 313 --KGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3842HTHFIS823e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 3e-20
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 1/135 (0%)

Query: 2 RILIVEDEPKTGAYLRKGLTEAGYVVDWVEDGITGQHQAETEEYDLLVLDVMLPGQDGWT 61
IL+ +D+ L + L+ AGY V + T + DL+V DV++P ++ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLQNLR-RSKSTPVLFLTARDDVGDRVKGLELGADDYLAKPFDFVELTARIKSILRRGQP 120
LL ++ PVL ++A++ +K E GA DYL KPFD EL I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 RDSNTLRVADLELDL 135
R S + + L
Sbjct: 125 RPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3843ACRIFLAVINRP8190.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 819 bits (2116), Expect = 0.0
Identities = 234/1065 (21%), Positives = 433/1065 (40%), Gaps = 62/1065 (5%)

Query: 5 LIRFAIAHRWLVMLAIAAVAALGVFSYQRLPIDAVPDITNVQVQINTSAPGYSPLEAEQR 64
+ F I + + G + +LP+ P I V ++ + PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITYPVETVMAGLPGLEQTRSIS-RYGLSQVTVIFKDGTDIYFARQLVNERIQEAKDKLPP 123
+T +E M G+ L S S G +T+ F+ GTD A+ V ++Q A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 GIAPAMGPTSTGLGEIYLWTVEADANARKPDGTRYTAADLRELQDWVVRPQLRNVRGVTE 183
+ YL D T D+ + V+ L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 VNSIGGYVKEYRVAPNPAKLMSYGLTLADVVRALERNNDNVGAGYI------EKRGEQYL 237
V G R+ + L Y LT DV+ L+ ND + AG + +
Sbjct: 175 VQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 VRVPGQARTVDDIANIVL-TNVGGVPVRMKDVGVVDIGRELRTGAATSNGEEVVLGTVFM 296
+ + + ++ + L N G VR+KDV V++G E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LMGENSRTVSKAVAAKMEDVNRTLPAGVKAIPVYDRTVLVEKAVATVKKNLLEGAVLVIA 356
G N+ +KA+ AK+ ++ P G+K + YD T V+ ++ V K L E +LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITALVIPLSMLMTFTGMVNAKVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI + +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENCVRRLAHAQSAAGRPLTRDERFAEVFGASQEARRALIFGQLIIMVVYLPIFALTGVEG 474
EN R + + + + + AL+ +++ V++P+ G G
Sbjct: 414 ENVERVMMEDKLP---------PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAITVVMALAAAMVLTVTFIPAAVALFIGERVE---EKENRLMGWARRA------ 525
++ +IT+V A+A ++++ + PA A + E + GW
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 526 -YEPVLAAFMTRPARVMIGAGAIVLVTLGLATRLGSEFIPSLNEGDLAVSALRIPGTSLS 584
Y + + R ++ IV + L RL S F+P ++G G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 585 QSVE-MQKSIEKTLKARFPEIERVFARTGTAEIAADPMPPNLSDGYIMLKPADTWPDPKK 643
++ + + + + LK +E VF G + N ++ LKP + +
Sbjct: 585 RTQKVLDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDEN 641

Query: 644 PRDRLVREIEEALAELP-GNAYEFSQPIQLRFNELISGVRSDVA-VKIFGDDMAVLNQTG 701
+ ++ + L ++ G F+ P EL + D + G L Q
Sbjct: 642 SAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQAR 698

Query: 702 EQIAAALQKVPGA-SEVKVEQTTGLPVLTVNLDRDKLARYGVSVADLQDSVAAAVGGQKA 760
Q+ + P + V+ + +D++K GVS++D+ +++ A+GG
Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758

Query: 761 GTLFQGDRRFDIVVRLPDELRSDIEAIKRLPIALPAPAAGASAPLAAAPYVPLAELATID 820
R + V+ + R E + +L + A VP + T
Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKLYV-----------RSANGEMVPFSAFTTSH 807

Query: 821 VAPGPNQISREDGKRRVVVSANVRGRDVGSFVADAREQLQQ-DVRVPAGYWVSWGGQFEQ 879
G ++ R +G + + G+ DA ++ ++PAG W G Q
Sbjct: 808 WVYGSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQ 864

Query: 880 LQSASERLKLVVPLALFMVFVLLFVMFNNVKDGLLVFTGIPFALSGGVVSLWLRGIPLSI 939
+ + + +V ++ +VF+ L ++ + + V +P + G +++ L +
Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDV 924

Query: 940 TAAVGFIALSGVAVLNGLVMISFIRNLRD-EGMPLDAAVHDGALTRLRPVLMTALVASLG 998
VG + G++ N ++++ F ++L + EG + A RLRP+LMT+L LG
Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILG 984

Query: 999 FLPMAFATGTGAEVQRPLATVVIGGILSSTALTLLVLPVLYRVSH 1043
LP+A + G G+ Q + V+GG++S+T L + +PV + V
Sbjct: 985 VLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029



Score = 87.2 bits (216), Expect = 2e-19
Identities = 76/532 (14%), Positives = 154/532 (28%), Gaps = 56/532 (10%)

Query: 2 FERLIRFAIAHRWLVMLAIAAVAALGVFSYQRLPIDAVPDITNVQVQINTSAPGYSPLEA 61
+ + + +L A + A V + RLP +P+ P + E
Sbjct: 526 YTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQER 585

Query: 62 EQRITYPVETVMAGLPGLEQTRSISRYGLSQ---------VTVIFKDGTDIYFARQLVNE 112
Q++ V + G S V K +
Sbjct: 586 TQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 113 RIQEAKDKLPP----GIAPAMGPTSTGLGEIYLWTVEADANARKPDGTRYTAADLRELQD 168
I AK +L + P P LG A + L
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELG---------TATGFDFELIDQAGLGHDALTQ 696

Query: 169 WV---------VRPQLRNVRGVTEVNSIGGYVKEYRVAPNPAKLMSYGLTLADVVRALER 219
L +VR ++ ++++ + K + G++L+D+ + +
Sbjct: 697 ARNQLLGMAAQHPASLVSVRPNGLEDT-----AQFKLEVDQEKAQALGVSLSDINQTIST 751

Query: 220 NNDNVGAGYIEKRGEQY--LVRVPGQAR-TVDDIANIVLTNVGGVPVRMKDVGVVDIGRE 276
RG V+ + R +D+ + + + G V
Sbjct: 752 ALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWV-- 809

Query: 277 LRTGAATSNGEEVVLGTVFMLMGENSRTVSKAVAAKMEDVNRTLPAGVKAIPVYDRTVLV 336
G+ + ++ + T S A ME++ LPAG+ +
Sbjct: 810 --YGSPRLERYNGLP-SMEIQGEAAPGTSSGDAMALMENLASKLPAGI-GYDWTGMSYQE 865

Query: 337 EKAVATVKKNLLEGAVLVIAVLFLFLGNIRAALITALVIPLSMLMTFTGMVNAKVSANLM 396
+ + V+V L + + LV+PL ++ ++
Sbjct: 866 RLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY 925

Query: 397 SLGAL--DFGIIVDGAVVIVENCVRRLAHAQSAAGRPLTRDERFAEVFGASQEARRALIF 454
+ L G+ A++IVE G+ + A + R ++
Sbjct: 926 FMVGLLTTIGLSAKNAILIVE----FAKDLMEKEGKGV-----VEATLMAVRMRLRPILM 976

Query: 455 GQLIIMVVYLPIFALTGVEGKMFHPMAITVVMALAAAMVLTVTFIPAAVALF 506
L ++ LP+ G + + I V+ + +A +L + F+P +
Sbjct: 977 TSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3844RTXTOXIND548e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 53.7 bits (129), Expect = 8e-10
Identities = 54/361 (14%), Positives = 115/361 (31%), Gaps = 42/361 (11%)

Query: 83 DGKPVDKG---VTVSGTLVRYDRTHAPLRFDAAGQKFVSAQSIAKPHVFDATIDVKAGND 139
+G+ V KG + ++ D A + Q +++ + ++K ++
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 140 AASFPFARADGAIALTDAQLATSRIALAKAGPAQIATPFQLPGEIKFNEDRTAHVVPRVA 199
F L L + + + Q +L + K E T
Sbjct: 174 PY---FQNVSEEEVLRLTSLIKEQFSTWQNQKYQK----ELNLDKKRAERLTVLARINRY 226

Query: 200 GIVEQVSVSLGQNVAKGQVLA---VIASTDLADRRSELLTAERRLS---GARATYERERT 253
E +S + L IA + ++ ++ + A L E E
Sbjct: 227 ---ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283

Query: 254 LWKERIS-AEQDYQQ-AQVQLREAEIAVQNARQKLAALNAPVGAGALNRYELRAPFAGTI 311
KE Q ++ +LR+ + +LA +RAP + +
Sbjct: 284 SAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE-----RQQASVIRAPVSVKV 338

Query: 312 VE-KHATPGEAI-AADASMFVISDLSTVWAEMAVPAQRLNDVRVGRDATVSATAFESRSS 369
+ K T G + A+ M ++ + T+ V + + + VG++A + AF
Sbjct: 339 QQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY 398

Query: 370 GPI----AYVG--SLLGEQTRT-APARVVLP-------NPDRVWRPGMFVNVSVDAGRQA 415
G + + ++ ++ + + N + GM V + G ++
Sbjct: 399 GYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRS 458

Query: 416 V 416
V
Sbjct: 459 V 459



Score = 38.7 bits (90), Expect = 5e-05
Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 11/132 (8%)

Query: 181 PGEIKFNEDRTAHVVPRVAGIVEQVSVSLGQNVAKGQVLAVIASTDLADRRSELLTAERR 240
G++ + + P IV+++ V G++V KG VL + + ++ L +
Sbjct: 87 NGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EADTLKTQSS 142

Query: 241 LSGARATYERERTLWKE-------RISAEQDYQQAQVQLREAEIAVQNARQKLAALNAPV 293
L AR R + L + + + V E +++ +
Sbjct: 143 LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202

Query: 294 GAGALNRYELRA 305
LN + RA
Sbjct: 203 YQKELNLDKKRA 214


47Bcenmc03_3855Bcenmc03_3862N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_3855-2143.463322TetR family transcriptional regulator
Bcenmc03_38560152.916214short-chain dehydrogenase/reductase SDR
Bcenmc03_38570112.050798FAD dependent oxidoreductase
Bcenmc03_38580102.203782alpha/beta hydrolase fold protein
Bcenmc03_38590101.760376carboxymuconolactone decarboxylase
Bcenmc03_3860-2111.839324lipoprotein
Bcenmc03_3861-3110.341342hypothetical protein
Bcenmc03_3862-1140.967874Bcr/CflA subfamily drug resistance transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3855HTHTETR551e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 1e-11
Identities = 24/179 (13%), Positives = 48/179 (26%), Gaps = 15/179 (8%)

Query: 22 RAAERRDALIRAATRVFGTVGFRKATVRSICQEAKLNDRYFYAAFDSTEDLLRCTYLHHA 81
A E R ++ A R+F G ++ I + A + Y F DL +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 82 QQLHDAVAQAVAARGGELRERVDAGLAAFFAFLRDPCAARVLLLEVMGVSADT------- 134
+ + + A G+ + L R LL+E++ +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTE-ERRRLLMEIIFHKCEFVGEMAVV 126

Query: 135 ----DMTYQRMLIDFGKLIMAIGAPGEAVTPAERTEQRLIGLALVGAMTNVGAAWLLTD 189
+ + R + + G ++ + WL
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPAD---LMTRRAAIIMRGYISGLMENWLFAP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3856DHBDHDRGNASE676e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 66.6 bits (162), Expect = 6e-15
Identities = 50/193 (25%), Positives = 82/193 (42%), Gaps = 2/193 (1%)

Query: 2 KGFSGKVAAITGAGSGMGRSLAVELARRGCEVALADVNETGLAGTAAACAQHGVRVSTRR 61
KG GK+A ITGA G+G ++A LA +G +A D N L ++
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 62 LDVADRDAVFAWADFVRAEHGKVNLIFNNAGVSLAASAETARLADLEWIVGINFWGVVHG 121
DV D A+ + E G ++++ N AGV + + E +N GV +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 122 TQAFLPHLRASGDGHVVNTSSLFGLVAMPTQSAYNATKFAVRGFTEALRMELELDGAPVS 181
+++ ++ G +V S V + +AY ++K A FT+ L +EL +
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA--EYNIR 181

Query: 182 ATCVHPGGVATSI 194
V PG T +
Sbjct: 182 CNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3860cloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.001
Identities = 36/111 (32%), Positives = 49/111 (44%), Gaps = 11/111 (9%)

Query: 248 GTLTGGLGGGSSSGSGGTSGTSSGGPLAPITGLLGTVTGALGGIGSSGTSGTGGTSGTGG 307
G G+GGG+S GSG +S + G G + G SG GG +GG
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWG---------GGSGSGIHWGGGSGHGNGGGNGNSGG 73

Query: 308 TSGTGGAGLGGLLAPVTNLVNALTPLGASLTGTVTTPGGNLSGTLGGVLTS 358
SGTGG L + APV AL+ GA V+ G LS + ++ +
Sbjct: 74 GSGTGG-NLSAVAAPVAFGFPALSTPGAGGLA-VSISAGALSAAIADIMAA 122



Score = 30.8 bits (69), Expect = 0.011
Identities = 28/115 (24%), Positives = 43/115 (37%)

Query: 165 GGVTLLGTPLNGLLSTLGSGLGLAGTKVGGATDNPVGAGLGGVVTQLGNTVTSTGGLVHD 224
G +NG + LG G G + + +NP G G G + G + GG +
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 225 NNAGSSSSGTGGSNPLAPITGLLGTLTGGLGGGSSSGSGGTSGTSSGGPLAPITG 279
+ GS + G + G T G GG + S S G + +A + G
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3862TCRTETB643e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 64.1 bits (156), Expect = 3e-13
Identities = 35/138 (25%), Positives = 64/138 (46%), Gaps = 2/138 (1%)

Query: 40 VPEMPSALHTSPAMVQLTLSVYMVVLGLGQLMFGPLSDRLGRRPVLLGGALLFSVASLAL 99
+P++ + + PA + +M+ +G ++G LSD+LG + +LL G ++ S+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 100 AMAGSG-GVFVALRLLQALGASAALVATFATVRDVYADRPEGSTLYSQFGAILAFVPALG 158
+ S + + R +Q GA AA A V Y + + G+I+A +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGA-AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 159 PMLGAGIAHGFGWRAIFM 176
P +G IAH W + +
Sbjct: 156 PAIGGMIAHYIHWSYLLL 173


48Bcenmc03_3925Bcenmc03_3932N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_39250121.121145two component LuxR family transcriptional
Bcenmc03_39260121.040804histidine kinase
Bcenmc03_39270120.467200plasmid stabilization system protein
Bcenmc03_39280120.686862hypothetical protein
Bcenmc03_39290140.730339major facilitator transporter
Bcenmc03_39300131.506092ABC nitrate/sulfonate/bicarbonate transporter,
Bcenmc03_39311141.807935two component transcriptional regulator
Bcenmc03_39320141.638994histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3925HTHFIS681e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 1e-15
Identities = 32/116 (27%), Positives = 53/116 (45%), Gaps = 2/116 (1%)

Query: 2 IRVILADDHAVMRDGLRHILERAGGFEIVGEASDGSGTLALAERAAADVLLLDLSMPAPT 61
+++ADD A +R L L RAG V S+ + D+++ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIELIRLVKRRAPSLRTLVLTMHAETQYAARAFKAGATGYLTKDSATAELVEAVGK 117
+L+ +K+ P L LV++ A +A + GA YL K EL+ +G+
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3926GPOSANCHOR363e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.2 bits (83), Expect = 3e-04
Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 5/109 (4%)

Query: 303 RALAGARYQHLLVVQKSAQLNDANERLEQRVAARTAQ---LSASNRDLRREVEERVRAER 359
+AL GA K L LE A Q L+A+ + LRR+++ A++
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 360 ALQASREELREIAAISASAREAEQRRI--ARELHDELAQTLATLKNDLE 406
L+A ++L E IS ++R++ +R + +RE +L L+ +
Sbjct: 327 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3931HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 2e-24
Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 3/145 (2%)

Query: 2 RILVVEDDAEIGAAIRSRLARLGHAVDLETDGATANGLLRVERFDLVVLDANLPGMDGFT 61
ILV +DDA I + L+R G+ V + ++ AT + DLVV D +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRNLRASGSTTPVLLVTARSAIDDRVSGLGLGADDYLVKPFD---YRELDARVQALLRR 118
+L ++ + PVL+++A++ + GA DYL KPFD + R A +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 119 NSGHANDVLTLGGLVIDRSSRLAEL 143
D G ++ RS+ + E+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEI 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_3932PF06580300.016 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.016
Identities = 17/102 (16%), Positives = 32/102 (31%), Gaps = 24/102 (23%)

Query: 367 NALLHGHA---DDIVVSVATMGNADQAVTLTVTDNGRGMPREHWDAALQPFVRIAPDGSE 423
N + HG A + + + VTL V + G +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGT-VTLEVENTGSLALK------------------N 306

Query: 424 RRTGSGLGLAIVQEVMKA-HGGRVGFAFPDA-GGFAVVLTFP 463
+ +G GL V+E ++ +G + G ++ P
Sbjct: 307 TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


49Bcenmc03_4020Bcenmc03_4027N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4020-122-3.337146GntR family transcriptional regulator
Bcenmc03_4021-119-4.120685alpha/beta hydrolase fold protein
Bcenmc03_4022-116-3.644423hypothetical protein
Bcenmc03_4023-115-3.938533LysR family transcriptional regulator
Bcenmc03_4024015-3.452414GCN5-related N-acetyltransferase
Bcenmc03_4025012-3.132301GCN5-related N-acetyltransferase
Bcenmc03_4026111-3.144888malic enzyme
Bcenmc03_4027113-4.044610LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4020BCTLIPOCALIN300.008 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 30.4 bits (68), Expect = 0.008
Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 29/91 (31%)

Query: 287 DGALERHLPRITAAYRRKCDAMCDALRDGFGDAIEFHRPEGGMFVWARLGAVSTDVLLQQ 346
D + ER L ++TA YR + D L G+ E G + ++
Sbjct: 45 DHSFERGLSQVTAEYRVRNDGGISVLNRGYS-------EEKGEW--------------KE 83

Query: 347 AIANKIVFVPGKAFFADNVDAASLRLSFAAP 377
A GKA+F + L++SF P
Sbjct: 84 A--------EGKAYFVNGSTDGYLKVSFFGP 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4024SACTRNSFRASE316e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 6e-04
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 9/59 (15%)

Query: 67 VVDIAVLPIHQKKGVGDLIMRALMDYIHENAP-----PTAYVSLMADHGTPKFYERYGF 120
+ DIAV ++KKGVG ++ +++ EN T +++ A H FY ++ F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACH----FYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4025SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 8e-07
Identities = 11/63 (17%), Positives = 23/63 (36%), Gaps = 3/63 (4%)

Query: 78 VDAIFVRPSHMGRGIGRTMLRFLEALAAEHGVVEMRLDATLNAAP---FYRSCGWTGDSI 134
++ I V + +G+G +L A E+ + L+ FY + ++
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151

Query: 135 STY 137
T
Sbjct: 152 DTM 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4027PF05043310.005 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.5 bits (71), Expect = 0.005
Identities = 15/55 (27%), Positives = 31/55 (56%)

Query: 22 KIRHLVLLLQIQQHGSLTRVAEHMASSQPAVTNALSELESMFGTPLFERSSRGMR 76
++ L LL + ++ + +AE + ++ AV + LS ++S F +F S+ G+R
Sbjct: 12 QLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIR 66


50Bcenmc03_4103Bcenmc03_4108N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4103-190.729311outer membrane efflux protein
Bcenmc03_41040120.636693hypothetical protein
Bcenmc03_41052100.336360hypothetical protein
Bcenmc03_4106110-0.253331two component heavy metal response
Bcenmc03_41072120.288166heavy metal sensor signal transduction histidine
Bcenmc03_4108311-0.405902major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4103PF05616300.021 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.1 bits (67), Expect = 0.021
Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 452 LPLPGAVPSATHSAESAPQSPSSADAAPQPASTPAAPSAPATQPHPE 498
+P P P + + + P S A PA+ PA P T+P+PE
Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSP--AENPANNPAPNENPGTRPNPE 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4106HTHFIS883e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 3e-22
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 1/123 (0%)

Query: 2 RILIVEDEPKMASYLRKGLTEASYTVDVAENGQDGLFLALHEDFDLIVLDVMLPALDGFE 61
IL+ +D+ + + L + L+ A Y V + N D DL+V DV++P + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLRA-QKQTPVLLLTAREAIEDKVAGLELGADDYLLKPFAYAEFLARIRSLLRRAPR 120
+L R++ + PVL+++A+ + E GA DYL KPF E + I L R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 NVR 123

Sbjct: 125 RPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4107PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 9e-05
Identities = 29/187 (15%), Positives = 59/187 (31%), Gaps = 38/187 (20%)

Query: 291 IEECERLQRMIENM--LFLARTDNARQHLKTVELDAGSELRRLASYFHA----LADEAGV 344
+E+ + + M+ ++ L + ++ EL + SY D
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLA----DELTVVDSYLQLASIQFEDRLQF 242

Query: 345 HIDVHGEAPVVADATLFRRAVSNLASNALEHA----EAASTIELAVSTQGGYAVVEVTNR 400
+ P + D + V L N ++H I L + G +EV N
Sbjct: 243 ENQI---NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 401 GVAIPPEQVERIFERFYRIDSSRHGAARNAGLGLAIVKSIIELHRGK---VEVASRDGRT 457
G E + G GL V+ +++ G ++++ + G+
Sbjct: 300 GSLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341

Query: 458 TFALYFP 464
+ P
Sbjct: 342 NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4108TCRTETA447e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.0 bits (104), Expect = 7e-07
Identities = 65/358 (18%), Positives = 128/358 (35%), Gaps = 26/358 (7%)

Query: 47 GASATTIGLIEGIAEATSPVVKVFSGTLSDYLRNRKWLAVAGYALGALSKPLFAIAPTIG 106
G++ + G LSD R+ + + A A+ + A AP +
Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGALSDRF-GRRPVLLVSLAGAAVDYAIMATAPFLW 97

Query: 107 VVVTARIVDRVGKGIRGAPRDALVADVTPVHLRGAAYGLRQSLDTVGAFLGPLLAVAIML 166
V+ RIV + G GA A +AD+T R +G + G GP+L L
Sbjct: 98 VLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG---GL 153

Query: 167 MWRDDFRLAFWLAVIPGVLAVALLAVGIDEPARAPGEKRVNPIRREVVAQLGARYWWVVA 226
M F+ A L + E + P+RRE + A + W
Sbjct: 154 MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGER----RPLRREAL-NPLASFRWARG 208

Query: 227 VGGV-------FVLARFSEAFLVLRAMGS----GVPVALVPLVMVAMNVVYAL-SAYPFG 274
+ V F++ + L + + + + A ++++L A G
Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG 268

Query: 275 KLADTTSHTKLMVVGLALLIAADLVLAHGAHWPTVLVGVALWGLHMGMTQGLLAMMVAQA 334
+A + +++G+ ++LA + L G+ L M+++
Sbjct: 269 PVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS-GGIGMPALQAMLSRQ 327

Query: 335 APAELRGTAFGVFNLISGIVTLVSSVVAGVLWDRAGAAATFYAGAIFSAATIVLLVCV 392
E +G G ++ + ++V ++ ++ A+ T + G + A + L+C+
Sbjct: 328 VDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA---ASITTWNGWAWIAGAALYLLCL 382



Score = 29.4 bits (66), Expect = 0.027
Identities = 30/165 (18%), Positives = 61/165 (36%), Gaps = 19/165 (11%)

Query: 241 LVLRAMGSGVPVALVPLVM--------VAMNVVYALSAYPF---------GKLADTTSHT 283
+ L A+G G+ + ++P ++ V + L+ Y G L+D
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 284 KLMVVGLALLIAADLVLAHGAHWPTVLVGVALWGLHMGMTQGLLAMMVAQAAPAELRGTA 343
+++V LA ++A + +G + G+ G T + +A + R
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARH 132

Query: 344 FGVFNLISGIVTLVSSVVAGVLWDRAGAAATFYAGAIFSAATIVL 388
FG + G + V+ G++ A F+A A + +
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLT 176


51Bcenmc03_4214Bcenmc03_4228N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4214-1110.646195arsenical-resistance protein
Bcenmc03_4215092.657788protein tyrosine phosphatase
Bcenmc03_4216-192.506414ArsR family transcriptional regulator
Bcenmc03_4217-193.527865hypothetical protein
Bcenmc03_4218-2142.411725porin opacity type
Bcenmc03_4219-1132.426095two component LuxR family transcriptional
Bcenmc03_4220-1131.977619histidine kinase
Bcenmc03_4221-2151.498035two component transcriptional regulator
Bcenmc03_42223130.071432Hpt sensor hybrid histidine kinase
Bcenmc03_4223314-0.303177YadA domain-containing protein
Bcenmc03_4224313-0.851879two component LuxR family transcriptional
Bcenmc03_4225314-0.799114two component LuxR family transcriptional
Bcenmc03_4226313-0.452013OmpA/MotB domain-containing protein
Bcenmc03_4227313-0.705939YadA domain-containing protein
Bcenmc03_4228180.463018Hpt sensor hybrid histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4214ACRIFLAVINRP290.043 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.043
Identities = 12/66 (18%), Positives = 26/66 (39%), Gaps = 3/66 (4%)

Query: 204 IVMPVILAQMLRKRLLANGQAAFDAAMTRIGP---WSIAALLATLVLLFAFQGEAILKQP 260
I++ ++ K +A A R+ P S+A +L L L + + +
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 261 LVIALL 266
+ I ++
Sbjct: 1002 VGIGVM 1007


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4218OUTRMMBRANEA374e-05 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 36.8 bits (85), Expect = 4e-05
Identities = 28/142 (19%), Positives = 44/142 (30%), Gaps = 20/142 (14%)

Query: 68 GGPDTGSNVTGSLGLGYQFGNGWRAEGEYV-FKRTNNFTSYWAPFDANANEFHVSAQRLM 126
GP + + GYQ E Y R P+ + AQ +
Sbjct: 48 NGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGR--------MPYKGSVENGAYKAQGVQ 99

Query: 127 LNGYKDFDLGRGFSVYGTLGIGVAIVSADGWQTNDTRRFASKTQTNLAYS--AGAGVSYA 184
L + + +Y LG V DT+ + S GV YA
Sbjct: 100 LTAKLGYPITDDLDIYTRLGGMVW--------RADTKSNVYGKNHDTGVSPVFAGGVEYA 151

Query: 185 INKRFSIDLGYRYV-DMGNVET 205
I + L Y++ ++G+ T
Sbjct: 152 ITPEIATRLEYQWTNNIGDAHT 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4219HTHFIS1042e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 104 bits (261), Expect = 2e-28
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 1/154 (0%)

Query: 10 DRPIVAIVDDDEPVRDGLALLLRTVGLPTRCYADAQAFLADADDRALGCVLLDLRMPGMS 69
+ + DDD +R L L G R ++A V+ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 70 GLDALDRLSARRA-LPVIVLTGHGNVDACRRAFKRGALDFLRKPVDDDELIDTVQQALRR 128
D L R+ R LPV+V++ +A ++GA D+L KP D ELI + +AL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 129 QAAQRGQDDAGQTRAARVATLSAREREVLEGIVR 162
+ + + + SA +E+ + R
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4220PF06580310.013 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.013
Identities = 19/102 (18%), Positives = 38/102 (37%), Gaps = 16/102 (15%)

Query: 374 ILHNLIRNA-RDALAGMP-LGEIRISGGRAGRHYRFSVVDNGPGVPDDALPRLFEPFFTT 431
++ L+ N + +A +P G+I + G + V + G + T
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN----------TK 308

Query: 432 RENGLGLGLPLCDTLAQR--QDGSLMIRNRPSGGVEATLLLP 471
G GL + + L + + + + G V A +L+P
Sbjct: 309 ESTGTGLQN-VRERLQMLYGTEAQIKLSEKQ-GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4222HTHFIS885e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 5e-20
Identities = 37/137 (27%), Positives = 58/137 (42%), Gaps = 2/137 (1%)

Query: 771 AARVLVVDDHPVNRTLQQSQLVTLGYAADAADDGASALRRCADTRYDLVMTDLNMPGMDG 830
A +LV DD RT+ L GY + A+ R A DLV+TD+ MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 831 YTLARVLRARYPDLPVIAITAHASAAEHARCAEAGIVAVLVKPVLLDTIDRTVRRFAKIS 890
+ L ++ PDLPV+ ++A + + +E G L KP L + + R ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR--ALA 120

Query: 891 ATSRPARNTLVDLAEGP 907
R D +G
Sbjct: 121 EPKRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4223OMADHESIN697e-14 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 68.8 bits (167), Expect = 7e-14
Identities = 63/205 (30%), Positives = 102/205 (49%), Gaps = 14/205 (6%)

Query: 509 GSDSVATGPQSTAIGTSATANSTGSVALGNGATSSGANATAIGRLATAGAANATVLGGSA 568
G ++ A G S AIG +A A +VA+G G+ ++G N+ AIG L+ A +A G ++
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 569 TA------------TGQSAIAIGQSAHANGLSAIGIGFLSDAQADN--SVALGARSVANR 614
TA T + +A+G ++ A+ +++ IG S A++ S+A+G RS +R
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 615 DNTVSVGSASQQRQIVNVAAGTQGSDAVNVSQLAPVVTALGGGATIDSATGAVTGPTYTL 674
+N+VS+G S RQ+ ++AAGT+ +DAVNV+QL + SA Y
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241

Query: 675 TNGGAQTTVGGALGALDSALTTANG 699
+ + SA T N
Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENA 266



Score = 63.0 bits (152), Expect = 5e-12
Identities = 67/217 (30%), Positives = 99/217 (45%), Gaps = 3/217 (1%)

Query: 106 NAALAADSATAIGANASAAGQSSVAIGGNTMAT-VNGVAIGTLSQATGANSTAVGVGAAA 164
NA+ + AIGA A AA ++VA+G ++AT VN VAIG LS+A G ++ G + A
Sbjct: 64 NASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTA 123

Query: 165 TASAASALGRGAAASGTNTSAFGNGARASGDSASALGRGAVASEANSVALGANSTADRAN 224
+ R + + F + A A A A+ S+A+G S DR N
Sbjct: 124 QKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDREN 183

Query: 225 AVSVGTTSAQRQIINVADGTAGTDAVNLNQLNAAIAASDQYFMVNSTSANLANATGANAV 284
+VS+G S RQ+ ++A GT TDAVN+ QL I + + N SA L A A
Sbjct: 184 SVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQEN--TNKRSAELLANANAYAD 241

Query: 285 AIGQAVTSSGSSSVAIGSGTSADGLHSMALGASSRVV 321
+V ++ S + + A S V+
Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVL 278



Score = 43.0 bits (100), Expect = 8e-06
Identities = 54/159 (33%), Positives = 75/159 (47%), Gaps = 8/159 (5%)

Query: 1391 GSLVVGDGSAASGENSSAIGQGSAASGDGSTAVGQGSNASGGNSSAIGQGSNASGSNSS- 1449
++ VG GS A+G NS AIG S A GD + G S A + G+ AS S++
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTA---QKDGVAIGARASTSDTGV 141

Query: 1450 AIGQGAVASGDSSTAIGQGS--SATGSGSVAIGAGSVATEANTVSFGDGTAEGNRRLVNI 1507
A+G + A +S AIG S +A S+AIG S N+VS G + NR+L ++
Sbjct: 142 AVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESL--NRQLTHL 199

Query: 1508 ADGVNASDAATKGQLDRAINGMQGQINDVAKNAYAGVAA 1546
A G +DA QL + I Q N + A A
Sbjct: 200 AAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 42.2 bits (98), Expect = 1e-05
Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 6/100 (6%)

Query: 276 ANATGANAVAIGQAVTSSGSSSVAIGSGTSADGLHSMALGASSRVVGDSNLAVGDGATVT 335
A+A G +++AIG ++ ++VA+G+G+ A G++S+A+G S+ +GDS + G
Sbjct: 65 ASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA---- 120

Query: 336 SLTTAPTNAIAIGKGANVSDAGVAGATGSIAIGTNAAAGG 375
+TA + +AIG A+ SD GVA S A N+ A G
Sbjct: 121 --STAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIG 158



Score = 40.3 bits (93), Expect = 7e-05
Identities = 36/128 (28%), Positives = 63/128 (49%)

Query: 411 GINSAAAGNASVAMGPNATAAGAGSIVIGNQAATTAANSVALGNAATGSAVNTTAIGSNA 470
G+N++A G S+A+G A AA ++ +G + T NSVA+G + + G+ +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 471 KATGNNSTVLGQSSSASGLSAVGIGFRANASGQEAISLGSDSVATGPQSTAIGTSATANS 530
A + + ++S++ AVG +A+A AI S A S AIG + +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 531 TGSVALGN 538
SV++G+
Sbjct: 182 ENSVSIGH 189



Score = 37.2 bits (85), Expect = 5e-04
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1410 GQGSAASGDGSTAVGQGSNASGGNSSAIGQGSNASGSNSSAIGQGAVASGDSSTAIGQGS 1469
G ++A G S A+G + A+ G + A+G GS A+G NS AIG + A GDS+ G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1470 SATGSGSVAIGAGSVATE 1487
+A G VAIGA + ++
Sbjct: 122 TAQKDG-VAIGARASTSD 138



Score = 35.3 bits (80), Expect = 0.002
Identities = 45/162 (27%), Positives = 76/162 (46%), Gaps = 8/162 (4%)

Query: 157 AVGVGAAATASAASALGRGAAASGTNTSAFGNGARASGDSASALGRGAVASEANSVALGA 216
A+G+ A G A+A G ++ A G A A+ +A A+G G++A+ NSVA+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 217 NSTADRANAVSVGTTSAQRQIINVADGTAGTDAVNLNQLNAAIAASDQYFMVNSTSANLA 276
S A +AV+ G S ++ DG A + + A+ + + NS + +
Sbjct: 106 LSKALGDSAVTYGAASTAQK-----DGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHS 160

Query: 277 NATGAN---AVAIGQAVTSSGSSSVAIGSGTSADGLHSMALG 315
+ AN ++AIG + +SV+IG + L +A G
Sbjct: 161 SHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202



Score = 32.2 bits (72), Expect = 0.019
Identities = 25/60 (41%), Positives = 36/60 (60%)

Query: 1438 GQGSNASGSNSSAIGQGAVASGDSSTAIGQGSSATGSGSVAIGAGSVATEANTVSFGDGT 1497
G ++A G +S AIG A A+ ++ A+G GS ATG SVAIG S A + V++G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4224HTHFIS464e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.0 bits (109), Expect = 4e-08
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 7 IRIVIADDHPAVVIGARYELSATNTLAVVASANNSTELMETLANHPCDVLVSDYAMPGTE 66
I++ADD A I + + V +N+ L +A D++V+D MP
Sbjct: 4 ATILVADDDAA--IRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD-- 59

Query: 67 YGDGLAMFSAILKRYPNLRIVVMT 90
+ + I K P+L ++VM+
Sbjct: 60 -ENAFDLLPRIKKARPDLPVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4225HTHFIS383e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 3e-05
Identities = 25/121 (20%), Positives = 49/121 (40%), Gaps = 7/121 (5%)

Query: 6 IRVVLADDHPATLGGVQHGLSSVP-TIRLTGSAGNSTELIALLDAGVCDVLVSDYAMPGG 64
+++ADD A + LS +R+T +A L + AG D++V+D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNA---ATLWRWIAAGDGDLVVTDVVMPD- 59

Query: 65 AYGDGIALFSYLQRNYPAVKLVVLTMLDNPAVIKGLLGLGISCIVSKSDAVDHLIPAVHA 124
+ L +++ P + ++V++ + G + K + LI +
Sbjct: 60 --ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 125 A 125
A
Sbjct: 118 A 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4226OMPADOMAIN1062e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (266), Expect = 2e-29
Identities = 54/149 (36%), Positives = 79/149 (53%), Gaps = 16/149 (10%)

Query: 94 FMCGKPQPVVQPAPAPAPQPAPQPVPQRQVLLQGDANFATDSAALTSQARNDLDRFIA-- 151
F G+ PVV PAPAPAP V + L+ D F + A L + + LD+ +
Sbjct: 191 FGQGEAAPVVAPAPAPAP-----EVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQL 245

Query: 152 ANRGVEFARVAITGFTDSTGSAAHNRTLSEARARTVVNYLRSNGLQARSFSAEGLGAADP 211
+N + V + G+TD GS A+N+ LSE RA++VV+YL S G+ A SA G+G ++P
Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305

Query: 212 VASNATADGR---------AQNRRVEIRL 231
V N + + A +RRVEI +
Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4227OMADHESIN821e-17 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 81.9 bits (201), Expect = 1e-17
Identities = 73/197 (37%), Positives = 114/197 (57%), Gaps = 7/197 (3%)

Query: 1205 GAQSIATGSGAVALGAGASAAGTGAVALGHG-VATGTNALALGNGTVASGNNAIAEGFNA 1263
G + A G ++A+GA A AA AVA+G G +ATG N++A+G + A G++A+ G A
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG-AA 120

Query: 1264 RAAGVNGLALGNTARANAADS-LAFGTTAQVDPLATNSIAIGRQANVTSTALNSVALGAS 1322
A +G+A+G ARA+ +D+ +A G ++ D A NS+AIG ++V + S+A+G
Sbjct: 121 STAQKDGVAIG--ARASTSDTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIAIGDR 176

Query: 1323 SVADRLNSVSVGSTGQQRQIIYVARGTANTDAVNVSQLKEAVAAFGGNASVDANGAIVNP 1382
S DR NSVS+G RQ+ ++A GT +TDAVNV+QLK+ + N + + + N
Sbjct: 177 SKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANA 236

Query: 1383 TYTIGGTTYRNVGDALN 1399
+ +G A N
Sbjct: 237 NAYADNKSSSVLGIANN 253



Score = 71.5 bits (174), Expect = 3e-14
Identities = 59/182 (32%), Positives = 86/182 (47%), Gaps = 28/182 (15%)

Query: 635 ATAVGANASATGRSAVALGGNTVASAQNAVALGTLSRATGLESTAVGVGAAAT------G 688
+ A+GA A A +AVA+G ++A+ N+VA+G LS+A G + G + A G
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 689 ANTSAFGRGASAG----------------------AGNSVALGTFSVADRANSVSVGSAS 726
A S G + G G S+A+G S DR NSVS+G S
Sbjct: 132 ARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191

Query: 727 ARRQITNVAAGTQDNDAVNVAQLNEAIANVDGNSPYFKANGAGDGSDAASATGVGSVAVG 786
RQ+T++AAGT+D DAVNVAQL + I N+ A + + A + +
Sbjct: 192 LNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIA 251

Query: 787 SN 788
+N
Sbjct: 252 NN 253



Score = 68.8 bits (167), Expect = 2e-13
Identities = 88/309 (28%), Positives = 129/309 (41%), Gaps = 66/309 (21%)

Query: 347 FSTGTYAANAAATVGNNSATAIGPNASATGISAVALGGNTVASAQNAVALGTLARATGLE 406
FS+ A+ + N +A I PNA ALG A G A A G+
Sbjct: 18 FSSPYAFADDYDGIPNLTAVQISPNADP------ALGLEYPVRPPVPGAGGLNASAKGIH 71

Query: 407 TTAVGVGAAATGASATALGRGAAATGTNSTALGTFASAIGTGNLAVGGGAAVAQTGRFVP 466
+ A+G A A +A A+G G+ ATG NS A+G + A+G + G + + G
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG---- 127

Query: 467 TNMIAIGTGARAIDGDVTGANGSIAIGSSATAAGTGSIAIGTNARHSGSSATVLGGEASA 526
+AIG+ A+ + TG
Sbjct: 128 -----------------------VAIGARASTSDTG------------------------ 140

Query: 527 MGGGGGVAVGYGAMASGSSGVSLGTNSTASATS--SVALGTNSVANRANAVSVGSAAQQR 584
VAVG+ + A + V++G +S +A S+A+G S +R N+VS+G + R
Sbjct: 141 ------VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR 194

Query: 585 QITNVAAGTAGTDAVNVNQLNAAIAATDNKYVSISTGLYAADVAATAAESATAVG-ANAS 643
Q+T++AAGT TDAVNV QL I T S L A A +S++ +G AN
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNY 254

Query: 644 ATGRSAVAL 652
+SA L
Sbjct: 255 TDSKSAETL 263



Score = 63.4 bits (153), Expect = 8e-12
Identities = 64/207 (30%), Positives = 97/207 (46%), Gaps = 30/207 (14%)

Query: 763 FKANGAGDGSDAASATGVGSVAVGSNAKALVAGGVAIGGSATASMANSIAIGNDVIAAQD 822
+ G G ASA G+ S+A+G+ A+A VA+G + A+ NS
Sbjct: 53 VRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNS------------ 100

Query: 823 GVAIGHGARAQNSNAISIGTQSAAGPNGVSLGNNALSVSDGIALGTNASAAGANSVALGS 882
VAIG ++A +A++ G S A +GV++G A + G+A+G N+ A NSVA+G
Sbjct: 101 -VAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGH 159

Query: 883 GSIAAS-----------------NTVSVGSVGNERKITNVAAGTDRQDAVNLGQLQDTGL 925
S A+ N+VS+G R++T++AAGT DAVN+ QL+
Sbjct: 160 SSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIE 219

Query: 926 VAPVDPTNPGAGLTSLAVTYGTNADGS 952
+ A L + A Y N S
Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSS 246



Score = 37.6 bits (86), Expect = 8e-04
Identities = 50/183 (27%), Positives = 79/183 (43%), Gaps = 23/183 (12%)

Query: 3118 GLELRPGTPGDGNGGGTGTNPYFGATDLTAGGSSAANPGTGTGNVAAG-SGASIG----- 3171
+RP PG G + + A TA + A G G++A G + +IG
Sbjct: 50 EYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 3172 ------------TGVNNGTVIGSGSTVGSSGGTAIGAGSAANGENATAIGQGSN--ATGS 3217
T +G IG+ ++ S G A+G S A+ +N+ AIG S+ A
Sbjct: 110 LGDSAVTYGAASTAQKDGVAIGARAST-SDTGVAVGFNSKADAKNSVAIGHSSHVAANHG 168

Query: 3218 GSVAIGSGSVANEANTVSFGNGTDTGNRRIVNIADGVGANDAATKGQLDRAVGGLGSQIN 3277
S+AIG S + N+VS G+ ++ NR++ ++A G DA QL + + N
Sbjct: 169 YSIAIGDRSKTDRENSVSIGH--ESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTN 226

Query: 3278 DLS 3280
S
Sbjct: 227 KRS 229



Score = 34.5 bits (78), Expect = 0.009
Identities = 38/142 (26%), Positives = 55/142 (38%), Gaps = 2/142 (1%)

Query: 2913 QVKNVAAGTDDTDAVNVAQLKSAGLVAPVDPTNPGSGLTSLAVTYSTNDDESANFDEVKL 2972
Q+ ++AAGT DTDAVNVAQLK + + L + A Y+ N S
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNY 254

Query: 2973 KGTNGTTITNVAAGAVNSTSTDAINGSQLHGTAQSVADTIGGGTTVGADGTLGNTAIEVN 3032
+ A + S D +N ++ H + SVA T A+ T
Sbjct: 255 TDSKSAETLENARKEAFAQSKDVLNMAKAH--SNSVARTTLETAEEHANSVARTTLETAE 312

Query: 3033 GQKYSTVAEAVQAAAAYGATDS 3054
AEA+ +A Y + S
Sbjct: 313 EHANKKSAEALASANVYADSKS 334



Score = 32.9 bits (74), Expect = 0.023
Identities = 53/207 (25%), Positives = 81/207 (39%), Gaps = 21/207 (10%)

Query: 1297 ATNSIAIGRQANVTSTALNSVALGASSVADRLNSVSVGSTGQ---QRQIIYVARGTANTD 1353
+SIAIG A A +VA+GA S+A +NSV++G + + Y A TA D
Sbjct: 69 GIHSIAIGATAEAAKGA--AVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126

Query: 1354 AVNVSQLKEAVAAFGGNASVDANGAIVNPTYTIGGTTYRNVGDALNALSNLGGGGTDPLA 1413
V A G AS G V +G + + +N G +
Sbjct: 127 GV----------AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHG------YS 170

Query: 1414 VTYGTNEDGTPNFAVVTLKGTDGTTLSNVKAGVADTDAVNVSQLKDSGLIGDDGKAIAAV 1473
+ G +V + L+++ AG DTDAVNV+QLK + +
Sbjct: 171 IAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSA 230

Query: 1474 TYDRNADGTPNLGSVTLAGGTDGTTLS 1500
NA+ + S ++ G + T S
Sbjct: 231 ELLANANAYADNKSSSVLGIANNYTDS 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4228HTHFIS726e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 6e-15
Identities = 27/141 (19%), Positives = 55/141 (39%), Gaps = 4/141 (2%)

Query: 770 RVLFVDDNPVNRSLVHDQLDVLGYRADVASSVAEALDLVERHDYAIVMTDLNMPGLDGYA 829
+L DD+ R++++ L GY + S+ A + D +V+TD+ MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 830 LARMLRERGRALPILAVTAHAEPDEMRLSKEAGIDEIVTKP----TSLRSLEQAIAKYAG 885
L +++ LP+L ++A + E G + + KP + + +A+A+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 886 PWRPGRAIPPASASTRGPLPR 906
G
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145


52Bcenmc03_4301Bcenmc03_4307N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4301318-5.450741fucose-binding lectin II
Bcenmc03_4302312-3.267576fucose-binding lectin II
Bcenmc03_4303212-2.215537fucose-binding lectin II
Bcenmc03_4304112-2.061218tryptophan 2-monooxygenase
Bcenmc03_430508-0.293481cytosine/purines uracil thiamine allantoin
Bcenmc03_4306180.224648Nitrilase/cyanide hydratase and apolipoprotein
Bcenmc03_43070110.623141outer membrane autotransporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4301PF074721485e-48 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 148 bits (375), Expect = 5e-48
Identities = 50/128 (39%), Positives = 70/128 (54%), Gaps = 10/128 (7%)

Query: 4 SQTSSNRAGEFSIPPNTDFRAIFFANAAEQQHIKLFIGDSQEPAA-YHKLTTRDGPREA- 61
+ R G F++PPN F N++ QQ I++++ D+ +PAA + T+D
Sbjct: 126 TTGGGERDGIFNLPPNIAFGVTALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDANLNTQ 185

Query: 62 TLNSGNGKIRFEVSVNGKPSATDARLAPINGKKSDGSPFTVNFGIVVSEDGHDSDYNDGI 121
+NSG GK+R V+ NGKPS +R I K FG+V SEDG D DYNDGI
Sbjct: 186 IVNSGKGKVRVVVTANGKPSKIGSRQVDIFKK--------TYFGLVGSEDGTDGDYNDGI 237

Query: 122 VVLQWPIG 129
+L WP+G
Sbjct: 238 AILNWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4302PF074722032e-67 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 203 bits (518), Expect = 2e-67
Identities = 89/159 (55%), Positives = 114/159 (71%), Gaps = 16/159 (10%)

Query: 113 GSAMHIDSYASLSAIGETAAPSSSQGGGNQGAETGGAGAGNIGGERDGTFNLPPHIKFGV 172
G ++ ++ + E P ++ GGG ERDG FNLPP+I FGV
Sbjct: 103 GVGAVVNYFSKATPQPEPTQPGTTTGGG----------------ERDGIFNLPPNIAFGV 146

Query: 173 TALTNAANDQTIDIYIDDDPKPAATFKGAGAQDQNLGTKVLDSGNGRVRVIVMANGKPSR 232
TAL N++ QTI++Y+DD+PKPAATF+GAG QD NL T++++SG G+VRV+V ANGKPS+
Sbjct: 147 TALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDANLNTQIVNSGKGKVRVVVTANGKPSK 206

Query: 233 LGSRQVDIFKKSYFGIVGSEDGADDDYNDGIVFLNWPLG 271
+GSRQVDIFKK+YFG+VGSEDG D DYNDGI LNWPLG
Sbjct: 207 IGSRQVDIFKKTYFGLVGSEDGTDGDYNDGIAILNWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4303PF07472407e-148 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 407 bits (1046), Expect = e-148
Identities = 230/245 (93%), Positives = 235/245 (95%), Gaps = 1/245 (0%)

Query: 1 MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMRRFMTAQLVEKLPQYDVFVDIATI 60
MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMR+FMT QLVEKLPQYDVFVDIATI
Sbjct: 1 MSQPFTHDDLYALLQLAGNDATAVQANGDQAVLDRMRQFMTTQLVEKLPQYDVFVDIATI 60

Query: 61 PYSFDVGSWQNKVKADAAGEVVACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPVP 120
PYSFDVGSWQNKVKADAAG+V+ACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQP P
Sbjct: 61 PYSFDVGSWQNKVKADAAGQVIACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPEP 120

Query: 121 -PAPAPTGGGERDGVFNLPPNIAFGVTALVNSSAPQTIEVFVDDNPKPAATFQGAGTQDA 179
TGGGERDG+FNLPPNIAFGVTALVNSSA QTIEV+VDDNPKPAATFQGAGTQDA
Sbjct: 121 TQPGTTTGGGERDGIFNLPPNIAFGVTALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDA 180

Query: 180 NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDGGDGDYNDGIAIL 239
NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDG DGDYNDGIAIL
Sbjct: 181 NLNTQIVNSGKGKVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDGTDGDYNDGIAIL 240

Query: 240 NWPLG 244
NWPLG
Sbjct: 241 NWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4307PRTACTNFAMLY872e-19 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 87.4 bits (216), Expect = 2e-19
Identities = 205/939 (21%), Positives = 324/939 (34%), Gaps = 99/939 (10%)

Query: 203 APAVYMQGNNDTLVNSGTIQ--TTGTATSGGSVDAVVSNTLGSSFTATITNQAGGRIISN 260
APA + NN ++V +G Q + G V T+ S QA G ++ N
Sbjct: 29 APAAHADWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTIKVS-----GRQAQGILLEN 83

Query: 261 NGIGVRSTNGATTITNAGLIQGGGGTAIQGGNGNVTLILQTGSQIVGTANGGAGTNTVTL 320
++ NG ++T++G + G G L + + + L
Sbjct: 84 PAAELQFRNG--SVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIAL 141

Query: 321 QGTGTASNAFTNFQSLTMAGADWTWAGTG-TFSTALVQSGTLNLTGTLGTTTASVVATVN 379
G + A +L AG G T + + G L++ + +
Sbjct: 142 YVAGEQAQASIADSTLQGAGGVQIERGANVTVQRSAIVDGGLHIGALQSLQPEDLPPSRV 201

Query: 380 AGATLQANA---SNLPLSVTDNGLVRFQQDSAGTYTGTISGSGAVEKTGAGTLTLTPSAA 436
A S P +V+ G D G +G A++ GA +
Sbjct: 202 VLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQ--GAVVHLQRATIR 259

Query: 437 GGNTYSGGTTITQGTLSVAADNALGAASGPLTFNG--GTLQLGSAFDLAASRAMSITSNN 494
G+ +GG A G +G G GS+ +LA S +
Sbjct: 260 RGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEA--PEL 317

Query: 495 GTIDTQGFNSTITQNITGTGGLTKLGAGTLTLNGANTYAGGTALNAGTLVVGDAAHASAA 554
G G + +T G L+ + GA +A A + TL G A A
Sbjct: 318 GAAIRVGRGARVT---VSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKAL 374

Query: 555 LGGGGP----VTVASGTMLGGYGGVTGNVTNNGTLSVANALASLASGAT----------- 599
L P +T+ G G V + + S+ +LAS A
Sbjct: 375 LYRVLPEPVKLTLTGGADAQG-DIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSL 433

Query: 600 ----GNFRINGNLTNAGLVQLGGSG---------VGNTLTVAGNYVGQNATLALNTTLAG 646
+ + N +N G ++L G G + N + + +N
Sbjct: 434 SIDNATWVMTDN-SNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMNVFA-- 490

Query: 647 DGAPSDKLVVSGGTASGSSTLKVTNVGGAGAQTTGDGIQVVQATNGATTSANAFSLSGGT 706
D SDKLVV ASG L V N +G++ ++ T + + + G
Sbjct: 491 DLGLSDKLVVMQD-ASGQHRLWVRN---SGSEPASANTLLLVQTPLGSAATFTLANKDGK 546

Query: 707 VSAGAYTYFLAKGGESNGTGDSWYLRNTVPPKPQPPVVQPGQPTPPAEPPIMAAEGTPES 766
V G Y Y LA +NG G + PP P+P QP P +P A P +
Sbjct: 547 VDIGTYRYRLA----ANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPA 602

Query: 767 IVEAVDHAGVGGTPEPVYRPEVPLYAEAPAVARQLDLLQIDTFHDRQGEQGLLAENGSVP 826
E A V YAE+ A++++L G L N
Sbjct: 603 GRELSAAANAAVNTGGVGLASTLWYAESNALSKRL---------------GELRLNPDAG 647

Query: 827 ASWVRVWGGNSNIKQKGDATPSFDGTVWGMQVGQDLYADTTASGHRNHYGFFLGFSRAVG 886
+W R + + + A FD V G ++G D +G R H G G++R
Sbjct: 648 GAWGRGFAQRQQLDNR--AGRRFDQKVAGFELGAD--HAVAVAGGRWHLGGLAGYTRGDR 703

Query: 887 DVKGFALAQPDLGVGSLQVNAYNLGGYWTHIGPGGWYTDAVLMGS----ALTVRTHSSDI 942
G G ++ ++GGY T+I G+Y DA L S V
Sbjct: 704 GFTGD---------GGGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYA 754

Query: 943 VNGSTNGNAFTGSVEAGLPIALGNGLTLEPQAQLVWQWLSLDRFNDGVS-SIAWNNGNTF 1001
V G + S+EAG +G LEPQA+L + + G++
Sbjct: 755 VKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSV 814

Query: 1002 VGRIGARLQYAFD-ASGVSWKPYLRVNVLRSFGTDDKTTFGGSTTIGTQVGQTAGQIGVG 1060
+GR+G + + A G +PY++ +VL+ F G G A ++G+G
Sbjct: 815 LGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRA-ELGLG 873

Query: 1061 LVAQLTKRGSAYATVSYLTNLGGEHQRTITGNAGVRWAW 1099
+ A L + S YA+ Y G + T +AG R++W
Sbjct: 874 MAAALGRGHSLYASYEYSK--GPKLAMPWTFHAGYRYSW 910


53Bcenmc03_4336Bcenmc03_4342N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4336-1110.295699short-chain dehydrogenase/reductase SDR
Bcenmc03_43370130.154297short chain dehydrogenase
Bcenmc03_43380120.974112aminoglycoside phosphotransferase
Bcenmc03_43391140.812351phosphoglycerate mutase
Bcenmc03_43401161.344901hypothetical protein
Bcenmc03_4341-1122.290668hypothetical protein
Bcenmc03_43420132.271624hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4336DHBDHDRGNASE1124e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (281), Expect = 4e-32
Identities = 82/259 (31%), Positives = 121/259 (46%), Gaps = 10/259 (3%)

Query: 1 MRLDSYAGQAVMITGAASGFGALLASELAAMGARLALGDLNGEALERVAAPLRAAGADVI 60
M G+ ITGAA G G +A LA+ GA +A D N E LE+V + L+A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 AQRCDVRVETEVAALVQEAVARFGRLDVGINNAGIAPPMKALIDTDEADLDLNFAVNAKG 120
A DVR + + G +D+ +N AG+ P + + + + F+VN+ G
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTG 119

Query: 121 VFFGMKHQIRQMLAQREGVILNVASMAGLGGAPKLAAYAASKHAVVGLTKTAALEYARHG 180
VF + + M+ +R G I+ V S +AAYA+SK A V TK LE A +
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 181 IRVNAVCPFYSTTSM---VTDSEIGDRQ---DFLAQ---GSPMKRLGRPDEIVATMLMLC 231
IR N V P + T M + E G Q L G P+K+L +P +I +L L
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 232 AKDNTYLTGQAVAVDGGVS 250
+ ++T + VDGG +
Sbjct: 240 SGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4337DHBDHDRGNASE1256e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (314), Expect = 6e-37
Identities = 87/255 (34%), Positives = 121/255 (47%), Gaps = 9/255 (3%)

Query: 8 LTGKIALVTGASRGIGEEIAKLLAEQGAYVIVSSRKLDDCQAVADAIVAAGGRAEALACH 67
+ GKIA +TGA++GIGE +A+ LA QGA++ + + V ++ A AEA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 VGRLEDIAATFEHIRGKHGRLDILVNNAAANPYFGHILDTDLAAYEKTVDVNIRGYFFMS 127
V I I + G +DILVN A G I +E T VN G F S
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVN-VAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 128 VEAGKLMKTHGGGAIVNTASVNALQPGDRQGIYSITKAAVVNMTKAFAKECGPLGIRVNA 187
K M G+IV S A P Y+ +KAA V TK E IR N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 188 LLPGLTKTKFAGALFADKD--------IYETWMTKIPLRRHAEPREMAGTVLYLVSDAAS 239
+ PG T+T +L+AD++ ET+ T IPL++ A+P ++A VL+LVS A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 240 YTNGECIVVDGGLTI 254
+ + VDGG T+
Sbjct: 245 HITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4341PF06580294e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 4e-04
Identities = 10/46 (21%), Positives = 23/46 (50%)

Query: 1 MLGTILIIVLILLLIGAFPAWPHSRSWGYWPSGTVGLIVVIVVILV 46
M+ I I ++ L+L A+ ++ + W G + L V+ +++
Sbjct: 42 MIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVI 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4342TYPE4SSCAGX270.022 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.1 bits (59), Expect = 0.022
Identities = 19/90 (21%), Positives = 43/90 (47%), Gaps = 1/90 (1%)

Query: 38 EQAASAAASPRAAKKAERAADRAFAKKVRQAIVRAPGVGNAQ-VTVFAKAKTGDVTLAGQ 96
EQA A R +K ERA +RA + + A+ + N + ++ K + + +
Sbjct: 156 EQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQME 215

Query: 97 IADESQDRAAVDAARQVPGVTSVKSKLQLR 126
++ Q++A +A +Q+ + +++ +R
Sbjct: 216 RLEDMQEQAQANALKQIEELNKKQAEEAVR 245


54Bcenmc03_4364Bcenmc03_4371N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4364-2101.608305TetR family transcriptional regulator
Bcenmc03_4365-3101.411498hypothetical protein
Bcenmc03_4366-292.592933cytochrome c class I
Bcenmc03_4367-1102.587230amine oxidase
Bcenmc03_43680122.763250two component transcriptional regulator
Bcenmc03_4369-1121.560412histidine kinase
Bcenmc03_43700121.521640hypothetical protein
Bcenmc03_43711121.349635major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4364HTHTETR619e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 9e-14
Identities = 30/188 (15%), Positives = 61/188 (32%), Gaps = 7/188 (3%)

Query: 8 RAERREATRERLLAAARTIFAEKGYAAASVEDIAAAAGHTRGAFYSNFRGKADVLFELLG 67
+ + TR+ +L A +F+++G ++ S+ +IA AAG TRGA Y +F+ K+D+ E+
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 68 RDQDEAAAALQRIVGAHDPDDDAQ-----RAMLAYWRRGTTQPASRLMWLDAQLQAARDP 122
+ D + +L + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 123 QFRARFGALLHDRRAFAAACIDAYAARVGVSLPLPTQVLALGLTALCDGMHSHGAAEARP 182
+ L + + + L T+ A+ + G+ + P
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN--WLFAP 182

Query: 183 TDDTLADT 190
L
Sbjct: 183 QSFDLKKE 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4368HTHFIS811e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 1e-19
Identities = 31/137 (22%), Positives = 58/137 (42%), Gaps = 6/137 (4%)

Query: 18 VLIAEDEPEIAEILTAYFARNGLRTVHAADGRRALELHLSLKPDLVLLDVQMPHVDGWKV 77
+L+A+D+ I +L +R G ++ + DLV+ DV MP + + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 78 LAEIR-HRGDTPVIMLTALDQDIDKLTGLRIGADDYVVKPFNPAEVVARAQAVLRRSMAG 136
L I+ R D PV++++A + + + GA DY+ KPF+ E++ L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE---- 121

Query: 137 SRQEEQRVLRAAPFEID 153
+ L +
Sbjct: 122 -PKRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4369PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 3e-05
Identities = 16/71 (22%), Positives = 26/71 (36%), Gaps = 20/71 (28%)

Query: 254 LLENARRHAV-----PGAIRIQTRIEDGMCRLRVEDDGPGIPAEFAPHVFQAFRRVDESQ 308
L+EN +H + G I ++ ++G L VE+ G ++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL---------------KNT 307

Query: 309 PGGTGLGLAVV 319
TG GL V
Sbjct: 308 KESTGTGLQNV 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4371TCRTETA391e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 1e-05
Identities = 49/306 (16%), Positives = 95/306 (31%), Gaps = 37/306 (12%)

Query: 32 VATFTTGLVHSLTLLFAARLLLGIGEGATLPAQARAVTHWFPRERRGVVQGFTHSFSRLG 91
V L +L+ R++ GI GAT + + R F +
Sbjct: 85 VDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDER------ARHFGFMS 137

Query: 92 NA-----VTPPIVAALTTWLSWRAAFFVIGAVTLVWLAWWIVGFREHPAGDDDGRPRAAR 146
V P++ L S A FF A+ L + F + + RP
Sbjct: 138 ACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNG--LNFLTGCFLLPESHKGERRPLRRE 195

Query: 147 PVAPSGPTPWGPLFRRMAPTIFVYFC------YGWTAWLFFTWLPTFFLNGQGLNLKSTA 200
+ P W +A + V+F W+ F F + + + A
Sbjct: 196 ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG-EDRFHWDATTIGISLAA 254

Query: 201 LFASGVFFAGVVGDTLGGWLCDRIYRKTGNLALSRQSVIVTSFVGALVCLLPLAFVHSTA 260
++ + L +R G + I+ +F P+ + ++
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLG-MIADGTGYILLAFATRGWMAFPIMVLLASG 313

Query: 261 GVALCLSGSFLCLELTIGPIWAVPSDIAPTHA-GIASGMMNAGSAISGILSPILFGYLVD 319
G+ + + A+ S G G + A ++++ I+ P+LF +
Sbjct: 314 GIGM-------------PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360

Query: 320 RT-GSW 324
+ +W
Sbjct: 361 ASITTW 366


55Bcenmc03_4496Bcenmc03_4501N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4496-181.347440major facilitator transporter
Bcenmc03_4497090.861120alcohol dehydrogenase
Bcenmc03_4498-1121.082971short-chain dehydrogenase/reductase SDR
Bcenmc03_4499-2101.285376outer membrane protein (porin)
Bcenmc03_4500-2111.273726AraC family transcriptional regulator
Bcenmc03_4501-2110.985065MetA-pathway phenol degradation-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4496TCRTETA508e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 8e-09
Identities = 73/394 (18%), Positives = 134/394 (34%), Gaps = 32/394 (8%)

Query: 21 LFMTFGFVFFDRLALSFL---FPFMSAELHLTNTQ---LGMVSSALALTWALSGAATGAW 74
L + V D + + + P + +L +N G++ + AL GA
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 75 SDARGTRKPLLVAAVLGFSVCSALSGLVGGFASLIAFRALMGIAEGPVLPLSQSLMVESS 134
SD G R+P+L+ ++ G +V A+ L R + GI G ++ + + + +
Sbjct: 67 SDRFG-RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADIT 124

Query: 135 TPSRRGLNMGLLQGSAAGLLGAMIGPPVVIGIATAYGWREAFYVSCIPGFLIAFCIWRWV 194
R + G + +A M+ PV+ G+ + F+ + L +
Sbjct: 125 DGDERARHFGFM---SACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 195 REMPPGGVRHVQADTAAVPAASGAEFSRWALLKERNILLCVLISCFFLTWFVVIISFAP- 253
E G R A P AS RWA L + FF+ V + A
Sbjct: 182 PE-SHKGERRPLRREALNPLAS----FRWARGMTVVAALMAV---FFIMQLVGQVPAALW 233

Query: 254 VFLVESR-HLAPSDMGVVMTCLGAA-WVFWGFAVPAISDRIGRKPTMIGFALIAAMCPVV 311
V E R H + +G+ + G + ++ R+G + + +IA +
Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGER-RALMLGMIADGTGYI 292

Query: 312 LI-HVGSVWALGALVFVTYTGLGCFTLFMATIPAETVPPRAIASALGLIMGAGEL---IG 367
L+ W + F L + M + A + + G + G+ +
Sbjct: 293 LLAFATRGW----MAFPIMVLLASGGIGMPALQA-MLSRQVDEERQGQLQGSLAALTSLT 347

Query: 368 GFVAPTVAGFAADRYGLQFAMWTSTTGAILACVL 401
V P + + W GA L +
Sbjct: 348 SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4498DHBDHDRGNASE892e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.0 bits (220), Expect = 2e-23
Identities = 65/250 (26%), Positives = 106/250 (42%), Gaps = 21/250 (8%)

Query: 8 DGQSVVVTGGTSGIGARTALRFAQAGASVVALGLDATGPHAPVHAGV------RCVELDV 61
+G+ +TG GIG A A GA + A+ + V + DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 62 TDGAAL----TRTIAALPRLDVLVNGVGISRHAG--EYRMDQFEHVLNVNLMSVMRASDA 115
D AA+ R + +D+LVN G+ R +++E +VN V AS +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 116 ALPALTAN-GGSIVNIASMYTYFGSKDRPAYSASKGGIAQLTRSLAQAWADHGIRVNAVA 174
+ GSIV + S AY++SK T+ L A++ IR N V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 175 PGWIDTPLSSALMADTLASRRILERT--------PLGRWGTADEVAEVILFLCSPGASFV 226
PG +T + +L AD + ++++ + PL + ++A+ +LFL S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 227 TGAVVPVDGG 236
T + VDGG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4499ECOLNEIPORIN724e-16 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 71.8 bits (176), Expect = 4e-16
Identities = 74/355 (20%), Positives = 124/355 (34%), Gaps = 61/355 (17%)

Query: 3 KKTFLALAACTLPAAAFAQSSVTMFGLMDTGISYVSNQGGHGAAKFDDNIF-----FPNL 57
KK+ +AL LP AA A VT++G + G+ + +GA +
Sbjct: 2 KKSLIALTLAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSK 59

Query: 58 LGFEGKEDLGAGTRAIFRLVNQYSLGNGSIIGGGLFARTAYVGLQNDRYGTLTLGNQYEF 117
+GF+G+EDLG G +AI+++ + S+ G R +++GL+ +G L +G
Sbjct: 60 IGFKGQEDLGNGLKAIWQVEQKASIAGT---DSGWGNRQSFIGLKGG-FGKLRVGRLN-- 113

Query: 118 MVDALAASGNEIAQDLVGLYGFRNGPFDRLGLPNNPTGAFDW---DRVAGSNRVANSVKY 174
G N D+ +++A SV+Y
Sbjct: 114 ------------------------SVLKDTGDINPWDSKSDYLGVNKIAEPEARLISVRY 149

Query: 175 TSPSLSGLTFGALYGFGNVAGSIGANNTVSVGASYDHGPFG--AGAAYTNQKYGAADGLP 232
SP +GL+ Y + AG + + G +Y +G F G AY +
Sbjct: 150 DSPEFAGLSGSVQYALNDNAGR-HNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENV-N 207

Query: 233 PTSVRNWGAGVHY-------TVGTVTGKALVTT---VHNARNGAGAWSAEAGAAWRP--S 280
+ Y +V A + HN++ A A P S
Sbjct: 208 IEKYQIHRLVSGYDNDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVS 267

Query: 281 PVWVIGASYTYMKGNDTLDDAHAHQLLAAVQYWLSKRTMVYVAGVHQRASHGSNA 335
S+ N+ D Q++ +Y SKRT V+ + G +
Sbjct: 268 YAHGFKGSFDATNYNNDYD-----QVVVGAEYDFSKRTSALVSAGWLQEGKGESK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4501ECOLIPORIN310.006 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 31.0 bits (70), Expect = 0.006
Identities = 25/91 (27%), Positives = 36/91 (39%), Gaps = 5/91 (5%)

Query: 174 GLSYGGRTGPVFDAKVMWTINRRNPATDYRSGQEFIVDYSAGWGFGNGLTVGAGGYFYRQ 233
L Y G+ V N RN D R S + G G + G Y
Sbjct: 169 ALQYQGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAG-AAYTTSD 227

Query: 234 MTDD-AQAGQTIA-GNRGRSFAIGPSIKYDS 262
T++ AG TIA G++ ++ G +KYD+
Sbjct: 228 RTNEQVNAGGTIAGGDKADAWTAG--LKYDA 256


56Bcenmc03_4558Bcenmc03_4569N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4558-1101.353745VacJ family lipoprotein
Bcenmc03_45592101.342974hopanoid biosynthesis associated RND transporter
Bcenmc03_45602110.702213toluene tolerance family protein
Bcenmc03_45611111.025482TetR family transcriptional regulator
Bcenmc03_45621111.193450EmrB/QacA family drug resistance transporter
Bcenmc03_4563-1120.738077two component LuxR family transcriptional
Bcenmc03_4564-1120.502203histidine kinase
Bcenmc03_4565-39-0.348611diguanylate cyclase
Bcenmc03_4566-110-0.847564malate synthase G
Bcenmc03_4567019-2.361306hypothetical protein
Bcenmc03_4569019-3.146235DEAD/DEAH box helicase domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4558VACJLIPOPROT2185e-72 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 218 bits (557), Expect = 5e-72
Identities = 75/238 (31%), Positives = 117/238 (49%), Gaps = 8/238 (3%)

Query: 3 KVRIIAATVAASAMLTGCATGP--NRNPNDPLEPMNRAMYKFN-DTVDTNIAQPIAKGYQ 59
K+R+ A + + +L GCA+ + +DPLE NR MY FN + +D I +P+A ++
Sbjct: 2 KLRLSALALGTT-LLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60

Query: 60 KVTPTPVRTAISNFFSNLGDLGNMANNLLQLRITDATQDLMRVAMNSLFGVAGLIDIATP 119
P P R +SNF NL + M N LQ R +N++ G+ G ID+A
Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120

Query: 120 AGLPKH---HQDFGLTMARWGMPSGPYLVLPVFGPSTIRDGVGRAVDVRFNLLNYIEPAA 176
A FG T+ +G+ GPY+ LP +G T+RD G D + +L+++
Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180

Query: 177 RNPMYIAQFISARSDLLGATDLLKQAALDPYSFVRDAYLQQRKSLTYHGQSASAAAPN 234
+ + I R+ LL + LL+Q + DPY VR+AY Q+ + G+ PN
Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQ-SSDPYIMVREAYFQRHDFIANGGELKPQENPN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4559ACRIFLAVINRP581e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 58.3 bits (141), Expect = 1e-10
Identities = 49/205 (23%), Positives = 82/205 (40%), Gaps = 18/205 (8%)

Query: 284 TLLAVLVILWLALRSKRMIGSVLVTLFVGLVVTAALGLAMVGSLNMISVAFMVLFVGLGV 343
+L LV+ L L++ R + + V L+ T A+ A S+N +++ MVL +GL V
Sbjct: 348 IMLVFLVMY-LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406

Query: 344 DFSIQYGVKYREERFRDERID--HALIGAAHSMGMPLALATAAVAASFFSFIPTAYRGVS 401
D +I V+ E ++++ A + + L ++A FIP A+ G S
Sbjct: 407 DDAIVV-VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSA---VFIPMAFFGGS 462

Query: 402 E------LGLIAGVGMFVALLTTLTLLPALLRLF-----APPGESKTPGFPWLAPVDDYL 450
+ M +++L L L PAL A E+K F W D+
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHS 522

Query: 451 DRHRKPILIGTLAVVIGALPLLAFL 475
H + L L + A +
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALI 547



Score = 32.1 bits (73), Expect = 0.013
Identities = 31/182 (17%), Positives = 63/182 (34%), Gaps = 36/182 (19%)

Query: 268 DEFASVEDGAALNGVLTLLAVLVILWLALRSKRMIGSVLVTLFVGLVVTAALGLAMVGSL 327
+ + A ++ + V + L S + SV++ + +G+V L +
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV-GVLLAATLFNQ- 920

Query: 328 NMISVAFMVLFVGLG----------VDFSIQYGVKYREERFRDERIDHA---------LI 368
V FMV + V+F+ + +E + E A +
Sbjct: 921 -KNDVYFMVGLLTTIGLSAKNAILIVEFAKD--LMEKEGKGVVEATLMAVRMRLRPILMT 977

Query: 369 GAAHSMGM-PLALATAAVAASFFSFIPTAYRGVSELGLIAGVGMFVALLTTLTLLPALLR 427
A +G+ PLA++ A + + + G+ +G GM A L + +P
Sbjct: 978 SLAFILGVLPLAISNGAGSGAQNAV------GIGVMG-----GMVSATLLAIFFVPVFFV 1026

Query: 428 LF 429
+
Sbjct: 1027 VI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4561HTHTETR633e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 3e-14
Identities = 31/201 (15%), Positives = 73/201 (36%), Gaps = 13/201 (6%)

Query: 20 KPRTKPAEVRLEELMAAAETLFLAQGVEATTISEIVEHAQVAKGTFYHYFESKADMLAAL 79
+ + A+ + ++ A LF QGV +T++ EI + A V +G Y +F+ K+D+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 80 AQRYTASFLDQLQHAVNGCDADDWLARLRAWIRASIE----------IYAATYRTHDIVY 129
+ ++ + D + LR + +E + + + V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPL-SVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 130 TNHHHHDRENADKNAILEQLGAILDGGVRAGAWRLAHPP-VTALLIYAGVHGATDHIIAS 188
+ +++ L + A A+++ + G ++ + +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 189 PET-DRVAFVDAVVDDCLRML 208
P++ D V L M
Sbjct: 182 PQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4562TCRTETB1275e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 127 bits (320), Expect = 5e-34
Identities = 83/421 (19%), Positives = 163/421 (38%), Gaps = 15/421 (3%)

Query: 2 SRYRRAALVLAACLGTFLATLDISIVNVALPTLQTALDTDIGGLQWVINAYALALSAFML 61
S R +++ C+ +F + L+ ++NV+LP + + WV A+ L S
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 62 SAGPLGDRYGHKRVWLASVILFTAGSIVCACAGRIEPLL-AGRAIQGLAGALLIPGAMPI 120
G L D+ G KR+ L +I+ GS++ LL R IQG AGA P + +
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMV 126

Query: 121 LTHAFPDARERARVIGGWSAFSALALIVGPLLGGLLVEHGGWQDIFLVNVPIGIVAVLLG 180
+ + R + G + A+ VGP +GG++ + W + L+ + I L
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186

Query: 181 TWGIPERRHPEHAAFDPFGQVLSVAWLGLLTYGLIGIGEVGTSHVKVLAPLAGAAVVFVV 240
E R H FD G +L + TS+ + + + F++
Sbjct: 187 KLLKKEVRIKGH--FDIKGIILMSVGIVFFMLFT-------TSYSISFLIV--SVLSFLI 235

Query: 241 FVRVETRVARPLLPVWLFRDRRLVRANLASFVLGFSGYSSLFFLSLFLQQAQGRAPAAAG 300
FV+ +V P + L ++ + L ++ + + + ++ + A G
Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295

Query: 301 WQLM-PQFVMTAITSVLFGRIAARIPLRALMVAGYGLIGAMLVVMAGFGAATPYAALGVV 359
++ P + I + G + R ++ G + + + T + ++
Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIII 355

Query: 360 LALLGVGMGLAVPATGMTVMELAPAERAGMASATMNALRQTGMSLGIAVLGSMMSVGALH 419
+ +LG + + L E AG + +N GIA++G ++S+ L
Sbjct: 356 VFVLGGLSFTKTVISTIVSSSLKQQE-AGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414

Query: 420 R 420
+
Sbjct: 415 Q 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4563HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 1e-16
Identities = 33/120 (27%), Positives = 48/120 (40%), Gaps = 1/120 (0%)

Query: 9 PAARLLLVDDHPLVRDGLRMRLEAADLSVVGEAGNADEALALAESLEPDLALMDVGMNGM 68
A +L+ DD +R L L A V NA + + DL + DV M
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 69 NGITLAGVFHERFPGIRVLMLSMHDNIEYVTQAVRAGASGYLLKDSPASEIVRAIGAVLA 128
N L + P + VL++S + +A GA YL K +E++ IG LA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4564PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 6/62 (9%)

Query: 397 HTVTLTIADNGCGFDAERSQADVRHGIGLRNMRERLDALGG---TLTITSQVGHTIVAAS 453
TVTL + + G ++ G GL+N+RERL L G + ++ + G
Sbjct: 290 GTVTLEVENTGSLALKNTKES---TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL 346

Query: 454 VP 455
+P
Sbjct: 347 IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4569TONBPROTEIN300.024 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.6 bits (66), Expect = 0.024
Identities = 28/146 (19%), Positives = 44/146 (30%), Gaps = 19/146 (13%)

Query: 338 AGASGVAVSLVCADEAPQLAAIEALIRQTLRREEEPGFEAEHRVPETSATGEIIKKPKKP 397
A A ++V++V + A++ + E EP E + KPK
Sbjct: 40 APAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 99

Query: 398 KKPKVAQAVPVGKQASGKKPPAQGGEGKRKAVAQHAKPAVGTGTDYTIGSPFSVQKPRSK 457
KP K Q + ++ A P T S +
Sbjct: 100 PKPVK-------------KVQEQPKRDVKPVESRPASPFENTAPARLTSS------TATA 140

Query: 458 PAGKAGTGKPAGKSAGGRGKPGKPSR 483
K T +G A R +P P+R
Sbjct: 141 ATSKPVTSVASGPRALSRNQPQYPAR 166


57Bcenmc03_4636Bcenmc03_4657N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4636535-5.998405YadA domain-containing protein
Bcenmc03_4637545-9.900121hypothetical protein
Bcenmc03_4638542-9.386307hypothetical protein
Bcenmc03_4640642-9.674971PAS/PAC sensor hybrid histidine kinase
Bcenmc03_4641736-8.732165two component LuxR family transcriptional
Bcenmc03_4643734-8.634874hypothetical protein
Bcenmc03_4644733-7.867634type III secretion FHIPEP protein
Bcenmc03_4645836-8.104576flagellar biosynthesis protein FlhB
Bcenmc03_4646734-7.901587flagellar biosynthetic protein FliR
Bcenmc03_4647633-7.873698flagellar biosynthetic protein FliQ
Bcenmc03_4648431-6.800126flagellar biosynthesis protein FliP
Bcenmc03_4649433-6.767784flagellar motor switch protein FliN
Bcenmc03_4650532-6.292038surface presentation of antigens (SPOA) protein
Bcenmc03_4651430-6.106193flagellin domain-containing protein
Bcenmc03_4652432-5.372056two component transcriptional regulator
Bcenmc03_4653431-4.594121histidine kinase
Bcenmc03_4654432-4.712082flagellar hook-basal body complex subunit FliE
Bcenmc03_4655530-4.967244flagellar MS-ring protein
Bcenmc03_4656330-5.292538flagellar motor switch protein G
Bcenmc03_4657333-5.840998flagellar assembly protein FliH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4636OMADHESIN685e-13 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 67.6 bits (164), Expect = 5e-13
Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 11/158 (6%)

Query: 309 GKDAYATG-NNIAVGMGAKADTGGA---GTGVIAIGEGAAAGRPGYS--GAIAIGYGARA 362
G +A A G ++IA+G A+A G A G G IA G + A P G A+ YGA +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 363 VDGTQGGYLRPVAIGAGAQAQGVSVAIGPDAQSSPGNGVALGHQASVGADATWAVALGTG 422
G VAIGA A VA+G ++++ N VA+GH + V A+ +++A+G
Sbjct: 122 TAQKDG-----VAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDR 176

Query: 423 SSADRADTVSVGNAVSQRQIVNVAAGTQGTDAVNVAQL 460
S DR ++VS+G+ RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 177 SKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 63.8 bits (154), Expect = 8e-12
Identities = 58/178 (32%), Positives = 97/178 (54%), Gaps = 9/178 (5%)

Query: 1088 AIGVGTIASGANTVAIGVRSYANSDGAVAIGNMAQTGASQPNSVAIGSNVTTNGASALAV 1147
A+G+ A G+ + A ++AIG A+ A++ +VA+G+ G +++A+
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAE--AAKGAAVAVGAGSIATGVNSVAI 103

Query: 1148 GSQAKANGDNAIALGNNNVMAVGEGSIAIGNKAVSAAGTTNGIALGAGANVARSVADSMA 1207
G +KA GD+A+ G + + +AIG +A +T+ + G N +S+A
Sbjct: 104 GPLSKALGDSAVTYGAASTAQ--KDGVAIGARA-----STSDTGVAVGFNSKADAKNSVA 156

Query: 1208 LGAKSSVEKGANGAVALGTGSKATRANTVSVGNTGTERQIVNVAAGTQGTDAVNVAQL 1265
+G S V ++A+G SK R N+VS+G+ RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 157 IGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 61.1 bits (147), Expect = 5e-11
Identities = 56/162 (34%), Positives = 91/162 (56%), Gaps = 9/162 (5%)

Query: 2044 GLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWATTNGAHALAIGSHAKANGENAVAVGS 2103
GLN A S+AIG A+ + +VA+G+ + G +++AIG +KA G++AV G+
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 2104 NGIKAIGASSIAIGNAAEASVGATNGIALGTGASVEPNVTDAMALGANTIVDDKANGAVA 2163
+AIG A S G+A+G + + +++A+G ++ V ++A
Sbjct: 120 ASTAQ--KDGVAIGARASTS---DTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIA 172

Query: 2164 LGAGSKATRANTISVGSAGSERQIVNIAAGTQSTDAVNVAQL 2205
+G SK R N++S+G RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 60.7 bits (146), Expect = 7e-11
Identities = 56/162 (34%), Positives = 90/162 (55%), Gaps = 9/162 (5%)

Query: 2975 GLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWVTTNGAHALAIGSHAKANGENAVAVGS 3034
GLN A S+AIG A+ + +VA+G+ G +++AIG +KA G++AV G+
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 3035 NGIKAIGASSIAIGNAAEASVGATNGIALGTGASVEPNVTDAMALGANTIVDDKANGAVA 3094
+AIG A S G+A+G + + +++A+G ++ V ++A
Sbjct: 120 ASTAQ--KDGVAIGARASTS---DTGVAVGFNSKAD--AKNSVAIGHSSHVAANHGYSIA 172

Query: 3095 LGASSKATRANTISVGSAGSERQIVNIAAGTQSTDAVNVAQL 3136
+G SK R N++S+G RQ+ ++AAGT+ TDAVNVAQL
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 57.6 bits (138), Expect = 6e-10
Identities = 56/149 (37%), Positives = 78/149 (52%), Gaps = 20/149 (13%)

Query: 75 AVRSVAIGLNAVAGRFSQVVIGDGASASEDYAVAIGVNANGAGQYGVAVGEDASAHEAAV 134
+ S+AIG A A + + V +G G+ A+ +VAIG + G V G ++A + V
Sbjct: 69 GIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGV 128

Query: 135 AIGAGAVAQDEGVAIGVRATA-VSGSVAIGHD----------------AKADRVDTVSVG 177
AIGA A D GVA+G + A SVAIGH +K DR ++VS+G
Sbjct: 129 AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIG 188

Query: 178 GGKWGPPQRQIVNVAAGTQDTDVVNVGQL 206
RQ+ ++AAGT+DTD VNV QL
Sbjct: 189 HESL---NRQLTHLAAGTKDTDAVNVAQL 214



Score = 48.4 bits (114), Expect = 4e-07
Identities = 62/168 (36%), Positives = 89/168 (52%), Gaps = 19/168 (11%)

Query: 891 AAGVAETDAVNVGQLSDATKSIRDSLSDGSLSMRYIKVKATGQAANPMGTNTVAIGAGAN 950
A A+ AV VG S AT +S++ G LS T AA+ + VAIGA A+
Sbjct: 78 TAEAAKGAAVAVGAGSIATGV--NSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAS 135

Query: 951 ATGNGSLALGTGSRANGLNSVAIGFNS-VATD---------------ANQVSVGDIGNER 994
+ G +A+G S+A+ NSVAIG +S VA + N VS+G R
Sbjct: 136 TSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR 194

Query: 995 RISNVADGTEDTDAVNVNQLTEAIEKMSARTDKLSSELKSRHSSLMAN 1042
+++++A GT+DTDAVNV QL + IEK T+K S+EL + ++ N
Sbjct: 195 QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADN 242



Score = 38.7 bits (89), Expect = 4e-04
Identities = 71/296 (23%), Positives = 125/296 (42%), Gaps = 15/296 (5%)

Query: 2754 GSVTLGGAGAAAPVALKNVAAGVDDTDAVNVGQLNTGLSDMKRELAEGNIDLKYIKVRAD 2813
G+ GAA V ++A GV+ +V +G L+ L D + K
Sbjct: 76 GATAEAAKGAAVAVGAGSIATGVN---SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGA 132

Query: 2814 GAPATATGAQSVAIGSKALAGGPNSLALGAGARALGNG--SVALGSNSIATEPMTVSVGD 2871
A + TG VA+G + A NS+A+G + N S+A+G S +VS+G
Sbjct: 133 RASTSDTG---VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGH 189

Query: 2872 DGTERKIIHVKAGDVTAKSTDAINGSQLFDALGQLKSHVAAEQSHLLSRVNALADSGEPN 2931
+ R++ H+ AG K TDA+N +QL + + + + + LL+ NA AD+ +
Sbjct: 190 ESLNRQLTHLAAG---TKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSS 246

Query: 2932 SLVVVEGMGGTNTA-SLSGGDPESTTAAAIGVEAHAAGANAIALGLNTGALSNESVAIGN 2990
L + + +A +L E+ + + A +N++A A + +
Sbjct: 247 VLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVART 306

Query: 2991 MAQTGSDQPFSVAIGSWVTTNGAHALAIGSHA--KANGENAVAVGSNGIKAIGASS 3044
+T + + + + N +A + SH AN V V ++ KAI S+
Sbjct: 307 TLETAEEHANKKSAEALASAN-VYADSKSSHTLKTANSYTDVTVSNSTKKAIRESN 361



Score = 38.7 bits (89), Expect = 4e-04
Identities = 43/147 (29%), Positives = 61/147 (41%), Gaps = 14/147 (9%)

Query: 3509 AGQTVMAANAGNGSNNVAIGSSSTISDDAGNATAVGANSTVRAAG------------GTA 3556
A V A + G N+VAIG S D+ A GA ST + G G A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDS--AVTYGAASTAQKDGVAIGARASTSDTGVA 142

Query: 3557 IGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGSDQGKRRIVNIADG 3616
+G + A A NS AIG S ++ +I G + D S G + R++ ++A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 3617 VNASDAATKGQLDRAVGGLQGQINGVS 3643
+DA QL + + Q N S
Sbjct: 203 TKDTDAVNVAQLKKEIEKTQENTNKRS 229



Score = 37.6 bits (86), Expect = 0.001
Identities = 24/56 (42%), Positives = 33/56 (58%)

Query: 3539 NATAVGANSTVRAAGGTAIGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANED 3594
++ A+GA + A+GAG+ A NS AIG SKA GD++V GA S A +D
Sbjct: 71 HSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126



Score = 36.0 bits (82), Expect = 0.003
Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 3544 GANSTVRAAGGTAIGAGADAHAANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGS 3603
G N++ + AIGA A+A + A+G GS A G NSVAIG S A D+ V++G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 3604 DQGK 3607
K
Sbjct: 122 TAQK 125



Score = 35.6 bits (81), Expect = 0.004
Identities = 75/385 (19%), Positives = 136/385 (35%), Gaps = 24/385 (6%)

Query: 3327 ASVTLGDAGTAVGLHNVATGAVSATSTDAVNGSQLHGMATSVANAIGGDTTVDENGQVAV 3386
A+V +G A G+++VA G +S D+ A AIG + + G
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVG 144

Query: 3387 NSIEVGGHKYATVSQAVQAAAAYGATDSLAVRYDVDSHGNPNYGSVTLGGPAAAPVTLTN 3446
+ + + + AA +G + ++ R D + + G +L LT+
Sbjct: 145 FNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR------QLTH 198

Query: 3447 VADGKSQYDAVNYGQLSSLQSDFENRLGSMDDRVSKIETTGGGSGGEARRTVSNDLISGS 3506
+A G DAVN QL ++ A N S
Sbjct: 199 LAAGTKDTDAVNVAQLKK----------EIEKTQENTNKRSAELLANANAYADNKSSSVL 248

Query: 3507 GDAGQTVMAANAGNGSNNVAIGSSSTISDDAGNATAVGANSTVRAAGGTAI-GAGADAHA 3565
G A + +A N A + S D N +NS R TA A + A
Sbjct: 249 GIANNYTDSKSAETLEN--ARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVART 306

Query: 3566 ANSTAIGQGSKARGDNSVAIGAGSVANEDNTVSFGDGSDQGKRRIVNIADGVNASDAATK 3625
TA +K + + + + +T+ + V +++ + +
Sbjct: 307 TLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTD-----VTVSNSTKKAIRESN 361

Query: 3626 GQLDRAVGGLQGQINGVSRNAYSGIAAATALTMIPGVDPGKTLSFGIGSASYKGYQAVAF 3685
D L +++ + G+A++ AL + ++F G Y+ QA+A
Sbjct: 362 QYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAI 421

Query: 3686 GGEARINKNLKMKAGVGLSSGGNTV 3710
G R+N+N+ +KAGV + + +
Sbjct: 422 GSGYRVNENVALKAGVAYAGSSDVM 446



Score = 34.1 bits (77), Expect = 0.012
Identities = 26/59 (44%), Positives = 36/59 (61%)

Query: 930 ATGQAANPMGTNTVAIGAGANATGNGSLALGTGSRANGLNSVAIGFNSVATDANQVSVG 988
A G A+ G +++AIGA A A ++A+G GS A G+NSVAIG S A + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 33.7 bits (76), Expect = 0.015
Identities = 32/133 (24%), Positives = 65/133 (48%), Gaps = 4/133 (3%)

Query: 1983 AAAASGSGGGGKGPSFVTIDGMGSDGSRFNTASITTGDPESTTAAAIGVDAHAAGAN-AI 2041
AA A G+G G + V I G +++T G + + + A A+ ++ +
Sbjct: 85 AAVAVGAGSIATGVNSVAI---GPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGV 141

Query: 2042 ALGLNTGALSNESVAIGNMAQTGSDQPFSVAIGSWATTNGAHALAIGSHAKANGENAVAV 2101
A+G N+ A + SVAIG+ + ++ +S+AIG + T+ ++++IG + +A
Sbjct: 142 AVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAA 201

Query: 2102 GSNGIKAIGASSI 2114
G+ A+ + +
Sbjct: 202 GTKDTDAVNVAQL 214



Score = 33.7 bits (76), Expect = 0.015
Identities = 27/66 (40%), Positives = 37/66 (56%)

Query: 2812 ADGAPATATGAQSVAIGSKALAGGPNSLALGAGARALGNGSVALGSNSIATEPMTVSVGD 2871
A G A+A G S+AIG+ A A ++A+GAG+ A G SVA+G S A V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 2872 DGTERK 2877
T +K
Sbjct: 120 ASTAQK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4638BACINVASINB354e-04 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 34.7 bits (79), Expect = 4e-04
Identities = 37/129 (28%), Positives = 60/129 (46%), Gaps = 13/129 (10%)

Query: 3 QLNVTEGVLASSLQVLTTFDLIWIVAALVLGGMAKGITGIGVPL-VAMPIVS-----QFM 56
+ N G + L L T ++ +VAA+ GG + + +G+ + VA IV F+
Sbjct: 309 ETNRIMGCIGKVLGALLT--IVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFI 366

Query: 57 -----PIRDAVLLLSMPIILGNIPQALEGGQVLATARKIAAPIAGTVFGNIAGVAILLSL 111
PI + VL M +I I +ALEG V ++A I G + IA VA+++ +
Sbjct: 367 QQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVV 426

Query: 112 NSGHAQAAS 120
AA+
Sbjct: 427 AVVGKGAAA 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4640HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 3/119 (2%)

Query: 953 SGLQILVVDDHPVNRLVTKAQLERLGYTAVAVSNGMDALRVLDNSDFALILTDCAMPEMD 1012
+G ILV DD R V L R GY SN R + D L++TD MP+ +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 1013 GYELTKRIRSREHRSRDTPIVALTANALPDEAIRCAEAGMDGLLIKPTTLAVLRDQLAH 1071
++L RI+ D P++ ++A AI+ +E G L KP L L +
Sbjct: 62 AFDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4641HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 25/119 (21%), Positives = 49/119 (41%), Gaps = 4/119 (3%)

Query: 4 RVVLADDHPIMLLGCRILIEQGGLEVVGEARDSRELMSILARVACDVVITDFSMPNTGRV 63
+++ADD + + + G +V ++ L +A D+V+TD MP+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE--- 60

Query: 64 DGLPMLSMIRREHVALPVIVLTNMANAGLLRAMLNEGVLGIVEKGAERNELFAAVRAAL 122
+ +L I++ LPV+V++ +G + K + EL + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4644PF04647290.042 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 29.4 bits (66), Expect = 0.042
Identities = 16/119 (13%), Positives = 36/119 (30%), Gaps = 20/119 (16%)

Query: 12 FAAPLALFTILAMVILPLPPAALDVMFTFNIVLSIVVVMV---AVTVKRP---------L 59
L +F +LA + + PA ++ + S++ ++ + L
Sbjct: 81 TLTSLLVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDNPRNLISNTEQRKTL 140

Query: 60 DFSAF-PTVILAATLMRLTLNVASTRVVLLNGYTGASAAGQVIESFANVVIGSNFVVGL 117
++L + A G + ++F +G F+VG
Sbjct: 141 KLKTSMVLMVLFGGSIGAYRLYTHQ-------IALAILLGVLWQTFTLTALGHKFIVGW 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4645TYPE3IMSPROT2925e-99 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 292 bits (749), Expect = 5e-99
Identities = 97/349 (27%), Positives = 178/349 (51%), Gaps = 7/349 (2%)

Query: 6 TGDKTEKATPQKLRKARMEGQVARSRDIGTCVGILVALKLIVVLTPAWLVELKHIFALSF 65
+G+KTE+ TP+K+R AR +GQVA+S+++ + I+ +++ L+ + + +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 66 ADLSGDDRLGNAVSMLFPAAVLLMCKMLAPL---AAVPAGIIVASLIPGGWIISHKNIMP 122
A+S + +L + PL AA+ A I + ++ G++IS + I P
Sbjct: 62 E--QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA--IASHVVQYGFLISGEAIKP 117

Query: 123 KLNRLNPLSGLKRLVSGKHYMQFGTTVLKALVLMATLFIVCRSNLSGFIRLQGAPLADAL 182
+ ++NP+ G KR+ S K ++F ++LK ++L ++I+ + NL ++L +
Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECIT 177

Query: 183 TGGANLFLSSAVTLSCIITVFALVDIPVQQIIFKRGQRMSKRDIKEEMKQSEGRPEVKSR 242
+ V + V ++ D + + + +MSK +IK E K+ EG PE+KS+
Sbjct: 178 PLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSK 237

Query: 243 IRQIQRQLARQGIRKTVPTADLVVMNPTHYAVALKYDVTRAQAPYVVAKGVDEVALFIRD 302
RQ +++ + +R+ V + +VV NPTH A+ + Y P V K D +R
Sbjct: 238 RRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRK 297

Query: 303 VARGHNVEVLELPPLARAIYHTSQVNQQIPAELYRAVAQVLSYVLQIKA 351
+A V +L+ PLARA+Y + V+ IPAE A A+VL ++ +
Sbjct: 298 IAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4646TYPE3IMRPROT1182e-34 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 118 bits (298), Expect = 2e-34
Identities = 81/248 (32%), Positives = 127/248 (51%), Gaps = 1/248 (0%)

Query: 7 QLLPLANTIFWPFCRIAAALAASPILGDVMVPVRLRLLIALFLALAIQPGIPTMPVIDLL 66
Q L N FWP R+ A ++ +PIL + VP R++L +A+ + AI P +P V +
Sbjct: 8 QWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVP-VF 66

Query: 67 QLEGVAAMAEQVLIGGLLGFVFHLVLCALQIFGTIASSQLGLSMAQINDPMNGQMADVLT 126
+ +Q+LIG LGF A++ G I Q+GLS A DP + VL
Sbjct: 67 SFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLA 126

Query: 127 SVMYVVFILLFFAVDGHLILTSVIARSFAVWPVGRFAFDLDALKHLAFAVGWIFSAAVAL 186
+M ++ +LLF +GHL L S++ +F P+G + +A L A IF + L
Sbjct: 127 RIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLML 186

Query: 187 ALPVMFATLVVQVGLGLLNRVAPALNIFALGFSITTMFGLLLLTLLLPSLPDHYGRMVEH 246
ALP++ L + + LGLLNR+AP L+IF +GF +T G+ L+ L+P + +
Sbjct: 187 ALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSE 246

Query: 247 VLELYDRL 254
+ L +
Sbjct: 247 IFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4647TYPE3IMQPROT535e-13 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 52.8 bits (127), Expect = 5e-13
Identities = 23/79 (29%), Positives = 36/79 (45%)

Query: 9 GFAVDALRLVLVIILVLITPGLITGVLVAIFQAATQINEQTMSFLPRLITTLVALALAGP 68
AL LVL++ I G+LV +FQ TQ+ EQT+ F +L+ + L L
Sbjct: 6 FAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSG 65

Query: 69 WMTGRVMHYTVEIFSRAAQ 87
W ++ Y ++ A
Sbjct: 66 WYGEVLLSYGRQVIFLALA 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4648FLGBIOSNFLIP2052e-68 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 205 bits (523), Expect = 2e-68
Identities = 108/242 (44%), Positives = 151/242 (62%), Gaps = 3/242 (1%)

Query: 6 LLRYLGLALAAFAMPALV---QAETLTLASDGVGGQGFTVKTQILVLMTLLGLLPALLMT 62
+ R L +A + + Q +T GGQ +++ Q LV +T L +PA+L+
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 63 MTSFLRYVIVLSLVKQALGLQQGLPGRIVTGVALVLTMLTMRPVGEEIWQKAFVPYDQGK 122
MTSF R +IV L++ ALG P +++ G+AL LT M PV ++I+ A+ P+ + K
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 123 ISMQTALATSEQPLGRYMLAQTNKATLAQMAKLSGTEKVMDPEKQPFLVKLSAFVLSELK 182
ISMQ AL QPL +ML QT +A L A+L+ T + PE P + L A+V SELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 183 TAFQMGAMLFIPFLIVDIIVASVLMAMGMMMLSPLVISLPLKLLLFVLVDGWSLTVNTLV 242
TAFQ+G +FIPFLI+D+++ASVLMA+GMMM+ P I+LP KL+LFVLVDGW L V +L
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 243 TS 244
S
Sbjct: 241 QS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4649FLGMOTORFLIN743e-20 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 73.8 bits (181), Expect = 3e-20
Identities = 31/72 (43%), Positives = 47/72 (65%)

Query: 34 MRMLRRIPVRLTLEVGEATVPLADLLSYETGSTVELNRLAGEPLVIKVNGTPVGLGEVVV 93
+ ++ IPV+LT+E+G + + +LL GS V L+ LAGEPL I +NG + GEVVV
Sbjct: 54 IDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVV 113

Query: 94 SGEHYGLRIIEL 105
+ YG+RI ++
Sbjct: 114 VADKYGVRITDI 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4650FLGMOTORFLIN363e-05 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 36.4 bits (84), Expect = 3e-05
Identities = 26/91 (28%), Positives = 40/91 (43%), Gaps = 19/91 (20%)

Query: 194 ALDDMWLDHLFARLDAQHLRPAPDSAQANVS----------------IPVTISVHVLSKN 237
ALDD+W D L A + A D+ + IPV ++V +
Sbjct: 14 ALDDLWADAL-NEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTR 72

Query: 238 MRLDELLRMRPGDVLPV-RLP-ETVDVLVNN 266
M + ELLR+ G V+ + L E +D+L+N
Sbjct: 73 MTIKELLRLTQGSVVALDGLAGEPLDILING 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4651FLAGELLIN1102e-29 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 110 bits (276), Expect = 2e-29
Identities = 59/290 (20%), Positives = 116/290 (40%), Gaps = 25/290 (8%)

Query: 6 TNAAAMNIKKAIGSTSNSLNTTMTRLGTGLRINSAKDDAAGLQIAVRLQAQTRGMGMAMQ 65
TN+ ++ + + + +SL++ + RL +GLRINSAKDDAAG IA R + +G+ A +
Sbjct: 6 TNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASR 65

Query: 66 NTQNASSMLQTADGAMKEVTNILYRMKDLATQAADGSSSANEKTAMQAEYDALGKELSNI 125
N + S+ QT +GA+ E+ N L R+++L+ QA +G++S ++ ++Q E +E+ +
Sbjct: 66 NANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRV 125

Query: 126 MKNTTYGGAKLLQKEVKNKDDGTVVSEGGRLTKEITFQIGATKDETMAADFSKHVANAHD 185
T + G K+L + ++ Q+GA ET+ D K +
Sbjct: 126 SNQTQFNGVKVLSQ-----------------DNQMKIQVGANDGETITIDLQKIDVKS-- 166

Query: 186 KFEGLSASYTGPEAGKTEEPGKELTDNANATIDLINNVLDDVGALRSAIGAAENRLAHTH 245
G +E ++ + + R + + T
Sbjct: 167 ------LGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTA 220

Query: 246 NNLANMSTNTADAEGRIMDADMASESAKMSSQQVLLQASMSMLKQTSSMN 295
+ + A D + + + + ++
Sbjct: 221 PTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIK 270



Score = 75.1 bits (184), Expect = 3e-17
Identities = 58/306 (18%), Positives = 103/306 (33%), Gaps = 8/306 (2%)

Query: 6 TNAAAMNIKKAIGSTSNSLNTTMTRLGTGLRINSAKDDAAGLQIAVRLQAQTRGMGMAMQ 65
N +++ T + T ++ D A AV L T+ +
Sbjct: 202 ANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAE 261

Query: 66 NTQNASSMLQTADGAMKEVTNILYRMKDLATQAADG--SSSANEKTAMQAEYDALGKELS 123
A ++ +G + + + + +G S++ N + D +
Sbjct: 262 AKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAAN 321

Query: 124 NIMKNTTYGGAKLLQKEVKNKDDGTVVSEGGRLTKEITFQIGATKDETMAADFSKHVANA 183
++ + + + +++ ANA
Sbjct: 322 VDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANA 381

Query: 184 HDKFEGLSASYTGPEAGKTEEPGKELTDN------ANATIDLINNVLDDVGALRSAIGAA 237
L+ + + D + I++ L V A+RS++GA
Sbjct: 382 AGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAI 441

Query: 238 ENRLAHTHNNLANMSTNTADAEGRIMDADMASESAKMSSQQVLLQASMSMLKQTSSMNQM 297
+NR NL N TN A RI DAD A+E + MS Q+L QA S+L Q + + Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQN 501

Query: 298 VLSLLQ 303
VLSLL+
Sbjct: 502 VLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4652HTHFIS616e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 6e-13
Identities = 26/123 (21%), Positives = 52/123 (42%), Gaps = 4/123 (3%)

Query: 11 KPRIALLEDNVAHARTVRHWLEAAGYDAIVEYDGRRFIDRIGREKVDMLLLDWDVPGMTG 70
I + +D+ A + L AGYD + + I D+++ D +P
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 71 IDVLIDMRKRVDYLIPIVLLTQHDDERDILHGLSCGADDYLVKPISE---RMLIARVIAQ 127
D+L ++K +P+++++ + + GA DYL KP +I R +A+
Sbjct: 63 FDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 128 LRK 130
++
Sbjct: 122 PKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4654FLGHOOKFLIE489e-11 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 48.1 bits (114), Expect = 9e-11
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 27 QAPSADIAGGFADLLKQAVRRTDAQQHHADDLVTAVETGASD-DLVGAMLASQQASLSFS 85
Q FA L A+ R Q A G L M Q+AS+S
Sbjct: 23 QESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQ 82

Query: 86 TMIQVRNKVMSAFDDIIKMQV 106
IQVRNK+++A+ +++ MQV
Sbjct: 83 MGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4655FLGMRINGFLIF2982e-96 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 298 bits (764), Expect = 2e-96
Identities = 154/548 (28%), Positives = 255/548 (46%), Gaps = 44/548 (8%)

Query: 30 ALSKLAPIVILAISLAALTMMLMHRQDSRYKPLFGSQEAVVAADMMAALDAEGIPYRIHP 89
A ++ IV + ++A + M++ + Y+ LF + ++A L IPYR
Sbjct: 21 ANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFAN 80

Query: 90 DSGQVLVPEQKLGAARMMLAAKGVVGKLPEGLEQVDKSDPLGVSQFVQDVRFRRGLEGEL 149
SG + VP K+ R+ LA +G+ G E +D+ G+SQF + V ++R LEGEL
Sbjct: 81 GSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQE-KFGISQFSEQVNYQRALEGEL 139

Query: 150 TQSIMALEPVSSARVHLSIAKSASFILADGDKSSASVVLTLKPNRKLNKEQIAAIVALVA 209
++I L PV SARVHL++ K + F+ + SASV +TL+P R L++ QI+A+V LV+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFV-REQKSPSASVTVTLEPGRALDEGQISAVVHLVS 198

Query: 210 GSVANLDPARVTVIDQSGNHLSAQIDLVLGNSTLDSELG--AQMREQVLRNIRELLTPVL 267
+VA L P VT++DQSG+ L+ G D++L + ++ R I +L+P++
Sbjct: 199 SAVAGLPPGNVTLVDQSGHLLTQSNT--SGRDLNDAQLKFANDVESRIQRRIEAILSPIV 256

Query: 268 GDGNFRASVAVELDHDRVEETREQYGEAPKVTQEAIR------DEKDIGQAALGVPGSLS 321
G+GN A V +LD E+T E Y ++ +R E+ GVPG+LS
Sbjct: 257 GNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALS 316

Query: 322 NRPAPPSTASMPEAPHSAKNAQ-----------------------TRQYAYDRNVVQIKR 358
N+PAPP+ A + P + +NAQ T Y DR + K
Sbjct: 317 NQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKM 376

Query: 359 SPVRVKRLNVAVVLNNAAAPGG-GKAWAPAQLAQVDTILRDGLGIDADRDDALTVSSLDF 417
+ ++RL+VAVV+N G Q+ Q++ + R+ +G R D L V + F
Sbjct: 377 NVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPF 436

Query: 418 RGT-PVTESQPWWKQPDNLVTIGTWAAWALGALLGFVFIFRPLLKVLRIWANGGRDPLSQ 476
P+W+Q + + W L ++ ++ + + L + Q
Sbjct: 437 SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQ 496

Query: 477 GANAVSGGPEAPALAAAADADTQPLLLADANLPPIGSGADVLIAHLKHLAAQDPERVAEV 536
+ + Q A+ L GA+V+ ++ ++ DP VA V
Sbjct: 497 AQVRQETEEAVEVRLSKDEQLQQ--RRANQRL-----GAEVMSQRIREMSDNDPRVVALV 549

Query: 537 IKPWIRDD 544
I+ W+ +D
Sbjct: 550 IRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4656FLGMOTORFLIG1731e-53 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 173 bits (441), Expect = 1e-53
Identities = 85/331 (25%), Positives = 167/331 (50%), Gaps = 6/331 (1%)

Query: 23 SLAAVERAAIILLSIGEEAAAGVLRCLSREELLDVTLAMSRMQGVKVDAVQNTIERFFTN 82
+L ++AAI+L+SIG E ++ V + LS+EE+ +T +++++ + + N + F
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 83 FREQSGVRGASRSFLQRSLEMALGGVVANSVLNKIYGDAIGPKMARLQWAQPQWLADRLR 142
Q ++ + + LE +LG A ++N + ++ A P + + ++
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 143 DEHVRMQAMFLVFLPPEQASRVIQALPEARREQVLLDIARLTEIDHDLLRDLEDVVDSCV 202
EH + A+ L +L P++AS ++ +LP + V IA + +++R++E V++ +
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 203 ANLGT-QSTAVEGVRQAADIINRMPGDRTQ---MVEILRARDPELVAAVEDLIYDFAVIA 258
A+L + T+ GV +IIN DR ++E L DPEL ++ ++ F I
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMA--DRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIV 251

Query: 259 NQDDEVISVILEHVDTALWGVALKGADPAVRDALLRSMPRRAVQAFEEMLRRTEPALPSK 318
DD I +L +D ALK D V++ + ++M +RA +E + P
Sbjct: 252 LLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKD 311

Query: 319 VESARREIMDIIRGLADDGDIELRLVAEEEL 349
VE ++++I+ +IR L + G+I + EE++
Sbjct: 312 VEESQQKIVSLIRKLEEQGEIVISRGGEEDV 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4657FLGFLIH552e-11 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 55.2 bits (132), Expect = 2e-11
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 5/191 (2%)

Query: 15 IRAASEQLGEFAAPD---EVGLLSEQLHAPPTGELLDEARQAGYADGFAAGERVGAEGAR 71
I E + E A P ++ L Q H + E RQ G+ G+ G G E
Sbjct: 25 IVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGL 84

Query: 72 RDVRSGFDALVAPVDALVRGFQRVQQAYRAKVRSEVAKLVGDVARQVVRAELETRPERIL 131
+ +S + A + LV FQ A + + S + ++ + ARQV+ ++
Sbjct: 85 AEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALI 144

Query: 132 AFVDEAVGTLTKPPESVSVRLNPSDYARLAQ--AAPDRVHGWQLVPDDRLEPGECRVRAD 189
+ + + +R++P D R+ A +HGW+L D L PG C+V AD
Sbjct: 145 KQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSAD 204

Query: 190 DIEMDAGCGQR 200
+ ++DA R
Sbjct: 205 EGDLDASVATR 215


58Bcenmc03_4669Bcenmc03_4681N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4669527-5.359192mannosyl-glycoprotein
Bcenmc03_4670527-5.785672flagella basal body P-ring formation protein
Bcenmc03_4671727-6.422977flagellar basal-body rod protein FlgB
Bcenmc03_4672726-5.615016flagellar basal-body rod protein FlgC
Bcenmc03_4673727-5.595163flagellar basal body rod modification protein
Bcenmc03_4674727-5.687277flagellar basal body FlaE domain-containing
Bcenmc03_4675828-5.361608flagellar basal-body rod protein FlgF
Bcenmc03_4676729-5.871846flagellar basal body rod protein FlgG
Bcenmc03_4677629-4.996515flagellar basal body L-ring protein
Bcenmc03_4678728-5.427399flagellar P-ring protein
Bcenmc03_4679726-5.496866hypothetical protein
Bcenmc03_4680830-6.495526flagellar hook-associated protein FlgK
Bcenmc03_4681732-6.848329flagellar hook-associated protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4669FLGFLGJ1344e-40 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 134 bits (339), Expect = 4e-40
Identities = 75/200 (37%), Positives = 98/200 (49%), Gaps = 4/200 (2%)

Query: 55 ATEPHLSSQAATWTNMMRARAEALAEPQQGVLGNGAPATGPNFADS---DQQAFLAEIMP 111
E L ++ M + Q + A N+ DS D +AFLA++
Sbjct: 99 TPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSL 158

Query: 112 HARRAGAMIGAAPELIAAHAALESGWGSKPLKNVRGETTHNLFGIKSAGGWAGESAAAVT 171
A+ A G LI A AALESGWG + ++ GE ++NLFG+K++G W G T
Sbjct: 159 PAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITT 218

Query: 172 TEYVNGSAVKMVDHFRAYRSYSGAFHDYAKLLRDSRRYAGVRNVGDDASAFASALKRGGY 231
TEY NG A K+ FR Y SY A DY LL + RYA V A A AL+ GY
Sbjct: 219 TEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAV-TTAASAEQGAQALQDAGY 277

Query: 232 ATDPAYATKLVEMVGLVKRM 251
ATDP YA KL M+ +K +
Sbjct: 278 ATDPHYARKLTNMIQQMKSI 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4672FLGHOOKAP1300.002 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.002
Identities = 8/39 (20%), Positives = 17/39 (43%)

Query: 91 SNVSAVEEMADMMAASRAFSTNVEVLTRIKGMQQDLLRM 129
S V+ EE ++ + + N +VL + L+ +
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4674FLGHOOKAP1363e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.7 bits (82), Expect = 3e-04
Identities = 15/43 (34%), Positives = 23/43 (53%)

Query: 345 RAVEQSNVDMTAELVSLMGAQQNYQANSKVLSTENEMMRALMQ 387
+ S V++ E +L QQ Y AN++VL T N + AL+
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 29.9 bits (67), Expect = 0.021
Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 2 SFNIALAGINAINGQLNQISNNIANSGTLGFKSGR 36
N A++G+NA LN SNNI++ G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4675FLGHOOKAP1330.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 0.001
Identities = 10/46 (21%), Positives = 20/46 (43%)

Query: 5 LYTAMTGAEHSLRALNVRANNLSNAQTSGFRADLASVTSQAARGYG 50
+ AM+G + ALN +NN+S+ +G+ + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGA 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4676FLGHOOKAP1383e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 38.0 bits (88), Expect = 3e-05
Identities = 12/49 (24%), Positives = 19/49 (38%)

Query: 208 NGIGTIKQGALEGSNVLAVEEMVEMIAAQRTYEMNTKVLSAADNMMQYL 256
N + + S V EE + Q+ Y N +VL A+ + L
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDAL 542



Score = 35.7 bits (82), Expect = 2e-04
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 13/74 (17%)

Query: 7 ISKTGIQAQDAKLQAIANNLANVNTVGFKRDRAVFEDMFYRAERQPGAQVSDNATGPGVQ 66
+ +G+ A A L +NN+++ N G+ R + +++ G G
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI-------------MAQANSTLGAGGW 52

Query: 67 LGNGTRIAGTQKVF 80
+GNG ++G Q+ +
Sbjct: 53 VGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4677FLGLRINGFLGH1401e-43 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 140 bits (354), Expect = 1e-43
Identities = 61/162 (37%), Positives = 83/162 (51%), Gaps = 6/162 (3%)

Query: 59 LTSDVRAFRAGDVLTVDLEESTQASKKSGTQVGKDS----SLSAKKPSLFGKALPVEAEL 114
L D R GD LT+ L+E+ ASK S +D L G A++
Sbjct: 65 LFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADV 124

Query: 115 GTKSG--FNGAGSSSQQNTLRGSVTVVVQRVMPNGLLQVRGEKRLVLNQGEENVRLAGYV 172
G FNG G ++ NT G++TV V +V+ NG L V GEK++ +NQG E +R +G V
Sbjct: 125 EASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVV 184

Query: 173 RAADIDSNNRVSSQRVANARITYAGRGSLADASQPGMLTRFF 214
I +N V S +VA+ARI Y G G + +A G L RFF
Sbjct: 185 NPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4678FLGPRINGFLGI309e-105 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 309 bits (792), Expect = e-105
Identities = 136/372 (36%), Positives = 199/372 (53%), Gaps = 10/372 (2%)

Query: 7 TIRAAALAIVIGIALSSTIEAHAQTVGNLVDVEGVRENALVGYGIVVGLAGSGDGT-QAK 65
I AA + + + +A + ++ ++ R+N L+GYG+VVGL G+GD +
Sbjct: 6 IIAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSP 65

Query: 66 YTTQSLTNMLKQFGTRLPENINLRSRNAAAVIVSATFPPGYRRGQKVDVTVSSLGDAKSL 125
+T QS+ ML+ G ++N AAV+V+A PP G +VDVTVSSLGDA SL
Sbjct: 66 FTEQSMRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSL 124

Query: 126 RGGTLLMTPLRAADGDVYALAQGNLVIPGLNVQGRSGTSVTINTPTTGRIPKGATIEREI 185
RGG L+MT L ADG +YA+AQG L++ G + QG ++T T+ R+P GA IERE+
Sbjct: 125 RGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIEREL 183

Query: 186 ATDFADTPTVRLNLKRPDFQTASSIADVINN----ALGSEVASTVDATSVDVVAPQVPSQ 241
+ F D+ + L L+ PDF TA +ADV+N G +A D+ + V P+ +
Sbjct: 184 PSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VAD 242

Query: 242 RVAFVARLNALKVSKGAEVPRVVFNSRTGTVVISQGVTVKPAVVSHGSLKVTIAEGTMVS 301
+A + L V +VV N RTGT+VI V + VS+G+L V + E V
Sbjct: 243 LTRLMAEIENLTVETDT-PAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVI 301

Query: 302 QPNAFANGSTVTAPVSEIGVTQAGGNAFQWTSGASLQAIVDTITRTGATPDDLMAILQAL 361
QP F+ G T P ++I Q G G L+ +V + G D ++AILQ +
Sbjct: 302 QPAPFSRGQTAVQPQTDIMAMQEGSKVAI-VEGPDLRTLVAGLNSIGLKADGIIAILQGI 360

Query: 362 SEAGALTGDLVV 373
AGAL +LV+
Sbjct: 361 KSAGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4679FLGFLGJ368e-06 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 36.2 bits (83), Expect = 8e-06
Identities = 23/97 (23%), Positives = 43/97 (44%), Gaps = 4/97 (4%)

Query: 14 ASIETAAVKAPDEAYRARVEDAAVKFEGLFIAQMLSEMKKATDQFKADNGFADRSSEAMI 73
S+ KA ++ A + A + EG+F+ ML M+ A + D F+ +
Sbjct: 16 QSLNELKAKAGEDP-AANIRPVARQVEGMFVQMMLKSMRDALPK---DGLFSSEHTRLYT 71

Query: 74 DYANRAVADAIAKQRGFGIADTLVAQMLPPDATPSKD 110
++ +A + +G G+A+ +V QM P P +
Sbjct: 72 SMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEES 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4680FLGHOOKAP11133e-29 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 113 bits (283), Expect = 3e-29
Identities = 77/312 (24%), Positives = 129/312 (41%), Gaps = 17/312 (5%)

Query: 1 MDRSARNTANQQTVGYTRQGVLRTARAS---------GGVDASSVIRFGDH-ANTQQKWA 50
++ ++ N ++ GYTRQ + S GV S V R D Q + A
Sbjct: 18 LNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYDAFITNQLRAA 77

Query: 51 SHGSVGEHRAVESYFRQLEEVMGLKDGSIKVSMGKFFGALDAASADVANSALRQQVLLAA 110
S G A +++ ++ S+ M FF +L ++ + A RQ ++ +
Sbjct: 78 QTQSSG-LTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNAEDPAARQALIGKS 136

Query: 111 GGMAKSFNSVQQMMRGQLDTLRQQSAATVEQINGLSRTAAELNRLVAEAEANGGA--PSE 168
G+ F + Q +R Q + A+V+QIN ++ A LN ++ G P+
Sbjct: 137 EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGAGASPNN 196

Query: 169 LIDQRDQAIDQLSALVDIRTVRQPDGTVDVSLAGGTPLIAGHQVAKMRVETLSGGTFELK 228
L+DQRDQ + +L+ +V + Q GT ++++A G L+ G ++ S
Sbjct: 197 LLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQLAAVPSSADPSRTT 256

Query: 229 LEL----AGTQYPVDGAKIGGELGGLSSFAKDTLLPQMEAIRSLAAELAGSFNEQVTAGF 284
+ AG + G LGG+ +F L + LA A +FN Q AGF
Sbjct: 257 VAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGF 316

Query: 285 GMGGSPGKALFT 296
G G+ F
Sbjct: 317 DANGDAGEDFFA 328



Score = 63.4 bits (154), Expect = 7e-13
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 2/110 (1%)

Query: 317 SGDPKAPGNSDNLLKLIELRSRRVDLPGFGEASLGDAYVLLVGKLGAQSEQNQSSLAIAV 376
S + ++ N L++L+S G S DAY LV +G ++ ++S A
Sbjct: 436 SEEDAGDSDNRNGQALLDLQSN--SKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQG 493

Query: 377 NVRQRAEEAWQSLSGVSMDEEAVNFSEALQVYSANMKVISVAKELFDATI 426
NV + QS+SGV++DEE N Q Y AN +V+ A +FDA I
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4681FLAGELLIN352e-04 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 35.4 bits (81), Expect = 2e-04
Identities = 30/138 (21%), Positives = 58/138 (42%), Gaps = 6/138 (4%)

Query: 13 LATLRASNSKAADLTSKISTGQRVQRASDDPIAAARLLLIERDTSV---LQRYQKNIDTL 69
L S S + ++S+G R+ A DD AA + R TS L + +N +
Sbjct: 14 QNNLNKSQSSLSSAIERLSSGLRINSAKDD---AAGQAIANRFTSNIKGLTQASRNANDG 70

Query: 70 SVRLQKNEVHLDGMLDTVMAVHDSLLSAADGSRSAADLNALAAPLRMRLNNLKQAANAKD 129
Q E L+ + + + V + + A +G+ S +DL ++ ++ RL + + +N
Sbjct: 71 ISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQ 130

Query: 130 GDGNFLFSGSQTNTAPIA 147
+G + S +
Sbjct: 131 FNGVKVLSQDNQMKIQVG 148


59Bcenmc03_4692Bcenmc03_4715N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4692344-8.892909two component LuxR family transcriptional
Bcenmc03_4693444-8.760299histidine kinase
Bcenmc03_4696445-9.8182134-oxalocrotonate tautomerase
Bcenmc03_4698444-8.859962hypothetical protein
Bcenmc03_4699447-10.102885two component LuxR family transcriptional
Bcenmc03_4701342-8.391121multi-sensor hybrid histidine kinase
Bcenmc03_4702545-9.353708two component LuxR family transcriptional
Bcenmc03_4703546-9.367388hypothetical protein
Bcenmc03_4704341-7.902060YadA domain-containing protein
Bcenmc03_4705125-6.190588HvnC; halovibrin
Bcenmc03_4706-118-4.328495integrase family protein
Bcenmc03_4707013-5.084528hypothetical protein
Bcenmc03_4708-110-1.302498transposase IS3/IS911 family protein
Bcenmc03_470908-0.744600integrase catalytic subunit
Bcenmc03_47100100.451037carbon starvation protein CstA
Bcenmc03_4711-1101.038237hypothetical protein
Bcenmc03_47120101.366550hemolysin III family channel protein
Bcenmc03_4713-1111.729048major facilitator transporter
Bcenmc03_4714-111-1.399872hypothetical protein
Bcenmc03_4715-112-1.798437DOPA 45-dioxygenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4692HTHFIS524e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.1 bits (125), Expect = 4e-10
Identities = 22/122 (18%), Positives = 48/122 (39%), Gaps = 7/122 (5%)

Query: 13 RINVVVADDHPVVSTGVAAILTAEMDINVVGVASTISELLLLLQQQPCDVLICDYSFSGD 72
++VADD + T + L+ +V ++ + L + D+++ D +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRITSNA-ATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 73 QQPDGVALFKRLRRNHPHVAIVVLTAHQDIALLVSRVMNTGVAGFLRKSSQDFARLAAIV 132
+ L R+++ P + ++V++A + G +L K D L I+
Sbjct: 61 ---NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIK-ASEKGAYDYLPKPF-DLTELIGII 115

Query: 133 RR 134
R
Sbjct: 116 GR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4693HTHFIS611e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-11
Identities = 30/121 (24%), Positives = 53/121 (43%), Gaps = 4/121 (3%)

Query: 898 SDASVLVVEDDRVSAQLMCDQLRMLGIGHVEVVCSAEDGIRRCQSRIYDLVVTDSNLPGK 957
+ A++LV +DD ++ L G V + +A R + DLVVTD +P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 958 GGAELLSDLRAAGISWPVVLCTADATLSR-VANVPFDAL--ITKPSTLSDLSHVLQNVLG 1014
+LL ++ A PV++ +A T + A + KP L++L ++ L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1015 P 1015

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4699HTHFIS362e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 2e-04
Identities = 17/110 (15%), Positives = 46/110 (41%), Gaps = 5/110 (4%)

Query: 32 KINIVVADSYPMIVQGLRHVFSSVENMRIVAEAHTLSGMASLLSSCECDVLICDYAFGDD 91
I+VAD I L S + ++ + +++ + D+++ D
Sbjct: 3 GATILVADDDAAIRTVLNQALSR-AGYDVRITSNAATLWR-WIAAGDGDLVVTDVV---M 57

Query: 92 PGPDGMRMLETIRRNHPNVKIILLAELRDGLSVQRVLKKGVSAFVVKSSD 141
P + +L I++ P++ +++++ ++ + +KG ++ K D
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4701HTHFIS847e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 7e-19
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 3/119 (2%)

Query: 995 SGTRILVVDDHPINRLVIEAQLARLGYTAIAVSNGTDALHALDDSDIALVLSDCAMPDMD 1054
+G ILV DD R V+ L+R GY SN + D LV++D MPD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 1055 GYDLARRIRSREPRSRHIPILALTANALPDEAIRCAEAGMDGLIVKPTTLTVLREELAR 1113
+DL RI+ P +P+L ++A AI+ +E G + KP LT L + R
Sbjct: 62 AFDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4702HTHFIS532e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 2e-10
Identities = 24/119 (20%), Positives = 50/119 (42%), Gaps = 4/119 (3%)

Query: 4 RVVLADDHPIMLLGCRLLIEQNGMEVVGEARDSSELMSILAHVACDVVITDFSMPNTGRA 63
+++ADD + + + G +V +++ L +A D+V+TD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP---DE 60

Query: 64 DGLAMLSTLRREHGALPVIVLTNMANAGLLRAMLNEGVLGIVEKGAEKSELFAAVRTAL 122
+ +L +++ LPV+V++ +G + K + +EL + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4704OMADHESIN611e-11 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 61.1 bits (147), Expect = 1e-11
Identities = 50/163 (30%), Positives = 93/163 (57%), Gaps = 9/163 (5%)

Query: 556 GMRTIANSDNSVAIGNMAQTGSEQPYSVAIGSHVTTNGASALAIGSQARANGENAIAVGN 615
G+ A +S+AIG A+ + + +VA+G+ G +++AIG ++A G++A+ G
Sbjct: 62 GLNASAKGIHSIAIGATAE--AAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 616 NNVHAIGESSIAIGNGAEAVVGATNGIALGTGASVARNVTDAMALGAKAFVEKRANGAVA 675
+ + +AIG A + G+A+G + + +++A+G + V ++A
Sbjct: 120 ASTAQ--KDGVAIGARAST---SDTGVAVGFNSKA--DAKNSVAIGHSSHVAANHGYSIA 172

Query: 676 LGAGSQASRANTISVGNAGSERQIVNVAAGTQGTDAVNVAQLQ 718
+G S+ R N++S+G+ RQ+ ++AAGT+ TDAVNVAQL+
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 47.6 bits (112), Expect = 2e-07
Identities = 50/148 (33%), Positives = 73/148 (49%), Gaps = 6/148 (4%)

Query: 347 AAGSADTDAVNVGQLTAATQPIRDELADVSLAMKSIQIKPQGVDAIAAGTNA----VAVG 402
AA + ++ G + A P+ L D ++ + + AI A + VAVG
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVG 144

Query: 403 AGANAMANGSIALGAGSRVTGLN--SVAIGINSVALETNQVSVGDVGRERRISNLAAGTK 460
+ A A S+A+G S V + S+AIG S N VS+G R++++LAAGTK
Sbjct: 145 FNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTK 204

Query: 461 GTDAVNLNQLSDAIGKVSSRTDKLSFDL 488
TDAVN+ QL I K T+K S +L
Sbjct: 205 DTDAVNVAQLKKEIEKTQENTNKRSAEL 232



Score = 45.3 bits (106), Expect = 1e-06
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 4/97 (4%)

Query: 386 PQGVDAIAAGTNAVAVGAGANAMANGSIALGAGSRVTGLNSVAIGINSVALETNQVSVGD 445
G++A A G +++A+GA A A ++A+GAGS TG+NSVAIG S AL + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 446 VGRERR----ISNLAAGTKGTDAVNLNQLSDAIGKVS 478
++ I A+ + AV N +DA V+
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVA 156



Score = 44.9 bits (105), Expect = 1e-06
Identities = 45/128 (35%), Positives = 67/128 (52%), Gaps = 9/128 (7%)

Query: 865 IGTGSGSV----INSDAGDTTAIGANSTQQASSGTAIGAGADARAINSTAIGQAASAHGE 920
I TG SV ++ GD+ ++ G AIGA A + A+G + A +
Sbjct: 94 IATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAST-SDTGVAVGFNSKADAK 152

Query: 921 NSTAVGQGATAWGNN--SIAIGAGSVADADNAVSFGNSATGMTRTLTNVSAGVAPTDAVN 978
NS A+G + N+ SIAIG S D +N+VS G+ + + R LT+++AG TDAVN
Sbjct: 153 NSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES--LNRQLTHLAAGTKDTDAVN 210

Query: 979 VQQLDESV 986
V QL + +
Sbjct: 211 VAQLKKEI 218



Score = 40.7 bits (94), Expect = 3e-05
Identities = 28/62 (45%), Positives = 38/62 (61%)

Query: 898 GAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGSVADADNAVSFGNSA 957
G A A+ I+S AIG A A + AVG G+ A G NS+AIG S A D+AV++G ++
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 958 TG 959
T
Sbjct: 122 TA 123



Score = 36.8 bits (84), Expect = 5e-04
Identities = 84/389 (21%), Positives = 142/389 (36%), Gaps = 39/389 (10%)

Query: 655 TDAMALGAKAFVEKRANGAVALGAGSQASRANTISVG---NAGSERQIVNVAAGTQGTDA 711
++A+GA A K A AVA+GAGS A+ N++++G A + + AA T D
Sbjct: 70 IHSIAIGATAEAAKGA--AVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG 127

Query: 712 VNVAQLQGAAASLASVIGGDTTVDGSGHVAIHSIEVSGHKYATVSAAVQAAAAYGATDSL 771
V + + + + +G ++ D VAI GH + AA +G + ++
Sbjct: 128 VAIGA-RASTSDTGVAVGFNSKADAKNSVAI------GH-------SSHVAANHGYSIAI 173

Query: 772 AVRYDLDSHGNPNYGSVTLGGPSAAPVMLTNVADGKSRYDAVNYGQLSSLQSDFENRMGA 831
R D + + G +L LT++A G DAVN QL
Sbjct: 174 GDRSKTDRENSVSIGHESLNR------QLTHLAAGTKDTDAVNVAQLKK----------- 216

Query: 832 MDDRVSKIETDTGDSRDDSRVMTMANLRRDNNDIGTGSGSVINSDAGDTTAIGANSTQQA 891
+ K + +T + A ++ + + + +S + +T
Sbjct: 217 ---EIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQ 273

Query: 892 SSGTAIGAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGSVADADNAV 951
S A A + ++ T + A + + A N A S ++
Sbjct: 274 SKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSK 333

Query: 952 SFGNSATGMTRTLTNVSAGVAPTDAVNVQQLDESVGGLRSQIEHDRADANGGTASAVAIA 1011
S T + T VS + Q D L ++++ + G AS+ A+
Sbjct: 334 SSHTLKTANSYTDVTVSNSTKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALN 393

Query: 1012 SLPQAPAPGKSVVAVGGGTYAGQSALAVG 1040
SL Q GK G G Y ALA+G
Sbjct: 394 SLFQPYGVGKVNFTAGVGGYRSSQALAIG 422



Score = 36.4 bits (83), Expect = 6e-04
Identities = 26/96 (27%), Positives = 46/96 (47%)

Query: 884 GANSTQQASSGTAIGAGADARAINSTAIGQAASAHGENSTAVGQGATAWGNNSIAIGAGS 943
G N++ + AIGA A+A + A+G + A G NS A+G + A G++++ GA S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 944 VADADNAVSFGNSATGMTRTLTNVSAGVAPTDAVNV 979
A D ++T T ++ ++V +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAI 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4706MPTASEINHBTR290.024 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 28.8 bits (64), Expect = 0.024
Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 438 DQYLVQRGLPIAPARWNPATPIIASLEADGTGI 470
D ++ L P W+P I + A+GTGI
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGI 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4710ACRIFLAVINRP300.035 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.035
Identities = 10/70 (14%), Positives = 25/70 (35%)

Query: 165 FGAFLIMVIILAVLALIVVKALTNSPWGTFTVAATIPIALFMGVYTRYIRPGRIGEVSII 224
+V I V+ + + AL S +V +P+ + + + + ++
Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMV 928

Query: 225 GFIGLMAAIA 234
G + + A
Sbjct: 929 GLLTTIGLSA 938


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4712PF06580300.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.007
Identities = 20/95 (21%), Positives = 29/95 (30%), Gaps = 8/95 (8%)

Query: 101 WSLFGVSWGLAVFGIVQELTLGRRTRLLSMILYV---LMGWLALVAVRPLIHALP----- 152
W G+ WG+ +L +L SMI + LMG + A R I
Sbjct: 13 WYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLN 72

Query: 153 PIGTAWLVAGGVIYSAGIYFFINDERIRHGHGIWH 187
V + ++F N R I
Sbjct: 73 MGQIILRVLPACVVIGMVWFVANTSIWRLLAFINT 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4713TCRTETB1232e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 2e-33
Identities = 72/363 (19%), Positives = 138/363 (38%), Gaps = 13/363 (3%)

Query: 13 RVLAATCVSYMLVLLDASIVNVALTDIAHTFGSRVAGLQWIVNAYTLAFASLLLTGGTLG 72
++L C+ +L+ ++NV+L DIA+ F A W+ A+ L F+ G L
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 73 DRLGDRTVYVAGLAVFVTASALCGVAPT-LPALAVARALQGVGSAMLVPCSLALINRAFP 131
D+LG + + + G+ + S + V + L +AR +QG G+A P + ++ +
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYI 132

Query: 132 EPAARASAISVWMGCGGIAMASGPLIGGLLIDLSGWRSLFFVNLPLGLAGIWLGRTVAPA 191
R A + + GP IGG++ W L + + + + + +
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKK 191

Query: 192 AVDRSRQFDWGGQAAAIVAIGALIGTLIEGPSLGWRSAPIVGGAVASVVAWIAFIAIEAR 251
V FD G V I + L S I + SV++++ F+ +
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFM--------LFTTSYSI-SFLIVSVLSFLIFVKHIRK 242

Query: 252 RREPMLPLAFFRNRLFAGSTFVSMASAFVFYGLLFVLSLFYRQVRGASPLDTGLAFL-PM 310
+P + +N F G + ++ + V S + G + P
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 311 TAMVALGGLTSGRIVARFGARGTMCAAFGLYAAGALGMTAIGATTPAWLAVAPMLAIGFA 370
T V + G G +V R G + + L + + TT ++ + + +G
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 371 SGS 373
S +
Sbjct: 363 SFT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4714PF06340280.017 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 27.7 bits (61), Expect = 0.017
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 3 PAD-TDADARRIHEDWHAAVVARDFDALMSLYAD 35
+ +D R ED+ A ++A D+D+L LY D
Sbjct: 84 SSTESDGAKTRTKEDFSARLLAGDYDSLQKLYID 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4715ENTEROTOXINA270.025 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 26.5 bits (58), Expect = 0.025
Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 10/57 (17%)

Query: 43 LGRFHERLVGPHPAWSYQIAFDAARFDDIVPWLVLNHGALDIFLHPNTHDELRDHRD 99
LG + PHP A + I W +N G +D LH N R++RD
Sbjct: 119 LGVY-----SPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRN-----REYRD 165


60Bcenmc03_4738Bcenmc03_4745N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4738-2101.792276phospholipase C, phosphocholine-specific
Bcenmc03_4739-291.044184GCN5-related N-acetyltransferase
Bcenmc03_4740-3100.965689thioredoxin domain-containing protein
Bcenmc03_4741-210-0.319521hypothetical protein
Bcenmc03_4742-2100.598491citrate transporter
Bcenmc03_4743-2120.317726putative sigma54 specific transcriptional
Bcenmc03_4744-213-0.357292hypothetical protein
Bcenmc03_4745-311-0.237212hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4738STREPTOPAIN330.005 Streptopain (C10) cysteine protease family signature.
		>STREPTOPAIN#Streptopain (C10) cysteine protease family signature.

Length = 398

Score = 32.7 bits (74), Expect = 0.005
Identities = 36/202 (17%), Positives = 65/202 (32%), Gaps = 36/202 (17%)

Query: 91 YTKATPAATVTPYHLDARQGNAQRAGGTPHTWADAQAAWDHGRMNRWPDAKTPLSMGYYD 150
Y + P +TP + G G T A A + + +P+ K Y
Sbjct: 160 YNQGNPYNLLTPVIEKVKPGEQSFVGQHAATGCVATATAQIMKYHNYPN-KGLKDYTYTL 218

Query: 151 AAEVPFQRALADAFTLCDHYHCGMHTGTIANRLFYWSGTNG--PNGISPADGSRVNIAAL 208
++ P+ + F I+ R + W+ S ++
Sbjct: 219 SSNNPYFNHPKNLF------------AAISTRQYNWNNILPTYSGRESNVQKMAISELMA 266

Query: 209 NNQFNGGNDIGPSSQGWTWTTYADRLQQAGVNWKVYQSLIDNFGCNEMMSFRH------- 261
+ + D GPSS + R+Q+A L +NFG N+ + +
Sbjct: 267 DVGISVDMDYGPSSGS----AGSSRVQRA---------LKENFGYNQSVHQINRGDFSKQ 313

Query: 262 -WRAAIEQMPAARRPAYVASTD 282
W A I++ + +P Y
Sbjct: 314 DWEAQIDKELSQNQPVYYQGVG 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4739SACTRNSFRASE401e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 1e-06
Identities = 25/120 (20%), Positives = 44/120 (36%), Gaps = 10/120 (8%)

Query: 36 PSLEKWQKRLDSVR----EHGMSLVAEIDGTVVGHLGLHPEPNPRRRHAAALGMMVDAAR 91
P ++++ V E + + ++ +G + + N +A + V
Sbjct: 45 PYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWN---GYALIEDIAVAKDY 101

Query: 92 HGRGIGGRLLAAAIDLAENWLNITRLELTVFTDNRAAIALYEKHGFRIEGESPDYALRDG 151
+G+G LL AI+ A+ + L L N +A Y KH F I D L
Sbjct: 102 RKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHFIIGAV--DTMLYSN 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4743HTHFIS369e-126 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 369 bits (949), Expect = e-126
Identities = 159/488 (32%), Positives = 229/488 (46%), Gaps = 37/488 (7%)

Query: 31 SAGTV-VVDRDARVVWMNERYAARFGFADPQQAVGLDCEAVIPNSLMREVVSTGQP--IL 87
+ T+ V D DA + + + +R G D + + ++ G ++
Sbjct: 2 TGATILVADDDAAIRTVLNQALSR---------AGYDVRITSNAATLWRWIAAGDGDLVV 52

Query: 88 LDIMETGREPLVV--------TRLPL-----KNEAGETVGAIGFALFDQLKTLTPIFSRY 134
D++ + LP+ +N + A +D L +
Sbjct: 53 TDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYL-PKPFDLTEL 111

Query: 135 AQLQQQ-LIATQRSLAQARRAKYTFASFVGTSAASLETKRQARRAAQVDSPVLLLGETGT 193
+ + L +R ++ VG SAA E R R Q D +++ GE+GT
Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171

Query: 194 GKELLAHAIHAASARALQPLVTVNVAAIPDTLLETEFFGAAPGAYTGADRKGRVGKFELA 253
GKEL+A A+H R P V +N+AAIP L+E+E FG GA+TGA + G+FE A
Sbjct: 172 GKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR-STGRFEQA 230

Query: 254 DRGTLFLDEIGDMPLPLQGKLLRVLQDKEFEPVGSNRIVRADVRIIAATSADLPALVAAG 313
+ GTLFLDEIGDMP+ Q +LLRVLQ E+ VG +R+DVRI+AAT+ DL + G
Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQG 290

Query: 314 RFRADLYYRLNVLTIHAPPLRERASDIAALVYATLEELAAQHGRAAHCELTDDALRMLCA 373
FR DLYYRLNV+ + PPLR+RA DI LV +++ A + G +AL ++ A
Sbjct: 291 LFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKR-FDQEALELMKA 348

Query: 374 YPWPGNVRELRNTLERALMLSDRSVIDARALAPFLG------PVRISPDGAAPAQRVGAV 427
+PWPGNVREL N + R L + VI + L P+ + + AV
Sbjct: 349 HPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAV 408

Query: 428 PAEQRAVDAPTASAAPA-ASYADALAAWERQFLTDALAACDGKVVDAAARIGIGRATLYK 486
R A A P Y LA E + AL A G + AA +G+ R TL K
Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRK 468

Query: 487 KLAALGID 494
K+ LG+
Sbjct: 469 KIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4745CHLAMIDIAOM6250.044 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 24.7 bits (53), Expect = 0.044
Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 3/35 (8%)

Query: 5 VAGLGGCGG---GAPLFTSDGRPTTQVQCTGNDWS 36
+A + CGG A + T P QV G DWS
Sbjct: 293 IATVSYCGGHKNTASVTTVINEPCVQVSIAGADWS 327


61Bcenmc03_4813Bcenmc03_4820N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4813-2111.130849TetR family transcriptional regulator
Bcenmc03_4814-290.621108short chain dehydrogenase
Bcenmc03_4815-291.042724trans-2-enoyl-CoA reductase
Bcenmc03_4816-171.546517transcriptional regulators-like protein
Bcenmc03_4817082.568708NAD-dependent epimerase/dehydratase
Bcenmc03_4818-171.835377integral membrane protein-like protein
Bcenmc03_4819072.027560putative thiol-disulfide oxidoreductase DCC
Bcenmc03_4820-282.270241hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4813HTHTETR618e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 8e-14
Identities = 42/205 (20%), Positives = 67/205 (32%), Gaps = 15/205 (7%)

Query: 1 MGVSRQQAAENRHAIVAAAERLFRLRGVDAVGLTELMKEAGFTQGGFYNHFKSKDALVAE 60
++Q+A E R I+ A RLF +GV + L E+ K AG T+G Y HFK K L +E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 VMDKAMQ------DRADSPNAGSVAKQVTAYLSGAHRDNVEGG---------CPLSGFAG 105
+ + + + G + L V F G
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 106 DAPRLTDAARACYTRGVAAYLERLERMVATEGSVAADARDDAIAVLSQMVGALVLSRAVA 165
+ + A R + L+ + + A A ++ + L+ + A
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 166 GTDPALADEILDAARRALVGQPDDP 190
L E D L P
Sbjct: 182 PQSFDLKKEARDYVAILLEMYLLCP 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4814DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.9 bits (155), Expect = 2e-14
Identities = 52/208 (25%), Positives = 88/208 (42%), Gaps = 17/208 (8%)

Query: 3 IEGAVVFITGANRGLGLEFAKQALERGARKVYAGARDP-------ASVTLPGVVP--VKL 53
IEG + FITGA +G+G A+ +GA + A +P +S+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGA-HIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 54 DVTDPAAVAA-----AADAARDVTLLINNAGIARLGSLTDDGAVDALRAHLETNVFGMLA 108
DV D AA+ + + +L+N AG+ R G L + + A N G+
Sbjct: 65 DVRDSAAIDEITARIEREMGP-IDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFN 122

Query: 109 MSRAFAGTLAAHGGGAILNILSVASWVNRPILSGYGVSKSAAWALTNGLRHSLREQHTQV 168
SR+ + + G+I+ + S + V R ++ Y SK+AA T L L E + +
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 169 VGLHAGFIDTDLTAGLDVPKATPADVVR 196
+ G +TD+ L + V++
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIK 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4817NUCEPIMERASE454e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.8 bits (106), Expect = 4e-07
Identities = 62/292 (21%), Positives = 100/292 (34%), Gaps = 62/292 (21%)

Query: 20 TVLVCGANGFIGRALCAQLEAGGHRVLRGVRHAAGPYDVAIDFAH--------------D 65
LV GA GFIG + +L GH+V+ G+ + YDV++ A D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVV-GIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 66 VDPHAWLARL---KGVDVVINAVGMLADRR----GATLDAVHRAAPSALFTACCRAGVRR 118
+ + L + V + LA R + + C ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 VIQISALGVERGDTR----------------YFASKQAADRFLQTLP----IDFRIVRPA 158
++ S+ V G R Y A+K+A + T + +R
Sbjct: 121 LLYASSSSV-YGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 159 LVYGAAG----ASARFFR-MLASLPVHVLPAGGHQRLRPVHVDDLAEVVARLVMQPSDSP 213
VYG G A +F + ML + V G +R ++DD+AE + RL +
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKR-DFTYIDDIAEAIIRLQDVIPHAD 238

Query: 214 PARAR------------RVIDVVGRDEVEYREMLAAYRAALGFPPAARVTLP 253
RV ++ VE + + A ALG A + LP
Sbjct: 239 TQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG-IEAKKNMLP 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4820NUCEPIMERASE445e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.4 bits (105), Expect = 5e-07
Identities = 60/320 (18%), Positives = 103/320 (32%), Gaps = 76/320 (23%)

Query: 187 LVTGGTGFIGETLVNQLLDAGQTVTLL---------ARDPLRAAYL-------FQGRVRS 230
LVTG GFIG + +LL+AG V + + R L + +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 231 VTSIDQLQPHERFDTVVNLAGAPVLGARWSKRRQALLLASRVGVTQALMRWVETAEVKPR 290
+ L F+ V L R+S S + ++ +++
Sbjct: 64 REGMTDLFASGHFERVFISPHR--LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 291 TWIQASAIGYYGVRPG-----DERLDEGSS--AGTGFMSDLCRRWEAAAEPL-----ERH 338
+ +S++ YG+ D+ +D S A T + A E + +
Sbjct: 122 LYASSSSV--YGLNRKMPFSTDDSVDHPVSLYAAT----------KKANELMAHTYSHLY 169

Query: 339 GVRAVVLRLGIVFGPGGALRPMLLPHYFG---LGGR----FGDGAQVMSWIHRDDVLRIV 391
G+ A LR V+GP G RP + F L G+ + G + + DD+ +
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAI 227

Query: 392 ARAMSNP------------------GMHGVYN--AVAPVPLTQRAFVQVVTKVLRRPA-- 429
R + VYN +PV L ++Q + L A
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMD--YIQALEDALGIEAKK 285

Query: 430 -FLHMPAAPLRAAMGEMAEL 448
L + + + L
Sbjct: 286 NMLPLQPGDVLETSADTKAL 305


62Bcenmc03_4864Bcenmc03_4879N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_4864-1111.547565TetR family transcriptional regulator
Bcenmc03_4865-1101.174580short-chain dehydrogenase/reductase SDR
Bcenmc03_4866-1100.330221LysR family transcriptional regulator
Bcenmc03_4867-1110.475607beta-lactamase
Bcenmc03_4868-1130.776030peroxidase-like protein
Bcenmc03_4869-1120.524566quinone oxidoreductase
Bcenmc03_4870012-0.632824TetR family transcriptional regulator
Bcenmc03_4871-211-0.754785IclR family transcriptional regulator
Bcenmc03_4872-112-0.262799dimethylmenaquinone methyltransferase
Bcenmc03_4873-211-0.592645D-isomer specific 2-hydroxyacid dehydrogenase
Bcenmc03_4874-38-0.396259SMP-30/gluconolaconase/LRE domain-containing
Bcenmc03_4875-27-0.449802major facilitator transporter
Bcenmc03_4876-29-0.021080porin
Bcenmc03_4877-29-0.175501amidohydrolase 2
Bcenmc03_4878-211-0.795239major facilitator transporter
Bcenmc03_4879-113-0.958066major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4864HTHTETR482e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.5 bits (115), Expect = 2e-09
Identities = 35/189 (18%), Positives = 67/189 (35%), Gaps = 10/189 (5%)

Query: 3 RPRSPDKHDAILAAAARALAEDGASATTAR-IAKLAGVAEGTVFTYFETKDALLNALYLS 61
+ + + IL A R ++ G S+T+ IAK AGV G ++ +F+ K L + ++
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 62 LKAGLREAMMTGFPE-HAPAEQAVRHAWNGYVSWGVANPDGRRAL-------QQLGVSGR 113
++ + E + + +R + V R + + +G
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 114 IDDAHRAAGAEGFGGIGSLLREQVGASGTLNRDEAHAFCSALFTSIAETAMESIARDPAR 173
+ A R E + I L+ + A + I+ ME+ P
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL-MENWLFAPQS 184

Query: 174 ADAYREAGF 182
D +EA
Sbjct: 185 FDLKKEARD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4865DHBDHDRGNASE991e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.0 bits (246), Expect = 1e-26
Identities = 51/188 (27%), Positives = 84/188 (44%), Gaps = 9/188 (4%)

Query: 3 KVWLVTGAARGLGRAISEAVLAAGDRLVAGARDPARLADLAE------RYGDRLLPVELD 56
K+ +TGAA+G+G A++ + + G + A +P +L + R+ + D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF---PAD 65

Query: 57 VTDEAAAAQAVSAARAAFGRIDVLVNNAGYGHTAPFEQMSADAFRDQIETNLFGVINLTR 116
V D AA + + G ID+LVN AG +S + + N GV N +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 117 AVLPTMRAQRAGHIFQVSSVGGRTSTPGLSAYQAAKWAVGGFSDVLAKEAAPFGVRVCTL 176
+V M +R+G I V S ++AY ++K A F+ L E A + +R +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 177 EPGGMRTE 184
PG T+
Sbjct: 186 SPGSTETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4869NUCEPIMERASE346e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.0 bits (78), Expect = 6e-04
Identities = 15/53 (28%), Positives = 24/53 (45%)

Query: 151 IVVTGAAGGVGSVATALLARLGYRVVAVTGRPADADYLRQLGAAEILDRGQFS 203
+VTGAAG +G + L G++VV + D + E+L + F
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4870HTHTETR784e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 77.7 bits (191), Expect = 4e-20
Identities = 30/180 (16%), Positives = 58/180 (32%), Gaps = 1/180 (0%)

Query: 2 LLRTGLEILTEKGFSATGLDEILGRAGVPKGSFYHYFDSKEAFGLKLIDRYAEFFARKLD 61
+L L + +++G S+T L EI AGV +G+ Y +F K ++ +
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 62 RHFSQLERSPLARVRAFVDDARDGMARHAYNRGCL-IGNLGQEMGTLPESFRARLRATFE 120
+ ++ PL+ +R + + R + I E + R
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCL 135

Query: 121 DWQRRLAECLDAAQQAGELAESADPAALAAFFWIGWEGAVLRAKLERSDQPLALFAQFFF 180
+ R+ + L +A L A G + L A+ +
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4875TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 33/182 (18%), Positives = 69/182 (37%), Gaps = 8/182 (4%)

Query: 258 STLRDAMTNWRVFVLAFVNFCGIVGSLGVGLWMPQIIKQFGVEHAVVGWLTAIPYAIGAG 317
S LR N + L ++F ++ + + + +P I F A W+ +
Sbjct: 8 SNLRH---NQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSI 64

Query: 318 AMLWWARLANRAANRIPYVAGALALAAAALCASAFMHAPVFKLI-ALCVTVSGILAFQAT 376
+ +L+++ + + G + ++ H+ LI A + +G AF A
Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG-FVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 377 YWAIPSGFLTGRAAAGGLALIVSVGNLGGFVGPSMIGALKQFSGG---FTAPLIAVSGVL 433
+ + ++ LI S+ +G VGP++ G + + P+I + V
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVP 183

Query: 434 LL 435
L
Sbjct: 184 FL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4876ECOLNEIPORIN665e-14 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 65.6 bits (160), Expect = 5e-14
Identities = 77/373 (20%), Positives = 123/373 (32%), Gaps = 82/373 (21%)

Query: 35 AQSSVTLYGIADVGVEHINNTNTGGAQTRE----ASGNLSGSRWGLKGVEDLGGGMKAIF 90
A + VTLYG GVE + GAQ GS+ G KG EDLG G+KAI+
Sbjct: 17 AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIW 76

Query: 91 QLENGFNINDGTTAQSTKGLGSNAATTSRIFGRQAWVGLAYRGQQLTFGRQNALFYEQAV 150
Q+E +I + RQ+++GL +L GR N++ +
Sbjct: 77 QVEQKASIAGTDSGWGN---------------RQSFIGLKGGFGKLRVGRLNSVLKD--- 118

Query: 151 AFDPMGASSRYSVLSVDYAAAARIDN---SVKYTG-VFGPLTAQAMYSTRYDTGYGAEVP 206
D S+ L V+ A + SV+Y F L+ Y+ + G
Sbjct: 119 TGDINPWDSKSDYLGVNK--IAEPEARLISVRYDSPEFAGLSGSVQYALNDNAGRH---- 172

Query: 207 GAQLTGRFFSGALTFSQGPLAASVSYEQRNSNTVATNTGTERRATAAASYAIGPVKGFAG 266
+ + G + + V N E+ + + +G
Sbjct: 173 ----NSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEK----YQIHRLV-----SG 219

Query: 267 YRYLRASNAFLPANPIRVANGAEASAANLYWAG----------AQYAVSPAFVVTATAYY 316
Y + + + + A L A A V +Y
Sbjct: 220 YD----------NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYA 269

Query: 317 Q----DVHSTSADPWL--AVLCADYLLSKRTDIYATAGFARNKGGSALGVNGYGTVAPDH 370
+T+ + V+ A+Y SKRT +AG+ + G + V
Sbjct: 270 HGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFV---------- 319

Query: 371 NQTGVVIGMRQKF 383
T +G+R KF
Sbjct: 320 -STAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4878TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.0 bits (78), Expect = 0.001
Identities = 63/364 (17%), Positives = 112/364 (30%), Gaps = 36/364 (9%)

Query: 53 LPTLALQFGLN---KAQLGMFTSVTAAGQIIGGILFGFVSDRIGRVRTALLCVGIYSLFS 109
LP L + A G+ ++ A Q + G +SDR GR L+ + ++
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 110 GLIAFAPDAHAFATLRFFGALGMGGTWTAGAALIAETWHPSRRGKGGALMQMGLPIGAIL 169
++A AP R + G T A IA+ R + M G +
Sbjct: 88 AIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146

Query: 170 AIAISGIVGATHGGLGGDSWRLLFLIGASPFFILFWVARKTPESPIWLERRHAKPQARKP 229
+ G+ +GG S F A+ + F ERR + +A P
Sbjct: 147 GPVLGGL-------MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP 199

Query: 230 DTAGHEKLNVRGLLTAFCFIFFLQYLYWGV------------FTWTPTFLITVKHLDFVH 277
+ + + A +FF+ L V F W T +
Sbjct: 200 LASFRWARGMTV-VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI---------- 248

Query: 278 SLKFVLALQFGAIAGFLLFSAWVDRIGRRPMFLAYLLVGALAVGVYIVSANPLLLMTAIF 337
+ ++A ++ R+G R + ++ + + + +
Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMV 308

Query: 338 LTGFSVNGIFAGAGPFLAEIIGNTASRGFFMGLAYNGGRLGGFIAPLIIGALASTSGGFV 397
L GI A + + +G G L + PL+ A+ + S
Sbjct: 309 LLASG--GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTW 366

Query: 398 LGLA 401
G A
Sbjct: 367 NGWA 370



Score = 32.5 bits (74), Expect = 0.003
Identities = 29/125 (23%), Positives = 48/125 (38%), Gaps = 7/125 (5%)

Query: 298 AWVDRIGRRPMFLAYLLVGALAVGVYIVSANPLLLMTAIFLTGFSVNGIFAGAGPFLAEI 357
A DR GRRP+ L L A+ + + +L + G + A AG ++A+I
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADI 123

Query: 358 I-GNTASRGF-FMGLAYNGGRLGGFIAPLIIGALASTSGGFVLGLATTIVAFVAAAAVVL 415
G+ +R F FM + G +A ++G L A + +
Sbjct: 124 TDGDERARHFGFMSACFG----FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 416 FAPET 420
PE+
Sbjct: 180 LLPES 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_4879TCRTETA310.012 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.012
Identities = 58/311 (18%), Positives = 97/311 (31%), Gaps = 16/311 (5%)

Query: 68 QLIVAPVAGSRIGSANARRVTLCVLTTSLLLTGSLFTLSLLGLLGVKLILAHALAI-GVS 126
Q APV G + RR L V L G+ +++ +L + G++
Sbjct: 56 QFACAPVLG-ALSDRFGRRPVLLVS-----LAGAAVDYAIMATAPFLWVLYIGRIVAGIT 109

Query: 127 SAVETPARQVLLLTSLQDATHTSNAVAMNTMVYNVGRMVGPTIAGFVYPTLGPRTSFAIY 186
A T A + + D + + + G + GP + G + P F
Sbjct: 110 GA--TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAA 166

Query: 187 AL--ALCFMAACVRSIRTATVDRPSRAESGLRDAVGYVLSDAFS--ARYLPILACIGLFA 242
A L F+ C + +R L + + + A + + + L
Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVG 226

Query: 243 GSYQTLVPLLADQGFHDAARFTGVFFACAGAGSLSAAVLLSSAFGPRAS-RRFIAYAPWT 301
L + + FH A G+ A G A +++ R RR +
Sbjct: 227 QVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA 286

Query: 302 AVGALAVLAATTDAAGSIPAFYALGFSLTFAATSTNATIQRQCPEHVRGGLVGMYGMAYN 361
+LA T + P L + A + RQ E +G L G +
Sbjct: 287 DGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTS 345

Query: 362 GTMPFGYLLVG 372
T G LL
Sbjct: 346 LTSIVGPLLFT 356


63Bcenmc03_5002Bcenmc03_5009N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5002-120-4.599273hemolysin-type calcium-binding protein
Bcenmc03_5003-115-1.910508type I secretion system ATPase
Bcenmc03_5004-112-0.959732HlyD family type I secretion membrane fusion
Bcenmc03_5005-39-0.061331outer membrane efflux protein
Bcenmc03_5006-1110.656235MscS mechanosensitive ion channel
Bcenmc03_5007-3121.093662major facilitator transporter
Bcenmc03_5008-2110.999418hypothetical protein
Bcenmc03_5009-2141.320253porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5002RTXTOXINA1288e-32 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 128 bits (323), Expect = 8e-32
Identities = 83/295 (28%), Positives = 117/295 (39%), Gaps = 28/295 (9%)

Query: 711 IMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGND-TVFGGAGNDELHGDAGNDRLLGE 769
+ G G+D+++ G + G G D V + + G+ R+LG
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLGG 672

Query: 770 AGNDRIFGEAGDDVLWGGDGDDV------LVGFTASN-DAKQTLSWGESDNDMLYGGNGN 822
+V G + N L E L G
Sbjct: 673 DVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEE----LIGTTRA 728

Query: 823 DALYGGLGNDYLDGGNDNDFLDGGDGDDRLFGGAGDDELNGGNGHDALSGETGNDKIFGG 882
D +G D G + +D ++G DG+DRL+G G+D L+GGNG D L G GNDK+ G
Sbjct: 729 DKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGV 788

Query: 883 AGNDTIWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAGNDLILGGLGNDLLHGEA 942
AGN+ + GGDGDD S +D +YG G DL+ GG G+DLL G
Sbjct: 789 AGNNYLNGGDGDDEFQVQGNSLAKNVLFGG--KGNDKLYGSEGADLLDGGEGDDLLKGGY 846

Query: 943 GNDE------------IQGGDGHDKLYGGDGN--DRLFGQVGNDILYGGAGDDLL 983
GND G DKL D + D F + GND++ ++L
Sbjct: 847 GNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNVL 901



Score = 127 bits (320), Expect = 2e-31
Identities = 68/191 (35%), Positives = 92/191 (48%), Gaps = 7/191 (3%)

Query: 609 LAGDGNDMMGGSSRNDNLWGGTGNDTLFGYDGDDRLYGEEGDDELNGGAGNDVLDGGIGN 668
+ D GS D G G+D + G DG+DRLYG++G+D L+GG G+D L GG GN
Sbjct: 723 IGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGN 782

Query: 669 DKLFGHVGNDIMNGGDGDDIMLGFTASNDSKQTLAWGETDDDIMYGGAGNDRMWGGVGHD 728
DKL G GN+ +NGGDGDD S G +D +YG G D + GG G D
Sbjct: 783 DKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLF--GGKGNDKLYGSEGADLLDGGEGDD 840

Query: 729 YMDGGDGADFV--SGGDGNDTVF-GGAGNDELH-GDAGNDRLLGE-AGNDRIFGEAGDDV 783
+ GG G D G G+ + G D+L D + + GND I + +V
Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900

Query: 784 LWGGDGDDVLV 794
L G + +
Sbjct: 901 LSIGHKNGITF 911



Score = 125 bits (315), Expect = 6e-31
Identities = 86/288 (29%), Positives = 117/288 (40%), Gaps = 48/288 (16%)

Query: 756 ELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGESDNDM 815
E H G+D++ AG+ I+ G DV++ D + + + G
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEA----GNYTVTR 668

Query: 816 LYGGNGNDALYGGLGNDYLDGGNDNDFLDGGDGDDRLFGGAGDDELNGGNGHDALSGETG 875
+ GG+ L + + G + + G E + + L G T
Sbjct: 669 VLGGDVK-VLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTR 727

Query: 876 NDKIFGGAGNDTIWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAGNDLILGGLGN 935
DK FG D G DGDD LI G GN
Sbjct: 728 ADKFFGSKFTDIFHGADGDD--------------------------------LIEGNDGN 755

Query: 936 DLLHGEAGNDEIQGGDGHDKLYGGDGNDRLFGQVGNDILYGGAGDDLLVGFTGDNEAKRT 995
D L+G+ GND + GG+G D+LYGGDGND+L G GN+ L GG GDD
Sbjct: 756 DRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEF-----------Q 804

Query: 996 LGPGETDDDYLYGGEGNDTLLGGLGDDYLDGGAGADHMEGGEGNDTYI 1043
+ + L+GG+GND L G G D LDGG G D ++GG GND Y
Sbjct: 805 VQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYR 852



Score = 124 bits (313), Expect = 9e-31
Identities = 91/313 (29%), Positives = 133/313 (42%), Gaps = 42/313 (13%)

Query: 639 DGDDRLYGEEGDDELNGGAGNDVLDGGIGNDKLFGHVGNDIMNGGDGDDIMLGFTASNDS 698
DGDD+++ G + G G+DV+ + G++ D + + + D
Sbjct: 618 DGDDKVFLSAGSANIYAGKGHDVVYYDKTD---TGYLTIDGTKATEAGNYTVTRVLGGDV 674

Query: 699 K--------QTLAWGETDDDIMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGNDTVFG 750
K Q ++ G+ + Y + G D + + G D FG
Sbjct: 675 KVLQEVVKEQEVSVGKRTEKTQYRSYEFTHI-NGKNLTETDNLYSVEELIGTTRADKFFG 733

Query: 751 GAGNDELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGE 810
D HG G+D + G GNDR++G+ G+D L GG+GD
Sbjct: 734 SKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGD-------------------- 773

Query: 811 SDNDMLYGGNGNDALYGGLGNDYLDGGNDND---FLDGGDGDDRLFGGAGDDELNGGNGH 867
D LYGG+GND L G GN+YL+GG+ +D + LFGG G+D+L G G
Sbjct: 774 ---DQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGA 830

Query: 868 DALSGETGNDKIFGGAGNDT--IWGGDGDDILVGFTASNDAKQSLAAWETDDDTIYGGAG 925
D L G G+D + GG GND G G I+ D K SLA + D + G
Sbjct: 831 DLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKED-KLSLADIDFRDVA-FKREG 888

Query: 926 NDLILGGLGNDLL 938
NDLI+ ++L
Sbjct: 889 NDLIMYKGEGNVL 901



Score = 108 bits (271), Expect = 9e-26
Identities = 63/215 (29%), Positives = 97/215 (45%), Gaps = 29/215 (13%)

Query: 635 LFGYDGDDRLYGEEGDDELNGGAGNDVLDGGIGNDKLFGHVGNDIMNGGDGDDIMLGFTA 694
L G D+ +G + D +G G+D+++G GND+L+G GND ++GG+G
Sbjct: 722 LIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG--------- 772

Query: 695 SNDSKQTLAWGETDDDIMYGGAGNDRMWGGVGHDYMDGGDGADFVSGGDGNDTVFGGAGN 754
DD +YGG GND++ G G++Y++GGDG D
Sbjct: 773 --------------DDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG------NSLAK 812

Query: 755 DELHGDAGNDRLLGEAGNDRIFGEAGDDVLWGGDGDDVLVGFTASNDAKQTLSWGESDND 814
+ L G GND+L G G D + G GDD+L GG G+D+ + G+ D
Sbjct: 813 NVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKL 872

Query: 815 MLYGGNGNDALYGGLGNDYLDGGNDNDFLDGGDGD 849
L + D + GND + + + L G +
Sbjct: 873 SLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKN 907



Score = 85.0 bits (210), Expect = 1e-18
Identities = 43/136 (31%), Positives = 69/136 (50%), Gaps = 16/136 (11%)

Query: 1377 GNALDNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRGSGCDTIRDDDETLGNSDVIS 1436
G ++ + G+ G ++LDGG G+D+L GG GND YR+ G G I DD G D +S
Sbjct: 817 GGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG---GKEDKLS 873

Query: 1437 IGAGVSADQLWFRHVGNDL-------EISILGTGDTATVRDWYL-----GSRYQIEQIRV 1484
+ A + + F+ GNDL + +G + T R+W+ S ++IEQI
Sbjct: 874 L-ADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEIEQIFD 932

Query: 1485 DDGRTLVNADVEKLVQ 1500
GR + ++K ++
Sbjct: 933 KSGRIITPDSLKKALE 948



Score = 53.0 bits (127), Expect = 9e-09
Identities = 57/180 (31%), Positives = 76/180 (42%), Gaps = 36/180 (20%)

Query: 550 NDKIYWINSRDFIMFGPTQIKVSPSNRSYLIGTDGNDVFDANYYAAYGHWIDSNLLVNFL 609
ND++Y D + G + L G DGND G+ N L
Sbjct: 755 NDRLYGDKGNDTLSGG--------NGDDQLYGGDGNDKL----IGVAGN----NYLN--- 795

Query: 610 AGDGNDMM---GGSSRNDNLWGGTGNDTLFGYDGDDRLYGEEGDDELNGGAGND--VLDG 664
GDG+D G S + L+GG GND L+G +G D L G EGDD L GG GND
Sbjct: 796 GGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLS 855

Query: 665 GIGNDKLFGHVGNDIMNGGDGDDIMLGFTASNDSKQTLAWGETDDD-IMYGGAGNDRMWG 723
G G+ + GG D + L ++ + +A+ +D IMY G GN G
Sbjct: 856 GYGHHIIDDD-------GGKEDKLSL----ADIDFRDVAFKREGNDLIMYKGEGNVLSIG 904



Score = 36.5 bits (84), Expect = 0.001
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 1381 DNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRGSGCDTIRDDDETLGNSDVISIGAG 1440
D+ I GN GN+ L G GND L GG G+D G G +D+ +G + + G
Sbjct: 746 DDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDG--------NDKLIGVAGNNYLNGG 797

Query: 1441 VSADQLWFRHVGNDLEISILGTGD 1464
D+ + + G G+
Sbjct: 798 DGDDEFQVQGNSLAKNVLFGGKGN 821



Score = 31.5 bits (71), Expect = 0.036
Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 1377 GNALDNTIIGNRGNNVLDGGAGNDILIGGLGNDTYRFGRG 1416
G+ + G G+++++G GND L G GNDT G G
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG 772


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5004RTXTOXIND312e-104 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 312 bits (801), Expect = e-104
Identities = 141/472 (29%), Positives = 233/472 (49%), Gaps = 33/472 (6%)

Query: 9 AWHELVRRYRDVWRHCWKRRHWMTLPAFDANEAEFLPAALSVQAAPVSPAGRWVARILTL 68
+ E + RY+ VW WK R + P + +E EFLPA L + PVS R VA +
Sbjct: 7 GFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMG 66

Query: 69 LVTTAILWSYFGKIDIVVDGGGKIIPSVRTKTLAAVEVASVRALHVRDGQIVKTGDALID 128
+ A + S G+++IV GK+ S R+K + +E + V+ + V++G+ V+ GD L+
Sbjct: 67 FLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLK 126

Query: 129 LDTRVIDAERQRANGDRLNAVLQVERARALIDAIDTGR--------PPRLADVEGVSPER 180
L +A+ + L A L+ R + L +I+ + P +V R
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 181 RREVERHLLDQWRD-----------FVARRDRLDSEVHRYGQAAPLAARRADDYARLMRT 229
+ + W++ A R + + ++RY + + R DD++ L+
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 230 GDVAEHAWIEAEQQRIDMAGQLADARHQRAAL--------------VAETRRNIQDALND 275
+A+HA +E E + ++ +L + Q + + I D L
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ 306

Query: 276 AQRIVDASSGDMRRAEAHGELLRLTAPIDGTVQQLAVHTIGTAVPAAQPLMQIVPREGAV 335
+ + ++ + E + + AP+ VQQL VHT G V A+ LM IVP + +
Sbjct: 307 TTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTL 366

Query: 336 EMEAFVDNRDIGFVKEGQVASVKIDAFEYTKYGTVSAIVSHVSRDAIDDDKKGLVYSVRI 395
E+ A V N+DIGF+ GQ A +K++AF YT+YG + V +++ DAI+D + GLV++V I
Sbjct: 367 EVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVII 426

Query: 396 MLDRSALVVDGREIALSPGMSGSAEIRTGTRRVIEYVLSPLLQHARESLRER 447
++ + L + I LS GM+ +AEI+TG R VI Y+LSPL + ESLRER
Sbjct: 427 SIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5006IGASERPTASE340.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.003
Identities = 18/122 (14%), Positives = 43/122 (35%), Gaps = 8/122 (6%)

Query: 33 AASAPVASGVGAAAPAISLPDAIAQLKQMQAELDRIKQQTSTASNSKELDALDDSAQEL- 91
A + A ++ + Q+ + + QT+ + ++ + + E
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE 1117

Query: 92 -STDVAKLQSDLTPQRAQVQAQLDVLGPAPAEGAAPEAPAV-AKQRAALDARKTQIDAAL 149
+ +V K+ S ++P++ Q + AE A P V K+ + +
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 150 KQ 151
K+
Sbjct: 1173 KE 1174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5007TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 42/143 (29%), Positives = 55/143 (38%), Gaps = 9/143 (6%)

Query: 59 VAPGLVKSGILSATTHGLFGTTGVASFIAALFSGLFIGTIACGFLADRFGRRAIFTWSLL 118
V PGL++ + S +G +A F G L+DRFGRR + SL
Sbjct: 27 VLPGLLRDLVHSNDVTAHYGI-----LLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81

Query: 119 WYTAANVVMAFQDTATGLNFWRFVVGLGLGVEMVTIGTYISELVPKQIRGRAFACEQA-- 176
+MA L R V G+ G G YI+++ R R F A
Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACF 140

Query: 177 -VGFVAVPVVAFLAYLLVPHAPL 198
G VA PV+ L PHAP
Sbjct: 141 GFGMVAGPVLGGLMGGFSPHAPF 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5009ECOLNEIPORIN918e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 91.4 bits (227), Expect = 8e-23
Identities = 72/323 (22%), Positives = 119/323 (36%), Gaps = 47/323 (14%)

Query: 20 AGAQSSITLYGLISAGVGFATNQGGKNAWQALSGT-----NQNPRWGLKGKEDLGQGLSA 74
A + +TLYG I AGV + + A A T + + G KG+EDLG GL A
Sbjct: 15 VAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKA 74

Query: 75 VFQLENGFNVMTGTAAQNGREFGRMAYVGLADRTYGALTFGRQYDAIHDYIGPVIIASNG 134
++Q+E ++ A + R +++GL +G L GR + D S
Sbjct: 75 IWQVEQKASI----AGTDSGWGNRQSFIGLK-GGFGKLRVGRLNSVLKDTGDINPWDSKS 129

Query: 135 VNIGDNDNGYNDIRVQNSVKYVSPVHYGLKFTALYGFSNTTGFANNSAYSFGLGYERGPL 194
+G N + R+ SV+Y SP GL + Y ++ G N+ +Y G Y+ G
Sbjct: 130 DYLGVNKIAEPEARLI-SVRYDSPEFAGLSGSVQYALNDNAGRHNSESYHAGFNYKNGGF 188

Query: 195 RWSVVYAQYNHPYSATNQDGAIANDYASPLLIFSKSAMSPAAYASRQRIAGTGGFYTIGR 254
A H N + + Y
Sbjct: 189 FVQYGGAYKRHHQVQENVNIEKYQIHR------------------------LVSGYDND- 223

Query: 255 AQFAAMFTDVR-YDYLDNSHLHLQNLGVNVVY-----TMTPQLFLGAAYAFTNGK-YDVI 307
A +A++ + ++ ++ H V +TP++ +YA +D
Sbjct: 224 ALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRV----SYAHGFKGSFDAT 279

Query: 308 DKRPKWHQVNLQADYFLSKRTDV 330
+ + QV + A+Y SKRT
Sbjct: 280 NYNNDYDQVVVGAEYDFSKRTSA 302


64Bcenmc03_5088Bcenmc03_5092N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5088453-12.100227response regulator receiver protein
Bcenmc03_5089448-10.678496PAS/PAC sensor signal transduction histidine
Bcenmc03_5090647-10.414976two component LuxR family transcriptional
Bcenmc03_5091647-10.227333major facilitator transporter
Bcenmc03_5092543-8.911491pyridine nucleotide-disulfide oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5088HTHFIS762e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 2e-19
Identities = 21/101 (20%), Positives = 43/101 (42%)

Query: 9 IIDDDQSVRRATGSLVRSLGWEVRTYESGEEFLSAERIADVACIISDVQMPGISGLEMYE 68
+ DDD ++R + G++VR + D +++DV MP + ++
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 69 MLLERGVAPPVIFITSFPSEATHRQAMKLGAICVFSKPVDP 109
+ + PV+ +++ + T +A + GA KP D
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5090HTHFIS1001e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 1e-26
Identities = 32/143 (22%), Positives = 62/143 (43%)

Query: 16 VVDDDDSMRSALGMLLRSVGLRVELFSSAQEFLAFDKPDVSSCLILDVRLKGQSGLVLQE 75
V DDD ++R+ L L G V + S+A + ++ DV + ++ L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 76 QIVAGDMGLPIIFITAHGDVAMSVKAMKNGALDFLSKPFRDQEMLDAVEGALLKHEARRR 135
+I LP++ ++A ++KA + GA D+L KPF E++ + AL + + R
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127

Query: 136 TDGRVAEVRRRYESLTPREREVM 158
++ + +E+
Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5091TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 28/156 (17%), Positives = 62/156 (39%), Gaps = 1/156 (0%)

Query: 252 IFSSKIFWIFGIIYFLDVFGIYGYTLWAPTIIKSLGVERNSLIGLLAALPNAVAVIVM-I 310
+ + F I + + + G+ P ++K + + IG + P ++VI+
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 311 IAGRKADSRRERRLLVAALFLMAAAGLTLALVWHGTLWLSIAALCIANAGLLSIPPIFWG 370
I G D R +L + ++ + LT + + T W + GL +
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 371 MPTAVLSPRNAASGIAWISAIGNIGGFFGPYVVGLL 406
+ ++ L + A +G++ ++ + G +VG L
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407



Score = 35.2 bits (81), Expect = 5e-04
Identities = 32/161 (19%), Positives = 60/161 (37%), Gaps = 2/161 (1%)

Query: 246 SINQNNIFSSKIFWIFGIIYFLDVFGIYGYTLWAPTIIKSLGVERNSLIGLLAALPNAVA 305
S +Q+N+ ++I I+ F V + P I S + A +
Sbjct: 4 SYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFS 63

Query: 306 VIVMIIAGRKADSRRERRLLVAALFLMAAAGLTLALVWHGTLWLSIAALCIANAGLLSIP 365
+ + G+ +D +RLL+ + + + + V H L I A I AG + P
Sbjct: 64 IGTAVY-GKLSDQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 366 PIFWGMPTAVLSPRNAASGIAWISAIGNIGGFFGPYVVGLL 406
+ + + N I +I +G GP + G++
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5092GPOSANCHOR330.003 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.003
Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 200 IHDAPRVAMREDEDVSREIQQALEADGIKLELQSRIAN 237
+ +A R ++R D D SRE ++ LEA+ KLE Q++I+
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343


65Bcenmc03_5215Bcenmc03_5221N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_52151110.353196porin
Bcenmc03_52161100.981629major facilitator transporter
Bcenmc03_52172120.051660hypothetical protein
Bcenmc03_52181120.516887leucyl aminopeptidase
Bcenmc03_52192130.855633peptidase
Bcenmc03_5220190.555674AraC family transcriptional regulator
Bcenmc03_5221090.357501AraC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5215ECOLNEIPORIN881e-21 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 88.3 bits (219), Expect = 1e-21
Identities = 74/369 (20%), Positives = 125/369 (33%), Gaps = 67/369 (18%)

Query: 24 AQSSVTLYGILDAGITYVNNTGGSHVVKFDDGVA-----YGNRFGLKGTEDLGGGLKAVF 78
A + VTLYG + AG+ + + G++ G KG EDLG GLKA++
Sbjct: 17 AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIW 76

Query: 79 TLESGFHLGNGQLGFGGAEFGRQAYVGLQNDWGTLSFGNQLDITNELVSIYNISAWGSGY 138
+E G RQ+++GL+ +G L G + L +I+ W S
Sbjct: 77 QVEQ----KASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSV---LKDTGDINPWDSKS 129

Query: 139 AIHQGDFDRFNGDRLPNSVKFLSNDLSGFKFGAMYSFGNVAGNFHRNSAWSAGASFTKGD 198
+ RL SV++ S + +G Y+ + AG H + ++ AG ++ G
Sbjct: 130 DYLGVNKIAEPEARL-ISVRYDSPEFAGLSGSVQYALNDNAG-RHNSESYHAGFNYKNGG 187

Query: 199 FSIGAAYTRLNNPNGIYAFDPYAMIGTHTFLGQQTVTVDPATGARTDLFANTPMDVDSQG 258
F + + Q+ V ++ R
Sbjct: 188 FFVQYGGAYKRHHQ-----------------VQENVNIEKYQIHRL------------VS 218

Query: 259 TFGIGTSYTIGKLTLDANYSYTTIKGFGQSSHMQVYEGGGLYQF-----TPALSFIAGYQ 313
+ Y + SH E + TP +S+ G++
Sbjct: 219 GYDNDALYASVAVQQQDAKLVE-----ENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFK 273

Query: 314 HTRF---EGHHWNQGTAGLHYLLSKRTDIYISGDYLRASQGVDAVVGYSFTPSTTQTQAD 370
+ + ++Q G Y SKRT +S +L+ +G T
Sbjct: 274 GSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGES---------KFVSTAGG 324

Query: 371 VRIGMRHSF 379
V G+RH F
Sbjct: 325 V--GLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5216TCRTETA320.005 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.7 bits (72), Expect = 0.005
Identities = 47/270 (17%), Positives = 95/270 (35%), Gaps = 14/270 (5%)

Query: 70 MFALTLVVGQVADRYDRRRIATICQSVEALAAGVFLLGAVQGWLAAPAVYAL--AAIVGT 127
FA V+G ++DR+ RR + + S+ A ++ AP ++ L IV
Sbjct: 56 QFACAPVLGALSDRFGRRPV--LLVSLAGAAVDYAIMAT------APFLWVLYIGRIVAG 107

Query: 128 ARAFESPSVSSLLPAVVPRTDLPRATALSTSANQAAQILGPAFGGLLYGVGAPVAFGTSV 187
+ + + + R ++ + GP GGL+ G F +
Sbjct: 108 ITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAA 167

Query: 188 AAFAVAAMLSGTIPLRSAPPAREPVTLRSV--FSGIAFIRREPAILGALSLDLFAVLFGG 245
A + + + S R P+ ++ + + R + +++ L G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 246 A-TALLPIYARDILQVGPWGLG-ALRAAPAVGALAGTLWLTRFPLKGRPGRAMFGGVIAF 303
AL I+ D +G +L A + +LA + + RA+ G+IA
Sbjct: 228 VPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD 287

Query: 304 GIATIVFGLSRHFALSLVALAALGASDVIS 333
G I+ + ++ + L + +
Sbjct: 288 GTGYILLAFATRGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5219THERMOLYSIN631e-12 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 63.5 bits (154), Expect = 1e-12
Identities = 89/488 (18%), Positives = 167/488 (34%), Gaps = 65/488 (13%)

Query: 109 VVTSERNDADFTVVRLQQQAAGLPVYGSDIAVTVAKDGRILYVASNTIGGVVA-TTRKSQ 167
++ ++ ++ TV+R +Q A G+ + V DG + ++ I + T +
Sbjct: 78 LIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHV-NDGELSSLSGTLIPNLDKRTLKTEA 136

Query: 168 AVDQQQALDRARAYLGVSGFTHL-------DAQLVAFVDQAGTHTAWKVRGRPQDGPKGD 220
A+ QQA A+ + +LV + D+ A++V R G+
Sbjct: 137 AISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLTPVPGN 196

Query: 221 WELLIDSGSGEVLRAEDKAFYA-TDGTGFVFRPDPLSPTKSSYGSTGYKDSSDADSTQLT 279
W +ID+ G+VL ++ A G V + + G Y + T +
Sbjct: 197 WIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYIN------TTYS 250

Query: 280 AARVRVTLKELAQSGTRYTLTGPYAACVDFDAPLDKACP----SQSTPAFEFTRGNLYFE 335
+ L++ + +T +D P + F +
Sbjct: 251 SYYGYYYLQDNTRGSGIFT----------YDGRNRTVLPGSLWADGDNQFF---ASYDAA 297

Query: 336 AVNVYYH---IDTFLRYVNQTLGIKALPYQYTGGVQYDPHGESGDDNSSYSSSSGRLTFG 392
AV+ +Y+ + + + V+ L V Y G +N+ ++ S + +G
Sbjct: 298 AVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYG----RGYNNAFWNGSQ--MVYG 351

Query: 393 QGGVDD----AEDADVVIHELGHGIHDWVTNGGLSQQEG-LSEGTGD---YLAAAYSRDF 444
G + DVV HEL H + D+ + G ++E D L Y+
Sbjct: 352 DGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRN 411

Query: 445 NQWSPSDAQYHWVYNWDG----HNEFWGGRVTNWNVGRTYAQARGAEIHTAGQY------ 494
W + Y D + G +++ T Q G +G
Sbjct: 412 PDWEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYL 471

Query: 495 ---WASCNLVARDAIGAQAMDKAFLKGLSM-TNSSTNQKAAAQAVLTAASALGYS-SAQL 549
V+ IG M K F + L ++N A + AA+ L S S ++
Sbjct: 472 LSQGGVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEV 531

Query: 550 TAIGNAYN 557
++ A+N
Sbjct: 532 NSVKQAFN 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5221TYPE4SSCAGA310.009 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 30.8 bits (69), Expect = 0.009
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 96 GGIPFELHSP-DDMSLIGVVVEPELMQQIEDAADVRLDARALRHGVVE 142
G P + H DD+S +G+ EL Q+I++ +A+A G +E
Sbjct: 940 AGFPLKRHDKVDDLSKVGLSRNQELAQKIDNLNQAVSEAKAGFFGNLE 987


66Bcenmc03_5347Bcenmc03_5366N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5347-39-0.342481short-chain dehydrogenase/reductase SDR
Bcenmc03_5348-3120.740005glutaminase
Bcenmc03_53491112.383264two component transcriptional regulator
Bcenmc03_53501122.795190histidine kinase
Bcenmc03_53511122.987775patatin
Bcenmc03_53522123.179419rod shape-determining protein MreB
Bcenmc03_53533123.102334hypothetical protein
Bcenmc03_53543123.080533outer membrane autotransporter
Bcenmc03_5355-1102.262710Amylo-alpha-16-glucosidase
Bcenmc03_5356-2110.280062hypothetical protein
Bcenmc03_5357-2111.630253LysR family transcriptional regulator
Bcenmc03_5358-2121.586538Beta-lactamase
Bcenmc03_5359-1160.738167hypothetical protein
Bcenmc03_5360-1141.044631antibiotic biosynthesis monooxygenase
Bcenmc03_5361-1141.516884MarR family transcriptional regulator
Bcenmc03_5362-192.831242histidine kinase
Bcenmc03_53630112.741433two component transcriptional regulator
Bcenmc03_53640132.683683hypothetical protein
Bcenmc03_53651113.479068RpiR family transcriptional regulator
Bcenmc03_53661133.565091GCN5-related N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5347DHBDHDRGNASE631e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.1 bits (153), Expect = 1e-13
Identities = 51/207 (24%), Positives = 88/207 (42%), Gaps = 19/207 (9%)

Query: 9 GRRIVITGANSGTGKEATRRLVAAGADVIMAVRSESKGDAARRDIRKEFPGTSIEVRTLD 68
G+ ITGA G G+ R L + GA + + K + ++ E E D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPAD 65

Query: 69 LSSLASVRNFGRQLLEEGRPLDVLVNNAGIMMP-PTRVLSSDGFELQLATNFLGHFALTN 127
+ A++ ++ E P+D+LVN AG++ P LS + +E + N G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 128 LLLPLLLEAKSPRVATMTSSAAMGATINFDDLQGERSYKPMTAYAQSKLACLLLANRLA- 186
+ +++ +S + T+ S+ A + M AYA SK A ++ L
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTS------------MAAYASSKAAAVMFTKCLGL 173

Query: 187 EIARERGWPLLSTSAHPGHTRTNLQTS 213
E+A + + PG T T++Q S
Sbjct: 174 ELA---EYNIRCNIVSPGSTETDMQWS 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5349HTHFIS704e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 4e-16
Identities = 29/124 (23%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 4 RILLVEDDTRLSTLIAGYLRKNDYEVDTVLHGDAAVPAILSIRPDLVILDVNLPGKDGFE 63
IL+ +DD + T++ L + Y+V + I + DLV+ DV +P ++ F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 ICREARKQYDGV-IIMVTARDEPFDELLGLEFGADDYVHKPVEPRILLARIKAQLRRAPA 122
+ +K + +++++A++ + E GA DY+ KP + L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 RAAE 126
R ++
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5350PF06580310.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.005
Identities = 33/184 (17%), Positives = 70/184 (38%), Gaps = 35/184 (19%)

Query: 201 DSIAQDVTELEELIDMSLTYARLEYSSLQSNLEMTAPVAWFEHQVNDAQLLYPDRAIESR 260
+ +T L EL+ SL Y+ SL L + + + A + + DR ++
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVV------DSYLQLASIQFEDR-LQFE 243

Query: 261 IEIGADLRVKMDRRLMSYAMRNLLRNASKYA------KSRIVVGISLVHGNIGIFVEDDG 314
+I + D ++ ++ L+ N K+ +I++ + +G + + VE+ G
Sbjct: 244 NQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 315 PGVPESERERIFDAFVRLDRRTGGYGLGLSITR---QVLHAHNGRIAVVDPVELGGARFE 371
++ +E G GL R Q+L+ +I + + + G
Sbjct: 301 SLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSE--KQGKVNAM 344

Query: 372 ISWP 375
+ P
Sbjct: 345 VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5352SHAPEPROTEIN354e-124 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 354 bits (910), Expect = e-124
Identities = 166/340 (48%), Positives = 226/340 (66%), Gaps = 2/340 (0%)

Query: 1 MSTPLFGKLFAQPVAIDPGTASTRIYTHERGVVLNQPSVVCFRKGGASDARPTLEAVGEL 60
M G +F+ ++ID GTA+T IY +G+VLN+PSVV R+ A + AVG
Sbjct: 1 MLKKFRG-MFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVA-AVGHD 58

Query: 61 AKALLGREPGHLEAVRPMRHGVIADAHAAEQMIRSFIDMSRTRSRFGRRVEVTLCVPSDA 120
AK +LGR PG++ A+RPM+ GVIAD E+M++ FI + S V +CVP A
Sbjct: 59 AKQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGA 118

Query: 121 TAVERRAIREAAFAAGVSEVELIEESLAAGLGAGLPVTEPVGSMVIDIGGGTTEVAVIAL 180
T VERRAIRE+A AG EV LIEE +AA +GAGLPV+E GSMV+DIGGGTTEVAVI+L
Sbjct: 119 TQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISL 178

Query: 181 GGIVYREAIRVGGSQFDAAIVNHVRNLYGVLLGEQTAEHVKKAIGSATSAVPRTSTRAVG 240
G+VY ++R+GG +FD AI+N+VR YG L+GE TAE +K IGSA G
Sbjct: 179 NGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRG 238

Query: 241 RSIGDGLPRSVELSNHDVADALAAPLKQVIGAVKSVLENAPAELVTDIANRGVVLTGGGA 300
R++ +G+PR L+++++ +AL PL ++ AV LE P EL +DI+ RG+VLTGGGA
Sbjct: 239 RNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGA 298

Query: 301 LLADLERLLYDETGLVARIADEPATCAVRGAGEAMGRLAM 340
LL +L+RLL +ETG+ +A++P TC RG G+A+ + M
Sbjct: 299 LLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDM 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5353OMPADOMAIN310.004 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 30.7 bits (69), Expect = 0.004
Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 103 GLNEATAMRDYLVARGVPADRIAV 126
A ++ DYL+++G+PAD+I+
Sbjct: 274 SERRAQSVVDYLISKGIPADKISA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5354INTIMIN441e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 43.5 bits (102), Expect = 1e-05
Identities = 66/279 (23%), Positives = 96/279 (34%), Gaps = 22/279 (7%)

Query: 1667 STGAVNLAGTGATFDVSGATGTQTVGALSGAAGTNVNLGANALALNGSGSSTFGGTIGGA 1726
T A+ T V+ A + +SG A L AN+ NGSG +T
Sbjct: 574 GTEAITYTATVKKNGVAQANVPVSFNIVSGTA----VLSANSANTNGSGKATVTLKSDKP 629

Query: 1727 GGVTVASGTQ----------VLTGDNTYTGGTTIAAGGTLQLGNGGTSGSVAGNVVDNGA 1776
G V V++ T V+ D T T I A T + NG + + V+
Sbjct: 630 GQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDK 689

Query: 1777 LIVNQSGNVTIASVLSGTGSLTQAGSGRLTLTGTSTLSGPTTVGAGTLAVNGSLGQSTVT 1836
+ NQ T + +G +T TST G + V A V + V
Sbjct: 690 PVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVE 749

Query: 1837 VQNGATLTGTG-TIGGLVVQGGATAAATQPGAALNV--GGNVTFQPGSTFQVAATPQQSG 1893
T+ I G V+G Q G GGN + S A+
Sbjct: 750 FFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVD--- 806

Query: 1894 SLAASGTATLNGGTVQVLANQSGYQPSTTYTILSASSGV 1932
A+SG TL ++ S + TYTI + +S +
Sbjct: 807 --ASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLI 843


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5358BLACTAMASEA369e-131 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 369 bits (949), Expect = e-131
Identities = 118/270 (43%), Positives = 162/270 (60%), Gaps = 1/270 (0%)

Query: 41 AAAAADAIAPAAAATTLADLERDAGGRLGVCAIDTASGR-VIEHRAGERFPFCSTFKAML 99
A A + E GR+G+ +D ASGR + RA ERFP STFK +L
Sbjct: 13 ATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVL 72

Query: 100 SAAVLAQSVERPGLLQQRVTYTKADLVNYSPVSEKHVGSGMTVAALCEAAIQYSDNSAAN 159
AVLA+ L++++ Y + DLV+YSPVSEKH+ GMTV LC AAI SDNSAAN
Sbjct: 73 CGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAAN 132

Query: 160 LLMKLIGGPSAVTAYARSIGDDAFRLDRWETELNTALPGDPRDTTTPAAMAASLRVLTLG 219
LL+ +GGP+ +TA+ R IGD+ RLDRWETELN ALPGD RDTTTPA+MAA+LR L
Sbjct: 133 LLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRKLLTS 192

Query: 220 DALPAAQRAQLVAWLRGNKVGDKRMRAGVPAGWVVGDKTGTGDYGTTNDAGVIWPTSRAP 279
L A + QL+ W+ ++V +R+ +PAGW + DKTG G+ G ++ P ++A
Sbjct: 193 QRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARGIVALLGPNNKAE 252

Query: 280 IVLAVYYTQTRADARAKDDVIASVARIVAQ 309
++ +Y T A ++ IA + + +
Sbjct: 253 RIVVIYLRDTPASMAERNQQIAGIGAALIE 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5359ACRIFLAVINRP240.048 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.0 bits (52), Expect = 0.048
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 8/50 (16%)

Query: 9 LLISLVLVAIVVYPYVRIVRRTGHSGWWILTMFVPVLNFVMLWVFAFARW 58
L +++LV +V+Y +++ +R T I T+ VPV V+L FA
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRAT-----LIPTIAVPV---VLLGTFAILAA 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5362PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 2e-05
Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 23/108 (21%)

Query: 350 LLNNLVDNAIRYA----GEGARVDVSARIDGTTPVLEVADDGPGIPEAERTDVWERFYRG 405
L+ LV+N I++ +G ++ + D T LEV + G + +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK---------- 308

Query: 406 EGAQAATSSGSGLGLSIV-KRIAEQHRASVALGTTRGGRGLTVTVRFP 452
+G GL V +R+ + + + + V P
Sbjct: 309 --------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5363HTHFIS933e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.0 bits (231), Expect = 3e-24
Identities = 31/127 (24%), Positives = 59/127 (46%)

Query: 2 RVLLVEDDPLIGSGLEQGLKQEGFAVDWVKDGDAASLALRATGYGLLLLDLGLPNRDGLS 61
+L+ +DD I + L Q L + G+ V + + A L++ D+ +P+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLAALRRRDENLPAIIITARDGVPDRIAGLDSGADDYLVKPFELDELLARIRAVNRRHAG 121
+L +++ +LP ++++A++ I + GA DYL KPF+L EL+ I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RAQTTLA 128
R
Sbjct: 125 RPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5366SACTRNSFRASE347e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.1 bits (78), Expect = 7e-05
Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 2/59 (3%)

Query: 61 GWLHVDLLVVPESARGQGAGTRIMDLAEREAVARGCHSAWLDTFDFQ--ARPFYEKRGY 117
G+ ++ + V + R +G GT ++ A A L+T D A FY K +
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


67Bcenmc03_5415Bcenmc03_5420N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5415-1101.555601acetate kinase
Bcenmc03_5416-291.026438PHB de-polymerase domain-containing protein
Bcenmc03_5417-371.525023enoyl-(acyl carrier protein) reductase
Bcenmc03_5418-291.594277hypothetical protein
Bcenmc03_5419-281.525112major facilitator transporter
Bcenmc03_5420-1121.122234TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5415ACETATEKNASE357e-124 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 357 bits (919), Expect = e-124
Identities = 152/398 (38%), Positives = 226/398 (56%), Gaps = 16/398 (4%)

Query: 5 VLVLNAGSSSLKFSVYDTREDRSLDAGLHGQVENLHDTPHLFVTDAEGATLADSAVARPG 64
+LV+N GSSSLK+ + ++++ L GL E + +T
Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGL---AERIG-INDSLLTHNANGEKIKIKKDMKD 58

Query: 65 HQGAI-EALHAWFAAHVG---REAAFDGVGHRVVHGGPYFTAPVRIDARVLDAIASLAPL 120
H+ AI L A + G + D VGHRVVHGG YFT+ V I VL AI L
Sbjct: 59 HKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIEL 118

Query: 121 APLHQPHHVDAIRAVAAVAPNLPQVACFDTAFHATVPALEREFALPRAL-TEQGIVRYGF 179
APLH P +++ I+A + P++P VA FDTAFH T+P + +P T+ I +YGF
Sbjct: 119 APLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGF 178

Query: 180 HGLSYEYIATALAA-LDPSWMQHRTVVAHLGNGASLCALANGRSVATTMGFTAVDGLPMG 238
HG S++Y++ A L+ + + HLGNG+S+ A+ NG+S+ T+MGFT ++GL MG
Sbjct: 179 HGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMG 238

Query: 239 TRTGALDPGVILYLQRHTGRSLDEVEHLIYAESGLLGVSGVSSDMRTLLASDA----PSA 294
TR+G++DP +I YL S +EV +++ +SG+ G+SG+SSD R L + A
Sbjct: 239 TRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRA 298

Query: 295 AHAVELFAYRAARELAALAGVLGGLDTLVFTAGIGEHAPRVRERICRRAAWLGIVLDDAA 354
A+ +FAYR + + + A +GG+D +VFTAGIGE+ P +RE I +LG LD
Sbjct: 299 QLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEK 358

Query: 355 N-AAGLP-VISSDASRVTVRVIPTDENLMIARHTRRVL 390
N G +IS+ S+V V V+PT+E MIA+ T +++
Sbjct: 359 NKVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5417DHBDHDRGNASE511e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 50.8 bits (121), Expect = 1e-09
Identities = 44/185 (23%), Positives = 67/185 (36%), Gaps = 7/185 (3%)

Query: 6 KRGLVVGIANGQSIAWGCARAFCRAGATL-AVTWQSDKTLPHVEPLFAQLDAPIRMPLDV 64
K + G A G I AR GA + AV + +K V L A+ P DV
Sbjct: 9 KIAFITGAAQG--IGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 65 GRPDQMAAVFDAIAVQWGAIDFVLHSVAYAPKADLQGRVVDSSPEGFSLAMDTSCHSFIR 124
+ + I + G ID +++ + + FS+ S F
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV---NSTGVFNA 123

Query: 125 MARLAEPLMT-RGGSLMAMSYLGAEQVVANYGVMGPVKAALEASVRYLAAELGGAGIRVN 183
+++ +M R GS++ + A + KAA + L EL IR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 184 AVSPG 188
VSPG
Sbjct: 184 IVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5419TCRTETB1154e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 115 bits (289), Expect = 4e-30
Identities = 86/407 (21%), Positives = 166/407 (40%), Gaps = 15/407 (3%)

Query: 9 LIVACAL-FMESVDANIIVTALPAMARDFGHNPVTLNIAITAYVVGLGVFIPICGWLADR 67
LI C L F ++ ++ +LP +A DF P + N TA+++ + + G L+D+
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 68 FSARAVFRTAIGIFVVGSLMCAASNS-LGVLTFARFIQGVGGAMMVPVGRIIIFRAVPRA 126
+ + I I GS++ +S +L ARFIQG G A + +++ R +P+
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 127 DLVRAMNYLAIPALFGPTVGPLVGGFITTYLHWRMIFFINVPIGIYGIYLASKHIANTHE 186
+ +A + G VGP +GG I Y+HW + I + I I + K +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVR 194

Query: 187 PDPGPLDWFGFLLSASGAALLLMGLTLIDGSLTSRSNAIIMCAAGAAMLALYVPYARRKE 246
G D G +L + G ++ T S +I ++V + R+
Sbjct: 195 IK-GHFDIKGIILMSVGIVFFMLFTT---------SYSISFLIVSVLSFLIFVKHIRKVT 244

Query: 247 RPVLDLSFLKIPTYHASVVGGSLFRIGLGAVPFLLPLALQEGLGMSAFHSG-LITCASAL 305
P +D K + V+ G + + ++P +++ +S G +I +
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 306 GGAVSRSTATHTLRRFGFRTVLIYNAAFAGLAIAAYGVFHPGMATWAIWLIVLVGGIFPA 365
+ + R G VL F ++ + + +IV V G
Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSF 364

Query: 366 LQFTSLNSMIYADISPRDAGRATSLGSVVQQMSLGLGVTVAGLVLHV 412
+ T +++++ + + ++AG SL + +S G G+ + G +L +
Sbjct: 365 TK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5420HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 28/181 (15%), Positives = 66/181 (36%), Gaps = 13/181 (7%)

Query: 9 PRQRRSVATVDAIVEAAARILERDGFAGYTTNAVAALAGVSIGSLYQYFPNRDALTAALV 68
++ + T I++ A R+ + G + + +A AGV+ G++Y +F ++ L + +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 69 ERESAHLLDDVD----------RAAALSSCDDVLRALVRGAVAHQMRRPVLARLIDFEEA 118
E +++ + + VL + V + + + E
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 119 RLPLGART---ERVADRIHATLLHALRLRDAPRVAAPDVVAHDLLAIVKGIVDAAGARGE 175
+ A+ DRI TL H + + P A + + G+++ +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 176 T 176
+
Sbjct: 184 S 184


68Bcenmc03_5439Bcenmc03_5445N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5439-1113.150478type III secretion protein SpaR/YscT/HrcT
Bcenmc03_5440-1113.598970type III secretion exporter
Bcenmc03_5441-1113.420249asparagine synthase
Bcenmc03_5442-2123.421459hypothetical protein
Bcenmc03_5443-3132.844845lytic transglycosylase catalytic
Bcenmc03_5444-3142.437694type III secretion system protein
Bcenmc03_5445-3132.375103type III secretion system apparatus protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5439TYPE3IMRPROT1343e-40 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 134 bits (339), Expect = 3e-40
Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 3/248 (1%)

Query: 15 LRPLLYVMPRLLPIMFVVPVFNEQIITGLVRNGIAVVIAAFVAPTIDAAQVAALPFLMWC 74
L + + R+L ++ P+ +E+ + V+ G+A++I +AP++ A V F
Sbjct: 13 LNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFF-AL 71

Query: 75 LLVAKEAMVGMLLAGAFSAVLFAIQGVGYLIDFQTGSGSAAFFDPMGGHEGGPTSGFLNF 134
L ++ ++G+ L A++ G +I Q G A F DP + ++
Sbjct: 72 WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDM 131

Query: 135 VAIALFVTAGGLQVLVQLFAQSYAWWPIGSLGPDFSSMLQTFIVRQTDTIFEWMVKLAAP 194
+A+ LF+T G L+ L ++ PIG +S + + IF + LA P
Sbjct: 132 LALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSLIFLNGLMLALP 189

Query: 195 VTIVLVLVELGIGLVGRAVPQLNIFVFSQPLKSALAVLMMILFLPVVYASLHSLLSPDSG 254
+ +L+ + L +GL+ R PQL+IFV PL + + +M +P++ L S
Sbjct: 190 LITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249

Query: 255 LMALLRAL 262
L+A + +
Sbjct: 250 LLADIISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5440TYPE3IMSPROT2446e-81 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 244 bits (625), Expect = 6e-81
Identities = 93/339 (27%), Positives = 174/339 (51%), Gaps = 3/339 (0%)

Query: 2 AEKDQKPTAKRLREAREKGDVPKSAETVSSAFFVGVCVALAVGIGALFARVQALFRLVFD 61
EK ++PT K++R+AR+KG V KS E VS+A V + L F L + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AVGAADPSARLAALIDGAARDWATLSAQIVAAGLQAGLLAGFVQVGGVMAWSRLVPQLSR 121
S L+ ++D ++ L ++ + + VQ G +++ + P + +
Sbjct: 63 QSYL-PFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 122 LNPAEGMKNLWSLRNLVNLAKMLMKTALLVATLGWLIVESLDPSVQSGFTRPASILALIV 181
+NP EG K ++S+++LV K ++K LL + +I +L +Q I L+
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 182 KLLMLLFGWAALIYIVMALIDIVHQRHEFNQKMKMSIDEVRREHKEDEGDPHIQAKRRQL 241
++L L + ++V+++ D + +++ +++KMS DE++RE+KE EG P I++KRRQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 242 AREAQFASLPDRIGYASVVVYSP-RVAVALYYG-GMGSLPWVLARGEGDAAERIVRLARD 299
+E Q ++ + + +SVVV +P +A+ + Y G LP V + + + ++A +
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 300 ALRPTLANVGLAQALYETTPENGTIQPQHFRAVAQLLKW 338
P L + LA+ALY + I + A A++L+W
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRW 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5442PF03544370.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.9 bits (85), Expect = 0.001
Identities = 13/67 (19%), Positives = 20/67 (29%)

Query: 23 PVVAPPPPPPPPPKKDDPAAGPANPTAAPPIPVTASLATDPSKPTNAEIQSATSLIQSMA 82
PVV P P P PK P+ + + + P +AT+
Sbjct: 91 PVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150

Query: 83 AQYTAPP 89
+ P
Sbjct: 151 TSVASGP 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5444TYPE3IMPPROT2262e-77 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 226 bits (578), Expect = 2e-77
Identities = 82/220 (37%), Positives = 129/220 (58%), Gaps = 10/220 (4%)

Query: 6 NPVALIAVIAALGIAPFAALMVTSYTKLVVVLGLLRSALGIQQVPPNLVLNGIALILSLF 65
N ++LIA++A + PF T + K +V ++R+ALG+QQ+P N+ LNG+AL+LS+F
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 IMAPVGMSIRDALQARHFDASGQLSTSDIGALADAALPPIKDFLVSHTRQRDREFFVRTA 125
+M P+ + + S + D L +D+L+ ++ + +FF
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDI---SSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQ 119

Query: 126 TSVWPKNRA-------DGIKDDDLLVLVPSFTLAELTKAFQIGFVIYIVFIVVDLLVANI 178
D I+ + L+P++ L+E+ AF+IGF +Y+ F+VVDL+V+++
Sbjct: 120 LKRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSV 179

Query: 179 LLALGMQMISPTTISVPFKLLLFVALDGWSLLVHGLVLSY 218
LLALGM M+SP TIS P KL+LFVALDGW+LL GL+L Y
Sbjct: 180 LLALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5445FLGMOTORFLIN552e-11 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 54.5 bits (131), Expect = 2e-11
Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 1/73 (1%)

Query: 312 VDLRFELPPTSMPLGELSALQPGAVIELQQGINQSVIHLVANGMLIGTGHLIAVGQKLGV 371
V L EL T M + EL L G+V+ L + + ++ NG LI G ++ V K GV
Sbjct: 62 VKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPL-DILINGYLIAQGEVVVVADKYGV 120

Query: 372 RVVTLTQPAPRER 384
R+ + P+ R R
Sbjct: 121 RITDIITPSERMR 133



Score = 29.9 bits (67), Expect = 0.007
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 3/57 (5%)

Query: 189 ALAVFFAAAPAALADARAAYANL---PVPLVFEIGRTELTTAELADVVGGDIIAIER 242
A AVF ++ A + PV L E+GRT +T EL + G ++A++
Sbjct: 35 ADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDG 91


69Bcenmc03_5454Bcenmc03_5461N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_54540153.079017hypothetical protein
Bcenmc03_54553141.219039hypothetical protein
Bcenmc03_54562131.883357sigma-54 dependent trancsriptional regulator
Bcenmc03_54572131.410754hypothetical protein
Bcenmc03_54582130.674261hypothetical protein
Bcenmc03_54593120.603959two component LuxR family transcriptional
Bcenmc03_54604120.672790methyl-accepting chemotaxis sensory transducer
Bcenmc03_54614120.5517992-dehydropantoate 2-reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5454IGASERPTASE463e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 46.2 bits (109), Expect = 3e-07
Identities = 47/263 (17%), Positives = 73/263 (27%), Gaps = 28/263 (10%)

Query: 264 ASMRAAPVSVPVPVPALAPVAAAPAVATPAVAAAA--------PAVAVPTVAAAVPAAAA 315
R V A P+V + A PA A P+ A +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 316 PAAAAAPAVAAAPAASVVPAAAMAAVPAAA-VIAAPAVADKAAPAPAAPVADTKAAEPVQ 374
+ A A A + V A + A T + +
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK--E 1102

Query: 375 PVADKAPEPAPAVADKTPEPAPAVADKAP-----EPAQPVADKAPEPMPA-------ATD 422
+ E A +KT E + +P E QP A+ A E P +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 423 TTQAAGEPVAEPMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAAAPADMPAAD 482
T A E A+ + + + E PA QP V + ++ P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 483 AK-----APDAVESAGTAAAQAA 500
+ P VE A T++ +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRS 1245



Score = 43.1 bits (101), Expect = 3e-06
Identities = 42/251 (16%), Positives = 67/251 (26%), Gaps = 16/251 (6%)

Query: 305 TVAAAVPAAAAPAAAAAPAVAAAPAASVVPAAAMAAVPAAAVIAAPAVADKAAP---APA 361
TV A P+V + A PA A + +
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 362 APVADTKAAEPVQPVADKAPEPAPAVADKTPEPAPAVADKAPEPAQPVADKAPEPMPAA- 420
+ A E + A E V T A + + Q K +
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 421 -----TDTTQAAGEPVAE--PMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAA 473
T+ TQ + ++ P + P A+ A E P + P A
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTE 1169

Query: 474 APADMPAADAKAPDAVESAGTAAAQAAGMPALTDPAQALPPATVDQQAAP----AAPVAP 529
PA +++ + P + P T PA P + P V
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229

Query: 530 APTVISTSTSS 540
P + +T+S
Sbjct: 1230 VPHNVEPATTS 1240



Score = 32.3 bits (73), Expect = 0.006
Identities = 24/186 (12%), Positives = 48/186 (25%), Gaps = 20/186 (10%)

Query: 313 AAAPAAAAAPAVAAAPAASVVPAAAMAAVPAAAVIAAPAVADKAAPAPAAPVADTKAAEP 372
P + + + +V P A A V + A A ++
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 373 VQPVADKAPEPAPAVADKTPEPAPAVADKAPEPAQPVADKAPEPMPAATDTTQAAGEPVA 432
QPV + + + P QP T ++++ +P
Sbjct: 1180 EQPVTESTTV------NTGNSVVENPENTTPATTQP------------TVNSESSNKP-- 1219

Query: 433 EPMPAAAVVAAPAADAKAAEPAPQATAEAPAPAAPQPAVAAAPADMPAADAKAPDAVESA 492
+ +V + P A + + A A A A + ++
Sbjct: 1220 KNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279

Query: 493 GTAAAQ 498
+Q
Sbjct: 1280 SQHISQ 1285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5456HTHFIS360e-122 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 360 bits (926), Expect = e-122
Identities = 145/466 (31%), Positives = 211/466 (45%), Gaps = 48/466 (10%)

Query: 47 AALVDVLASRGWDVWRAKTVADALNLVKANRPHAGIVDFDSFASPDVASFEAL----LRD 102
L L+ G+DV A + A + D PD +F+ L
Sbjct: 17 TVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDV---VMPDENAFDLLPRIKKAR 73

Query: 103 PRVGWVALADGERLRNITIARLIRHCCFDYVRNAAAYTTIGYLVGHAYGMLKLADGDPAA 162
P + + ++ A +DY+ T + ++G A K
Sbjct: 74 PDLPVLVMSAQNTFMTAIKA--SEKGAYDYLPKPFDLTELIGIIGRALAEPK-RRPSKLE 130

Query: 163 EAPPPGGAMIGACGAMRRLFATIRKVANTEATVFIAGESGTGKELTAAAIHRQSSRADAP 222
+ G ++G AM+ ++ + ++ T+ T+ I GESGTGKEL A A+H R + P
Sbjct: 131 DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGP 190

Query: 223 FVAVNCAAIPTTLLQAELFGHERGAFTGAHQRKIGRIEAAHGGTLFLDEIGDMPFESQAS 282
FVA+N AAIP L+++ELFGHE+GAFTGA R GR E A GGTLFLDEIGDMP ++Q
Sbjct: 191 FVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTR 250

Query: 283 LLRFLQEGKIERLGGHASIPVDVRIVSATHVDLEAAMQAGRFRADLYYRLCVLRIDEPPL 342
LLR LQ+G+ +GG I DVRIV+AT+ DL+ ++ G FR DLYYRL V+ + PPL
Sbjct: 251 LLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPL 310

Query: 343 RMRGRDIMLLADDVLRRYRDDGSYRIRGFTPCAIEAIHNYPWPGNVRELINRIRFAVVMT 402
R R DI L +++ +G ++ F A+E + +PWPGNVREL N +R +
Sbjct: 311 RDRAEDIPDLVRHFVQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALY 369

Query: 403 NGPLISAADLELR-------------------------------------PYTSLRPPTL 425
+I+ +E
Sbjct: 370 PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLY 429

Query: 426 AQARRQAERHAIEETLLRHRHQHADVAAELGISRATLYRLMIAHGL 471
+ + E I L R A LG++R TL + + G+
Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5459HTHFIS290.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.018
Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 3/61 (4%)

Query: 34 VAVYRSAAELVASLGGVDCDIVLVDYAIRGDEQMDGLALFDWLRRTRPNVGIVVLVANEN 93
V + +AA L + D D+V+ D + + L +++ RP++ ++V+ A
Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTD--VV-MPDENAFDLLPRIKKARPDLPVLVMSAQNT 86

Query: 94 P 94

Sbjct: 87 F 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5461NUCEPIMERASE300.015 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.015
Identities = 18/64 (28%), Positives = 26/64 (40%), Gaps = 13/64 (20%)

Query: 1 MRILVVG-AGAVGGYFGGRLAAAGRDVTFL----------VRDGRAAALARDGLLIRSPR 49
M+ LV G AG +G + RL AG V + ++ R LA+ G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFH--K 58

Query: 50 GDLT 53
DL
Sbjct: 59 IDLA 62


70Bcenmc03_5482Bcenmc03_5489N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5482-2111.2414433-ketoacyl-(acyl-carrier-protein) reductase
Bcenmc03_5483-2110.670256hypothetical protein
Bcenmc03_54840141.607156short chain dehydrogenase
Bcenmc03_54851131.439350DNA-binding response regulator CreB
Bcenmc03_54861151.155833sensory histidine kinase CreC
Bcenmc03_54871141.564111hypothetical protein
Bcenmc03_54882142.917364endoribonuclease L-PSP
Bcenmc03_54892152.724164major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5482DHBDHDRGNASE1212e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 121 bits (305), Expect = 2e-35
Identities = 79/255 (30%), Positives = 120/255 (47%), Gaps = 14/255 (5%)

Query: 16 GRVVLVTGAAQGIGAAIARRFAESDAFVAVADLNGDAAAAQADALASAGGDARAYRVDAA 75
G++ +TGAAQGIG A+AR A A +A D N + +L + A A+ D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 76 SRGDLDALVAAVERDGGRLDVVIHNAAYFPLTPFMQIDEPTLDRTLSVNLSALFWLAQAA 135
+D + A +ER+ G +D++++ A + + + T SVN + +F +++
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 136 LPAFQRAGAGRLLVTSSVTG--PRVVYPGLAHYAASKAGVNGFIRAAALELARRNVTVNG 193
+G ++ S PR +A YA+SKA F + LELA N+ N
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRT---SMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 194 VEPGMIRTPAAGNL-----GDASV----AAQIAHDIPLARMGEPEDIANAMLFLASADAA 244
V PG T +L G V IPL ++ +P DIA+A+LFL S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 245 YITGQTIVVDGGATL 259
+IT + VDGGATL
Sbjct: 245 HITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5484DHBDHDRGNASE805e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 79.7 bits (196), Expect = 5e-20
Identities = 61/253 (24%), Positives = 102/253 (40%), Gaps = 26/253 (10%)

Query: 5 LKGKAVAITGGFGHLGVATAAWLGERGARVALIGRGA-----APAAQTLPGVPADALRIG 59
++GK ITG +G A A L +GA +A + ++ A+A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA- 64

Query: 60 GIDLVDPQAAVQALDTVNREFGRVDALLNIAGAFVWQTIADGDAATWDRMYELNVKTALN 119
D+ D A + + RE G +D L+N+AG I W+ + +N N
Sbjct: 65 --DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 120 ASKAALPYLVASQAGRIVNIGAGAAFKAGAGMGAYAAAKAGVTRLTEALAAELLDRGVTV 179
AS++ Y++ ++G IV +G+ A M AYA++KA T+ L EL + +
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 180 NALLPSIIDTPPNRKDMPDAD------------------FSRWVRPEQLAATIGFLLSAD 221
N + P +T D + + +P +A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 222 AQAITGASIPVSG 234
A IT ++ V G
Sbjct: 243 AGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5485HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 38/134 (28%), Positives = 60/134 (44%), Gaps = 3/134 (2%)

Query: 1 MTQPTILIVEDEQAIADTIVYALGTDGMQTVHCTLAQAALDRLRDTHVDLVVLDVGLPDL 60
MT TIL+ +D+ AI + AL G + A + DLVV DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGFEVCRRLRT-FTDIPVIFLTARHDEIDRIV-GLEIGADDYVVKPFSPRELAARVRVIL 118
+ F++ R++ D+PV+ ++A + + E GA DY+ KPF EL + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA-QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 119 RRFHRTAAPEPAPA 132
R + +
Sbjct: 120 AEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5489TCRTETB1066e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 106 bits (266), Expect = 6e-27
Identities = 74/416 (17%), Positives = 158/416 (37%), Gaps = 23/416 (5%)

Query: 24 HSWALVVLLVGAILPPLDYFIVNLALPAIRDGIGAHQAELQLVVSAYACANAVVQITGGR 83
H+ L+ L + + L+ ++N++LP I + A V +A+ ++ G+
Sbjct: 12 HNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71

Query: 84 LGDLYGRKRMFMIGMAGFVLASTLCGLADNG-TVLVGGRVLQGLFAAILAPQVLATIRSV 142
L D G KR+ + G+ S + + + ++L+ R +QG AA V+ +
Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY 131

Query: 143 FSPQEQVRVMGFYGFAFGLAAVIGQLGGGALISLHPFGLGWRAIFLVNLPIGILALIGSW 202
+ + + G G + +G GG + H +L+ +P+ + +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMI--AHYIHWS----YLLLIPMITIITVPFL 185

Query: 203 RFIPENRAPRGQRIDVPGTVLMSLFLLMLVYPLTHGREAGWPLWMIACGVGALPMLGALL 262
+ + D+ G +LMS+ ++ + T + + +++
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS------------F 233

Query: 263 AVEARRLARGHDPLLDVRLLRNPVVALGLLLAFL-FYTLSAFFLSYGIYLQGCLNWSPLA 321
+ + + + DP +D L +N +G+L + F T++ F ++ S
Sbjct: 234 LIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE 293

Query: 322 SGFAIL-PLGLGFLASPLLTTRLVARFGGYRVLTLGFAMLAAGVAIAAALARDGAPGPGF 380
G I+ P + + + LV R G VL +G L+ A+ L +
Sbjct: 294 IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMT 352

Query: 381 YAGIAAIGIGQGLVLPSVVRIVLAEVDAARAGVASGMVSAMLQIGAAVGAATIGGV 436
+ +G G + IV + + AG +++ + G A +GG+
Sbjct: 353 IIIVFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


71Bcenmc03_5507Bcenmc03_5518N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_55070131.663367porin
Bcenmc03_55080111.831823extracellular solute-binding protein
Bcenmc03_55091131.747159binding-protein-dependent transport systems
Bcenmc03_55100111.571607binding-protein-dependent transport systems
Bcenmc03_55110101.675792LacI family transcription regulator
Bcenmc03_5512090.944690phosphatidylinositol-specific phospholipase C X
Bcenmc03_5513-1110.721813dienelactone hydrolase-like protein
Bcenmc03_5514090.942369GCN5-related N-acetyltransferase
Bcenmc03_5515090.748231LysR family transcriptional regulator
Bcenmc03_5516090.972834amidohydrolase
Bcenmc03_55172110.767654porin
Bcenmc03_55181130.723301major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5507ECOLNEIPORIN702e-15 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 70.2 bits (172), Expect = 2e-15
Identities = 78/355 (21%), Positives = 117/355 (32%), Gaps = 45/355 (12%)

Query: 21 AQTSGSVTLYGTVDTGIIYSTNQQFTRADGSTGGGHAWQMGGGNLVPSRFGFQGAEPLGG 80
VTLYGT+ G+ S + + G + S+ GF+G E LG
Sbjct: 15 VAAMADVTLYGTIKAGVETSRSVA-HNGAQAASVETG---TGIVDLGSKIGFKGQEDLGN 70

Query: 81 GLDAVFTLEQQFLSANGQALQGGTAFSRQAWVGLRQEGIGTLGLGRQYDSYTDMLGAYVS 140
GL A++ +EQ+ A +RQ+++GL+ G G L +GR D
Sbjct: 71 GLKAIWQVEQK----ASIAGTDSGWGNRQSFIGLKG-GFGKLRVGRLNSVLKD----TGD 121

Query: 141 SNNWSTPYGSHLGDVDNLNAAFNFNNAVKFTSADFNGLTFGGTFSFGGQAGDFSAKRGYA 200
N W +LG V+ + +V++ S +F GL+ ++ AG Y
Sbjct: 122 INPW-DSKSDYLG-VNKIAEPEARLISVRYDSPEFAGLSGSVQYALNDNAG-RHNSESYH 178

Query: 201 VAATYTRAPVAFSVGYLDLHQPLDAALGGASSYIGDFACSNPGAMYCLLQDAGSMRAFGA 260
Y G Y + Y D ++ A A
Sbjct: 179 AGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKY----QIHRLVSGY----DNDALYASVA 230

Query: 261 GGSVTLGAATVALTYTHTRLGDSRYFSTAAQPRTQAFTFDIGELNVTYMFTPALQGGVAY 320
A V Y+H AA + G + TP + A+
Sbjct: 231 --VQQQDAKLVEENYSHN-----SQTEVAAT-----LAYRFGNV------TPRV--SYAH 270

Query: 321 IFNAAHTDGRGTTRYHQVNVGANYSLSKRTALYAVAIGQVASGTGLGTDADGNAA 375
F + Y QV VGA Y SKRT+ V+ G + G G
Sbjct: 271 GFKGSFDATNYNNDYDQVVVGAEYDFSKRTSAL-VSAGWLQEGKGESKFVSTAGG 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5510VACCYTOTOXIN300.009 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.4 bits (68), Expect = 0.009
Identities = 14/40 (35%), Positives = 17/40 (42%), Gaps = 3/40 (7%)

Query: 62 APTVEHFASVWANGIGAPLFNSLLVGFGTTLLALALAFPA 101
AP E +VWAN IG NS G +L + A
Sbjct: 1016 APKYEKPTNVWANAIGGTSLNS---GGNASLYGTSAGVDA 1052


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5511HTHTETR290.017 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 29.2 bits (65), Expect = 0.017
Identities = 21/114 (18%), Positives = 39/114 (34%), Gaps = 4/114 (3%)

Query: 10 VTASDVAARAGVSRSAVSRAFSPTASIAPQTRERVMVAARAL--GYQVNLIARDMITQRS 67
+ ++A AGV+R A+ F + + + E L YQ + R
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 68 SMIGVVTAGFENPFRARLLSDLMAALGQRALTPLVTNAED--PRQVRQSLEQLL 119
+I V+ + R L+ + +V A+ + +EQ L
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTL 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5514SACTRNSFRASE387e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 7e-06
Identities = 25/109 (22%), Positives = 41/109 (37%), Gaps = 18/109 (16%)

Query: 41 EPTDAAAVLVRIDDGRAYVAVDPQGTCVGFAFYRLLDAQRLYLEELDVAPSHAGQRIGAR 100
E AA L +++ C+G R +E++ VA + + +G
Sbjct: 61 EEEGKAAFLYYLEN-----------NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTA 109

Query: 101 LIEQVIARAAREHVEQVVLSTFRDAPWNAP---YYARLGFRIID-DTAL 145
L+ + I A H ++L T N +YA+ F I DT L
Sbjct: 110 LLHKAIEWAKENHFCGLMLETQDI---NISACHFYAKHHFIIGAVDTML 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5516OMPTIN330.001 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 33.4 bits (76), Expect = 0.001
Identities = 24/119 (20%), Positives = 48/119 (40%), Gaps = 17/119 (14%)

Query: 235 AAVVTVGTFHAGTAPNVIPETATLQLSVRSLDAATRDEVEARIRRIADAQARAYGTVAQV 294
+ + +F + + P+ +S+ +L T++ V +A+ R V+Q+
Sbjct: 11 TTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERV-----YLAEEGGR---KVSQL 62

Query: 295 DYQAISRVVVNDAA--AADLAVETITALA-GAGGLTLLADGVMGSEDFSWMTERVPGCY 350
D++ N+AA + + + ++ GA G T L D WM PG +
Sbjct: 63 DWK------FNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTW 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5517ECOLNEIPORIN552e-10 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 54.8 bits (132), Expect = 2e-10
Identities = 65/323 (20%), Positives = 117/323 (36%), Gaps = 46/323 (14%)

Query: 43 VALYGSVDMGINYQS-VGGRSTWQTQSG-----GEWTSKFGFFGRENLGGGWRAEFNLES 96
V LYG++ G+ V + SK GF G+E+LG G +A + +E
Sbjct: 21 VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQ 80

Query: 97 GFLANNGAQQDTQSFFNRQSWIGLMSDRYGRLRLGKQIGTGLPLFIDVFGTVGTNSVYTW 156
+ A D+ NRQS+IGL +G+LR+G+ + D +S +
Sbjct: 81 K---ASIAGTDSGW-GNRQSFIGLKGG-FGKLRVGRLNS----VLKDTGDINPWDSKSDY 131

Query: 157 LGAAAVQTARGVGYNSDLGPGATQLPARVDN---AITYRTPIVAGTTTLMLMYAPSNVAG 213
LG + A + ++ Y +P AG + + YA ++ AG
Sbjct: 132 LGVNKI--------------------AEPEARLISVRYDSPEFAGLSG-SVQYALNDNAG 170

Query: 214 RAPAASAQGALLQWYNGTTYLAASY---NQVWGVNGASTVRNDLYGLGAVYDTGRLVLSA 270
R + + A + NG ++ + + ++ L + YD L S
Sbjct: 171 R-HNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASV 229

Query: 271 SFNQYAPKLAGDGIARVYT--LGTIVPFGVNAVRASIVYRDTSGVRDAAGRPAKDSALGV 328
+ Q KL + + + + + V + Y + V
Sbjct: 230 AVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFK-GSFDATNYNNDYDQV 288

Query: 329 MLGYDYLLSKRTGLYARTGFIRN 351
++G +Y SKRT G+++
Sbjct: 289 VVGAEYDFSKRTSALVSAGWLQE 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5518TCRTETA310.014 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.014
Identities = 75/359 (20%), Positives = 136/359 (37%), Gaps = 52/359 (14%)

Query: 82 IGAYADRVGRKPALVLTVALMALGTGIIGFAPTYAQIGIAAPLLIVIGRLLQGFSAGGEV 141
+GA +DR GR+P L++++A A+ I+ AP ++ IGR++ G + G
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW--------VLYIGRIVAGIT-GATG 113

Query: 142 GAATTLLMESGGARRSGELVSWQMASQGGAALAGALVALTLSRWLPSDALQGWGWRVPFV 201
A + + + A G +AG ++ + + P PF
Sbjct: 114 AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP---------HAPFF 164

Query: 202 LGLLIGPVGFYLRRHLDDTLPHPAAGAPRVSRRIPWRQVAAGTLLVIGGTSTMYTIVFFL 261
+ + F L LP G R P R+ A L M + +
Sbjct: 165 AAAALNGLNFLTGCFL---LPESHKG-----ERRPLRREALNPLASFRWARGMTVVAALM 216

Query: 262 PSFLTLTL--GMPASVALLSG--------------CTAGAVM--LVGSPFAGRFADRLRR 303
F + L +PA++ ++ G A ++ L + G A RL
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 304 RKPMLRTVCAISTALVLPAFHAMRTWPSVVTVLAVVVVLIGLMTLSSPAGFVMILEALRP 363
R+ ++ + A T +L AF A R W + ++VL+ + PA M+ +
Sbjct: 277 RRALMLGMIADGTGYILLAF-ATRGW-----MAFPIMVLLASGGIGMPALQAMLSRQVDE 330

Query: 364 EVRATSLGMIYALGVTIFGGFAQLIVSALWRATGSFYAPAWYVLAGGSASLVGLALFRE 422
E + G + AL ++ L+ +A++ A+ + W +AG + L+ L R
Sbjct: 331 ERQGQLQGSLAAL-TSLTSIVGPLLFTAIYAASIT-TWNGWAWIAGAALYLLCLPALRR 387


72Bcenmc03_5533Bcenmc03_5543N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5533-3121.736985TetR family transcriptional regulator
Bcenmc03_5534-3112.316302LysR family transcriptional regulator
Bcenmc03_5535-2111.826018short-chain dehydrogenase/reductase SDR
Bcenmc03_5536-2101.341940MerR family transcriptional regulator
Bcenmc03_5537-1101.256630RND family efflux transporter MFP subunit
Bcenmc03_5538-1110.988923hydrophobe/amphiphile efflux-1 (HAE1) family
Bcenmc03_5539-1111.826779RND efflux system outer membrane lipoprotein
Bcenmc03_5540-1100.471118MarR family transcriptional regulator
Bcenmc03_55412101.278195glutathione S-transferase domain-containing
Bcenmc03_5542419-0.558337major facilitator transporter
Bcenmc03_5543526-1.491233isochorismatase hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5533HTHTETR692e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 2e-16
Identities = 34/205 (16%), Positives = 66/205 (32%), Gaps = 12/205 (5%)

Query: 9 PSKRRTRGRPLADASVGPDVILRAARRTFAKRGYDATSVREVARELGIDAALIAHHFGTK 68
K + + IL A R F+++G +TS+ E+A+ G+ I HF K
Sbjct: 2 ARKTKQEAQETRQH------ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55

Query: 69 ETLWLAVVEQIVELAEPMFDALRALRASSLPH--RDRVRRALELCVDHEFAEPDI--GMF 124
L+ + E + +A R+ + LE V E + +F
Sbjct: 56 SDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIF 114

Query: 125 FSTAATEEGGRLDRLQERIVRPYHDAMFPLLADAVEAGAIRP-VDPNVLFFMIASAIGTT 183
E + + Q + +D + L +EA + + ++ I
Sbjct: 115 HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 184 VSYSHMMLEYTSLPTRPEAFREAVL 208
+ + L + +L
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5535DHBDHDRGNASE762e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.9 bits (186), Expect = 2e-18
Identities = 74/258 (28%), Positives = 110/258 (42%), Gaps = 18/258 (6%)

Query: 8 RTAIVTGGSSGIGFAIASRLVQDGYRVAIVGRDAARLEAAVARLGGAAIGQVGDLSVRHD 67
+ A +TG + GIG A+A L G +A V + +LE V+ L A + D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 68 AEAVVAAIVARW----PRIDVLVNNAGLTGRVGADTKAGEAEAVWDAVLHANLKSLFLTT 123
+ A+ I AR ID+LVN AG+ R G + E W+A N +F +
Sbjct: 69 SAAI-DEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEE--WEATFSVNSTGVFNAS 124

Query: 124 MAVLPHVAD-RAARIVNIGSIAARAGSLLPGGLAYAAAKAGVEGFTVALARELGPRGATV 182
+V ++ D R+ IV +GS A AYA++KA FT L EL
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA--AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 183 NTVAPGYIAG-------TRFFGDSGVAPAVAAMIRVQTPVGRAGQPDDVADAVAWLAGPR 235
N V+PG G V + P+ + +P D+ADAV +L +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 236 ASFVTGATIAVNGGWRVG 253
A +T + V+GG +G
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5537RTXTOXIND448e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 8e-07
Identities = 18/130 (13%), Positives = 46/130 (35%), Gaps = 28/130 (21%)

Query: 67 ELRPRVSGYLQRVAYKEGDVVAQGALLFEIDPRPYRIALDRANAQQQRARAAA------- 119
E++P + ++ + KEG+ V +G +L ++ + + +AR
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 120 --------------------SLANVQLKRVQTLIDAH-ATSQEELDNARATAEQARADLQ 158
+++ ++ R+ +LI +T Q + ++ RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 159 AADAAVADAK 168
A + +
Sbjct: 218 TVLARINRYE 227



Score = 43.7 bits (103), Expect = 1e-06
Identities = 19/114 (16%), Positives = 42/114 (36%), Gaps = 10/114 (8%)

Query: 109 NAQQQRARAAASLANVQLKRVQTLIDAHATSQEELDNARATAE--------QARADLQAA 160
+ + A L V +++ + +++EE + Q ++
Sbjct: 256 EQENKYVEAVNELR-VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 161 DAAVADAKLNLGFTEVRAPIAGRV-GRAVATVGNLARADDTLLTTVVSQDPVYV 213
+A + + +RAP++ +V V T G + +TL+ V D + V
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5538ACRIFLAVINRP10000.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1000 bits (2586), Expect = 0.0
Identities = 433/1045 (41%), Positives = 621/1045 (59%), Gaps = 20/1045 (1%)

Query: 4 SRFFIDRPIFAVVLSIVIFALGLISIPMLPAGEYPEVVPPSVVVRATYPGANPKEIAESV 63
+ FFI RPIFA VL+I++ G ++I LP +YP + PP+V V A YPGA+ + + ++V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 AEPLEEAINGVEGIMYMKSVAGSDGSLQVVVTFLQGVDPDTAAVRVQNRVSQALSRLPDE 123
+ +E+ +NG++ +MYM S + S GS+ + +TF G DPD A V+VQN++ A LP E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 124 VRQYGVTTQKQSPTPLMYVSLYSPDNSRDSLYLRNYLTLHVKDELSRLTGIGDVGVYGSG 183
V+Q G++ +K S + LM S + + +Y+ +VKD LSRL G+GDV ++G
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180

Query: 184 DYAMRLWLDPNRLASRGLTASDVIAAVREQNVQVSAGQLGAEPSPKKNDFLVSINVRGRL 243
YAMR+WLD + L LT DVI ++ QN Q++AGQLG P+ SI + R
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 244 RTVQEFSDIVLRNGDDGQVVKLSDVARIELGAGDYTLRSYFNDRHSAVVGIFLSPGANAL 303
+ +EF + LR DG VV+L DVAR+ELG +Y + + N + +A +GI L+ GANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 304 DVAKAVYAKLDELSKRFPPGVAYRPVWDPTVFVRESIRAVQHTLIEAVVLVVLVVILFLQ 363
D AKA+ AKL EL FP G+ +D T FV+ SI V TL EA++LV LV+ LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 364 TWRASIIPLVAVPVSVVGTFAWLYLLGYSINTLTLFGLVLAIGIVVDDAIVVVENVERNI 423
RA++IP +AVPV ++GTFA L GYSINTLT+FG+VLAIG++VDDAIVVVENVER +
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 424 A-QGLSPRDAAHQAMREVSGPIVAIALVLCAVFVPMAFMSGVTGQFYKQFAVTIAISTVI 482
L P++A ++M ++ G +V IA+VL AVF+PMAF G TG Y+QF++TI + +
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 483 SAINSLTLSPALAAKLLRPHGAPKDALTRALDRAFGWLFHPFNRFFERSSDRYHGVVGRT 542
S + +L L+PAL A LL+P A FGW FN F+ S + Y VG+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGF---FGW----FNTTFDHSVNHYTNSVGKI 533

Query: 543 LKRRGVVFAVYAALLAATALLFNAVPGGFIPVQDKLYLFAGAKLPEGASLARTSAVTEQM 602
L G +YA ++A +LF +P F+P +D+ +LP GA+ RT V +Q+
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 603 TKIALGT--DGVEMVPAFAGLNALQGVNTPNITNSYVILKPFDQRHR---TAAQINADLN 657
T L VE V G + N ++V LKP+++R+ +A +
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 658 ARFAAIDGGITYALMPPPIQGLGNGSGYSLYLEDRGGLGYGELQKALTAFQAAVAKTPGM 717
I G P I LG +G+ L D+ GLG+ L +A A+ P
Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711

Query: 718 SYPV-SSYQANIPQLEVKVDRLKAKAQGVALTDLFNTLQVYLGSMYVNDFNVFGRVYRVM 776
V + + Q +++VD+ KA+A GV+L+D+ T+ LG YVNDF GRV ++
Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771

Query: 777 AQADAGHRQTAADIANLRTRNAKGEMVPIGSMVTVGPAYGPDPVVRYNGYPAADLIGDAD 836
QADA R D+ L R+A GEMVP + T YG + RYNG P+ ++ G+A
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 837 PKAMSSSQAIAKLQQIAKDVLPPGITLEWTDLSYQQVTQSNAAIVVFPLAVMLVFLVLAS 896
P SS A+A ++ +A LP GI +WT +SYQ+ N A + ++ ++VFL LA+
Sbjct: 832 PGT-SSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 897 LYESWTLPLAVILIVPVCMCAALFGVWLSGGDNNVFVQVGLVVLMGLACKNAILIVEFAR 956
LYESW++P++V+L+VP+ + L L N+V+ VGL+ +GL+ KNAILIVEFA+
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 957 EL-EIQGKGVVEAALEACKLRLRPIVMTSVAFIAGSVPLLIGSGAGSEVRAATGVTVFAG 1015
+L E +GKGVVEA L A ++RLRPI+MTS+AFI G +PL I +GAGS + A G+ V G
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009

Query: 1016 MLGVTLFGLFLTPVFYVAIRKLAGG 1040
M+ TL +F PVF+V IR+ G
Sbjct: 1010 MVSATLLAIFFVPVFFVVIRRCFKG 1034



Score = 85.3 bits (211), Expect = 7e-19
Identities = 63/329 (19%), Positives = 117/329 (35%), Gaps = 23/329 (6%)

Query: 730 QLEVKVDRLKAKAQGVALTDLFNTLQ---VYLGSMYVNDFNVFGRVYRVMAQADAGHRQT 786
+ + +D + D+ N L+ + + + + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 787 AADIANLRTR-NAKGEMVPIGSMVTVGPAYGPDP-VVRYNGYPAADLI----GDADPKAM 840
+ + R N+ G +V + + V + R NG PAA L A+
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 841 SSS--QAIAKLQQIAKDVLPPGITLEWTDLSYQQVTQSNAAI--VVFPL--AVMLVFLVL 894
+ + +A+LQ P G+ + + Y +I VV L A+MLVFLV+
Sbjct: 303 AKAIKAKLAELQPF----FPQGMKVLYP---YDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 895 ASLYESWTLPLAVILIVPVCMCAALFGVWLSGGDNNVFVQVGLVVLMGLACKNAILIVE- 953
++ L + VPV + + G N G+V+ +GL +AI++VE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 954 FARELEIQGKGVVEAALEACKLRLRPIVMTSVAFIAGSVPLLIGSGAGSEVRAATGVTVF 1013
R + EA ++ +V ++ A +P+ G+ + +T+
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 1014 AGMLGVTLFGLFLTPVFYVAIRKLAGGTP 1042
+ M L L LTP + K
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5542TCRTETA1198e-32 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 119 bits (299), Expect = 8e-32
Identities = 93/360 (25%), Positives = 154/360 (42%), Gaps = 23/360 (6%)

Query: 42 LLAIALDAMGFGLVYPMMSAIFSDPHAGILPADAGAHARNFYLGLGYGVYPLCMFFGSSL 101
L +ALDA+G GL+ P++ + D D AH G+ +Y L F + +
Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLV---HSNDVTAH-----YGILLALYALMQFACAPV 62

Query: 102 MGELSDRYGRRRVLLLCVLGLAAGYAMMAAGAWHASVALLLAGRGLTGLMAGCQGIAQAA 161
+G LSDR+GRR VLL+ + G A YA+MA + +L GR + G+ +A A
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATA---PFLWVLYIGRIVAGITGATGAVAGAY 119

Query: 162 ITDLSTPDTKAYNMSIMSLAFSAGVIVGPVLGGVTSDRTISPLFDYGTPFMLVAALSLIC 221
I D++ D +A + MS F G++ GPVLGG+ SP PF AAL+ +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSP----HAPFFAAAALNGLN 173

Query: 222 ACWTWVSYRDSAAPRGDT-RIDPLLPLRIIVEAARQRDVAFLSVVFFLMQVGYGLYLQTI 280
+S R + L PL A VA L VFF+MQ+ +
Sbjct: 174 FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW 233

Query: 281 MLLLQAKFGYTSARLGLFSGVIGLCFVFGLLCVVRLMLRVWRVIDIAKTGLLVAGLGQIL 340
++ + +F + + +G+ G+ + + G++ G G IL
Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYIL 293

Query: 341 SALFPHEPVLWALAMVVGCFDMV--AYTTMYTAFSDAVSDDRQGWALGVAGSVMAVAWVV 398
A + + + +++ + A M S V ++RQG G ++ ++ +V
Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGMPALQAM---LSRQVDEERQGQLQGSLAALTSLTSIV 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5543ISCHRISMTASE605e-13 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 59.6 bits (144), Expect = 5e-13
Identities = 46/208 (22%), Positives = 71/208 (34%), Gaps = 25/208 (12%)

Query: 4 PTIRTLAGASAPTSIAAARTALLVIDFQNEYFSGRLP--IPDGPGALGNARRVIAFADRA 61
PT + R LL+ D Q YF N R++ +
Sbjct: 12 PTASDMPQNKVSWVPDPNRAVLLIHDMQ-NYFVDAFTAGASPVTELSANIRKLKNQCVQL 70

Query: 62 GIPVFHVQHVGT---ADSPIFAD----GSDGFRFH----SDLHPAPQHTVVQKTSVSVFP 110
GIPV + G+ D + D G + + ++L P V+ K S F
Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130

Query: 111 TTDLDARLKAAGIDTLIVTGLMTHACVAGAARDAVPLGYAVIVVDDACATRDLDVADGGT 170
T+L ++ G D LI+TG+ H A +A V DA VAD
Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA-------VADFS- 182

Query: 171 VPHRDLHRATLAALSDTFGDVLTTEQVL 198
+ H+ L + + T+ +L
Sbjct: 183 ---LEKHQMALEYAAGRCAFTVMTDSLL 207


73Bcenmc03_5574Bcenmc03_5580N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5574-2122.006884autoinducer synthesis protein
Bcenmc03_55750142.439617hypothetical protein
Bcenmc03_55760132.480928LuxR family transcriptional regulator
Bcenmc03_55771132.880911MgtC/SapB transporter
Bcenmc03_55781103.173934outer membrane efflux protein
Bcenmc03_55791113.027964fusaric acid resistance protein region
Bcenmc03_55800152.610000multidrug resistance protein MdtN
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5574AUTOINDCRSYN1311e-40 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 131 bits (332), Expect = 1e-40
Identities = 30/155 (19%), Positives = 60/155 (38%), Gaps = 10/155 (6%)

Query: 11 LPHELAADLGRYRRRVFVEQLGWALPSANESFERDQFDRDDTVYVFARNADGDMCGCARL 70
L + +L R+ F ++L WA+ + E DQ+D ++T Y+F D + R
Sbjct: 12 LSETKSGELFTLRKETFKDRLNWAVQCTD-GMEFDQYDNNNTTYLFGIK-DNTVICSLRF 69

Query: 71 LPTTRPYLLKSLFADLVAEDMPLPQSAAVWELSRFAATDDEGGPGNAEWAVRP----MLA 126
+ T P ++ F ++ +P+ E SRF D+ + P +
Sbjct: 70 IETKYPNMITGTFFPYFK-EINIPEG-NYLESSRFFV--DKSRAKDILGNEYPISSMLFL 125

Query: 127 AVVECAAQLGARQLIGVTFASMERLFRRIGIHAHR 161
+++ + G + + M + +R G
Sbjct: 126 SMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRV 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5578CHANLCOLICIN320.009 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 31.6 bits (71), Expect = 0.009
Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 402 SAVLMQARNDAESASARLTRTKEEAVRQVVAAQNAVQTSLASHDAAKALVDAAQTSYDAA 461
+ +A +AE + R K E RQ+ A+ + A + AKA V+ AQ AA
Sbjct: 143 AEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKA-VEIAQKKLSAA 201

Query: 462 LTAYRNGVGSVTDATIAQSQLLAARNAEVDSYAGALSAAAA 502
+ G + S + AR+AE+ + AG + A
Sbjct: 202 QSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQ 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5579adhesinmafb300.027 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 0.027
Identities = 27/108 (25%), Positives = 34/108 (31%), Gaps = 13/108 (12%)

Query: 281 AILERGGYPVDVTLALPPADALPPLARIAATDLQDAITHFAEPGATA--------PTVDA 332
IL Y +D A+ LP + A ++ F + A P
Sbjct: 250 DILYGTRYAIDKA-AMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAE 308

Query: 333 TAEASANATPAAAAATPEAPAAAPAPAPHGGFFLPDARTN---PDHIR 377
T EA N AA A A AA P A G F + D R
Sbjct: 309 TVEAVFNVAAAAKVAK-LAKAAKPGKAAVSGDFADSYKKKLALSDSAR 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5580RTXTOXIND755e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.9 bits (184), Expect = 5e-17
Identities = 58/423 (13%), Positives = 121/423 (28%), Gaps = 102/423 (24%)

Query: 8 RPSVKGRVIALAIVALGIAALAYAY--HRTTAYPSTDDASIDADVVHVASPVGGRIVQLA 65
S + R++A I+ + A + + + + + ++
Sbjct: 52 PVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEII 111

Query: 66 VHENQRVAKGDLLYVIDPVPYRLTVAQAQADLELAR-------ASLDTRRR------SLI 112
V E + V KGD+L + + + Q+ L AR + L
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 113 GERSNASVAAEQVKRATQNY-------------------------DLATRDVNRL----- 142
E +V+ E+V R T +NR
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR 231

Query: 143 ---------APLAAQGYVSAQQF----------------DQAKVRQRDASVSLAQAQEQQ 177
+ L + ++ ++++ Q ++ + A+ + Q
Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 178 RAS---AQTIGDDADAIATLHAREAALARAQHALDDTVVRAPHDGLVTGLSVL-PGETLA 233
+ + + LA+ + +V+RAP V L V G +
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVT 351

Query: 234 PNQSIFTLIDASEWFAV-GNFRETSLNRIAVGDCATV-YSMIDRSR--PLTGKVVGIGAG 289
+++ ++ + V + + I VG A + +R L GKV I
Sbjct: 352 TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411

Query: 290 IADSARINLPRSLPIVQNSVNWVHVAQRFPVRVKLDEP------DGKLVRVGASAIVEVR 343
+ R+ L F V + ++E + G + E++
Sbjct: 412 AIEDQRLGLV------------------FNVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453

Query: 344 HGS 346
G
Sbjct: 454 TGM 456


74Bcenmc03_5661Bcenmc03_5667N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5661014-0.108426branched-chain amino acid aminotransferase
Bcenmc03_5662014-0.751377hypothetical protein
Bcenmc03_5663-213-1.414361mannitol dehydrogenase domain-containing
Bcenmc03_5664-113-1.376042HAD family hydrolase
Bcenmc03_5665-112-1.483323sugar transporter
Bcenmc03_5666-213-1.962567two component LuxR family transcriptional
Bcenmc03_5667-114-2.087427histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5661SECA300.020 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.020
Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 193 TRAAPGGTGDAKCGGNYAASLAAQAEAIREGCEQVVFLDAVERRWIEELGGMNV 246
T A GT D GG++ A +AA E E++ V + E GG+++
Sbjct: 504 TNMAGRGT-DIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHI 556


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5665TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 2e-06
Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 8/133 (6%)

Query: 42 GIISGALPLIARDFGLDYRAQE----LVAAAILLGAVIGALAGTRMSAAFGRRKTITIVS 97
G+I LP + RD L+A L+ + G +S FGRR + +
Sbjct: 22 GLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG-ALSDRFGRRPVLLVSL 80

Query: 98 AIYAAGVLAAALSPDAWSLAASRLVLGFAVGGSTQIVPT-YIAELAEPDKRGRLVTYFNV 156
A A A +P W L R+V G + G+T V YIA++ + D+R R + +
Sbjct: 81 AGAAVDYAIMATAPFLWVLYIGRIVAG--ITGATGAVAGAYIADITDGDERARHFGFMSA 138

Query: 157 SIGIGILLAALIG 169
G G++ ++G
Sbjct: 139 CFGFGMVAGPVLG 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5666HTHFIS903e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-22
Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 2/125 (1%)

Query: 2 ILIVDDTPENLAFLSDTLQAHGYVVIVALSGEDALKRLARVTPDVVLLDAMMPDMDGFET 61
IL+ DD L+ L GY V + + + +A D+V+ D +MPD + F+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 62 CVRIKQDGRHEHLPVIFMTALTESEHVVRGFRVGGIDYVTKPVQPEELCARIGAHVRRSR 121
RIK+ LPV+ M+A ++ G DY+ KP EL IG + +
Sbjct: 66 LPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 AQLYA 126
+
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5667HTHFIS741e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 1e-15
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 1/120 (0%)

Query: 931 RRTILVVDDLDDQRDIVVQLLTPLGFDVAEAASGTDALRWLAMHTADAIIMDISMPLMDG 990
TILV DD R ++ Q L+ G+DV ++ RW+A D ++ D+ MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 991 YETSRLISENQLSNAPIVLLSANAFADDRDRASATGCKGYLVKPLQVNLLLDKLAQLLAL 1050
++ I + + P++++SA +AS G YL KP + L+ + + LA
Sbjct: 63 FDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


75Bcenmc03_5684Bcenmc03_5691N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5684-1110.726931porin
Bcenmc03_56851110.651563RND efflux system outer membrane lipoprotein
Bcenmc03_5686111-0.012483RND family efflux transporter MFP subunit
Bcenmc03_5687010-0.019602acriflavin resistance protein
Bcenmc03_56880100.489949two component transcriptional regulator
Bcenmc03_56890110.096134LysR family transcriptional regulator
Bcenmc03_5690-1111.090066hypothetical protein
Bcenmc03_5691-1121.674050short-chain dehydrogenase/reductase SDR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5684ECOLNEIPORIN1012e-26 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 101 bits (253), Expect = 2e-26
Identities = 76/395 (19%), Positives = 138/395 (34%), Gaps = 68/395 (17%)

Query: 1 MKKYLAIPAAVACLLASAAHAQSSVTLYGTIDAGLDYISNQKSAAGAGPVYGVQSGNVST 60
MKK L I +A L +A + VTLYGTI AG++ + +G V
Sbjct: 1 MKKSL-IALTLAALPVAAM---ADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDL 56

Query: 61 -SRWGLRGNEDLGGGLAAVFTLENGFNVANGKLGNGGDEFGRQAWVGLASRQWGTVTLGR 119
S+ G +G EDLG GL A++ +E ++A G RQ+++GL +G + +GR
Sbjct: 57 GSKIGFKGQEDLGNGLKAIWQVEQKASIA----GTDSGWGNRQSFIGLKG-GFGKLRVGR 111

Query: 120 QYDFLVDF--VAPLSATGSGFGGNLADHPYDNDNLANDTRMNNAVKFRSANYGGFTFGGA 177
L D + P + G N P + R+ + ++ S + G +
Sbjct: 112 LNSVLKDTGDINPWDSKSDYLGVNKIAEP--------EARLISV-RYDSPEFAGLSGSVQ 162

Query: 178 YGFSNQGGGFSNDNAYSVGAQYVNGPVDLAVAYLQSNQPGGVDAPQNTGGSLSSADGDAM 237
Y ++ G N +Y G Y NG +
Sbjct: 163 YALND-NAGRHNSESYHAGFNYKNGGFFVQYGGAYKR----------------HHQVQEN 205

Query: 238 LTGGRWRTFGAGAHYAFDHAAI-GFVYTRTILNDPRELSQGGAYGRVNGQLLTFSNYELN 296
+ +++ + Y D+ A+ V + + + ++ T + N
Sbjct: 206 VNIEKYQIHRLVSGY--DNDALYASVAVQQQDAKL--VEENYSHNSQTEVAATLAYRFGN 261

Query: 297 GRYFLTPAFSLGGAYTFTQGRFDDADHGIAPKWNQFMLQADYALSRRTDLYLEGVYQRVT 356
++ A G++ + + ++Q ++ A+Y S+RT + + +
Sbjct: 262 VTPRVSYAHGFKGSF----DATNYNND-----YDQVVVGAEYDFSKRTSALVSAGWLQ-- 310

Query: 357 GADGVAVLGHAGIFNLAASGNDRQAVVAAGIRHKF 391
G G+RHKF
Sbjct: 311 EGKG--------------ESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5686RTXTOXIND363e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.6 bits (82), Expect = 3e-04
Identities = 22/148 (14%), Positives = 45/148 (30%), Gaps = 6/148 (4%)

Query: 117 ELDQQLQQARADLQSSLANEKLAASTAARWTRMLAQDSVSQQETDEKSSDLAAKQAIVAA 176
E + + +A +L+ + + S V+Q +E L +
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL--VTQLFKNEILDKLRQTTDNIGL 313

Query: 177 NEANVRRLDALEAFKRIVAPFDGVVTARKT-DIGQLISAGGGAGPELFAVSDVHRMRVYV 235
+ + + + I AP V K G +++ + V + + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA---ETLMVIVPEDDTLEVTA 370

Query: 236 SVPQNEAAAIRPGMTATLTVPEHPGETF 263
V + I G A + V P +
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRY 398



Score = 34.4 bits (79), Expect = 7e-04
Identities = 18/103 (17%), Positives = 32/103 (31%), Gaps = 11/103 (10%)

Query: 85 IHAQVSGYLHAWYTDIGAHVKSGQLLGLIDTPELDQQLQQARADLQSSLANEKLAASTAA 144
I + + G V+ G +L + A AD + ++ A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLKTQSSLLQARLEQT 151

Query: 145 RWTRMLAQDSVSQQETDEKSSDLAAKQAIVAANEANVRRLDAL 187
R+ + S S + L + +E V RL +L
Sbjct: 152 RYQIL----SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5687ACRIFLAVINRP6410.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 641 bits (1655), Expect = 0.0
Identities = 247/1079 (22%), Positives = 435/1079 (40%), Gaps = 68/1079 (6%)

Query: 4 IVRLALTRPYTFVVLALLILIAGPLAAVRTPIDIFPDIRIPVISVVWNYAGLQPDDMSGR 63
+ + RP VLA+++++AG LA ++ P+ +P I P +SV NY G +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 VITYYERTLGTTVNDVQHIESQSFR-GYGIVKIFFQPTVDIRTATAQVTSISQTVLKQMP 122
V E+ + ++++ ++ S S G + + FQ D A QV + Q +P
Sbjct: 61 VTQVIEQNM-NGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLP 119

Query: 123 PGTTPPQILNYNASTVPVLQIALTSNTLDEQK--LGDYAVNFIRPQLLSVPGVAIPTPYG 180
I +S+ ++ S+ + + DY + ++ L + GV +G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 181 GKTREVQIDLDPQALQSKGLSAQDVAHALAQQNQIIPAGT------QKIGRFEYNIKLNN 234
+ ++I LD L L+ DV + L QN I AG + +I
Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 235 SPLSLDALNDLPIKSVG-GTTIYIRDVAHVRDGYPPQGNIVRVDGHRAVLMSILKNGSAS 293
+ + + ++ G+ + ++DVA V G I R++G A + I A+
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 294 TLDIIAGVKAKLPLVEQTLPPGLKLVTMGDQSTFVNGAVSGVAREGIIAAALTSLMILLF 353
LD +KAKL ++ P G+K++ D + FV ++ V + A L L++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 354 LGSWRSTLIIAASIPLAVLSAIALLAATGETLNVMTLGGLALAVGILVDDATVTIENV-N 412
L + R+TLI ++P+ +L A+LAA G ++N +T+ G+ LA+G+LVDDA V +ENV
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 WHLEQGKDTRTAIVDGAKQIVMPALVSLLCICIVFVPMLMLDGISRFLFVPMAKAVIFSM 472
+E + A QI + + + VF+PM G + ++ + ++ +M
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 473 VSSFVLSRTFVPMLAQYLLKPHASAGHASGELAAVMDPHAGHAGAHDVPPSRNPLVRFQR 532
S +++ P L LLKP ++ H F
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHH-------------------------ENKGGFFG 513

Query: 533 AFERRFESVRASYRILLGLALTRRKPFVVAFLCIVAASFLLAPSLGRNFFPTIDSGEIAL 592
F F+ Y +G L +++ + IVA +L L +F P D G
Sbjct: 514 WFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLT 573

Query: 593 HVRAPVGTRVEETAAELDRIENTIRGVIPPAQLREVIDNIGLPNSGINLTYNNSGTLGPQ 652
++ P G E T LD++ + L+ N+ + +++
Sbjct: 574 MIQLPAGATQERTQKVLDQVTDYY--------LKNEKANVESVFTVNGFSFSGQ---AQN 622

Query: 653 DGDILISL-----SRDHAPTADYV-HTLRERLPRAYPGTTFSFLPADIVSQILNFGAPAP 706
G +SL +A+ V H + L + G F IV L
Sbjct: 623 AGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVE--LGTATGFD 680

Query: 707 VDLQVAGPNQQANLAYAHELYRKLR--LIAGVADPRIQQASTYPQFTVTVDRTRADQLGI 764
+L L A + A + R QF + VD+ +A LG+
Sbjct: 681 FELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGV 740

Query: 765 TEQDVTNSVVATLAGTSQVDPTYWLNPRNGVSYPIVAQTPQYRMTTLSALQNLPVTGANG 824
+ D+ ++ L GT D G + Q + L V ANG
Sbjct: 741 SLSDINQTISTALGGTYVNDFI-----DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG 795

Query: 825 QSQLLGGLATITRGVGNAVVSHYNIEPLFDIYATTQGRDLGAVATDIDDVVKATAKDLPK 884
+ T G+ + YN P +I G + D +++ A LP
Sbjct: 796 EMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPA 852

Query: 885 GSTVTLRGQVQTMNGAFAGLLLGLVGAIVLIYLLIVVNFHSWADAFVIVSALPAALAGIV 944
G G + + + V+++L + + SW+ ++ +P + G++
Sbjct: 853 GIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVL 912

Query: 945 WMLFTTHTPLSVPALTGAILCMGVATANSILVVSFARERLAETGNALASA-LEAGFTRFR 1003
+ V + G + +G++ N+IL+V FA++ + + G + A L A R R
Sbjct: 913 LAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLR 972

Query: 1004 PVLMTALAMIIGMAPMALGLGDGGEQNAPLGRAVIGGLACATIATLFFVPVVFSLVHRR 1062
P+LMT+LA I+G+ P+A+ G G +G V+GG+ AT+ +FFVPV F ++ R
Sbjct: 973 PILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 92.6 bits (230), Expect = 5e-21
Identities = 93/535 (17%), Positives = 189/535 (35%), Gaps = 50/535 (9%)

Query: 550 GLALTRRKPFVVAFLCIVAASFLLAPSLGRNFFPTIDSGEIALHVRAPVGTRVEETAAEL 609
+ R V + ++ A L L +PTI +++ P G + +
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GADAQTVQDTV 61

Query: 610 DR-IENTIRGVIPPAQLREVIDNIGLPNSGINLTYNNSGTLGPQDGDILISLSRDHAPTA 668
+ IE + G+ + D+ G I LT+ D I+ +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGS--VTITLTFQ-------SGTDPDIAQVQ------ 106

Query: 669 DYVHTLRERLPRAYPGTTFSFLPADIVSQILNFGAPAPVDLQV------AGPNQQANLAY 722
++ +L A P LP ++ Q ++ + L V Q +++
Sbjct: 107 -----VQNKLQLATP-----LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISD 156

Query: 723 --AHELYRKLRLIAGVADPRIQQASTYPQFTVTVDRTRADQLGITEQDVTNSVVATLA-- 778
A + L + GV D +Q + +D ++ +T DV N +
Sbjct: 157 YVASNVKDTLSRLNGVGD--VQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQI 214

Query: 779 GTSQVDPTYWLNPRNGVSYPIVAQTPQYRMTTLSALQNLPV-TGANGQSQLLGGLATITR 837
Q+ T L P ++ I+AQT R + + ++G L +A +
Sbjct: 215 AAGQLGGTPAL-PGQQLNASIIAQT---RFKNPEEFGKVTLRVNSDGSVVRLKDVARVEL 270

Query: 838 GVGN-AVVSHYNIEP--LFDIYATTQGRDLGAVATDIDDVVKATAKDLPKG-STVTLRGQ 893
G N V++ N +P I T L A I + P+G +
Sbjct: 271 GGENYNVIARINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDT 329

Query: 894 VQTMNGAFAGLLLGLVGAIVLIYLLIVVNFHSWADAFVIVSALPAALAGIVWMLFTTHTP 953
+ + ++ L AI+L++L++ + + + A+P L G +L
Sbjct: 330 TPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYS 389

Query: 954 LSVPALTGAILCMGVATANSILVVSFARERLAETGNALASALEAGFTRFRPVLMTALAMI 1013
++ + G +L +G+ ++I+VV + E A E ++ + L+ ++
Sbjct: 390 INTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVL 449

Query: 1014 IGM-APMALGLGDGGEQNAPLGRAVIGGLACATIATLFFVPVVFSLVHRRDALKH 1067
+ PMA G G ++ +A + + L P + + + + + +H
Sbjct: 450 SAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5688HTHFIS844e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 4e-21
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 2/139 (1%)

Query: 2 AHILTIEDDPLIADHIAHTLRAAGHQIDVARTGRDGMARAMSANYDVVTLDRMLPDLDGL 61
A IL +DD I + L AG+ + + + + D+V D ++PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 TILATMRGVGLDTPVLVMSAMSGVDQRIEGLRAGGDDYLVKPFSLEEMCARIDVLIRRRP 121
+L ++ D PVLVMSA + I+ G DYL KPF L E+ I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 RGARVETVLRAGELALDLV 140
R R + + + LV
Sbjct: 124 R--RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5691DHBDHDRGNASE981e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 97.8 bits (243), Expect = 1e-26
Identities = 62/196 (31%), Positives = 95/196 (48%), Gaps = 3/196 (1%)

Query: 3 VSKKFAAVTGAGSGIGRAAAIALARAGFTVALLGRTEASLSETQNAIRAAGGDAQVFPVD 62
+ K A +TGA GIG A A LA G +A + L + ++++A A+ FP D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 VTDEASVDHAFAQIAQRFGRLDVLFNNAGRNAPVVALDEYELDVWNSVVATNLTGVFLCA 122
V D A++D A+I + G +D+L N AG P + + W + + N TGVF +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 123 RAAWRLMKTQTPQGGRIINNGSISAHAPRPDTIAYTATKHAVTGITKSLALDGRRYNIAC 182
R+ + M + + G I+ GS A PR AY ++K A TK L L+ YNI C
Sbjct: 125 RSVSKYMMDR--RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 183 GQIDIGNAATALTERM 198
+ G+ T + +
Sbjct: 183 NIVSPGSTETDMQWSL 198


76Bcenmc03_5794Bcenmc03_5805N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5794-2152.613069response regulator receiver/ANTAR
Bcenmc03_5795-2142.594263uroporphyrin-III C-methyltransferase
Bcenmc03_5796-1152.349234major facilitator transporter
Bcenmc03_57970142.410866nitrite reductase (NAD(P)H), large subunit
Bcenmc03_57980122.560268nitrite reductase (NAD(P)H), small subunit
Bcenmc03_5799-2102.415059molybdopterin oxidoreductase
Bcenmc03_5800-282.124595short chain dehydrogenase
Bcenmc03_5801-372.017999hypothetical protein
Bcenmc03_5802-391.741394LysR family transcriptional regulator
Bcenmc03_5803-391.631201hypothetical protein
Bcenmc03_5804-3111.897777hypothetical protein
Bcenmc03_5805-2141.847343virulence factor family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5794HTHFIS464e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 45.6 bits (108), Expect = 4e-08
Identities = 33/156 (21%), Positives = 54/156 (34%), Gaps = 7/156 (4%)

Query: 8 TRLRVLLVTDTDKPIGELGDALARLGYEMLNDVATPARLPAAVEEQRPDVVIIDTDSPSR 67
T +L+ D L AL+R GY++ + A L + D+V+ D P
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 68 DTLEQLAVMHATAPR-PVLMFSHDADQELIRAAVGAGVSAYLVEGLSAERLAPILEVALA 126
+ + L + P PVL+ S A G YL + L I+ ALA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 127 RFSHDDALRRRLADVEREL-----AERKLIDRAKRV 157
+ + L A +++ R+
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5796TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 69/312 (22%), Positives = 116/312 (37%), Gaps = 28/312 (8%)

Query: 40 PLMPLIAREFHLTAAQVANINI--AAVAAT-IAVRLLVGPLCDRFGPRRVYAGLLLLGAI 96
P++P + R+ + A+ I A A A ++G L DRFG R V L A+
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV 85

Query: 97 PVFAVSFTHDYLWFLICRLGIGAIGA-GFVITQYHTSVMFAPNVVGTANATTAGWGNAGA 155
++ I R+ G GA G V Y A G A G+ +A
Sbjct: 86 DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY-----IADITDGDERARHFGFMSACF 140

Query: 156 GATQALMPLLVAAGLMLGFGEDSSWRIALVVPGVAMLAMAWAYWRFTQDCPQGDFVALRK 215
G P+L GLM GF + + A + G+ L + + +G+ LR+
Sbjct: 141 GFGMVAGPVL--GGLMGGFSPHAPFFAAAALNGLNFLTGCF----LLPESHKGERRPLRR 194

Query: 216 QGVTVDSGKKGGWASFFRACGNYRVWMLFVTYGACFGVEVFIHNIAALYYVDHFKLSLKD 275
+ + + + WA ++ V + +V + ++ D F
Sbjct: 195 EALNPLASFR--WARGMTVVA----ALMAVFFIMQLVGQVPA-ALWVIFGEDRFHWDATT 247

Query: 276 AGFAVGMFGLLALFARALGGWLSDKIAARRSLDVRATLLCALIIGEGLGLIWFSHAQGIG 335
G ++ FG+L A+A+ ++ +AAR L +I +G G I + A
Sbjct: 248 IGISLAAFGILHSLAQAM---ITGPVAARLG---ERRALMLGMIADGTGYILLAFATRGW 301

Query: 336 MALVAMLAFGLF 347
MA M+
Sbjct: 302 MAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5798SECBCHAPRONE290.004 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 28.7 bits (64), Expect = 0.004
Identities = 10/56 (17%), Positives = 18/56 (32%), Gaps = 4/56 (7%)

Query: 10 TRVCPLDD----IVPNTGVCALVNGEQVAVFHVAHADGGVFAIDNVDPVSQAAVMS 61
T + D + N V + F GVF I ++ + A ++
Sbjct: 55 TEAKQVGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLT 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5800DHBDHDRGNASE968e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.9 bits (238), Expect = 8e-26
Identities = 52/184 (28%), Positives = 77/184 (41%), Gaps = 6/184 (3%)

Query: 4 KRILITGAGTGFGREVALRLAERGHDVTAGVRTAVEIDALTDAAAQRGTALRAVKLDVTS 63
K ITGA G G VA LA +G + A +++ + + A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 A------YDRARAAELDVDVLVNNAGVGEAGALVDLPVEIVRELFDVNVFGPLELTQQIA 117
+ R +D+LVN AGV G + L E F VN G ++ ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 118 RGMLARKHGKIVFVSSIAGLITGPFTGAYCASKHAIESVAEAMHAELAPHGIRVAVVNPG 177
+ M+ R+ G IV V S + AY +SK A + + ELA + IR +V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 178 PYRT 181
T
Sbjct: 189 STET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5804PF06580330.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.006
Identities = 26/154 (16%), Positives = 46/154 (29%), Gaps = 16/154 (10%)

Query: 438 WWMTFA----LTLASLALSLAKG---LAFVEAGVLGTLLVLLLVSRRRFNRHSSLLAERF 490
W+ TL + G L + + +L+ L+L R +R
Sbjct: 13 WYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRS------FIKRQ 66

Query: 491 TVSWFVSVAMVLMLAVWVLFFAFRDVPYTRDLWSHFSFDARAPRALRATLAAGVF---VA 547
++L + + +W +F P A LA + V
Sbjct: 67 GWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVV 126

Query: 548 LFALWQLLRPAPGRFVKPAQQDLIDAEQIIRAQE 581
+ +W LL F Q ++ + AQE
Sbjct: 127 VTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQE 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5805PF06057307e-106 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 307 bits (789), Expect = e-106
Identities = 73/211 (34%), Positives = 104/211 (49%), Gaps = 3/211 (1%)

Query: 215 ELDVSDLPLVELPAKGGSDRLAIVISGDGGWRDLDKTIAEALQRDGVSVVGIDSLRYFWS 274
L V V + L I +SGDGGW LDK + LQ+ G VVG SL+Y+W
Sbjct: 33 LLPVEPSTQVNAASSHTKPPLVIFLSGDGGWATLDKAVGGILQQQGWPVVGWSSLKYYWK 92

Query: 275 EKPPAQVSRDLARVMRTYMARWHASRVALVGYSFGADVMPFAYNRLPADLRDKVAVMSLL 334
+K P V++D ++ Y A + +V L+GYSFGA+V+PF N +PA R V LL
Sbjct: 93 QKDPKDVTQDTLAIIDKYQAEFGTQKVILIGYSFGAEVIPFVLNEMPARYRKNVLGAVLL 152

Query: 335 GFAPSADFQIRVTGWLGMPASDKALKVAPEIAKVPPTLVQCFYGAEETDT--MCPALANT 392
+ S+DF+I V+ + PE+ K + C YG E+ +CP +
Sbjct: 153 SPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQTTVPMLCLYGKEDDAPLHLCPEVKQP 212

Query: 393 GADVIKTQGDHHFGRDYIALEKKILGAFGKP 423
V++ G H F DY + K I G + KP
Sbjct: 213 NVTVMELSGGHSFDDDYDKVVKLIKG-WLKP 242



Score = 39.8 bits (93), Expect = 1e-05
Identities = 17/76 (22%), Positives = 25/76 (32%)

Query: 17 MMLAAAVCAAQPADVKAETVSGGRYGPVTVTKPSGPLRGFVVLFSREAGWHAADQQAADA 76
+ A A + AD T+ S V+ S + GW D+
Sbjct: 14 LCSTANAFADEFADNLGLTLLPVEPSTQVNAASSHTKPPLVIFLSGDGGWATLDKAVGGI 73

Query: 77 LAKAGAMTVGVDSGRY 92
L + G VG S +Y
Sbjct: 74 LQQQGWPVVGWSSLKY 89


77Bcenmc03_5913Bcenmc03_5941N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_59131100.885611type II secretion system protein
Bcenmc03_5914281.561223PilS domain-containing protein
Bcenmc03_5915281.129040type II secretion system protein E
Bcenmc03_5916271.907986hypothetical protein
Bcenmc03_5917271.366218type IV prepilin
Bcenmc03_5918271.025029YscC/HrcC family type III secretion outer
Bcenmc03_5919491.997255histidine kinase
Bcenmc03_59204112.165670two component transcriptional regulator
Bcenmc03_59214114.149193type II/III secretion system family protein
Bcenmc03_59224151.798128hypothetical protein
Bcenmc03_59233141.471238hypothetical protein
Bcenmc03_59242122.707452hypothetical protein
Bcenmc03_59252122.927225putative type III secretion apparatus protein
Bcenmc03_59260102.026038hypothetical protein
Bcenmc03_59270100.758484HrpO family type III secretion protein
Bcenmc03_5928-1100.777595type III secretion system protein
Bcenmc03_5929-1101.135315type III secretion system apparatus protein
Bcenmc03_5930-110-0.762219hypothetical protein
Bcenmc03_5931-111-1.025039HrcV family type III secretion protein
Bcenmc03_5932014-1.074142type III secretion system protein HrcU
Bcenmc03_5933-215-0.430175hypothetical protein
Bcenmc03_5934-2120.222549HrpB2-like protein
Bcenmc03_5935-2110.833647hypothetical protein
Bcenmc03_5936091.476586YscJ/HrcJ family type III secretion apparatus
Bcenmc03_5937-1111.358435type III secretion protein HrpB4
Bcenmc03_5938-1120.248490HrpE/YscL family type III secretion apparatus
Bcenmc03_5939-114-0.612503type III secretion system ATPase
Bcenmc03_5940-120-1.975946putative type III secretory pathway protein
Bcenmc03_5941-122-4.073145type III secretion protein SpaR/YscT/HrcT
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5913BCTERIALGSPF672e-14 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 66.8 bits (163), Expect = 2e-14
Identities = 53/267 (19%), Positives = 116/267 (43%), Gaps = 16/267 (5%)

Query: 24 RKIEKMLSNGLPLLKVLEELELRASHDGRKPTLPEAILFGEWRRTVQNGGSLAEGMDGWV 83
R++ +++ +PL + L+ + + S L A+ R V G SLA+ M +
Sbjct: 75 RQLATLVAASMPLEEALDAVA-KQSEKPHLSQLMAAV-----RSKVMEGHSLADAMKCFP 128

Query: 84 PQAEQM---IVLAGEQSGRLEGALRSVTGIVTSGRRIRNAIAQGLAYPVALLAMMLAYLY 140
E++ +V AGE SG L+ L + +++R+ I Q + YP L + +A +
Sbjct: 129 GSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVS 188

Query: 141 LFGARLVPQFAA--IEDPERWHGSARLLYDLSVFVQAWLPECLIVVVVLVALLVWSMPRW 198
+ + +VP+ I + S R+L +S V+ + P L+ ++ + +
Sbjct: 189 ILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQE 248

Query: 199 SGRLR-SRFDNYAPW--SLYRLMVGSSFLTAFASMQAAGFTVEKSLAQLAD-HAKPWLRE 254
R+ R + P + R + + + + + A+ + +++ D + + R
Sbjct: 249 KRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARH 308

Query: 255 RIDDTLFGVKSGLNVGEAMRMSGHRFP 281
R+ V+ G+++ +A+ + FP
Sbjct: 309 RLSLATDAVREGVSLHKALEQTA-LFP 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5914PilS_PF088051322e-41 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 132 bits (333), Expect = 2e-41
Identities = 41/165 (24%), Positives = 84/165 (50%), Gaps = 8/165 (4%)

Query: 38 RGASLLEAISYLGIAAIVVIGAIALLAGAFSSANTNSITEQVNAIQSGVKKLYMGQSASY 97
+GA+L+E + +G+ ++ A L + S+ +++ V + + +K L +
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQGRYTD 85

Query: 98 ANLSNSVLASAGVFPSTLAPASGSGAITNMWNGTITVAAATNNSNQFTITYTNVPRSVCV 157
+N + L + G+ PS + + + N W G++T+ +++ + F + NVP+ C+
Sbjct: 86 SNYIKT-LYAQGLLPSDMIADTTGASAKNPWGGSVTITTSSDKYS-FNVVEANVPQKNCM 143

Query: 158 NSVTAGGSWISIT-VNETALTLPATPDSAATACASGDTNTVAWTS 201
V A S +I+ +N T + SAAT CAS D+NT+ +++
Sbjct: 144 AMVNALRSSSAISKINNT----STSTVSAATVCAS-DSNTLTFST 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5917BCTERIALGSPH452e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 44.6 bits (105), Expect = 2e-07
Identities = 17/59 (28%), Positives = 28/59 (47%)

Query: 7 RQRGFTIVEMLAALAIASLMIVGVTAMIDTSLADAKGQQAAAWQAQMTQAAAQLITQNQ 65
RQRGFT++EM+ L + + V S D+ Q A ++AQ+ + + Q
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5918TYPE3OMGPROT2642e-82 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 264 bits (675), Expect = 2e-82
Identities = 93/312 (29%), Positives = 153/312 (49%), Gaps = 27/312 (8%)

Query: 277 SQQLSSAVASSGPTAGGQASGGGDEEALPVIEADQRTNSVLIRDTPDRMYQYPALIQRLD 336
+ Q + P A +AS A +EAD N++++RD+P+RM Y LI LD
Sbjct: 223 TIQQVTVDNQRIPQAATRAS------AQARVEADPSLNAIIVRDSPERMPMYQRLIHALD 276

Query: 337 VKPRLIEIEAHIFEVDTSSIRQLGVNWTAHNSHIDLQTGNGLGAQNTYGGTLTQNFGNTT 396
IE+ I +++ + +LGV+W ++TGN G + N
Sbjct: 277 KPSARIEVALSIVDINADQLTELGVDWRV-----GIRTGNNHQVVIKTTGDQSNIASN-- 329

Query: 397 LAGNVTAAAMPVGGVLSAVIGNAGRYLMANVSALEEQNLAKIDASPKVTTLDNIEADMAN 456
G + S V YL+A V+ LE + A++ + P + T +N +A + +
Sbjct: 330 ------------GALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH 377

Query: 457 QTQFFVRVSGYTSADLYSVSTGVSLRVLPMVVDEGGRTQIKLDVAIQDGQL--TSRTVDN 514
++V+V+G A+L ++ G LR+ P V+ +G +++I L++ I+DG S ++
Sbjct: 378 SETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG 437

Query: 515 IPVISSTNINTSAFVNEGEALLIAGYKNDGRIDTTTGVPVLSKIPVIGNLFKYTDRENTR 574
IP IS T ++T A V G++L+I G D + VP+L IP IG LF+ R
Sbjct: 438 IPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRR 497

Query: 575 MERLFLLTPRII 586
RLF++ PRII
Sbjct: 498 TVRLFIIEPRII 509



Score = 177 bits (449), Expect = 3e-50
Identities = 70/244 (28%), Positives = 110/244 (45%), Gaps = 11/244 (4%)

Query: 4 FRFGLLFILVAAC----VAATVHAAPVNWHTRMVDYTADSKDIKDVLRDFAASQGIPADI 59
F F V +++ A ++W Y A + ++D+L DF A+ +
Sbjct: 3 FPLHSFFKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVV 62

Query: 60 SKDVQGSVTGKF-HMPPQRLLDTLASSFGFVWYYDGQVLDIVTPDEMKSTLIKLDHGSTA 118
S + V+G+F H PQ L +AS + VWYYDG VL I E+ S LI+L A
Sbjct: 63 SDKINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAA 122

Query: 119 QLRSTLAAMNVTDPRFRITYDDVQGAAIVNGPPNYVKLVGDVAQRLDTTTRHRA----GT 174
+L+ L + +PRF D V+GPP Y++LV A L+ T+ R+
Sbjct: 123 ELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGAL 182

Query: 175 VVQVYPLHHAWAMDRSVVADGQSMTLLGVATVLNNVYH--PQQGGGGNSGGGGRAPNVQR 232
++++PL +A A DR++ + GVAT+L V Q ++ +A
Sbjct: 183 AIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRAS 242

Query: 233 AQPM 236
AQ
Sbjct: 243 AQAR 246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5920HTHFIS796e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 6e-19
Identities = 29/129 (22%), Positives = 53/129 (41%), Gaps = 1/129 (0%)

Query: 5 TRIMLLEDDRIQQTMLVSWLKAEGYQVEAFDNGIEARNHLSDHWADLMILDWDVPGLSGD 64
I++ +DD +T+L L GY V N ++ DL++ D +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 KLLSWVRGRSRSTVPVIFQTVHSDEEEIVRILDTGADDFLIKPVDRIVFLARIRALLRRF 124
LL ++ R +PV+ + + ++ + GA D+L KP D + I L
Sbjct: 64 DLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 QTAGSERRR 133
+ S+
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5921TYPE3OMGPROT1642e-46 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 164 bits (417), Expect = 2e-46
Identities = 83/273 (30%), Positives = 126/273 (46%), Gaps = 14/273 (5%)

Query: 77 PPWSSAPYRYSTSGASLPDTLRALSAATHVPIAFDAGLPGRVEGRFEL-PPQRFVEMLAH 135
W PY Y G SL D L A + + +V G+FE PQ F++ +A
Sbjct: 29 LDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHIAS 88

Query: 136 GYGLVWYYDGTVLHVDAAGTQTTLIVRLNYARPTDLHALLAQTGIDDVRFVARDDAPARG 195
Y LVWYYDG VL++ + ++RL + +L L ++GI + RF R D +
Sbjct: 89 LYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPD-ASNR 147

Query: 196 LITFRGPPAWIALVGRAAQRLDADARARV----KTAVRIVPLHYGNAADRSAFANGRSNV 251
L+ GPP ++ LV + A L+ + R A+ I PL Y +A+DR+
Sbjct: 148 LVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDEVA 207

Query: 252 VQGVASRAARVLDPHDSLRATITEYEAP--------LPVLGADAGTNAVLVRDRPERLDA 303
GVA+ RVL + T+ P + AD NA++VRD PER+
Sbjct: 208 APGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNAIIVRDSPERMPM 267

Query: 304 DVRAIIALDRPRQHVGLGLLVAEVDTDALGAIG 336
R I ALD+P + + L + +++ D L +G
Sbjct: 268 YQRLIHALDKPSARIEVALSIVDINADQLTELG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5922PF05932290.003 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 29.0 bits (65), Expect = 0.003
Identities = 16/96 (16%), Positives = 29/96 (30%), Gaps = 8/96 (8%)

Query: 37 DVRLD-----HFENDPEAMYVNFHYGTVTAGRTLVIFRLMLEANLLIYAQDQAQLGLDAD 91
++ +D D + G + + + L L L LGLD
Sbjct: 31 NMIIDNTFALTLSCDYARERLLL-IGLLEPHKDIPQQCL-LAGALNPLLNAGPGLGLDEK 88

Query: 92 TGGIILILRLPLTPDVDGAVVADTVSHYTEHGRYWR 127
+G +P + + ++ E R WR
Sbjct: 89 SGLYHAYQSIPRE-KLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5927TYPE3IMQPROT562e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 56.3 bits (136), Expect = 2e-14
Identities = 26/78 (33%), Positives = 43/78 (55%)

Query: 4 DTLIGLTSQGLLLCLYISLPAIIVSALSGLIVAFLQAITSLQDQTISHSVKLFAVIGTLL 63
D L+ ++ L L L +S IV+ + GL+V Q +T LQ+QT+ +KL V L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 LTAGWGGTTILRFALRLL 81
L +GW G +L + +++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5928TYPE3IMPPROT2156e-73 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 215 bits (548), Expect = 6e-73
Identities = 72/213 (33%), Positives = 117/213 (54%), Gaps = 7/213 (3%)

Query: 10 LLIAVIAIGLIPFVAMVVTSYAKIVVVLGLLRNALGVQQVPPNMVLNGIAILVTLYVMAP 69
L+ + L+PF+ T + K +V ++RNALG+QQ+P NM LNG+A+L++++VM P
Sbjct: 7 LIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVMWP 66

Query: 70 IGMSAADTMRHEQLANSPAEIMVQAFGASQAPFRTFLKAHSRERERLFFMRSAHVIWPES 129
I A E + + + + +R +L +S FF +
Sbjct: 67 IMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQYGE 126

Query: 130 LA-------NALHEDDLIVLAPAFTLTELADAFKIGFLLYIAFIIVDFVIANVLLAMGLN 182
+ + + + L PA+ L+E+ AFKIGF LY+ F++VD V+++VLLA+G+
Sbjct: 127 ETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALGMM 186

Query: 183 QIQPTNVAIPFKLLLFVAMDGWSALIHGLILTY 215
+ P ++ P KL+LFVA+DGW+ L GLIL Y
Sbjct: 187 MMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5929TYPE3OMOPROT595e-12 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 59.2 bits (143), Expect = 5e-12
Identities = 39/179 (21%), Positives = 72/179 (40%), Gaps = 22/179 (12%)

Query: 180 PAPLPARLGRIRVPGYLIVGEKALPIATLRRLRPGDVVLRFADPAFAGWDGAAPTHALAL 239
PA R +R P ++G + L R+ GDV+L A
Sbjct: 138 PAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEV------------- 184

Query: 240 RWGVAGMHQYVAHATLDGPRLVLDTEPA-MTDRNDHPVPGAPDAETQTSLDELELPVSFE 298
+ ++G +V + + + N+ AET L++L + + F
Sbjct: 185 ---YCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTET----AETLPGLNQLPVKLEFV 237

Query: 299 IETVTLPLAQLSALRPGYVIELRGTLRDARIRLLAYGQLIGIGELVTVGEQLGVRVIDV 357
+ + LA+L A+ ++ L T + + ++A G L+G GELV + + LGV + +
Sbjct: 238 LYRKNVTLAELEAMGQQQLLSL-PTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEW 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5930PF05616270.050 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 27.4 bits (60), Expect = 0.050
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 67 PLRRPADADAPDDVLADTADP-PDTDADADDDLDADAPPAER 107
P PA+ AP++ +P PD D + D + D D P R
Sbjct: 333 PAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTR 374


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5932TYPE3IMSPROT2843e-96 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 284 bits (728), Expect = 3e-96
Identities = 114/348 (32%), Positives = 193/348 (55%), Gaps = 9/348 (2%)

Query: 1 MSDEKTEQPTRRKLKEARKKGTVAKSVDLVAAVLVLVSLA-LFTFAWHPLLDALHQNISR 59
MS EKTEQPT +K+++ARKKG VAKS ++V+ L++ A L + + +
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 60 AIDFCGS-ERSMHTLWSVLMHELANGAIACCAISVAAALAAALSIGAQVGLQVSFDPVMP 118
A +++ + + + L C + AAL A S Q G +S + + P
Sbjct: 61 AEQSYLPFSQALSYV---VDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117

Query: 119 KMDRLSPAAGLKQIFSKRALIDLLKMSVKAVVIAAALWHVIVSLFPLITSAMAEPLAALS 178
+ +++P G K+IFS ++L++ LK +K V+++ +W +I + + ++
Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECIT 177

Query: 179 QILWHALLKLLFAAAVLFLVVGVVDWKIQHWLMMRAQRMSKDEVKRELKEREGEPKLKHE 238
+L L +L+ V F+V+ + D+ +++ ++ +MSKDE+KRE KE EG P++K +
Sbjct: 178 PLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSK 237

Query: 239 RRKRAKELVNGGGESLSGAVGRANVVVVNPTHYAVALRYVRGETPLPVVLAKGIDDAARE 298
RR+ +E+ + ++ V R++VVV NPTH A+ + Y RGETPLP+V K D +
Sbjct: 238 RRQFHQEIQS---RNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294

Query: 299 IRALATQRGVPIVANPPVARALY-EVAEHRPIPSELFEVVAAILRWVE 345
+R +A + GVPI+ P+ARALY + IP+E E A +LRW+E
Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5936FLGMRINGFLIF924e-23 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 92.3 bits (229), Expect = 4e-23
Identities = 53/199 (26%), Positives = 82/199 (41%), Gaps = 9/199 (4%)

Query: 1 MNARIRIVGRASVHACVVLCVALCIALAGCKQE-LYGNLSEQDCNEIVAALLQAGIDAKK 59
+ A RI + A V + VA+ + L+ NLS+QD IVA L Q I +
Sbjct: 19 LRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRF 78

Query: 60 ESADGGKTWSAKVDDAHIVQAMNVLREHGLPAHKYDDLGDLFKKDGLVSTPTEERVRFIY 119
+ G D H + L + GLP +L ++ + E+V +
Sbjct: 79 AN--GSGAIEVPADKVH--ELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQR 133

Query: 120 GVSQELSKTLSQIDGVMVARVHIVLPDNDPLAMHVKPSSASVFIKYLPTANL--AMIEPQ 177
+ EL++T+ + V ARVH+ +P K SASV + P L I
Sbjct: 134 ALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQIS-A 192

Query: 178 IKNLVVHSVEGLSYDKVSL 196
+ +LV +V GL V+L
Sbjct: 193 VVHLVSSAVAGLPPGNVTL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5938FLGFLIH290.022 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 28.6 bits (63), Expect = 0.022
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 12/76 (15%)

Query: 163 QVRVHPDEHDAAVETFSHAVA--QWRARGQPVQLTVHADRTLDRGACVCDTDIGSVDASL 220
Q+RVHPD+ + ++ WR RG P TL G C D G +DAS+
Sbjct: 162 QLRVHPDDLQRVDDMLGATLSLHGWRLRGDP---------TLHPGGCKVSADEGDLDASV 212

Query: 221 KVQIDAV-RMAADAAL 235
+ + R+AA +
Sbjct: 213 ATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5940RTXTOXIND300.006 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.006
Identities = 15/131 (11%), Positives = 40/131 (30%), Gaps = 6/131 (4%)

Query: 4 RRAMAFSTIRNRLTRARAALRDTLAEQQRERDEADARLAEQQRVLTHAAQEVDRRAARID 63
R + + L +++ ER AR+ + + +D ++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS--- 242

Query: 64 RLLDGHGPVRIDELLDWEKLLADAHARRARELDTLQQLRDGVAAINHAIGTTRTAILRHD 123
L + +L+ E +A L+Q+ + + T + +++
Sbjct: 243 --LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY-QLVTQLFKNE 299

Query: 124 VRIDLCSARLD 134
+ L +
Sbjct: 300 ILDKLRQTTDN 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5941TYPE3IMRPROT1371e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 137 bits (348), Expect = 1e-41
Identities = 47/240 (19%), Positives = 106/240 (44%), Gaps = 2/240 (0%)

Query: 17 LSFMILVAVCGVRLLVLMTIFPPTGGELITKRIRNAMVVLWSIYVAYGQQALMTQLHGGF 76
LS++ L +R+L L++ P + KR++ + ++ + +A A + F
Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFF 69

Query: 77 LLAVVVKEAVIGLVIAFVASPVFWVAEAVGTYIDDLTGYNNVQITNPSLGQQTTLTSTLL 136
L + V++ +IG+ + F F G I G + +P+ + + ++
Sbjct: 70 ALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 137 MQCATVAFWTLGGMTFLLGAVFQTYVWWPLGSLTPVPRAFIEAFVMQQTDSLMVTIAKLA 196
A + F T G +L+ + T+ P+G AF+ + + + + LA
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFL--ALTKAGSLIFLNGLMLA 187

Query: 197 GPAVLLLLLVDVGVGLLSRIASKLDLVSLAQPVKGALAVLLLALMIGMFIGQVKDQVALL 256
P + LLL +++ +GLL+R+A +L + + P+ + + L+A ++ + + + +
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247


78Bcenmc03_5974Bcenmc03_5979N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bcenmc03_5974-1113.188892PRC-barrel domain-containing protein
Bcenmc03_59750113.153559hypothetical protein
Bcenmc03_59760112.954873hypothetical protein
Bcenmc03_59770113.057497transport-associated
Bcenmc03_59781102.814762PAS/PAC sensor hybrid histidine kinase
Bcenmc03_59791111.110667sigma-54 dependent trancsriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5974PF05272290.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.009
Identities = 11/43 (25%), Positives = 14/43 (32%)

Query: 109 DKSRWPTMADPEWAEALHAYYGSSPYWLIEEGETALDSPPYEA 151
+ +AEALH Y Y+ E E P E
Sbjct: 718 NLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQEL 760


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5976CHANLCOLICIN290.013 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.013
Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query: 66 LEGVAIGAIVGLVGAALLYLGGLHSPLAWIGVPLVGGYVGALCGA 110
LE A A V V ALL+ + L G+ +V G + +
Sbjct: 467 LEKKAADAGVSYV-VALLFSLLAGTTLGIWGIAIVTGILCSYIDK 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5978HTHFIS912e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.0 bits (226), Expect = 2e-21
Identities = 35/118 (29%), Positives = 53/118 (44%), Gaps = 3/118 (2%)

Query: 520 LDGQRVLVVDDDATSRTSLAAALETMGAQVSTARSGHDALEAVERQPPSVVLSDLAMPDG 579
+ G +LV DDDA RT L AL G V + + +V++D+ MPD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 580 DGYWLLDRIRRLPNGGGHLPVVAVTAHAGKADRRRVMAAGFDAYLCKPVDMPTLASVI 637
+ + LL RI++ LPV+ ++A + G YL KP D+ L +I
Sbjct: 61 NAFDLLPRIKKA---RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bcenmc03_5979HTHFIS326e-111 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 326 bits (836), Expect = e-111
Identities = 124/344 (36%), Positives = 172/344 (50%), Gaps = 32/344 (9%)

Query: 5 ANRTKQRADGRRLSGRSAAMRTLLGRIEKIAPTRASVMIAGESGVGKDIVARRLHDLSAR 64
+ DG L GRSAAM+ + + ++ T ++MI GESG GK++VAR LHD R
Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKR 186

Query: 65 HDGPFVPMNCGAIPAELAEAQLFGHEKGSFTGAITQREGFFEAARGGTLLLDEIAEMPAA 124
+GPFV +N AIP +L E++LFGHEKG+FTGA T+ G FE A GGTL LDEI +MP
Sbjct: 187 RNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMD 246

Query: 125 LQVKLLRAIESNTIVRVGGTEPIPLDVRFVSATRHNPADAVRDGRLREDLFYRLAAFAIY 184
Q +LLR ++ VGG PI DVR V+AT + ++ G REDL+YRL +
Sbjct: 247 AQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLR 306

Query: 185 VPPLRQRDGDVEMIAQEFVDTLNARHRAHKRLTDAAIAALRAYSWPGNVRELHNTIERAY 244
+PPLR R D+ + + FV KR A+ ++A+ WPGNVREL N + R
Sbjct: 307 LPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLT 366

Query: 245 ILADEGI---------------DVALPKQALPAAESTAEGAM-------------ALPVG 276
L + + D + K A + + A+ ALP
Sbjct: 367 ALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPS 426

Query: 277 ATLHHAQQRF----IAETLRHFDGNKPRAAKALGISLKTLYNRL 316
I L GN+ +AA LG++ TL ++
Sbjct: 427 GLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKI 470



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.