PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome256.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_010184 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BcerKBAB4_0243BcerKBAB4_0270Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_02431283.505450putative lipoprotein
BcerKBAB4_0244-1253.296076abortive infection protein
BcerKBAB4_02451222.653442co-chaperonin GroES
BcerKBAB4_02460182.240302chaperonin GroEL
BcerKBAB4_0247-1131.107093GMP synthase
BcerKBAB4_0248-1120.424259xanthine/uracil/vitamin C permease
BcerKBAB4_0249215-0.421589two component transcriptional regulator
BcerKBAB4_0250117-0.668487histidine kinase
BcerKBAB4_0251316-1.670080putative alpha/beta hydrolase
BcerKBAB4_0252317-2.020330DSBA oxidoreductase
BcerKBAB4_0253419-2.259237hypothetical protein
BcerKBAB4_0254319-2.570480methyltransferase type 12
BcerKBAB4_0255318-2.691864methyltransferase type 11
BcerKBAB4_0257419-2.851335NAD-dependent epimerase/dehydratase
BcerKBAB4_0258321-1.822097methyltransferase type 12
BcerKBAB4_0259322-1.656394glycosyl transferase family protein
BcerKBAB4_0260422-1.727480hypothetical protein
BcerKBAB4_0262418-1.920349undecaprenyl pyrophosphate phosphatase
BcerKBAB4_0263420-2.291231bacitracin ABC transporter, permease
BcerKBAB4_0264218-1.462878ABC transporter
BcerKBAB4_0265015-0.772294histidine kinase
BcerKBAB4_0266-110-0.081885two component transcriptional regulator
BcerKBAB4_02670110.398466hypothetical protein
BcerKBAB4_02680110.699173hypothetical protein
BcerKBAB4_02692141.367102phosphoribosylaminoimidazole carboxylase
BcerKBAB4_02702162.109798phosphoribosylaminoimidazole carboxylase ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0249HTHFIS913e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 3e-23
Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 1/122 (0%)

Query: 1 MAGETILVVDDEKEIRNLITIYLKNEGYKVLQAGDGEEGLRILEENEVHLVVLDIMMPKV 60
M G TILV DD+ IR ++ L GY V + R + + LVV D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGIHMCMKVREA-KEMPIIMLSAKTQDMDKILGLTTGADDYVAKPFNPLELIARIKSQLR 119
+ + ++++A ++P++++SA+ M I GA DY+ KPF+ ELI I L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RY 121

Sbjct: 121 EP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0257NUCEPIMERASE1491e-44 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 149 bits (377), Expect = 1e-44
Identities = 74/336 (22%), Positives = 136/336 (40%), Gaps = 37/336 (11%)

Query: 5 NYLIVGGNSFIGINLALGLLKQGQNVKVFSRHINNFPQNIISE----------VEFIKGD 54
YL+ G FIG +++ LL+ G V ++N++ + + +F K D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID-NLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 55 LANVKDIYK--ALVSVDIIIYLAATSNVATSIEDVFGDINSSFF-FLNFMESVKNFPVKK 111
LA+ + + A + + V S+E+ +S+ FLN +E ++ ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 112 IVLASSGGTVYGEPEYLPIDEEHPL-KPLSPYGITKVSLENYLYFYKKKYGIDYVVCRYS 170
++ ASS +VYG +P + + P+S Y TK + E + Y YG+ R+
Sbjct: 121 LLYASSS-SVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 171 NPYGKYQNPLKKVGAINCFLYQHLSNEKINIYGNPQEIIRDYIYIDDLVEITIQLSQLNR 230
YG + P A+ F L + I++Y ++ RD+ YIDD+ E I+L +
Sbjct: 180 TVYGPWGRPDM---ALFKFTKAMLEGKSIDVYNY-GKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 231 LKSC-----------------VYNIGSGKGLSLKRIIVELEKLTERKVEFTCYKQRQENV 273
VYNIG+ + L I LE + + + +V
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDV 295

Query: 274 QKIILNIDRVRRECNWEPKIDFKSGIRLNKLWIEEF 309
+ + + + P+ K G++ W +F
Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0266HTHFIS914e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 4e-23
Identities = 32/121 (26%), Positives = 61/121 (50%), Gaps = 1/121 (0%)

Query: 1 MKDIRILIADDDKEIRDLLKRYLERELYMVDTAINGEEALCLFNQNNYNLVILDLMMPKV 60
M IL+ADDD IR +L + L R Y V N + +LV+ D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGIEVCRKLRDT-TNIPILMLTAKDHEVDKILGLSIGADDYITKPFSIHEVVARVKALMR 119
+ ++ +++ ++P+L+++A++ + I GA DY+ KPF + E++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 R 120

Sbjct: 121 E 121


2BcerKBAB4_0331BcerKBAB4_0338Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0331213-2.582320periplasmic binding protein
BcerKBAB4_0332114-3.268071hypothetical protein
BcerKBAB4_0333215-3.695372hypothetical protein
BcerKBAB4_0334316-4.456413DNA binding domain-containing protein
BcerKBAB4_0335215-4.221167spore coat protein B
BcerKBAB4_0336113-4.227023spore coat protein B
BcerKBAB4_0337113-3.630904hypothetical protein
BcerKBAB4_0338114-3.187519hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0331FERRIBNDNGPP526e-10 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 52.3 bits (125), Expect = 6e-10
Identities = 47/250 (18%), Positives = 101/250 (40%), Gaps = 25/250 (10%)

Query: 52 NPQRVVVLS-SFAGNVMSLGVNLVGV------DSWSKQNPRFDSKLKNVAEVSDENVEKI 104
+P R+V L +++LG+ GV W + P DS + +V ++ N+E +
Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVI-DVGLRTEPNLELL 92

Query: 105 AELNPDLIIGLSNIK-NVDKLKKIAPTVTYTYG----KVDYLTQHL-EIGKLLNKEKEAK 158
E+ P ++ + + + L +IAP + + + + L E+ LLN + A+
Sbjct: 93 TEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAE 152

Query: 159 TWVDDFKKRAQTAGKDIKAKIGEDATVSVVENFN--KQLYVYGENWGRGTEILYQEMKLK 216
T + ++ ++ + A ++ + + V+G N EIL +
Sbjct: 153 THLAQYEDFIRSMKPRF---VKRGARPLLLTTLIDPRHMLVFGPN-SLFQEIL---DEYG 205

Query: 217 MPEKVKEKALKEGYYALSTEVLPEFAGDYLIV--SKNKDTDNSFQETESYKNIPAVKNNR 274
+P + + G A+S + L + ++ N ++ T ++ +P V+ R
Sbjct: 206 IPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGR 265

Query: 275 VYEANMMEFY 284
+ FY
Sbjct: 266 FQRVPAVWFY 275


3BcerKBAB4_0595BcerKBAB4_0608Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0595419-3.298698VanZ family protein
BcerKBAB4_0596617-2.729944hypothetical protein
BcerKBAB4_0597515-2.460929undecaprenyl pyrophosphate phosphatase
BcerKBAB4_0598714-0.775872methyl-accepting chemotaxis sensory transducer
BcerKBAB4_05993110.689222hypothetical protein
BcerKBAB4_06001110.451524XRE family transcriptional regulator
BcerKBAB4_0601111-0.280187hypothetical protein
BcerKBAB4_0602080.694090sortase family protein
BcerKBAB4_0603191.138228hypothetical protein
BcerKBAB4_06042100.459052amino acid/peptide transporter
BcerKBAB4_0605290.182697branched-chain amino acid transport system II
BcerKBAB4_0606110-0.019222hypothetical protein
BcerKBAB4_06071160.780770hypothetical protein
BcerKBAB4_0608216-0.117916amino acid permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0596SURFACELAYER270.007 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 26.6 bits (58), Expect = 0.007
Identities = 7/31 (22%), Positives = 20/31 (64%)

Query: 10 GDHITTFESTWRFQEGEQIFVHDDNTKRNFI 40
G +TT+ +++F+ G++ + NT++ ++
Sbjct: 403 GTEVTTYGGSYKFKNGQRYYKIGANTEKTYV 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0599IGASERPTASE454e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 4e-07
Identities = 49/280 (17%), Positives = 95/280 (33%), Gaps = 26/280 (9%)

Query: 70 NLNNTIAKNKEDQAAIQRKIDETHKQI-----EQKKNEIVVLEDKVLARKDIMKKRMVSV 124
+LN ++ N D A + K+ + + E +K V + +I
Sbjct: 952 HLNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVP 1011

Query: 125 QNSSNTSLVVEVVVESKNFADFIQRMNAVTTILEADKEILRLQEQDLRQIEEDKKEIDEK 184
N+ + V E V A + V + + + + EQD + +E+ ++
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE 1071

Query: 185 EASLVVDKQKLAKAQADLQDNLKKRQDNLQTVQAKYNQVASQLNLAAEEKAKVEANMKAV 244
S V + + QT + K + EEKAKVE
Sbjct: 1072 AKSNVKANTQTNEV-----AQSGSETKETQTTETKETA-----TVEKEEKAKVET----- 1116

Query: 245 QETIAREQEAARIAAEERAKAEAAAKAEQEALAKAQAEFAEKQKQEKANKPAEPVANKPA 304
+ QE ++ ++ K E + + +A + + K+ ++ +PA
Sbjct: 1117 ----EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 305 EPVANNSSKVEPEQPKVTAGGK--EMYVHATAYTADPSEN 342
+ ++N + E V G E + T T P+ N
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0603IGASERPTASE423e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 3e-06
Identities = 25/141 (17%), Positives = 49/141 (34%), Gaps = 14/141 (9%)

Query: 50 GEYHYHNKPASSGGTTSPAPSQNNNGAVEAERQAEAQRNAE------------AEKQRAA 97
G Y +N T + ++A+ + N E A +
Sbjct: 976 GRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE 1035

Query: 98 EAQRKAEEERQRAAEEQRKAEEERQRVAEEQRKAEEARKQEEAQRQADMEKGQLEGQKSG 157
+ AE +Q + ++ ++ + A+ + A+EA+ +A Q + E Q G ++
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN-EVAQS-GSETK 1093

Query: 158 ETDFKAGKNNAEGHVAGKSDA 178
ET K A K+
Sbjct: 1094 ETQTTETKETATVEKEEKAKV 1114



Score = 39.7 bits (92), Expect = 2e-05
Identities = 30/191 (15%), Positives = 59/191 (30%), Gaps = 10/191 (5%)

Query: 78 EAERQAEAQRNAEAEKQRAAEAQRKAEEERQRAAEEQRKAEEERQRVAEEQRKAEEARKQ 137
A + AE KQ + ++ ++ + A+ + A+E + V + E A+
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 138 ---EEAQRQADMEKGQLEGQKSGETDFKAGKNNAEGHVAGKSDAYKQAFTTAYAAAWSLE 194
+E Q E +E ++ + + + S +Q+ T A + E
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVE-TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148

Query: 195 EQ-----KKAHFEKGKEQGLTQEAMDDSQITPEFKVNFAEGFQVGNKERTEKIEKEQAEL 249
K+ + Q A + S + V + GN A
Sbjct: 1149 NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ-PVTESTTVNTGNSVVENPENTTPATT 1207

Query: 250 GEKAGKELAEK 260
E + K
Sbjct: 1208 QPTVNSESSNK 1218



Score = 36.2 bits (83), Expect = 2e-04
Identities = 17/154 (11%), Positives = 50/154 (32%), Gaps = 3/154 (1%)

Query: 56 NKPASSGGTTSPAPSQNNNGAVEAERQAEAQRNAEAEKQRAAEAQRKAEEERQR--AAEE 113
P +P+ + + ++N + + A+ + A+E + A +
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 114 QRKAEEERQRVAEEQRKAEEARKQEEAQRQADMEKGQLEGQKSGETDFKAGKNNAEGHVA 173
+ + E Q + E + +A +E + + + + +E V
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET-VQ 1140

Query: 174 GKSDAYKQAFTTAYAAAWSLEEQKKAHFEKGKEQ 207
+++ ++ T + A E+ ++
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174


4BcerKBAB4_0796BcerKBAB4_0837Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_07961173.554186hypothetical protein
BcerKBAB4_07971163.106566hypothetical protein
BcerKBAB4_07982142.473157hydroxyglutarate oxidase
BcerKBAB4_07991151.970610glyoxalase family protein
BcerKBAB4_08001172.048833proline racemase
BcerKBAB4_08012182.006391ornithine cyclodeaminase
BcerKBAB4_08020202.046538hypothetical protein
BcerKBAB4_0803-1192.420864extracellular solute-binding protein
BcerKBAB4_0804-1212.829053binding-protein-dependent transport system inner
BcerKBAB4_0805-2214.025066binding-protein-dependent transport system inner
BcerKBAB4_0806-2193.549148oligopeptide/dipeptide ABC transporter ATPase
BcerKBAB4_0807-2192.918575oligopeptide/dipeptide ABC transporter ATPase
BcerKBAB4_0808-2172.805380hypothetical protein
BcerKBAB4_0809-2182.538403Beta-lactamase
BcerKBAB4_08100212.025894general substrate transporter
BcerKBAB4_0811022-0.059081signal transduction histidine kinase regulating
BcerKBAB4_0812318-1.514248response regulator receiver
BcerKBAB4_0813417-3.129604hypothetical protein
BcerKBAB4_0814719-4.693548XRE family transcriptional regulator
BcerKBAB4_0815620-5.055641integrase family protein
BcerKBAB4_0816619-3.612839DNA-cytosine methyltransferase
BcerKBAB4_0817820-3.872047ATPase
BcerKBAB4_0818621-3.732945hypothetical protein
BcerKBAB4_0819720-3.476095hypothetical protein
BcerKBAB4_0820620-3.926039hypothetical protein
BcerKBAB4_0821519-3.568852hypothetical protein
BcerKBAB4_0822522-4.809925hypothetical protein
BcerKBAB4_0823422-4.219089hypothetical protein
BcerKBAB4_0824423-3.888318HNH endonuclease
BcerKBAB4_0825231-0.031938hypothetical protein
BcerKBAB4_08272424.430213group-specific protein
BcerKBAB4_08282373.905002hypothetical protein
BcerKBAB4_08291333.477338MerR family transcriptional regulator
BcerKBAB4_0830-1303.395305hypothetical protein
BcerKBAB4_0831-1283.329574primosome subunit DnaD
BcerKBAB4_0832-2192.725719replicative DNA helicase
BcerKBAB4_0833-1152.148874transcriptional regulator TrmB
BcerKBAB4_08340152.583892hypothetical protein
BcerKBAB4_08350182.914579hypothetical protein
BcerKBAB4_08361153.015557TetR family transcriptional regulator
BcerKBAB4_08372152.738346NADH:flavin oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0809BLACTAMASEA356e-126 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 356 bits (916), Expect = e-126
Identities = 91/291 (31%), Positives = 150/291 (51%), Gaps = 17/291 (5%)

Query: 20 VLLSCVSLIGCSNSNTQSEPPKQTNQANQIKQENTGNQSFAKLEKEYDAKLGIYALDTGT 79
+ L +SL+ + P + E + ++G+ +D +
Sbjct: 4 IRLCIISLLATLPLAVHASPQPL--------------EQIKLSESQLSGRVGMIEMDLAS 49

Query: 80 NQTV-AYHSDDRFAFASTSKSLAVGALLRKNSL--EALDQRITYTHEDLSNYNPITEKHV 136
+T+ A+ +D+RF ST K + GA+L + E L+++I Y +DL +Y+P++EKH+
Sbjct: 50 GRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHL 109

Query: 137 DTGMTLKELADASVRYSDSTAHNLILKQLGGPSEFEKILREMGDTVTTSERFEPELNEVH 196
GMT+ EL A++ SD++A NL+L +GGP+ LR++GD VT +R+E ELNE
Sbjct: 110 ADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEAL 169

Query: 197 PGETHDTSTPEAIAKTLQSFTLGTALPIEKRELLVDWMKRNTTGDNLIRAGVPKGWEVAD 256
PG+ DT+TP ++A TL+ L + L+ WM + LIR+ +P GW +AD
Sbjct: 170 PGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIAD 229

Query: 257 KTGAGSYGTRNDIAIIWPPNKKPIVLAILSNHDKEDAKYDDKLIADATKVV 307
KTGAG G R +A++ P NK ++ I ++ IA +
Sbjct: 230 KTGAGERGARGIVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAAL 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0810TCRTETA348e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 8e-04
Identities = 30/129 (23%), Positives = 57/129 (44%), Gaps = 16/129 (12%)

Query: 40 EFFPKGDPTSQLLNTAAIFAVGFLMRPIGSLLMGRYADRHGRRAALTLSITVMAGGSLII 99
+ D T+ A++A LM+ + ++G +DR GRR L +S+ A I+
Sbjct: 34 DLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM 90

Query: 100 ACTPSYESIGIMAPIILVLARLLQGLSLGGEYGTSATYLSEMASSGRR----GFYSSFQY 155
A P +L + R++ G++ G + Y++++ R GF S+
Sbjct: 91 ATAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFG 141

Query: 156 VTLVAGQMV 164
+VAG ++
Sbjct: 142 FGMVAGPVL 150



Score = 29.0 bits (65), Expect = 0.037
Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 11/82 (13%)

Query: 285 VVLQPIAGLLSDKIGRRPLLMAFGILGTLLTAPIFFFMEKTTEPMVAFLLMMVGLII--V 342
P+ G LSD+ GRRP+L+ +L A + + + T ++ +G I+ +
Sbjct: 57 FACAPVLGALSDRFGRRPVLLV-----SLAGAAVDYAIMATAP---FLWVLYIGRIVAGI 108

Query: 343 TGYT-SINAIVKAELFPTEIRA 363
TG T ++ A++ + RA
Sbjct: 109 TGATGAVAGAYIADITDGDERA 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0811PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 3e-04
Identities = 23/131 (17%), Positives = 53/131 (40%), Gaps = 24/131 (18%)

Query: 397 EKKIDFHIEGDSALHPLPDHIKVSHLITILGNIIDNAFD-AVSERGEKN-VSFFVTDIGH 454
E ++ F + + A+ ++V ++ + +++N +++ + + T
Sbjct: 237 EDRLQFENQINPAIM----DVQVPPML--VQTLVENGIKHGIAQLPQGGKILLKGTKDNG 290

Query: 455 DIVFEVIDSGAGILAEKITNIFQKGFSTKGNDRGYGLANVKEMVDLL---EGTIEIQNEK 511
+ EV ++G+ L G GL NV+E + +L E I++ +EK
Sbjct: 291 TVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYGTEAQIKL-SEK 337

Query: 512 NGGAIFTIYLP 522
G + +P
Sbjct: 338 QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0812HTHFIS631e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 1e-13
Identities = 37/151 (24%), Positives = 69/151 (45%), Gaps = 5/151 (3%)

Query: 3 KVAIAEDDFRVAQIQEEFLSKIK-DVKVIGKALNAKETMELLQKEEIDLLLLDNYLPDGI 61
+ +A+DD + + + LS+ DV++ NA + + DL++ D +PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GTDLLPKIHADFPNVDVIMVTAANENHMLEKAIRNGVSNYLIKPVTLEKFVRTIEDYKRK 121
DLLP+I P++ V++++A N KA G +YL KP L + + I +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 KQLLHSNNEVNQEIIDNFFGTS-QIQDMKNL 151
+ S E + + G S +Q++ +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0830SECA290.003 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.003
Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 26 RILNEDEYIEELCKKTQEELTEYIEAKIKPHKLEELSDLLELINALAEHEG 76
+L+ + E + ++ I+A I P LEE+ D+ L L
Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFD 715


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0836HTHTETR542e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 2e-11
Identities = 29/199 (14%), Positives = 62/199 (31%), Gaps = 21/199 (10%)

Query: 1 MEKVDRRIIKSKEAIKNAFIELMAEKGFDKITVKDICSGADVGNRTFYLHYLDKFDLLDK 60
K + ++++ I + + L +++G ++ +I A V Y H+ DK DL +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 LVIERIEALKTLCAPLHD-------LSFREACIAWFENM--EQHYFF-----FSTMLAGK 106
+ + L RE I E+ E+ F
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 107 GASAFRKHFFDYIIEQIKDDVD-IKEGINKG-----FSEDMIITFFGSAIVGVVETYFM- 159
+ ++ + +E +K I I G++E +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 160 KGLPDPPEIVAEQLGMLLD 178
D + + + +LL+
Sbjct: 182 PQSFDLKKEARDYVAILLE 200


5BcerKBAB4_0847BcerKBAB4_0931Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_08473211.090110hypothetical protein
BcerKBAB4_08480201.261217hypothetical protein
BcerKBAB4_0849-1170.280173PadR-like family transcriptional regulator
BcerKBAB4_0850-120-1.341067hypothetical protein
BcerKBAB4_0851-320-2.780564protein-export membrane protein-related protein
BcerKBAB4_0853-118-3.901012MarR family transcriptional regulator
BcerKBAB4_0854220-3.073696CDP-diacylglycerol--serine
BcerKBAB4_0855520-4.262045ATP synthase protein I
BcerKBAB4_0856320-5.100281hypothetical protein
BcerKBAB4_0857418-4.401309group-specific protein
BcerKBAB4_0858420-4.007558hypothetical protein
BcerKBAB4_0859521-3.757575hypothetical protein
BcerKBAB4_0860323-5.097166hypothetical protein
BcerKBAB4_0861222-3.440630hypothetical protein
BcerKBAB4_0863121-3.134755hypothetical protein
BcerKBAB4_0864118-3.205192barstar (barnase inhibitor)
BcerKBAB4_0865218-3.008874hypothetical protein
BcerKBAB4_0866122-0.357396hypothetical protein
BcerKBAB4_0867-121-1.091464major facilitator transporter
BcerKBAB4_08680171.274419ArsR family transcriptional regulator
BcerKBAB4_08700160.949647VanZ family protein
BcerKBAB4_08720171.824543hypothetical protein
BcerKBAB4_08730142.178596DhaKLM operon coactivator DhaQ
BcerKBAB4_0874-1111.611593TetR family transcriptional regulator
BcerKBAB4_0875-1113.100102dihydroxyacetone kinase
BcerKBAB4_0876-2122.106600hypothetical protein
BcerKBAB4_0877-2142.238799hypothetical protein
BcerKBAB4_0878-2141.826825luciferase family protein
BcerKBAB4_0879-2140.762685RimK domain-containing protein ATP-grasp
BcerKBAB4_0880-2140.430149peptidase M6 immune inhibitor A
BcerKBAB4_0881-117-0.890919S-layer protein
BcerKBAB4_0882-117-2.122939PadR-like family transcriptional regulator
BcerKBAB4_0883212-2.063075hypothetical protein
BcerKBAB4_0884217-1.143902phosphoribulokinase/uridine kinase
BcerKBAB4_08853180.240188NUDIX hydrolase
BcerKBAB4_0886217-0.164475hypothetical protein
BcerKBAB4_0888416-1.391675hypothetical protein
BcerKBAB4_0889416-1.541340phospholipase D/transphosphatidylase
BcerKBAB4_0890320-1.273591hypothetical protein
BcerKBAB4_0891218-2.002451transposase IS4 family protein
BcerKBAB4_0892316-2.920385erythromycin esterase
BcerKBAB4_0893618-3.171341MarR family transcriptional regulator
BcerKBAB4_0894416-2.103745acetyl-CoA carboxylase carboxyltransferase
BcerKBAB4_0895520-2.163280hypothetical protein
BcerKBAB4_0896520-3.158126group-specific protein
BcerKBAB4_0897620-3.262879hypothetical protein
BcerKBAB4_0898521-2.158341CsbD family protein
BcerKBAB4_0899320-2.298636transglycosylase-associated protein
BcerKBAB4_0900218-3.372105hypothetical protein
BcerKBAB4_0901015-3.746388anti-sigma-factor antagonist
BcerKBAB4_0902012-3.719531serine-protein kinase RsbW
BcerKBAB4_0903013-3.933348RNA polymerase sigma factor SigB
BcerKBAB4_0904013-4.626383ferritin Dps family protein
BcerKBAB4_0905013-4.222871response regulator receiver modulated serine
BcerKBAB4_0906013-4.170343chemotaxis protein CheR
BcerKBAB4_0907014-3.409951GAF sensor hybrid histidine kinase
BcerKBAB4_0908122-2.322374hypothetical protein
BcerKBAB4_0909316-0.620415hypothetical protein
BcerKBAB4_09102130.413235hypothetical protein
BcerKBAB4_09110140.935194hypothetical protein
BcerKBAB4_09121150.856787hypothetical protein
BcerKBAB4_0913113-0.623701hypothetical protein
BcerKBAB4_0914112-0.954635major facilitator transporter
BcerKBAB4_0915-113-1.582385GntR family transcriptional regulator
BcerKBAB4_0916114-2.217760alcohol dehydrogenase
BcerKBAB4_0917218-4.023612hypothetical protein
BcerKBAB4_0918-119-2.998023histidine kinase
BcerKBAB4_0919-216-1.866069two component LuxR family transcriptional
BcerKBAB4_0920-1242.212252hypothetical protein
BcerKBAB4_09216253.286937hypothetical protein
BcerKBAB4_09226213.303545hypothetical protein
BcerKBAB4_09236213.130683hypothetical protein
BcerKBAB4_09244172.586799hypothetical protein
BcerKBAB4_09253171.435684metallophosphoesterase
BcerKBAB4_0926314-0.022526hypothetical protein
BcerKBAB4_0927217-3.4491413'-5' exoribonuclease YhaM
BcerKBAB4_0928827-5.785704hypothetical protein
BcerKBAB4_0929827-6.142249hypothetical protein
BcerKBAB4_0930930-9.066607ankyrin repeat-containing protein
BcerKBAB4_0931418-2.966524hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0850cloacin250.008 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 25.4 bits (55), Expect = 0.008
Identities = 11/22 (50%), Positives = 12/22 (54%)

Query: 2 GYGGSCGGYGGSCGGGCGFGGG 23
G G GG G+ GGG G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0864PF05272260.030 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 26.2 bits (57), Expect = 0.030
Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 2/44 (4%)

Query: 60 FDGWSKFEK-RLPRDTK-IMKECLLDYNEEDRKDPAWKSEFLFN 101
F W E RL + ++K E R PA F+
Sbjct: 429 FGEWLDDEVARLRLRGRWLLKPRRAALIEALRSAPALAGCVAFD 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0867TCRTETA698e-15 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 68.7 bits (168), Expect = 8e-15
Identities = 70/390 (17%), Positives = 139/390 (35%), Gaps = 23/390 (5%)

Query: 1 MKKVNPLLILTLAIGVFGIITTEMGIVGVLPQITEKFGISTTQA---GFLVSIFALVVAI 57
MK PL+++ + + + I+ VLP + S G L++++AL+
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGL--IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFA 58

Query: 58 SGPFLILLVSSINRKIILLTAIFAFVISNIIYAYTTQFEIMLIFRILPAALHPLFFSIAL 117
P L L R+ +LL ++ + I A ++ I RI+ A + ++A
Sbjct: 59 CAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV-AGITGATGAVAG 117

Query: 118 VTAAKLVPPEKSGQAVTKVFMGITVGFALGVPLTSYLADQFSLEIAFLFGALVNTLAFIG 177
A + ++ + + G G P+ L FS F A +N L F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAG-PVLGGLMGGFSPHAPFFAAAALNGLNFLT 176

Query: 178 ILILLPSMPVKEKMSFGKQIRILGKPGLWLNILTVTFLFAAMFSVYSYFAEYLAKVTSM- 236
LLP E+ ++ W +TV A+F + + A + +
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 237 -------NGSLISIMLFIFGIVMILGN-HLFGSLLQKSIVNTVILFPILYS-IVYILVYY 287
+ + I I L FGI+ L + G + + ++ ++ YIL+ +
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 288 LGSYLVPMIFIVFIWGIVHAGGLIVGQTWL-ISEAKEAPEFGNSLFVSFSNLGITLGTTI 346
M F + + G+ Q L +E + ++L +G +
Sbjct: 297 ATRG--WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLL 354

Query: 347 GGWFISNLGIHQLIWSGFIFTLLSFLLIII 376
+ W+G+ + + L ++
Sbjct: 355 FTAIYA---ASITTWNGWAWIAGAALYLLC 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0874HTHTETR416e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.2 bits (96), Expect = 6e-07
Identities = 26/179 (14%), Positives = 56/179 (31%), Gaps = 29/179 (16%)

Query: 8 KKIIANSLKYLMETESFHKISVSDIMLHCQMRRQTFYYHFKDKFELLSWIYKEETK---E 64
+ I+ +L+ L + S+ +I + R Y+HFKDK +L S I++ E
Sbjct: 14 QHILDVALR-LFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 65 NIIDFLD------YETWENIFDLLFDYFYEN-------------QKFYRNAFKVIE-QNS 104
+++ I + + +F V + Q +
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 105 FNHYLFEHTKNLYMKIIDELSMSCGFSLSDETKNTIASFYSHGFVGTIKDWIESKCEVD 163
++ + I+ +D A G +++W+ + D
Sbjct: 133 LCLESYDRIEQTLKHCIEA-----KMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0876ENTEROVIROMP260.048 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 26.4 bits (58), Expect = 0.048
Identities = 22/90 (24%), Positives = 30/90 (33%), Gaps = 10/90 (11%)

Query: 69 GESAKGGYYLATVQSAYLNDSNDSIVVNVSVKNVRGQMIGLSELKYNLKDEKDGKAYEGK 128
G+ K YY T AY + SI V V + Q Y G +Y
Sbjct: 78 GDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT--YKHDTSDYGFSYG-- 133

Query: 129 VIDQNPSDIQVNPNETVELKIAFEVPGTTD 158
+ +Q NP E V L ++E
Sbjct: 134 ------AGLQFNPMENVALDFSYEQSRIRS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0902PF06580270.040 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.8 bits (59), Expect = 0.040
Identities = 6/35 (17%), Positives = 13/35 (37%), Gaps = 2/35 (5%)

Query: 53 NIVQHAY--KEDVGEITIVFGLYEDRLEIMVADNG 85
N ++H G+I + + + V + G
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0905HTHFIS831e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-19
Identities = 35/147 (23%), Positives = 74/147 (50%), Gaps = 12/147 (8%)

Query: 2 SILIVDDNPVNIFVIEKILKQAGYQDLVSLNSAQELFEYIHFGKDSSRHNEIDLILLDIM 61
+IL+ DD+ V+ + L +AGY D+ ++A L+ +I + DL++ D++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWI-------AAGDGDLVVTDVV 56

Query: 62 MPEIDGLEVCRRLQNEEKFKDIPIIFVTALEDANKLAEALDIGAMDYITKPINKVELLAR 121
MP+ + ++ R++ D+P++ ++A +A + GA DY+ KP + EL+
Sbjct: 57 MPDENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 122 MRVALRLKSELNWHKEQEENLRNELDL 148
+ AL + E++ ++ + L
Sbjct: 115 IGRALAEPKRR--PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0906VACCYTOTOXIN280.041 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.5 bits (63), Expect = 0.041
Identities = 22/76 (28%), Positives = 30/76 (39%), Gaps = 4/76 (5%)

Query: 184 SNYYSTDNRFAYFNPSLLQN-IIFAQHNLVTDQSFNEFHIILCRNVLIYFTSKLQNQVQQ 242
+ YY D + Y N +LQ F N V S N F + RN L + +
Sbjct: 1201 ARYYYGDTSYFYMNAGVLQEFANFGSSNAV---SLNTFKVNATRNPLNTHARVMMGGELK 1257

Query: 243 LFYESLGHNGFLCLGN 258
L E + GF+ L N
Sbjct: 1258 LAKEVFLNLGFVYLHN 1273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0907HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 2e-15
Identities = 26/107 (24%), Positives = 51/107 (47%), Gaps = 3/107 (2%)

Query: 777 TILIVDDDHRNIFALQNALEKQHANIITAQNGIECLEILKSNTNIDLILMDIMMPNMDGY 836
TIL+ DDD L AL + ++ N + + DL++ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 837 ETMEHIRMNLGLHEIPIIALTAKAMPNDKEKCLSAGASDYISKPLNL 883
+ + I+ ++P++ ++A+ K GA DY+ KP +L
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0910PF07132290.015 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 28.5 bits (63), Expect = 0.015
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 6/79 (7%)

Query: 51 SGEKVNSETAHKADIFSATGLVAGGVAGGLGGLLTGLGILAVSGMGPIVAAGPIAAAIGG 110
G + ++ +DI + + + GGLGG L GLG G ++ G G
Sbjct: 40 FGGQRSNIAEQLSDIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGG------LG 93

Query: 111 AGIGGGAGSLIGAFIGLGI 129
G+G GS +G+ +G G+
Sbjct: 94 GGLGSSLGSGLGSALGGGL 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0914TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.2 bits (94), Expect = 1e-05
Identities = 62/345 (17%), Positives = 112/345 (32%), Gaps = 23/345 (6%)

Query: 25 IGITSVSPLLETIRQDLNISNFSVS---FLTAIPVFCMGTFALLTGKVIKKYGAERAIMT 81
+GI + P+L + +DL SN + L A+ A + G + ++G R +
Sbjct: 19 VGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG--RRPVL 76

Query: 82 CLILIGFAT--CMRAFTSSISTLFASSLFIGIGIALAGPLLSGFIKEKFPTK-----IGL 134
+ L G A + A + L+ + GI A G + +I + G
Sbjct: 77 LVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGF 135

Query: 135 MIGIYSVGMGTGASLSAGLTIPLQHVLKDDWNMALAFWGVLTIIAIIFWYPVMKRKKNTS 194
M + GM G L GL + A A G+ + K ++
Sbjct: 136 MSACFGFGMVAGPVL-GGLMGGFSPHAP--FFAAAALNGLNFLTGCFLLPESHKGERRPL 192

Query: 195 TQNKKNNSLPLRNKK-----AWLFTIFFGLQSGIFYSITTWLAPANQNMGVSSEQAGTLI 249
+ N R + A L +FF +Q W+ + G +
Sbjct: 193 RREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISL 252

Query: 250 TVFTVVQMIC-SFLIPTLADIYKNRALWLLGSICFVLVGLSLMIYPLTTPWIPSILLGIG 308
F ++ + + + +A R +LG I G L+ + I++ +
Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADG-TGYILLAFATRGWMAFPIMVLLA 311

Query: 309 LGGVFPLALMLPLYETKTSEDASAWTAMMQSGGYIMGGFIPVLAG 353
GG+ AL L E + + + P+L
Sbjct: 312 SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356



Score = 32.9 bits (75), Expect = 0.002
Identities = 29/150 (19%), Positives = 57/150 (38%), Gaps = 9/150 (6%)

Query: 246 GTLITVFTVVQMICSFLIPTLADIYKNRALWLLGSICFVLVGLSLMIYP-LTTPWIPSIL 304
G L+ ++ ++Q C+ ++ L+D + R + L+ + + P L +I I+
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 305 LGIGLGGVFPLALMLPLYETKTSEDASAWTAMMQSGGYIMGGFI--PVLAGIARDYFNSY 362
GI G +A + + ++ + M G + PVL G+ + S
Sbjct: 106 AGIT-GATGAVAGAY-IADITDGDERARHFGFM--SACFGFGMVAGPVLGGLMGGF--SP 159

Query: 363 TQVFIIMALLSFILFLLTLVMNKRKRNAED 392
F A L+ + FL + E
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGER 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0918PF06580346e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 6e-04
Identities = 26/123 (21%), Positives = 46/123 (37%), Gaps = 15/123 (12%)

Query: 279 RFSEATSIHVRFTIQSPPHISS-LVKEHCLYVISECLTNIAKH---SQATDVNLKVEYID 334
+F + ++F Q P I V + + E N KH + ++
Sbjct: 235 QFED----RLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTK 287

Query: 335 DLEKLTIEVEDNGIGFDTRYIGKNPGHYGLIGLNERVRLIKGEIHIL--SEKMKGTKVYI 392
D +T+EVE+ G K GL + ER++++ G + SEK +
Sbjct: 288 DNGTVTLEVENTGSLALKN--TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 393 QVP 395
+P
Sbjct: 346 LIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0919HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 2e-19
Identities = 24/116 (20%), Positives = 50/116 (43%), Gaps = 2/116 (1%)

Query: 5 VLIVDDHFVVREGLKLIIETSDSFQIIGEAANGEEALSFIEKKKPDVILMDLNMPKMSGL 64
+L+ DD +R L + + + + +N +I D+++ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 ETIEALNKKQNHTPIIILTTYNEDELMLKGIELGAKGYLLKDTDRENLFRTLEAAI 120
+ + + K + P+++++ N +K E GA YL K D L + A+
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0926GPOSANCHOR404e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.0 bits (93), Expect = 4e-05
Identities = 32/256 (12%), Positives = 79/256 (30%), Gaps = 14/256 (5%)

Query: 187 MKKIEEKMKEWQGKIGTYEKQVEQLKESEEKLASVRAEKESAERRKQDYEILVALEPLVI 246
+++ ++ + + ++ +++E E + A + E A +
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 247 EKSTHEKVLENENGQFPVNGMARYEAVKAKIEPLQVQVDSLQKKIETV---------QSE 297
K + + +N A +E + +++ Q ++E
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 210

Query: 298 IDSTEIDEAFLQKESYVEELRMQHMSYENARQEMRD----MTGSIANIKEEIAELQQQIG 353
++ + +L N + A ++ AEL++ +
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 270

Query: 354 ATFEQETVLSFDISLATKELIMQTVQKARELESQKAQLDERFKTAQEQLEEQEENIRQIS 413
T S I E + +LE Q L+ ++ + L+ E +Q+
Sbjct: 271 GAMNFSTADSAKIKTLEAE-KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329

Query: 414 QQMLADEEKNTLVEKE 429
+ EE+N + E
Sbjct: 330 AEHQKLEEQNKISEAS 345



Score = 33.1 bits (75), Expect = 0.006
Identities = 39/293 (13%), Positives = 96/293 (32%), Gaps = 17/293 (5%)

Query: 153 SDALLQLDKKLEKEMNQRFKPSGRNPEINVSLQEMKKIEEKMKEWQGKIGTYEKQVEQLK 212
S + L+ + ++ + ++ + + + ++E+
Sbjct: 140 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 199

Query: 213 ESEEKLASVRAEKESAERRKQDYEILVALEPLVIEKSTHEKVLENENGQFPVNGMARYEA 272
E ++ + K + L A + + + + A A
Sbjct: 200 EGAMNFSTADSAKIKTLEA--EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAA 257

Query: 273 VKAKIEPLQVQVDSLQKKIETVQSEID--STEIDEAFLQKESYVEELRMQHMSYENARQE 330
++A+ L+ ++ ++I E +K + ++ + + ++ R++
Sbjct: 258 LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRD 317

Query: 331 MRDMTGSIANIKEEIAELQQQIGATFEQETVLSFDI--SLATKELIMQTVQKARELESQ- 387
+ + ++ E +L++Q + L D+ S K+ + QK E
Sbjct: 318 LDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 377

Query: 388 -------KAQLD---ERFKTAQEQLEEQEENIRQISQQMLADEEKNTLVEKEK 430
+ LD E K ++ LEE + + + EE L EKEK
Sbjct: 378 EASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEK 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0927MICOLLPTASE300.010 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.5 bits (68), Expect = 0.010
Identities = 20/108 (18%), Positives = 40/108 (37%), Gaps = 7/108 (6%)

Query: 20 IKTATKGLASNGKPFLTVILQDPSGDIEAKLWDV-------SPEVEKQYVAETIVKVAGD 72
IK+ + + F +D G+I+A WD + +Y +V
Sbjct: 779 IKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLT 838

Query: 73 ILNYKGRIQLRVKQIRVANENEVTDISDFVEKAPIKKEDMVEKITQYI 120
+ + G I K+I+V + V I++ +K + + K +
Sbjct: 839 VTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLV 886


6BcerKBAB4_0945BcerKBAB4_0960Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_09450223.062133XRE family transcriptional regulator
BcerKBAB4_09460182.339493glycerol-3-phosphate responsive antiterminator
BcerKBAB4_0947-2171.849559MIP family channel protein
BcerKBAB4_0948-1161.603890glycerol kinase
BcerKBAB4_0949-2120.157086FAD dependent oxidoreductase
BcerKBAB4_0950-117-0.005732PadR-like family transcriptional regulator
BcerKBAB4_0951116-0.004131teicoplanin resistance protein VanZ
BcerKBAB4_09532180.709465RNA polymerase factor sigma C
BcerKBAB4_09542180.910835lineage-specific thermal regulator protein
BcerKBAB4_09553190.631793cell cycle protein FtsW
BcerKBAB4_09567240.997071UvrD/REP helicase
BcerKBAB4_0957726-0.821913peptidyl-prolyl isomerase
BcerKBAB4_09583200.146080hypothetical protein
BcerKBAB4_09593180.371849hypothetical protein
BcerKBAB4_09602191.025406transcriptional regulator Hpr
7BcerKBAB4_0970BcerKBAB4_0982Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0970216-1.868710hypothetical protein
BcerKBAB4_0971016-0.209043hypothetical protein
BcerKBAB4_0972-1180.614498hypothetical protein
BcerKBAB4_0973-216-0.431199hypothetical protein
BcerKBAB4_0974-214-0.682290TetR family transcriptional regulator
BcerKBAB4_0975-213-0.430694hypothetical protein
BcerKBAB4_0976-212-0.476296membrane-flanked domain-containing protein
BcerKBAB4_0977417-1.140043membrane-flanked domain-containing protein
BcerKBAB4_0978517-1.470004hypothetical protein
BcerKBAB4_0979617-1.676598hypothetical protein
BcerKBAB4_0980617-1.338718hypothetical protein
BcerKBAB4_0981414-1.002152TetR family transcriptional regulator
BcerKBAB4_0982414-0.768741cell wall anchor domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0974HTHTETR757e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 7e-19
Identities = 31/160 (19%), Positives = 65/160 (40%), Gaps = 5/160 (3%)

Query: 6 QTSQNIVEASFKLMAEHGIEKMSLSMIAKEVGISKPAIYYHFSSKEALVDFLFEEIFS-- 63
+T Q+I++ + +L ++ G+ SL IAK G+++ AIY+HF K L ++E S
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 64 GYHFASYFDKEQYTKENFAEKLIADGLHMLSEYEGQEGILRVINEFIVTASRNEKYQKRL 123
G Y K + +++ L E + ++ +I Q+
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 FEIQEDFLNGFHDLLKKGVELDVVSQQATEENAHTLALVI 163
+ + + LK +E ++ + A+++
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKML---PADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0975FLGMRINGFLIF270.003 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 26.9 bits (59), Expect = 0.003
Identities = 8/33 (24%), Positives = 15/33 (45%)

Query: 16 YKIPGMIEAFQADKGWLALISLVWLLWFGYFIP 48
++ I+ A WL ++ + W+LW P
Sbjct: 449 WQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRP 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_097760KDINNERMP290.041 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.041
Identities = 14/76 (18%), Positives = 29/76 (38%), Gaps = 24/76 (31%)

Query: 15 KITDFIPLLIFMFSLNGKFPFWYLIPAGFGLLTIFSAFEKWYYTTYWVENNVLHVKQGLF 74
KI F+P++ +F L P+G L Y++ +N++ + Q
Sbjct: 493 KIMTFMPVIFTVFFLW--------FPSGLVL--------------YYIVSNLVTIIQQQL 530

Query: 75 VKKESYLNKERVQTIN 90
+ + L K + +
Sbjct: 531 IYRG--LEKRGLHSRE 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0981HTHTETR571e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 1e-12
Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 6 TRQKILAAASQIVQFKGVAKLTLEAVAKEAGVSKGGLLYHFSNKEALIEGMILKGTEEYH 65
TRQ IL A ++ +GV+ +L +AK AGV++G + +HF +K L + E
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW----ELSE 67

Query: 66 GAIHNRVTEDTEKKGRWIRSFVEERL 91
I E K S + E L
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREIL 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0982TONBPROTEIN407e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 39.6 bits (92), Expect = 7e-05
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 2038 LAPPGPEKPDPEKPEKPDPEKPEKPDPEKPGTTDPEKPGTTDPEKPETTDPEKPGTTDPE 2097
L PP +P PE +P+PE P+P E P + KP+ KP E
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPP------KEAPVVIEKPKPKPKPKPKPVKKVQE 108

Query: 2098 KPEKELPKTGQKMPVEPYM 2116
+P++++ + P P+
Sbjct: 109 QPKRDVKPVESR-PASPFE 126


8BcerKBAB4_1031BcerKBAB4_1082Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_10312152.861504S-layer protein
BcerKBAB4_10323142.548156malate synthase
BcerKBAB4_10332161.592003isocitrate lyase
BcerKBAB4_1034213-0.761516hypothetical protein
BcerKBAB4_10354151.913025cold-shock DNA-binding domain-containing
BcerKBAB4_10365142.596836hypothetical protein
BcerKBAB4_10375142.769593ComK family protein
BcerKBAB4_10384143.078603hypothetical protein
BcerKBAB4_10395153.268942signal peptidase I
BcerKBAB4_10405143.220342ATP-dependent nuclease subunit AddB
BcerKBAB4_10415143.225606recombination helicase AddA
BcerKBAB4_10421201.386103hypothetical protein
BcerKBAB4_10431190.913594hypothetical protein
BcerKBAB4_10444210.639632spore germination protein PF
BcerKBAB4_10452220.043357spore germination protein PE
BcerKBAB4_10462140.707808spore germination protein PD
BcerKBAB4_10470150.121592spore germination protein PC
BcerKBAB4_1048-3182.005246spore germination protein GerPB
BcerKBAB4_1049-2192.425786spore germination protein GerPA
BcerKBAB4_1050-2191.382514putative stage 0 sporulation regulatory protein
BcerKBAB4_1051-2151.8398985-carboxymethyl-2-hydroxymuconate
BcerKBAB4_1052-1212.731247hypothetical protein
BcerKBAB4_1053-1193.748027ornithine--oxo-acid transaminase
BcerKBAB4_1054-1142.501848hypothetical protein
BcerKBAB4_10551162.264714hypothetical protein
BcerKBAB4_10561152.676237asparagine synthase
BcerKBAB4_10582172.316651catalase
BcerKBAB4_10591151.238073ammonium transporter
BcerKBAB4_1060315-1.068235alpha amylase
BcerKBAB4_1061216-0.632064putative nucleotide-binding protein
BcerKBAB4_1062-116-1.163777RNA-binding S1 domain-containing protein
BcerKBAB4_1063019-2.823519hypothetical protein
BcerKBAB4_1064-119-2.955772fatty acid desaturase type 2
BcerKBAB4_1065023-4.527875CarD family transcriptional regulator
BcerKBAB4_1066321-4.147546hypothetical protein
BcerKBAB4_1067-216-3.121861hypothetical protein
BcerKBAB4_1068-116-2.280756hypothetical protein
BcerKBAB4_1069016-1.401813hypothetical protein
BcerKBAB4_1070017-0.397247peptidyl-prolyl isomerase
BcerKBAB4_10711201.103346hypothetical protein
BcerKBAB4_10721152.019935hypothetical protein
BcerKBAB4_10734172.860088hypothetical protein
BcerKBAB4_10743162.079651hypothetical protein
BcerKBAB4_10752121.195653cof family hydrolase
BcerKBAB4_10761131.220279hypothetical protein
BcerKBAB4_10771131.781414ATPase
BcerKBAB4_1078-1142.412673hypothetical protein
BcerKBAB4_1079-2121.788819hydrolase
BcerKBAB4_1080-1131.987342NAD-dependent epimerase/dehydratase
BcerKBAB4_1081-1143.611795hypothetical protein
BcerKBAB4_1082-2143.3673123-oxoacyl-ACP synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1047RTXTOXIND290.013 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.013
Identities = 5/33 (15%), Positives = 11/33 (33%)

Query: 9 LHQLQQALQIQQQTILNLEEQVRLLQEELNELK 41
L + + I E R+ + L++
Sbjct: 209 LDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1068PF06580260.017 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.4 bits (58), Expect = 0.017
Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 3/48 (6%)

Query: 44 EKMDMMISLVTTYMRIE-SGSTEELEALQEEIIHAQAY--IQKRKFEE 88
K M++ ++ MR S +L +E+ +Y + +FE+
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFED 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1077GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.4 bits (86), Expect = 2e-04
Identities = 33/118 (27%), Positives = 58/118 (49%), Gaps = 5/118 (4%)

Query: 405 RTEIDSMPTELDEVTRRIMQLEIEEAALGKEKDLGSQERLKTLQRELSDLKEVASSMRAK 464
S+ +LD QLE E L ++ + R ++L+R+L +E + A+
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASR-QSLRRDLDASREAKKQLEAE 366

Query: 465 WEKEKEDIHKVRDLREHLERLRRELEEA-EGNYDLNKAAELRHGKIPAIEKELKEAEE 521
+K +E +K+ + + LRR+L+ + E + KA E + K+ A+EK KE EE
Sbjct: 367 HQKLEEQ-NKISEAS--RQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE 421


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1080NUCEPIMERASE491e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 48.6 bits (116), Expect = 1e-08
Identities = 49/233 (21%), Positives = 93/233 (39%), Gaps = 44/233 (18%)

Query: 12 RVLIIGALTFVGYHLVNKMIAEEVEVYGLD-FDEFDSMTKINEEKLLLIGRNALFTYYS- 69
+ L+ GA F+G+H+ +++ +V G+D +++ + + + +L L+ + F ++
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDV-SLKQARLELLAQ-PGFQFHKI 59

Query: 70 -IRDEDGWRSV-EEESFDTVYFCLYEPNQ--QSGFR---------NERVILQYLKRIVRM 116
+ D +G + F+ V+ + R + + +L I+
Sbjct: 60 DLADREGMTDLFASGHFERVF------ISPHRLAVRYSLENPHAYADSNLTGFLN-ILEG 112

Query: 117 CEKNKVK-LNLISSIEI-GS------TEESENKHLFSKVEEGLKKGEL---QYSA-YRVP 164
C NK++ L SS + G + + H S K EL YS Y +P
Sbjct: 113 CRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLP 172

Query: 165 -------TLYGPWQPSFMMYHQLILSELHEKECRCIN-GEKGSDLLYVEDVCE 209
T+YGPW M + + L K N G+ D Y++D+ E
Sbjct: 173 ATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAE 225


9BcerKBAB4_1118BcerKBAB4_1128Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1118221-0.483266glycosyl transferase family protein
BcerKBAB4_1119419-0.300267methyltransferase type 12
BcerKBAB4_11203210.054891hypothetical protein
BcerKBAB4_11210160.550578streptomycin biosynthesis StrF domain-containing
BcerKBAB4_11220170.297412nucleotidyl transferase
BcerKBAB4_1123-1130.258581dTDP-4-dehydrorhamnose 3,5-epimerase
BcerKBAB4_1124-1150.300669dTDP-glucose 4,6-dehydratase
BcerKBAB4_11251180.799918dTDP-4-dehydrorhamnose reductase
BcerKBAB4_11264181.675764enoyl-(acyl carrier protein) reductase
BcerKBAB4_11277201.371133hypothetical protein
BcerKBAB4_11282151.029662spore coat protein Z
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1124NUCEPIMERASE1952e-62 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 195 bits (498), Expect = 2e-62
Identities = 78/335 (23%), Positives = 142/335 (42%), Gaps = 26/335 (7%)

Query: 1 MNILVTGGAGFIGSNFIHYMLKNYETYKIINYDALT--YSGNLNNVK-SIQENPNYSFVK 57
M LVTG AGFIG + +L+ ++ D L Y +L + + P + F K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQ--VVGIDNLNDYYDVSLKQARLELLAQPGFQFHK 58

Query: 58 GKIQNGELLEHVVKECDVQVIVNFAAESHVDRSIENPIPFYDTNVIGTVTLLELVKKYSH 117
+ + E + + + + V S+ENP + D+N+ G + +LE +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 118 IKLVQVSTDEVYGSLGKTGKFTEETPLA-PNSPYSSSKASSDMIALSYYETYQLPVIITR 176
L+ S+ VYG L + F+ + + P S Y+++K +++++A +Y Y LP R
Sbjct: 119 QHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 177 CSNNYGPYQYPEKLIPLMVTNALEGKKLPLYGDGLNVRDWLHVTDHCSAIDTVLHKGCV- 235
YGP+ P+ + LEGK + +Y G RD+ ++ D AI +
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 236 -----------------GEVYNIGGNNEKTNIDVVEQIIKILGKTKKDIEFVTDRLGHDR 278
VYNIG ++ +D ++ + LG + + + G
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI-EAKKNMLPLQPGDVL 296

Query: 279 RYAIDAQKMKNEFEWEPKYTFEQGLKETVEWYKNN 313
+ D + + + P+ T + G+K V WY++
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1125NUCEPIMERASE451e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.2 bits (107), Expect = 1e-07
Identities = 44/237 (18%), Positives = 82/237 (34%), Gaps = 44/237 (18%)

Query: 4 RIIITGANGQLGKQLQEELNSEEYDIYPFDK--------------KLL----------DV 39
+ ++TGA G +G + + L + + D +LL D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 40 TNISRIKQVVQEIKPHTIVHCAAYTKVDGAEKEQDLAYL-INAIGARNVAVASQLVGAK- 97
+ + + + V + E AY N G N+ + +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYS-LENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LVYVSTDYVFPGDKPDGYHEFHNPA-PINIYGASKFAGEQFVKELHNKYFIVRTSW---- 152
L+Y S+ V+ ++ + + P+++Y A+K A E + Y + T
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 153 LYGKYGN------NFVKTMLRLGKERENISVVAD--QVGSPTYVADLITVINKLIHT 201
+YG +G F K ML E ++I V TY+ D+ I +L
Sbjct: 181 VYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1126DHBDHDRGNASE622e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 61.6 bits (149), Expect = 2e-13
Identities = 60/259 (23%), Positives = 106/259 (40%), Gaps = 19/259 (7%)

Query: 4 LQGKTFVVMGVANQRSIAWGIARSLHNAGAKLI-FTYAGERLERNVRELAETLEGQESLV 62
++GK + G A + I +AR+L + GA + Y E+LE+ V + E + +
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVS--SLKAEARHAEA 61

Query: 63 LPCDVTNDEELTACFETIKQEVGTIHGVAHCIAFANRDDLKGEFVDTSRDGFLLAQNISA 122
P DV + + I++E+G I + + G S + + ++++
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLR----PGLIHSLSDEEWEATFSVNS 117

Query: 123 FSLTAVAREAKKVMT--EGGNILTLTYLGGERVVKNYNVMGVAKASLEASVKYLANDLGQ 180
+ +R K M G+I+T+ + +KA+ K L +L +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 181 HGIRVNAISAGPIRT-----LSAKGVGDFNSILKEIEE---RAPLRRATTPEEVGDTAVF 232
+ IR N +S G T L A G I +E PL++ P ++ D +F
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 233 LFSDLARGVTGENIHVDSG 251
L S A +T N+ VD G
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


10BcerKBAB4_1161BcerKBAB4_1195Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1161-2183.229881methyltransferase type 11
BcerKBAB4_1162-1193.255859hypothetical protein
BcerKBAB4_1163-1193.308620hypothetical protein
BcerKBAB4_11640222.774992group-specific protein
BcerKBAB4_11652252.670978dihydrolipoamide succinyltransferase
BcerKBAB4_11661201.7516002-oxoglutarate dehydrogenase E1 component
BcerKBAB4_1167-218-3.938977XRE family transcriptional regulator
BcerKBAB4_1168-118-4.087251hypothetical protein
BcerKBAB4_1169119-3.694821hypothetical protein
BcerKBAB4_1171120-3.642447hypothetical protein
BcerKBAB4_1172320-3.673421hypothetical protein
BcerKBAB4_1173723-4.141108*hypothetical protein
BcerKBAB4_1174722-3.968806helix-turn-helix domain-containing protein
BcerKBAB4_1175722-4.014269HNH endonuclease
BcerKBAB4_1176722-4.028681hypothetical protein
BcerKBAB4_1177723-3.518324restriction endonuclease-like protein
BcerKBAB4_1178624-3.513897hypothetical protein
BcerKBAB4_1179521-3.862958phosphatidylserine/phosphatidylglycerophosphate/
BcerKBAB4_1181420-4.235771hypothetical protein
BcerKBAB4_1182422-3.023736hypothetical protein
BcerKBAB4_1183220-2.972135resolvase domain-containing protein
BcerKBAB4_1184321-3.196941hypothetical protein
BcerKBAB4_1185-223-2.004367hypothetical protein
BcerKBAB4_1186-221-0.749855hypothetical protein
BcerKBAB4_1187-220-0.162763integrase family protein
BcerKBAB4_1188-121-0.433146*hypothetical protein
BcerKBAB4_1189-118-0.651799hypothetical protein
BcerKBAB4_1190319-0.711138serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_1191615-0.860016peptidase S26B, signal peptidase
BcerKBAB4_1192716-0.839635camelysin
BcerKBAB4_11939210.185265transposase IS3/IS911 family protein
BcerKBAB4_11942182.052866integrase catalytic region
BcerKBAB4_11952162.625318hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1165RTXTOXIND290.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.026
Identities = 33/198 (16%), Positives = 65/198 (32%), Gaps = 24/198 (12%)

Query: 52 SGIVSKLLGEPGDTVEVGATIAILDANGAAAAVSTPAPPAEQPKQETTEAPKAAAPSAEQ 111
+ IV +++ + G++V G + L A GA A Q + E + + +
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE--QTRYQILSRSIE 161

Query: 112 NKALQGLPNTNRPIASPAARKMARELGIDLNEVRSTDPLGRVRPHDVQAHAAAPKEAPAA 171
L L + P + + L + E ST + KE
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST-----------WQNQKYQKELNLD 210

Query: 172 PKQSP-----APVAKTEFEKPVERVKMSRRRQTIAK------RLVEVQQTSAMLTTFNEV 220
K++ A + + E VE+ ++ + K ++E + V
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRV 270

Query: 221 DMSAIMELRKERKDAFEK 238
S + ++ E A E+
Sbjct: 271 YKSQLEQIESEILSAKEE 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1182BACSURFANTGN280.045 Yersinia/Haemophilus virulence surface antigen sign...
		>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen

signature.
Length = 322

Score = 28.1 bits (62), Expect = 0.045
Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 4/75 (5%)

Query: 26 ITETQDKAIKTIDSLTQSGDWNQYDDFRAISAFTMKFLDTAKYDLKEVIDGIHTEIKEYH 85
I Q LT G NQ R+ F+M + L + + + Y
Sbjct: 55 IKHNQSGRSMLDRKLTSDGKANQ----RSSFTFSMIMYRMIHFVLSTRVPAVRESVANYG 110

Query: 86 TKILFSLAKTAREIH 100
I F A+T
Sbjct: 111 GNINFKFAQTKGAFL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1195IGASERPTASE407e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.4 bits (94), Expect = 7e-06
Identities = 19/123 (15%), Positives = 50/123 (40%)

Query: 167 DKQIQAFEKSLVDETAKQVSEAEKKKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEE 226
+ A E + Q +E + + + E K+ +K+E+AK E ++
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 227 AKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEE 286
+ S KQE+++ ++ + + K+ +++ + ++ AK+ + +Q
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 287 QKE 289
+
Sbjct: 1184 TES 1186



Score = 37.7 bits (87), Expect = 4e-05
Identities = 22/127 (17%), Positives = 51/127 (40%), Gaps = 2/127 (1%)

Query: 165 EIDKQIQAFEKSLVDET-AKQVSEAEKKKKLEESKKQEEAKKL-EESKKQEEAKKLEESK 222
E+ + +++ ET E E+K K+E K QE K + S KQE+++ ++
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 223 KQEEAKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEEAKKLEESKKQEEAKKLEESK 282
+ + K+ +++ + ++ AK+ + +Q + + + E +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 283 KQEEQKE 289
Q
Sbjct: 1204 PATTQPT 1210


11BcerKBAB4_1529BcerKBAB4_1538Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1529-112-4.010893hypothetical protein
BcerKBAB4_1530013-4.721492hypothetical protein
BcerKBAB4_1531-214-3.854293hypothetical protein
BcerKBAB4_1532-215-3.479864nitroreductase
BcerKBAB4_1533117-4.101318RNA polymerase factor sigma-70
BcerKBAB4_1534018-3.398637hypothetical protein
BcerKBAB4_1535021-2.113420hypothetical protein
BcerKBAB4_1536020-1.696553major facilitator transporter
BcerKBAB4_1537318-2.441963ArsR family transcriptional regulator
BcerKBAB4_1538217-2.816138major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1536TCRTETA543e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.4 bits (131), Expect = 3e-10
Identities = 50/258 (19%), Positives = 101/258 (39%), Gaps = 9/258 (3%)

Query: 56 WSGSIVDRLNKRSIMLITDIIRAALIGCIPLFDSIWVIYIFIFLTRIATSFFDPASFTYK 115
G++ DR +R ++L++ A + +WV+YI + I + + Y
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYI 120

Query: 116 TMLIRAEERAQFNAWNNFCTSGAFIIGPALAGILLTTYSASFVIYCNSLSFLLSTILIYF 175
+ +ERA+ + + C + GP L G L+ +S + + L+ + F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGG-LMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 176 LPNITLQTKQNEEVSNTFLQTLRSDWKQVFSFARTETYIILIFVLFQATMLVSMALDSQE 235
L + + ++ L+ + F +AR T + + +F LV +
Sbjct: 180 LLPESHKGERRP------LRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW 233

Query: 236 VVFTNQVLFLSDIDYSMLVSITGAAY-VFGSFLVSLFAKRLPIQHCIGLGMIFTAIGYVI 294
V+F + ++ G + + + + A RL + + LGMI GY++
Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYIL 293

Query: 295 FAFSNSFIVAASGFILLG 312
AF+ +A +LL
Sbjct: 294 LAFATRGWMAFPIMVLLA 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1538TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.9 bits (114), Expect = 4e-08
Identities = 52/292 (17%), Positives = 110/292 (37%), Gaps = 17/292 (5%)

Query: 59 LPQLLLSPFIGGIVDRFSKKKIMIFTDILRGIIVLTYLLAFYK-IEIIFVSNICLAVLSC 117
L Q +P +G + DRF ++ +++ + L G V ++A + ++++ I +A ++
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVS--LAGAAVDYAIMATAPFLWVLYIGRI-VAGITG 110

Query: 118 LFEPAKQSTLKNIVHQKHLVTANSLSSTINGFMSIMGASLGGLIAQ---SLSIEIAFFIN 174
+ + +I S GF + G LGGL+ A +N
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 175 SLSYFISAYIIYKIKIPSRDTFSTKKAFFTDIKDGYTYILQRKIILSLILVGISWGIIGG 234
L++ +++ + R + + + + ++ +L+ V ++G
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREA---LNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 235 AYQLLLTIYAERIFH---SNIGILYTVQGAGLMIGSLLVNLYISKN-EDKMKRAFGWAYL 290
L I+ E FH + IGI G + ++ ++ ++ G
Sbjct: 228 VPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD 287

Query: 291 LQGIFFLGFVLSDQLIIGIITLLCMRIAGGIIVPLDTTLLQMYTEENMIGKV 342
G L F + I+ LL +GGI +P +L +E G++
Sbjct: 288 GTGYILLAFATRGWMAFPIMVLL---ASGGIGMPALQAMLSRQVDEERQGQL 336


12BcerKBAB4_1548BcerKBAB4_1564Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1548213-2.031705flagellar motor switch protein
BcerKBAB4_1549416-3.154308hypothetical protein
BcerKBAB4_1550315-3.088986hypothetical protein
BcerKBAB4_1551415-3.193218chemotaxis protein CheR
BcerKBAB4_1552114-3.406068hypothetical protein
BcerKBAB4_1553113-3.153466hypothetical protein
BcerKBAB4_1554113-2.953163hypothetical protein
BcerKBAB4_1555113-2.453766flagellar hook-associated protein FlgK
BcerKBAB4_1556316-2.213762flagellar hook-associated protein FlgL
BcerKBAB4_1557215-2.430357flagellar capping protein
BcerKBAB4_15583170.184396flagellar protein fliS
BcerKBAB4_1559214-0.164211hypothetical protein
BcerKBAB4_1560314-0.274298flagellar basal body rod protein FlgB
BcerKBAB4_15612130.057121flagellar basal body rod protein FlgC
BcerKBAB4_1562314-0.622699flagellar hook-basal body protein FliE
BcerKBAB4_1563412-0.651162flagellar MS-ring protein
BcerKBAB4_1564311-0.921100flagellar motor switch protein G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1548FLGMOTORFLIN576e-12 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 56.8 bits (137), Expect = 6e-12
Identities = 23/71 (32%), Positives = 40/71 (56%)

Query: 475 DTSILQNVEMNVKFVFGSTVRTIQDILSLQENEAVVLDEDIDEPIQIYVNDVLVAYGELV 534
D ++ ++ + + G T TI+++L L + V LD EP+ I +N L+A GE+V
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 535 NVDGFFGVKVT 545
V +GV++T
Sbjct: 113 VVADKYGVRIT 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1555FLGHOOKAP11012e-25 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 101 bits (253), Expect = 2e-25
Identities = 69/249 (27%), Positives = 110/249 (44%), Gaps = 14/249 (5%)

Query: 4 SDYNTPLSGMLAAQMGLQTTKQNLSNIHTPGYVRQMVNYGSVGASNGHTPEQRIGYGVQT 63
S N +SG+ AAQ L T N+S+ + GY RQ ++ G +G GV
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAG--GWVGNGVYV 59

Query: 64 LGVDRITDEVKTKQFNDQLSQFSYYAYMNSTLSRVESMVGTTGKNSLSSLMDGFFNAFRE 123
GV R D T Q +Q S +S++++M+ T+ SL++ M FF + +
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTS-SLATQMQDFFTSLQT 118

Query: 124 VAKNPEQPNYYDTLVSETGKFTSQINRLAKNLDTAEAQTTEDIEAHVNEFNRLGASLAEA 183
+ N E P L+ ++ +Q + L + Q I A V++ N +A
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKKI----GQAGTQVPNQLLDERDRIVTEMSKYANIEVS---YESMNPNIASVRMNGVLT 236
N +I G PN LLD+RD++V+E+++ +EVS + N +A NG
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA----NGYSL 234

Query: 237 VNGQDTYPL 245
V G L
Sbjct: 235 VQGSTARQL 243



Score = 58.8 bits (142), Expect = 2e-11
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 358 QFIVGVASDKSAVNAY-QNIHKDLLEGIQQEKMSIEGVNMEEEMVNLMAFQKYFVANSKA 416
+V +K+A +++ + ++ SI GVN++EE NL FQ+Y++AN++
Sbjct: 472 ASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQV 531

Query: 417 ITTMNEVFDSLFSI 430
+ T N +FD+L +I
Sbjct: 532 LQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1556FLAGELLIN383e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 38.1 bits (88), Expect = 3e-05
Identities = 28/127 (22%), Positives = 60/127 (47%)

Query: 1 MRVSTFQNANWAKNQMMDLNVQQQYHRNQVTSGKKNLFMSEDPLAASKSFAIQHSLANIE 60
++T + +N + +++SG + +D + + ++ +
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 QMQKDLADSKNVLTQTENTLQGVFKSLTRADQLTLQALNGTNSEKELKAIGAEIDQILKQ 120
Q ++ D ++ TE L + +L R +L++QA NGTNS+ +LK+I EI Q L++
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 VVYLANT 127
+ ++N
Sbjct: 122 IDRVSNQ 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1560FLGHOOKAP1300.004 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.004
Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 20 NTVSSNIANANTPGYKAQDVTFAEKMN 46
NT S+NI++ N GY Q A+ +
Sbjct: 19 NTASNNISSYNVAGYTRQTTIMAQANS 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1561FLGHOOKAP1355e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.9 bits (80), Expect = 5e-05
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 5 INASGSGLTTARKWMEVTSNNIVNANTTGAPGAEPYHRRSVVLESNNSFASMLDGAPTNG 64
IN + SGL A+ + SNNI + N G Y R++ ++ NS G NG
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAG------YTRQTTIMAQANSTLGA-GGWVGNG 56

Query: 65 VKIKSIETDRNENLV 79
V + ++ + + +
Sbjct: 57 VYVSGVQREYDAFIT 71



Score = 28.4 bits (63), Expect = 0.009
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 97 NIDVTAEMTNVMVAQKMYEANTSVLNANKKMLDKDLEI 134
+++ E N+ Q+ Y AN VL + D + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1562FLGHOOKFLIE355e-06 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 35.4 bits (81), Expect = 5e-06
Identities = 17/63 (26%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 38 LEDMNQTQNNAQTAVYDLLTKGVG-ETHDVLIQQKKAESQMKTAALVRDNLIENYKSLIN 96
L+ ++ TQ A+T G +DV+ +KA M+ VR+ L+ Y+ +++
Sbjct: 41 LDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMS 100

Query: 97 MQI 99
MQ+
Sbjct: 101 MQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1563FLGMRINGFLIF1599e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 159 bits (404), Expect = 9e-45
Identities = 97/540 (17%), Positives = 215/540 (39%), Gaps = 46/540 (8%)

Query: 17 LVIGAALLAIATGALLYFTLPDKYVVVYQNLNDTDKQEITAELSKLGVDYQLAADG-SIR 75
+V G+A +AI +L+ PD Y ++ NL+D D I A+L+++ + Y+ A +I
Sbjct: 28 IVAGSAAVAIVVAMVLWAKTPD-YRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGAIE 86

Query: 76 VQKNDAPWVRKEMNGMGLPFNSKSGEEILLESSLGSSEQDKKMKQIVGTKKQLEQDIVRN 135
V + +R + GLP G E+L + G S+ +++ + +L + I
Sbjct: 87 VPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI-ET 145

Query: 136 FATIETANVQITLPEKETIFDEEKAKGTAAITVGVKRGQLLTADQVAGIQQMISAAVPGV 195
+++A V + +P+ ++F E+ +A++TV ++ G+ L Q++ + ++S+AV G+
Sbjct: 146 LGPVKSARVHLAMPKP-SLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGL 204

Query: 196 KAEEVSVIDSKKGIISKGADEAHSNSSSSYEKEVEMQHQIEGKLKQDIDATLMTMFKSNE 255
V+++D ++++ + + ++ + +E ++++ I+A L + +
Sbjct: 205 PPGNVTLVDQSGHLLTQSNTSGRDLNDAQ----LKFANDVESRIQRRIEAILSPIVGNGN 260

Query: 256 YKVNTKVSVNYDEVTRQSEKYG-DKGVLRSKQEQEESSTA-QEGADTKQGAGITANGEVP 313
+++ + E Y + ++ + + + Q GA G + +
Sbjct: 261 VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPA 320

Query: 314 NYGT----------NNNQNGKVVYDNKNGNKI----------ENYEIDKTVETIKKHP-E 352
N QN + N N NYE+D+T+ K + +
Sbjct: 321 PPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKMNVGD 380

Query: 353 LTKTNVVVWVDNDTLVKRKI------DMTTFKEAIGTAAGLQADPNGNFTNGQVNVVTVQ 406
+ + +V V V+ TL K M ++ A G +NVV
Sbjct: 381 IERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK-----RGDTLNVVNSP 435

Query: 407 FDQPKVEKEKEPEKSGMNWWLFGGITAGLLALIGLVWFFLARRKKKREEEEYEEYLAEEE 466
F + P ++ L ++ + W + + + EE A +E
Sbjct: 436 FSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQE 495

Query: 467 VAASSESIFEIPEEKI----VPEPKPEPVEPSEPTLDDQVQEATKEHVEGTAKVIKKWLN 522
A + E E ++ + + + +++E + A VI++W++
Sbjct: 496 QAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMS 555


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1564FLGMOTORFLIG2057e-66 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 205 bits (522), Expect = 7e-66
Identities = 116/336 (34%), Positives = 196/336 (58%), Gaps = 6/336 (1%)

Query: 2 LDEISSKEKAAILIRTLNEEVAAKVIEYMTAEEKEVLLREIAKFRVYKPETLENVLGEFL 61
+ ++ K+KAAIL+ ++ E+++KV +Y++ EE E L EIAK E +NVL EF
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 62 YELNVKELNLVTPDKEYIRRIF-KNMPEEDLEKLLEDLWYN-KDNPFEFLNSLTDLEPLL 119
EL + + + +Y R + K++ + ++ +L + PFEF+ D +L
Sbjct: 72 -ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRA-DPANIL 129

Query: 120 TVLNDESPQTIAIIASYIKPQLASQLIERLPDHKRVETVMGIAKLEQVDGELINQIGELL 179
+ E PQTIA+I SY+ PQ AS ++ LP + IA +++ E++ ++ +L
Sbjct: 130 NFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVL 189

Query: 180 KAKLNNMAFSAINKTDGLKTIVNILNNVSRGVEKTVFQKLDEVDYALSEKIKENMFVFED 239
+ KL +++ G+ +V I+N R EK + + L+E D L+E+IK+ MFVFED
Sbjct: 190 EKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFED 249

Query: 240 LLGLEDLALRRVLEEITDNGVIAKALKIAKEEIKEKLFTCMSSNRREMILEELDGLGPLK 299
++ L+D +++RVL EI D +AKALK ++EK+F MS M+ E+++ LGP +
Sbjct: 250 IVLLDDRSIQRVLREI-DGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 300 MTDAEKAQQTITGTVKKLEKEGRIIVQRG-EDDVLI 334
D E++QQ I ++KLE++G I++ RG E+DVL+
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


13BcerKBAB4_1577BcerKBAB4_1601Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_15773200.307286PAS/PAC sensor-containing diguanylate
BcerKBAB4_15786283.139472integral membrane protein TerC
BcerKBAB4_15807322.552770flagellin domain-containing protein
BcerKBAB4_15816292.657102flagellin domain-containing protein
BcerKBAB4_15823242.292438flagellin domain-containing protein
BcerKBAB4_15833242.139800flagellin domain-containing protein
BcerKBAB4_15843291.437005lytic transglycosylase
BcerKBAB4_15854320.805502flagellar motor switch protein
BcerKBAB4_15863300.769220flagellar motor switch protein FliM
BcerKBAB4_1587325-0.567798flagellar motor switch protein
BcerKBAB4_1588421-0.047383flagellar motor switch protein
BcerKBAB4_1589316-0.450123flagellar biosynthesis protein FliP
BcerKBAB4_15904140.009239flagellar biosynthesis protein FliQ
BcerKBAB4_15912120.662963flagellar biosynthesis protein FliR
BcerKBAB4_15921110.777719flagellar biosynthesis protein FlhB
BcerKBAB4_15930111.366508flagellar biosynthesis protein FlhA
BcerKBAB4_1594-1121.267327flagellar biosynthesis regulator FlhF
BcerKBAB4_1595-1141.634769flagellar basal body rod protein FlgG
BcerKBAB4_1596-216-0.036661NtaA/SnaA/SoxA family monooxygenase
BcerKBAB4_1597115-1.697887transcriptional regulator TrmB
BcerKBAB4_1598316-2.699461AzlC family protein
BcerKBAB4_1599418-4.275981branched-chain amino acid transport
BcerKBAB4_1600319-3.914000hypothetical protein
BcerKBAB4_1601-117-3.651585VanZ family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1580FLAGELLIN845e-22 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 84.3 bits (208), Expect = 5e-22
Identities = 40/142 (28%), Positives = 64/142 (45%)

Query: 3 IGTNVLSMNASQSLYENEKRMKMATDKKLNTASDTPANVAIVTRMHARASGIQVAIRKIE 62
+ + A + ++ +A + + + I A + I+
Sbjct: 366 GESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASID 425

Query: 63 EALQNISLYRADLGAMMKRLQFNIENLNNQSLALTGASSRIEDADMAQEMSDFFKFKLLT 122
AL + R+ LGA+ R I NL N L A SRIEDAD A E+S+ K ++L
Sbjct: 426 SALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQ 485

Query: 123 EVALSMVSQANQIPQMVSKLLQ 144
+ S+++QANQ+PQ V LL+
Sbjct: 486 QAGTSVLAQANQVPQNVLSLLR 507



Score = 40.0 bits (93), Expect = 1e-06
Identities = 19/88 (21%), Positives = 37/88 (42%), Gaps = 5/88 (5%)

Query: 1 MRIGTNVLSMNASQSLYENEKRM-----KMATDKKLNTASDTPANVAIVTRMHARASGIQ 55
I TN LS+ +L +++ + ++++ ++N+A D A AI R + G+
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 56 VAIRKIEEALQNISLYRADLGAMMKRLQ 83
A R + + L + LQ
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQ 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1581FLAGELLIN1286e-36 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 128 bits (322), Expect = 6e-36
Identities = 78/271 (28%), Positives = 122/271 (45%), Gaps = 3/271 (1%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMDKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDHIADKTNFNGNAFLGGTGAGGKDITIQLSDASSDTMKITAIDTKAITTATLATATATG 180
ID ++++T FNG L I + +D + T+ + ID K++
Sbjct: 122 IDRVSNQTQFNGVKVLSQD--NQMKIQVGANDGETITIDLQKIDVKSLGLDGF-NVNGPK 178

Query: 181 PDKALNAASAPAQITALDTAIQGIADARATFGSQLNRLDHNLNNVTSQATNMAASASQIE 240
+ S+ +T DT G R S D V + AA+
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 241 DADMAKEMSNMTKFKILNEAGISMLSQANQT 271
D ++ K + A
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 80.9 bits (199), Expect = 2e-19
Identities = 57/252 (22%), Positives = 99/252 (39%), Gaps = 1/252 (0%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMDKEYQALAQEIDHIADKTNFNGNAFLGGTGAGGKDITIQ 149
A ++ + K++ + Q + + A G +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 LSDASSDTMKIT-AIDTKAITTATLATATATGPDKALNAASAPAQITALDTAIQGIADAR 208
A++ K+T A T I +T D A S + ++D+A+ + R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 209 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 268
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 269 NQTPQMVSKLLQ 280
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1582FLAGELLIN1263e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 126 bits (317), Expect = 3e-35
Identities = 77/272 (28%), Positives = 121/272 (44%), Gaps = 4/272 (1%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMNKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDHIADKTNFNGNAFLGGTGAGGKDITIQLSDASSDTMTIAAIDTKDITTTKLAVGADPA 180
ID ++++T FNG L I + +D + T+ + ID K + V
Sbjct: 122 IDRVSNQTQFNGVKVLSQD--NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNG--P 177

Query: 181 SATKNLNATTAATEITALDTAIQNIADARATFGSQLNRLDHNLNNVTSQATNMAASASQI 240
+ ++ +T DT R S D V + AA+
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 241 EDADMAKEMSNMTKFKILNEAGISMLSQANQT 272
D ++ K + A
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 78.9 bits (194), Expect = 1e-18
Identities = 49/252 (19%), Positives = 94/252 (37%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMNKEYQALAQEIDHIADKTNFNGNAFLGGTGAGGKDITIQ 149
A ++ + K++ + Q + + A G +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 LSDASSDTMTIAAIDTKDITTTKLAVGADPASATKNLNATTAATEITALDTAIQNIADAR 209
A++ + + + + + A + ++D+A+ + R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 210 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 269
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 270 NQTPQMVSKLLQ 281
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1583FLAGELLIN1314e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 131 bits (330), Expect = 4e-37
Identities = 78/272 (28%), Positives = 125/272 (45%), Gaps = 7/272 (2%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMDKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 INHIADKTNFNGNAFLNKGTNPGEGKDITIQLSDASSDTMTIAAIDTKDITTTKLATDGT 180
I+ ++++T FNG L++ I + +D + T+ + ID K + +G
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQ----MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGP 177

Query: 181 K---KLDATTAATEITALDTAIQEIADARATFGSQLNRLDHNLNNVTSQATNMAASASQI 237
K D ++ +T DT R S D V + AA+
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 238 EDADMAKEMSNMTKFKILNEAGISMLSQANQT 269
D ++ K + A
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 78.9 bits (194), Expect = 9e-19
Identities = 52/252 (20%), Positives = 99/252 (39%), Gaps = 3/252 (1%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMDKEYQALAQEINHIADKTNFNGNAFLNKGTNPGEGKDIT 149
A ++ + K++ + Q + + A +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 IQLSDASSDTMTIAAIDTKDITTTKLATDGTKKLDATTAAT---EITALDTAIQEIADAR 206
++A+ D +T+A T + + A + + ++D+A+ ++ R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 207 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 266
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 267 NQTPQMVSKLLQ 278
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1584PF06580310.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.0 bits (70), Expect = 0.005
Identities = 9/43 (20%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 121 ELTNKY-NIQKIRSSNEGKYEDIIDRASSTYGIPKTLIQKMIE 162
+ + Y + I+ + ++E+ I+ A +P L+Q ++E
Sbjct: 223 TVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1585TYPE3OMOPROT424e-08 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 41.5 bits (97), Expect = 4e-08
Identities = 14/67 (20%), Positives = 31/67 (46%)

Query: 5 DDIPLTIYFEIGNTKKKIEDLLHITKGTLYRLENSTKNTVRLMLENEEIGTGKILTKNGK 64
+ +P+ + F + + +L + + L L + + V +M +G G+++ N
Sbjct: 228 NQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDT 287

Query: 65 MYVEIVE 71
+ VEI E
Sbjct: 288 LGVEIHE 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1586FLGMOTORFLIM1442e-42 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 144 bits (364), Expect = 2e-42
Identities = 93/329 (28%), Positives = 165/329 (50%), Gaps = 10/329 (3%)

Query: 4 EKLSQEQIDALLKAVNEGEEMPAFAQEAGKQDKFQEYDFNRPEKFGVEHLRSLQAIASTF 63
E LSQ++ID LL A++ G+ A+ K YDF RP+KF E +R+L + TF
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 64 GKQTSQTLSARMRIPIELEPSTVEQVPFTSEYVEKMPKDYYLYCVIDLGLPELGEIVIEI 123
+ T+ +LSA++R + + ++V+Q+ + E++ +P L VI + P G V+E+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTY-EEFIRSIPTPSTL-AVITMD-PLKGNAVLEV 119

Query: 124 DLAFVIYIHECWLGGDSKRNFTMRRPLTAFEFLTLDNIFLLLCKNLEQSFESVVAIEPKF 183
D + I + GG + ++R LT E ++ + + + N+ +S+ V+ + P+
Sbjct: 120 DPSITFSIIDRLFGGT-GQAAKVQRDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 184 VTTETDPNALKITTASDIISLLNVNMKTEFWNTTVRIGIPFLSVEEIMDKLTSENIVEHS 243
ET+P +I S+++ L+ + K + IP++++E I+ KL+S+ S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFW--FS 236

Query: 244 SDKRKK---YTSEVEVKVNQVYKPVHVAVGEQKMTMGEIEQIEEGDIIPLH-TKVSDQLR 299
S +R Y + K++ V V VG ++++ +I + GDII LH T V D
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 300 GYVDGKHKFNCFIGKDGTRKALLFKSFIE 328
+ + KF C G G + A IE
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1587FLGMOTORFLIN585e-14 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 58.0 bits (140), Expect = 5e-14
Identities = 22/94 (23%), Positives = 51/94 (54%)

Query: 13 LEEFAGKRNEAGKAHIDTVSDISIELGVKLGKSSITLGDVKQLKVGDVLEVEKNLGHKVD 72
++ G ID + DI ++L V+LG++ +T+ ++ +L G V+ ++ G +D
Sbjct: 39 FQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLD 98

Query: 73 VYLSDMKVGIGEAIVMDEKFGIIISEIEADKKHA 106
+ ++ + GE +V+ +K+G+ I++I +
Sbjct: 99 ILINGYLIAQGEVVVVADKYGVRITDIITPSERM 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1589FLGBIOSNFLIP1634e-52 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 163 bits (415), Expect = 4e-52
Identities = 70/203 (34%), Positives = 127/203 (62%)

Query: 48 SSVQLFALVTLLSLSSSIVLLFTHFTYFMIVLGITRQGLGVMNLPPNQVLVGLALFLSLF 107
VQ +T L+ +I+L+ T FT +IV G+ R LG + PPNQVL+GLALFL+ F
Sbjct: 40 LPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFF 99

Query: 108 TMQPVLGQLKSDVWDPMTKEKITVSQAAETTAPIMKDYMSKHTYKHDLKMMLKVRGEELP 167
M PV+ ++ D + P ++EKI++ +A E A ++++M + T + DL + ++
Sbjct: 100 IMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPL 159

Query: 168 KDLKDLSLFTLVPSFTLTQIQKGLLTGMFIYLAFVFIDLIISTLLMYLGMMMVPPMILSL 227
+ + + + L+P++ ++++ G I++ F+ IDL+I+++LM LGMMMVPP ++L
Sbjct: 160 QGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIAL 219

Query: 228 PFKILVFVYLGGYTKIVDIMFKT 250
PFK+++FV + G+ +V + ++
Sbjct: 220 PFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1590TYPE3IMQPROT383e-07 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 38.2 bits (89), Expect = 3e-07
Identities = 15/81 (18%), Positives = 35/81 (43%)

Query: 4 SPIIDIFQTFFYKGVMILMPIAVVSMIVVIIIAVIMAMMQIQEQTLTFLPKMASIVLVII 63
++ Y +++ +V+ I+ +++ + + Q+QEQTL F K+ + L +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 ILGPWMFQELTMLILDLFDKI 84
+L W + L +
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1591TYPE3IMRPROT967e-26 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 96.0 bits (239), Expect = 7e-26
Identities = 51/233 (21%), Positives = 113/233 (48%), Gaps = 1/233 (0%)

Query: 10 FFAFCRITSFLYFLPFFSGRSIPAMAKVTFGLALSITVADQVDVSHIKTVWDVAA-YAGT 68
F+ R+ + + P S RS+P K+ + ++ +A + + + A A
Sbjct: 17 FWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQ 76

Query: 69 QIVIGLSLSKIVEMLWNIPKMAGHILDFDIGLSQASLFDVNAGSQSTLLSTIFDIFFLII 128
QI+IG++L ++ + + AG I+ +GLS A+ D + +L+ I D+ L++
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 129 FISLGGINYFVATILKSFQYTEAISKLLTTSFLDSLLATLLFAITSAVEIALPLMGSLFI 188
F++ G + ++ ++ +F + L ++ +L + + +ALPL+ L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLT 196

Query: 189 INFVLILIAKNAPQLNVFMNAYVIKITCGILFIAMSVPMLGYVFKNMTDVLLE 241
+N L L+ + APQL++F+ + + +T GI +A +P++ +++ +
Sbjct: 197 LNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1592TYPE3IMSPROT2871e-97 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 287 bits (737), Expect = 1e-97
Identities = 90/343 (26%), Positives = 184/343 (53%), Gaps = 2/343 (0%)

Query: 4 DNKTEKATPQKRKKSREEGNIARSKDLNNLFSILVLAVVVYFFGDWLGYEIAHSVAVLFD 63
KTE+ TP+K + +R++G +A+SK++ + I+ L+ ++ D+ + + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QIGKNTDS--TEYFYLMGILLLKVSAPILILVYAFHLFNYMIQVGFLFSSKVLKPKASRI 121
Q + + + + P+L + + ++++Q GFL S + +KP +I
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPKNYFTRLFSRKSLVDILKSLFYMGLIGYVSYVLFKKNLEKIVSMIGFNWTASLTEIIS 181
NP R+FS KSLV+ LKS+ + L+ + +++ K NL ++ + +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 QIKFIFLAILIILIVLSIIDFIYQKWEYEQDIKMKKEEVKQEHKDNEGDPQVKGKRKNFM 241
++ + + + +V+SI D+ ++ ++Y +++KM K+E+K+E+K+ EG P++K KR+ F
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 HAILQGTIAKKMDGATFIVNNPTHISVVLRYNKQVDAAPIVVAKGEDELALYIRTLAREQ 301
I + + + ++ +V NPTHI++ + Y + P+V K D +R +A E+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 EIPMVENRPLARSLYYQVEEDETIPEDLYVAVIEVMRYLIQTK 344
+P+++ PLAR+LY+ D IP + A EV+R+L +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1593TYPE3IMSPROT397e-05 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 38.6 bits (90), Expect = 7e-05
Identities = 26/163 (15%), Positives = 58/163 (35%), Gaps = 19/163 (11%)

Query: 192 IFGIVILFVNIIFGLIVGMMQQGMSFADAAI-----------HYTQLTVGDGIVNQIGSL 240
+L V + + ++Q G + AI ++ +V + S+
Sbjct: 85 YLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSI 144

Query: 241 MLAISTGIIVTRVFDGSADTVTEGIFKELLAHEVVVYALGGLFIAMGVFTPLPFLPFALV 300
+ + I++ + G+ T+ + E + LG + + V + F+ ++
Sbjct: 145 LKVVLLSILIWIIIKGNLVTLLQLPT---CGIECITPLLGQILRQLMVICTVGFVVISIA 201

Query: 301 GGTI-IFLGVRNKKRIKKEKEDELQKELE---MIQGDEEQLQQ 339
+ ++ K K E + E KE+E I+ Q Q
Sbjct: 202 DYAFEYYQYIKELKMSKDEIKREY-KEMEGSPEIKSKRRQFHQ 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1595FLGHOOKAP1300.015 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.015
Identities = 6/40 (15%), Positives = 15/40 (37%)

Query: 2 NGLYIGSMGMMNYMQHINVHSNNVANAQTTGFKAENMTSK 41
+ + G+ +N SNN+++ G+ +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


14BcerKBAB4_1652BcerKBAB4_1657Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1652012-3.448065putative lipoprotein
BcerKBAB4_1653110-3.429642hypothetical protein
BcerKBAB4_1654011-3.029785hypothetical protein
BcerKBAB4_1655012-3.197457RNA polymerase sigma factor SigW
BcerKBAB4_1656-212-3.372780two component LuxR family transcriptional
BcerKBAB4_1657-112-3.214874histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1656HTHFIS733e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 3e-17
Identities = 25/116 (21%), Positives = 49/116 (42%), Gaps = 3/116 (2%)

Query: 14 KIKILIADDNSFIREGMKIILNTYEEFEVLDTVNDGKEAVAYCKKYEVDIALLDVRMPNM 73
IL+ADD++ IR + L+ ++V ++ + + D+ + DV MP+
Sbjct: 3 GATILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 74 NGVEATKLICEETKTKPLILTT-FDDDEYILDAVKNGAKGYLLKNNDPERIRDAIK 128
N + I + P+++ + + + A + GA YL K D + I
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


15BcerKBAB4_1695BcerKBAB4_1705Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1695015-3.527230sodium:neurotransmitter symporter
BcerKBAB4_1696119-0.994309polysaccharide deacetylase
BcerKBAB4_1697223-0.243943hypothetical protein
BcerKBAB4_1698224-0.216461hypothetical protein
BcerKBAB4_16992211.575951hypothetical protein
BcerKBAB4_17001172.167105fibronectin-binding family protein
BcerKBAB4_17010163.669158beta-lactamase inhibitory protein II
BcerKBAB4_1702-2112.396741hypothetical protein
BcerKBAB4_1703-2122.407230hypothetical protein
BcerKBAB4_1704-2102.959208peptide methionine sulfoxide reductase
BcerKBAB4_1705-2123.384038short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1700PF07299340e-122 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 340 bits (873), Expect = e-122
Identities = 208/213 (97%), Positives = 212/213 (99%)

Query: 1 MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID 60
MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID
Sbjct: 7 MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID 66

Query: 61 TVLTVQNREDAESFLLKINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEELDMKELTYLS 120
TVLTVQNREDAESFLLKINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEELDMKEL+YLS
Sbjct: 67 TVLTVQNREDAESFLLKINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEELDMKELSYLS 126

Query: 121 WIDKGSSRKFIIAKNDENKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGDVPGTF 180
WIDKGSSRKFIIAKND+NKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGD+PGTF
Sbjct: 127 WIDKGSSRKFIIAKNDKNKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGDIPGTF 186

Query: 181 VRKGNYICKDGVACNHNMKSLDKLQDFIERLKK 213
V+KGNYICKDGVACN NMKSLDKLQDFIERLKK
Sbjct: 187 VKKGNYICKDGVACNQNMKSLDKLQDFIERLKK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1705DHBDHDRGNASE885e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 88.2 bits (218), Expect = 5e-23
Identities = 70/263 (26%), Positives = 120/263 (45%), Gaps = 21/263 (7%)

Query: 2 LKGKIALVTGASRGIGRAIAKRLANDGALV-AVHYGNRKEDAEETVHEIQSNGGSAFSIG 60
++GKIA +TGA++GIG A+A+ LA+ GA + AV Y K + + + ++ AF
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP-- 63

Query: 61 ANLESLHGVDNLYISLDNELQKRTGGTQFDILINNAGIGPGAFIEETTEQFFDRMVSVNA 120
A++ +D + +++ G DIL+N AG+ I +++ ++ SVN+
Sbjct: 64 ADVRDSAAIDEIT----ARIEREMGP--IDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 121 KAPFFIIQQALPRLRD--NSRIINISSAATRISLPDFVAYSMTKGAINTMTFTLAKQLGA 178
F + + D + I+ + S + AY+ +K A T L +L
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 179 RGITVNAILPGFIKTDMNAELLSDP---------MMKQYATTISAFNRLGEVEDIADTAA 229
I N + PG +TDM L +D ++ + T I +L + DIAD
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIP-LKKLAKPSDIADAVL 236

Query: 230 FLSSPDSRWVTGQLIDVSGGSCL 252
FL S + +T + V GG+ L
Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259


16BcerKBAB4_1773BcerKBAB4_1784Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1773214-2.910627branched-chain amino acid transport system II
BcerKBAB4_1774518-3.982335glyoxalase/bleomycin resistance
BcerKBAB4_1775621-3.216553amidinotransferase
BcerKBAB4_1776721-2.3576844'-phosphopantetheinyl transferase
BcerKBAB4_1777621-2.821285beta-lactamase
BcerKBAB4_1778521-2.992713fatty acid desaturase
BcerKBAB4_1779520-2.950732oleoyl-(acyl-carrier-protein) hydrolase
BcerKBAB4_1780520-2.755152amino acid adenylation domain-containing
BcerKBAB4_1781519-2.750185Beta-ketoacyl synthase
BcerKBAB4_1782418-3.064605amino acid adenylation domain-containing
BcerKBAB4_1783318-2.465465cyclic peptide transporter
BcerKBAB4_1784317-1.918949amino acid adenylation domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1775ARGDEIMINASE1032e-27 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 103 bits (258), Expect = 2e-27
Identities = 63/261 (24%), Positives = 107/261 (40%), Gaps = 51/261 (19%)

Query: 66 HTLMLPPEVRFPEQVFTRDVGFTIGETVFISNMKNEVRKGE----ERIFK---KTLSEQD 118
+ ++ P P +FTRD +IG V I+ M +VR+ E E IFK
Sbjct: 149 NLFIIDP---MPNVLFTRDPFASIGNGVTINKMFTKVRQRETIFAEYIFKYHPVYKENVP 205

Query: 119 IAYIDCINSHIEGGDV-IIDQDIVYIGVSNRTLFNSVKKLQQLLTHYK-------IIPVP 170
I + +EGGD ++++ ++ IG+S RT SV+KL L K +P
Sbjct: 206 IWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIP 265

Query: 171 FSKDFLHLDCVFNIISKEEALIYP------------HAFSNSTLKMLSDRYNLIEVSKK- 217
++ ++HLD VF I + + S+S + + ++ + +V
Sbjct: 266 KNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVLTYNPSSSKIHIKKEKARIKDVLSFY 325

Query: 218 --------------------EQFTLATNVLSLGNKKMISLPSNTKTNKELRLRGYEVIEI 257
EQ+ NVL++ ++I+ N TNK G +V I
Sbjct: 326 LGRKIDIIKCAGGDLIHGAREQWNDGANVLAIAPGEIIAYSRNHVTNKLFEENGIKVHRI 385

Query: 258 DFGEIIKSGGSFRCCTLPIQR 278
E+ + G RC ++P+ R
Sbjct: 386 PSSELSRGRGGPRCMSMPLIR 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1776BINARYTOXINB300.006 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.006
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 8/63 (12%)

Query: 158 TLKESFVKAIG----KGLLY----PLDSFGFNMDDWSQNKITLRNTNSDFSQFYFCLNRL 209
TLKE+ A G G L + F FN D + I + + + Y L+++
Sbjct: 551 TLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNATNIYTVLDKI 610

Query: 210 EQN 212
+ N
Sbjct: 611 KLN 613


17BcerKBAB4_2022BcerKBAB4_2037Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2022212-0.771988hypothetical protein
BcerKBAB4_2023214-1.128048group-specific protein
BcerKBAB4_2024215-1.443086hypothetical protein
BcerKBAB4_2025213-0.960051cell division protein FtsK
BcerKBAB4_2026314-1.280381hypothetical protein
BcerKBAB4_2027315-1.242865hypothetical protein
BcerKBAB4_2028214-0.512990group-specific protein
BcerKBAB4_2029013-0.365105hypothetical protein
BcerKBAB4_2030014-0.259368hypothetical protein
BcerKBAB4_2031020-1.422539ankyrin
BcerKBAB4_2032023-1.692128hypothetical protein
BcerKBAB4_2033121-0.375335resolvase domain-containing protein
BcerKBAB4_2034420-1.962568transposase IS116/IS110/IS902 family protein
BcerKBAB4_2035419-1.306641hypothetical protein
BcerKBAB4_2036417-1.443268hypothetical protein
BcerKBAB4_2037216-1.057208hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2027SYCDCHAPRONE310.017 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 30.7 bits (69), Expect = 0.017
Identities = 18/92 (19%), Positives = 29/92 (31%), Gaps = 13/92 (14%)

Query: 739 YQQASQVLQAAIQKDMKNIELLNQLGIVYYEAGQFYETRDGAKSNAAYQQALDAYNRVVN 798
Y+ A +V QA D + LG GQ Y A+ +Y+
Sbjct: 52 YEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQ-------------YDLAIHSYSYGAI 98

Query: 799 SGTRDINTLVNIGILYDKVGQGNEAEKFFTEA 830
++ + + G+ EAE A
Sbjct: 99 MDIKEPRFPFHAAECLLQKGELAEAESGLFLA 130



Score = 29.5 bits (66), Expect = 0.037
Identities = 20/109 (18%), Positives = 38/109 (34%), Gaps = 6/109 (5%)

Query: 596 REIDKENNEAAYLLASANFRIGKYQEAVLNFEQALANNAKGIEPYKKDAMRDLAVSHMKM 655
EI + E Y LA ++ GKY++A F+ ++ Y L M
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCV-----LDHYDSRFFLGLGACRQAM 83

Query: 656 KEFEKAEDVIVKMSTKTNEDKAIVSYLKGQLSTATVQLDKAESFFKEAI 704
+++ A + ++ + + +L +AES A
Sbjct: 84 GQYDLAIHSYSYGAIMDIKE-PRFPFHAAECLLQKGELAEAESGLFLAQ 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2028cloacin427e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.4 bits (99), Expect = 7e-06
Identities = 26/80 (32%), Positives = 29/80 (36%), Gaps = 1/80 (1%)

Query: 166 GTGTEGTGTEGTGTEGTGTGGTGTGGTGTGGTGTGGTGTEGTGTGGTGTEGTGTGGTGTG 225
G G T T G GG G G G + G +E GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 226 GTGTGGTGTGGTGTGGTGTG 245
G G GG G G G+G G
Sbjct: 63 GNG-GGNGNSGGGSGTGGNL 81



Score = 42.4 bits (99), Expect = 9e-06
Identities = 24/80 (30%), Positives = 32/80 (40%), Gaps = 1/80 (1%)

Query: 162 TGTEGTGTEGTGTEGTGTEGTGTGGTGTGGTGTGGTGTGGTGTE-GTGTGGTGTEGTGTG 220
+G +G G +G G G G GG + G+G G G+G G G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 221 GTGTGGTGTGGTGTGGTGTG 240
GG G G G+G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 8e-04
Identities = 21/70 (30%), Positives = 29/70 (41%)

Query: 184 TGGTGTGGTGTGGTGTGGTGTEGTGTGGTGTEGTGTGGTGTGGTGTGGTGTGGTGTGGTG 243
+GG G G + +G TG G G G+G + GG+G+G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 244 TGGTGTGGTG 253
G G G
Sbjct: 62 HGNGGGNGNS 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2030RTXTOXIND355e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 5e-04
Identities = 27/170 (15%), Positives = 60/170 (35%), Gaps = 29/170 (17%)

Query: 9 QLEQ-IAKNISEMQTHSQNIQQNLNQSMFSIQMQW--------QGATSQHFY----GEYM 55
Q E + K +E T I + + + A ++H +Y+
Sbjct: 204 QKELNLDKKRAERLTVLARIN-RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYV 262

Query: 56 RSMRLMESYIRNLQVTEKELRRIAQKFRQADEEYQKKQTEKLKETHKKEKKHEKSWWEKG 115
++ + Y L+ E E+ ++++ + ++ + +KL++T
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE----- 317

Query: 116 IEGAAEFIGVNDAIRAVTGKDPITG--KELS--TKERLIAAGWTLLNFVP 161
E IRA P++ ++L T+ ++ TL+ VP
Sbjct: 318 -LAKNEERQQASVIRA-----PVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


18BcerKBAB4_2118BcerKBAB4_2147Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2118317-2.095398lysine 2,3-aminomutase YodO family protein
BcerKBAB4_2119117-3.641890hypothetical protein
BcerKBAB4_2120017-3.172721hypothetical protein
BcerKBAB4_2121-116-2.625813hypothetical protein
BcerKBAB4_2122013-1.012448hypothetical protein
BcerKBAB4_2123013-0.584289hypothetical protein
BcerKBAB4_2124011-0.056828serine/threonine protein kinase
BcerKBAB4_2125-1121.628934SpoOM family protein
BcerKBAB4_2126-1161.917423PA-phosphatase like phosphoesterase
BcerKBAB4_21270152.073805cation diffusion facilitator family transporter
BcerKBAB4_21282171.839407thioredoxin domain-containing protein
BcerKBAB4_21302210.297026hypothetical protein
BcerKBAB4_2131118-1.069072hypothetical protein
BcerKBAB4_2134018-2.739864hypothetical protein
BcerKBAB4_2135017-3.250556hypothetical protein
BcerKBAB4_2136018-3.531849hypothetical protein
BcerKBAB4_2137213-3.296360hypothetical protein
BcerKBAB4_2139216-3.109413hypothetical protein
BcerKBAB4_2140418-3.451254group-specific protein
BcerKBAB4_2141418-3.746747hypothetical protein
BcerKBAB4_2142316-2.709566hypothetical protein
BcerKBAB4_2143117-3.523556glycoside hydrolase family protein
BcerKBAB4_2144016-2.705825peptidyl-prolyl isomerase
BcerKBAB4_2145119-2.498760hypothetical protein
BcerKBAB4_2146020-2.012887hypothetical protein
BcerKBAB4_2147218-1.929138spore germination protein PF
19BcerKBAB4_2208BcerKBAB4_2233Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2208014-3.579462inosine/uridine-preferring nucleoside hydrolase
BcerKBAB4_2209-216-2.654896hypothetical protein
BcerKBAB4_2210016-2.427830hypothetical protein
BcerKBAB4_2211-117-3.167814hypothetical protein
BcerKBAB4_2212-118-3.080055cyclic nucleotide-binding protein
BcerKBAB4_2213119-1.326018hypothetical protein
BcerKBAB4_2214-118-0.869182N-acetyltransferase GCN5
BcerKBAB4_2215015-2.491044O-methyltransferase domain-containing protein
BcerKBAB4_2216-116-2.582413MarR family transcriptional regulator
BcerKBAB4_2217114-2.464508N-acetyltransferase GCN5
BcerKBAB4_2218115-2.801127beta-lactamase domain-containing protein
BcerKBAB4_2219017-4.007199TetR family transcriptional regulator
BcerKBAB4_2220116-3.735788MMPL domain-containing protein
BcerKBAB4_2221417-4.483727hypothetical protein
BcerKBAB4_2222115-3.688211chloramphenicol O-acetyltransferase
BcerKBAB4_2223116-3.423355N-acetyltransferase GCN5
BcerKBAB4_2224114-3.575210N-acetyltransferase GCN5
BcerKBAB4_2225112-2.930376N-acetyltransferase GCN5
BcerKBAB4_2226015-2.851637hypothetical protein
BcerKBAB4_2227-114-3.565127hypothetical protein
BcerKBAB4_2228116-4.518104XRE family transcriptional regulator
BcerKBAB4_2229017-4.558034hypothetical protein
BcerKBAB4_2230219-3.929989hypothetical protein
BcerKBAB4_2231117-3.056730alpha/beta hydrolase
BcerKBAB4_2232017-3.572329protoporphyrinogen oxidase
BcerKBAB4_2233-118-3.506636hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2212RTXTOXINA330.001 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.001
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 4/53 (7%)

Query: 31 VNHFEKGKLICDKDDEIHRLYF-VIKGKVKVYTITPEGKKLILRFINPLAIVG 82
++++E+GK + K DE + F +KG + + +L+F+ PL G
Sbjct: 504 IDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDS---KSSTLLKFVTPLLTPG 553


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2216YERSSTKINASE270.024 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 27.4 bits (60), Expect = 0.024
Identities = 13/56 (23%), Positives = 36/56 (64%)

Query: 11 MNLLLNMSGTFKLLSEKSTEFTHLEQHIVEYIAQQKVAVNLKMIASYLNIPKQQLS 66
+++L+N SG++ ++ +S + + +V++ +Q A++ +M+A++ I Q++S
Sbjct: 622 LSILINRSGSWADVARQSLQRFDSTRPVVKFGTEQYTAIHRQMMAAHAAITLQEVS 677


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2219HTHTETR793e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 3e-20
Identities = 40/190 (21%), Positives = 75/190 (39%), Gaps = 12/190 (6%)

Query: 18 KSTKEIILEVATRLFLTQNYQVVSMDEVAKECGVTKATVYYYYSTKADLFTATMIQMMVR 77
+ T++ IL+VA RLF Q S+ E+AK GVT+ +Y+++ K+DLF+
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 78 IRENMDQILS-TNKTLEERLLNFATVYLHATMDIDMNNFMKDAKLSLSEEQLKEL----- 131
I E + + L L +T+ + + + + E + E+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEI-IFHKCEFVGEMAVVQQ 128

Query: 132 --KNAEDNMYEVLEKALDNAILLGEIPKG-NPKFAAHAFVALLS--IGNFKDENHNPTLA 186
+N Y+ +E+ L + I +P + AA +S + N+ + L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188

Query: 187 NIDELAKEIV 196
I+
Sbjct: 189 KEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2220ACRIFLAVINRP533e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 53.3 bits (128), Expect = 3e-09
Identities = 40/232 (17%), Positives = 90/232 (38%), Gaps = 25/232 (10%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADNGWIKVDAQAISIM 262
L A +L+ LV+ + L ++ ++P + V + T LA G+ +I+ +
Sbjct: 344 LFEAIMLVFLVMYLFL-QNMRATLIPTIAVPVV---LLGTFAILAAFGY------SINTL 393

Query: 263 T----VLLFGAGTDYCLFLISRYKEYLLEEESKYK-ALQLAIKASGGAIIMSALTVVLGL 317
T VL G D + ++ + ++E++ K A + ++ GA++ A+ +
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF 453

Query: 318 GTLLL--AHYGAFHR-FAVPFSVAVFIMGIAALTILPALLLIFGRVVFFPFIPRTAEMNE 374
+ GA +R F++ A+ + + AL + PAL + P +AE +E
Sbjct: 454 IPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK-------PVSAEHHE 506

Query: 375 EFARKKKKVVKVKNTKGFFSKKLGDIVVRKPWTIIMLTVFLLGGLASFVPRI 426
+ ++ +++ ++ G+ R+
Sbjct: 507 NKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558



Score = 38.7 bits (90), Expect = 1e-04
Identities = 29/161 (18%), Positives = 67/161 (41%), Gaps = 9/161 (5%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADNGWIKVDAQAISIM 262
L+ + ++V + L LY S + + +LVV I+ L N V +
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLG--IVGVLLAATLFNQKNDVYFM---VG 929

Query: 263 TVLLFGAGTDYCLFLISRYKEYLLEE-ESKYKALQLAIKASGGAIIMSALTVVLGLGTLL 321
+ G + ++ K+ + +E + +A +A++ I+M++L +LG+ L
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 322 LAH---YGAFHRFAVPFSVAVFIMGIAALTILPALLLIFGR 359
+++ GA + + + + A+ +P ++ R
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2224SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 4e-05
Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 6/100 (6%)

Query: 49 YSSVEMMKYLIEELD--TYKVIMDEKVIGGIIVTISGKSYGRIDRIFVEPFLQGKGIGSR 106
Y +M +EE + ++ IG I + + Y I+ I V + KG+G+
Sbjct: 50 YEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTA 109

Query: 107 VIKLIEE---KFPNIRIWDLETSSRQINNHHFYKKMGYEI 143
++ E + + LET I+ HFY K + I
Sbjct: 110 LLHKAIEWAKENHFCGLM-LETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2225SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 1e-05
Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 11/91 (12%)

Query: 54 FVAEYDGEVVGFVGLTQSPGRRSHSGDLFIGVDSEYHNKGIGKALLTKMLDLADNWLMLE 113
F+ + +G + + + + D I V +Y KG+G AL L A W E
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIED--IAVAKDYRKKGVGTAL----LHKAIEW-AKE 120

Query: 114 RVELGV-LET---NPRAKVLYEKFGFEEEGV 140
G+ LET N A Y K F V
Sbjct: 121 NHFCGLMLETQDINISACHFYAKHHFIIGAV 151


20BcerKBAB4_2251BcerKBAB4_2258Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_22518250.459846hypothetical protein
BcerKBAB4_22529241.492941hypothetical protein
BcerKBAB4_22537273.530931lysine exporter protein LysE/YggA
BcerKBAB4_22545264.147076XRE family transcriptional regulator
BcerKBAB4_22555261.658596hypothetical protein
BcerKBAB4_22564261.687001triple helix repeat-containing collagen
BcerKBAB4_2257419-0.334080hypothetical protein
BcerKBAB4_2258218-0.922020triple helix repeat-containing collagen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2258cloacin395e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 5e-05
Identities = 35/134 (26%), Positives = 51/134 (38%), Gaps = 5/134 (3%)

Query: 180 GGATGATGATGAT--GATGATGATGATGATGGGATGATGATGATGATGATGATGATGATG 237
G TGA +G G TG GA+ +G + G+ G +G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 238 ATGATGATGATGATGATGATGATGATGATGVTGGGAIIPFASGTTPSALVNALIANTGTL 297
+ G +G G A A A G + GG + ++G +A+ + + A G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGP- 126

Query: 298 LGFGFSQPGVALTG 311
F F GVAL G
Sbjct: 127 --FKFGLWGVALYG 138



Score = 30.8 bits (69), Expect = 0.012
Identities = 20/83 (24%), Positives = 26/83 (31%), Gaps = 4/83 (4%)

Query: 153 INATPGATGPTGP----TGPTGPTGPTGATGGGATGATGATGATGATGATGATGATGATG 208
IN P G G +G + P G G G +G G + G +G G
Sbjct: 20 INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 209 GGATGATGATGATGATGATGATG 231
+ A A GA G
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGG 102


21BcerKBAB4_2274BcerKBAB4_2349Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2274414-0.491926putative glycerol-3-phosphate acyltransferase
BcerKBAB4_2275315-0.693556class V aminotransferase
BcerKBAB4_22762161.256018hypothetical protein
BcerKBAB4_22771150.599104hypothetical protein
BcerKBAB4_2278014-0.079011hypothetical protein
BcerKBAB4_2280014-1.946113triple helix repeat-containing collagen
BcerKBAB4_2283-314-3.469902hypothetical protein
BcerKBAB4_2284-314-3.209013DEAD/DEAH box helicase
BcerKBAB4_2285-216-3.984588TetR family transcriptional regulator
BcerKBAB4_2286-216-3.968682major facilitator transporter
BcerKBAB4_2287-117-3.876558hypothetical protein
BcerKBAB4_2288220-1.940455ABC transporter
BcerKBAB4_2289216-2.053585methyltransferase type 11
BcerKBAB4_2290115-2.1094233-demethylubiquinone-9 3-methyltransferase
BcerKBAB4_2291115-2.400204ThiJ/PfpI domain-containing protein
BcerKBAB4_2292115-3.372280MarR family transcriptional regulator
BcerKBAB4_2293216-5.043007phosphoglyceromutase
BcerKBAB4_2294419-6.989076SpoIISA like protein
BcerKBAB4_2295425-8.132512hypothetical protein
BcerKBAB4_2296016-4.674132hypothetical protein
BcerKBAB4_2297015-4.262777group-specific protein
BcerKBAB4_2298-114-2.926762hypothetical protein
BcerKBAB4_2299-112-1.959267hypothetical protein
BcerKBAB4_2300111-2.365913hypothetical protein
BcerKBAB4_2301213-1.852881aminoacyl-histidine dipeptidase
BcerKBAB4_2302114-3.238230hypothetical protein
BcerKBAB4_2303216-3.706966hexapaptide repeat-containing transferase
BcerKBAB4_2304114-3.421265two component transcriptional regulator
BcerKBAB4_2305-111-3.275333histidine kinase
BcerKBAB4_2306-111-3.414697ABC transporter
BcerKBAB4_2307011-3.715013hypothetical protein
BcerKBAB4_2308111-3.496896hypothetical protein
BcerKBAB4_2310-18-2.868129ABC transporter
BcerKBAB4_2311-19-3.295247hypothetical protein
BcerKBAB4_2312113-3.990516hypothetical protein
BcerKBAB4_2313016-4.036451two component transcriptional regulator
BcerKBAB4_2314017-3.968507histidine kinase
BcerKBAB4_2315015-2.873071serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_2316117-3.154332invasion protein LagB
BcerKBAB4_2317114-2.969332acyltransferase 3
BcerKBAB4_2319114-2.414994VanZ family protein
BcerKBAB4_2320111-1.827357GDSL family lipase
BcerKBAB4_2321211-1.924766penicillin-binding protein transpeptidase
BcerKBAB4_2322114-2.602501two component transcriptional regulator
BcerKBAB4_2323114-3.024509histidine kinase
BcerKBAB4_2324213-3.532905ABC transporter
BcerKBAB4_2325213-3.135984hypothetical protein
BcerKBAB4_2326113-2.547667hypothetical protein
BcerKBAB4_2327114-2.887102ECF subfamily RNA polymerase sigma-24 factor
BcerKBAB4_2328013-2.402607ECF-type sigma factor negative effector
BcerKBAB4_2329012-2.040084peptidoglycan glycosyltransferase
BcerKBAB4_2330013-1.733219penicillin-binding protein transpeptidase
BcerKBAB4_2331113-1.850143Beta-lactamase
BcerKBAB4_2332114-2.416265hemolytic enterotoxin
BcerKBAB4_2333214-1.934735hemolytic enterotoxin
BcerKBAB4_2334216-2.417116hemolytic enterotoxin
BcerKBAB4_2335317-2.897843S-layer protein
BcerKBAB4_2336519-3.069625hypothetical protein
BcerKBAB4_2337519-3.508130FAD-dependent pyridine nucleotide-disulfide
BcerKBAB4_2338418-3.916230beta-lactamase domain-containing protein
BcerKBAB4_2339218-2.973987spore germination B3 GerAC family protein
BcerKBAB4_2340217-2.550557spore germination protein
BcerKBAB4_2341115-1.261576GerA spore germination protein
BcerKBAB4_2342015-1.425968filamentation induced by cAMP protein fic
BcerKBAB4_2343114-0.260967ArsR family transcriptional regulator
BcerKBAB4_2344114-0.715217alpha/beta fold family hydrolase
BcerKBAB4_2345217-1.985967TetR family transcriptional regulator
BcerKBAB4_2346217-1.970378chitin-binding domain-containing protein
BcerKBAB4_2347117-3.029165phosphatidylinositol-specific phospholipase C X
BcerKBAB4_2348018-2.657562hypothetical protein
BcerKBAB4_2349-219-3.340600metallophosphoesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2275AUTOINDCRSYN290.042 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 28.7 bits (64), Expect = 0.042
Identities = 10/52 (19%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 15 ESIHKLNYKTFVEEIPQHEETKDRVRIDRFHEENT-YLICLDDDKLVGMVAL 65
+ L +TF + + + D + D++ NT YL + D+ ++ +
Sbjct: 18 GELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDNTVICSLRF 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2285HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 1e-16
Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 13/173 (7%)

Query: 9 ERRNEILETAERLFVTKGYTKTTVNDILKEIGIAKGTFYHYFKSKEEVMDEII----MRI 64
E R IL+ A RLF +G + T++ +I K G+ +G Y +FK K ++ EI I
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 65 IKADVAKAKAIVSNPNIPVLDKLFRVLMEQSPKSGDVKDKMIEQFHQPNNA-EMHQKSIV 123
+ ++ +P + VL ++ ++E + + M FH+ EM
Sbjct: 71 GELELEYQAKFPGDP-LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 124 QSIIHLSPV--LAEILEQGIAEGIFSTPY-PQETIELLLSSAQVIFDEGLFQW 173
Q + L + + L+ I + + ++ + W
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY----ISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2286TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.1 bits (99), Expect = 3e-06
Identities = 53/344 (15%), Positives = 126/344 (36%), Gaps = 26/344 (7%)

Query: 39 DITGRADIFAGLYAVTSIPFLLAPLGGAIADRFNRRDLMVIFDFINAAIVLSFIVLLFTG 98
D+T I LYA+ F AP+ GA++DRF RR +++ AA+ + ++
Sbjct: 40 DVTAHYGILLALYALMQ--FACAPVLGALSDRFGRR-PVLLVSLAGAAV--DYAIMATAP 94

Query: 99 PVSIILIGTIMFLLAVVSAMYSPVVMASIPQLVPENKLEQANGIVNGVQSLSNIVAPVFG 158
+ ++ IG I +A ++ V A I + ++ + G ++ + PV G
Sbjct: 95 FLWVLYIGRI---VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151

Query: 159 GILYGIIGLKMLVIISCLAFFLSAILEMFIKIPFIKRARESHIIPTIVKDMKEGFIYVLK 218
G++ G + L+ + F+ +P + + + +
Sbjct: 152 GLM-GGFSPHAPFFAAAALNGLNFLTGCFL-LPESHKGERRPLRREALNPLASFRWARGM 209

Query: 219 QPFI--LKSMLLAALLNLILTPLFVVGAPIIIRVTMESSDTLYGIGMGLIDFATILGALS 276
+ + L+ + L+V+ R +++ + I ++ A+
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFG--EDRFHWDATTIGISLAAFGI-LHSLAQAMI 266

Query: 277 IGFFAKKLQMKTLYYWMLLIALLVLPMALSVTPVILNLGYYPPFILFILSSILIAMIVTI 336
G A +L + ++ + T + +P +L I + + +
Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILLAFATRGWM---AFPIMVLLASGGIGMPALQAM 323

Query: 337 VSIYVITVVQKKTPNENLGKVMAIITAVSQCMAPIGQVIYGFMF 380
+S V E G++ + A++ + +G +++ ++
Sbjct: 324 LSRQV--------DEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2304HTHFIS705e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 5e-16
Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 1/117 (0%)

Query: 3 TIMIVEDDIKIAELLGTHIEKYGYQSVMIEDFENVLDTFQKIKPDLVLLDVNLPNFDGYY 62
TI++ +DD I +L + + GY + + + DLV+ DV +P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 WCRQIRAI-STCPIVFLSARSGEMDQVMALENGGDDYITKPFYYEVVMSKIRSQLRR 118
+I+ P++ +SA++ M + A E G DY+ KPF ++ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2305PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 2e-04
Identities = 21/111 (18%), Positives = 40/111 (36%), Gaps = 26/111 (23%)

Query: 221 RFILNQVLSNAIKYSSSKKE---KVIVSAYSKGRAIILEVRDYGVGIPTADLPRVFHPFY 277
++ ++ N IK+ ++ K+++ + LEV + G
Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT--------- 307

Query: 278 TGENGRKFKESTGMGLYLVKE----VCGKLNHKIELESEVDKGTIVRIIFP 324
KESTG GL V+E + G +I+L + ++ P
Sbjct: 308 --------KESTGTGLQNVRERLQMLYGT-EAQIKLSEK-QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2313HTHFIS1008e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 100 bits (251), Expect = 8e-27
Identities = 33/118 (27%), Positives = 64/118 (54%), Gaps = 1/118 (0%)

Query: 3 SKILIVDDDKEIRNLISVYLENEGLKTQKAEDAIEALKLLEEKKFDLIILDIMMPNMDGI 62
+ IL+ DDD IR +++ L G + +A + + DL++ D++MP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EACMKIREER-NMPIIMLSAKSEDIDKIQGLASGADDYLSKPFNPLELIARVKSQLRR 119
+ +I++ R ++P++++SA++ + I+ GA DYL KPF+ ELI + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2315BLACTAMASEA551e-10 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 55.2 bits (133), Expect = 1e-10
Identities = 31/160 (19%), Positives = 56/160 (35%), Gaps = 16/160 (10%)

Query: 5 RFITIVTVLTLFCSMAVPFGRASA-ETVPAIDVEAGSAI---LVEANSGKILYQKNADES 60
R+I + + L E + + + + ++ SG+ L ADE
Sbjct: 2 RYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADER 61

Query: 61 LAIASMTKMMSEYLVHEAVDKGKLKWDQKVKISEYAYKISQDRSLSNVPLEN---GGSYT 117
+ S K++ V VD G + ++K+ Q + P+ T
Sbjct: 62 FPMMSTFKVVLCGAVLARVDAGDEQLERKI-------HYRQQDLVDYSPVSEKHLADGMT 114

Query: 118 VKELYEAMVIYSANGATIALAEEIAG-KEVN-FVKMMNDK 155
V EL A + S N A L + G + F++ + D
Sbjct: 115 VGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDN 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2322HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 35/138 (25%), Positives = 61/138 (44%), Gaps = 3/138 (2%)

Query: 3 KIMIVEDDQKISKLLQSHINKYGYEGNITEDFENILANFEKIQPDLVLLDVNLPSFDGFY 62
I++ +DD I +L +++ GY+ IT + + DLV+ DV +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 WCRQIRTV-STCPIIFISARSGEMEQVMALEHGADDYITKPFHYEVVMAKIRSHLRRIYG 121
+I+ P++ +SA++ M + A E GA DY+ KPF ++ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 EYAPKAEERVVQQSGLIL 139
P E Q ++
Sbjct: 125 R--PSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2323PF06580444e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 4e-07
Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 23/108 (21%)

Query: 221 RFVIHQVISNAIKYSAGSRKN---VTITTLEEERSVILEIHDHGVGIPKEDLPRVFRPFY 277
++ ++ N IK+ + + ++ +V LE+ + G K
Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT--------- 307

Query: 278 TGINGRKFKESTGMGLYLVKNIIKRL---EHDIEIKSEVGKGTTICII 322
KESTG GL V+ ++ L E I++ + GK + +I
Sbjct: 308 --------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2331BLACTAMASEA382e-136 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 382 bits (982), Expect = e-136
Identities = 97/263 (36%), Positives = 145/263 (55%), Gaps = 3/263 (1%)

Query: 47 HKNHATYKEFSQLEKKFDARLGVYAIDTGTNRTI-AYRPNERFAFASTYKALAAGVLLQQ 105
H + ++ E + R+G+ +D + RT+ A+R +ERF ST+K + G +L +
Sbjct: 20 HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLAR 79

Query: 106 NSIDK--LNEVITYTKDDLVEYSPITEKHVDTGMKLGEIAEAAVRSSDNTAGNILFNKIG 163
L I Y + DLV+YSP++EKH+ GM +GE+ AA+ SDN+A N+L +G
Sbjct: 80 VDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVG 139

Query: 164 GPKGYEKALRQMGDRVTIADRFEPELNEATPGDIRDTSTAKAIATNLKAFTVGNALPADK 223
GP G LRQ+GD VT DR+E ELNEA PGD RDT+T ++A L+ L A
Sbjct: 140 GPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARS 199

Query: 224 RKVLTEWMKGNATGDKLIRAGVPTDWIVGDKSGAGSYGTRNDIAIVWPPNRAPIIIAILS 283
++ L +WM + LIR+ +P W + DK+GAG G R +A++ P N+A I+ I
Sbjct: 200 QRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARGIVALLGPNNKAERIVVIYL 259

Query: 284 SKDEKEAAYDNQLIAEATEVIVK 306
A NQ IA +++
Sbjct: 260 RDTPASMAERNQQIAGIGAALIE 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2345HTHTETR447e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.2 bits (104), Expect = 7e-08
Identities = 13/53 (24%), Positives = 28/53 (52%)

Query: 10 TKKNISNTLITLLNEKEFGRITTKDICIHAMVSKSTFYSHFADKYDLLEKLVQ 62
T+++I + + L +++ + +I A V++ Y HF DK DL ++ +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64


22BcerKBAB4_2361BcerKBAB4_2403Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_23614161.355491AMP-dependent synthetase and ligase
BcerKBAB4_2362628-0.054595group-specific protein
BcerKBAB4_2363526-1.295415hypothetical protein
BcerKBAB4_2364221-3.898328septum formation initiator
BcerKBAB4_2365120-3.088849hypothetical protein
BcerKBAB4_2366119-4.297122hypothetical protein
BcerKBAB4_2367-116-4.518204hypothetical protein
BcerKBAB4_2368-116-4.678869N-acetyltransferase GCN5
BcerKBAB4_2369017-5.166206hypothetical protein
BcerKBAB4_2370-116-4.455712DinB family protein
BcerKBAB4_2371-115-5.326588serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_2372-116-4.651531histidine kinase
BcerKBAB4_2373-116-4.492548two component transcriptional regulator
BcerKBAB4_2375012-4.626383hypothetical protein
BcerKBAB4_2376011-4.367086S-adenosylhomocysteine nucleosidase
BcerKBAB4_2378112-5.365460N-acetyltransferase GCN5
BcerKBAB4_2379112-4.911812hypothetical protein
BcerKBAB4_2380113-5.381145activator of Hsp90 ATPase 1 family protein
BcerKBAB4_2381113-4.876814ABC transporter
BcerKBAB4_2382116-4.602887PadR-like family transcriptional regulator
BcerKBAB4_2383116-3.926827hypothetical protein
BcerKBAB4_2384116-3.207914beta-lactamase
BcerKBAB4_2385114-2.407708NUDIX hydrolase
BcerKBAB4_2386114-2.154812short-chain dehydrogenase/reductase SDR
BcerKBAB4_2387114-2.061852HxlR family transcriptional regulator
BcerKBAB4_2388015-1.359605methyltransferase type 11
BcerKBAB4_2389115-1.187272alpha/beta hydrolase
BcerKBAB4_2390216-1.683416glyoxalase/bleomycin resistance
BcerKBAB4_2391114-2.662120aminoglycoside phosphotransferase
BcerKBAB4_2392114-2.878268TetR family transcriptional regulator
BcerKBAB4_2393-112-3.547013dihydrolipoamide dehydrogenase
BcerKBAB4_2394013-3.704623cupin
BcerKBAB4_2396-114-4.162677AraC family transcriptional regulator
BcerKBAB4_2397-213-3.974424M3 family oligoendopeptidase
BcerKBAB4_2398-215-3.581204hypothetical protein
BcerKBAB4_2399-218-3.234962transcriptional regulator
BcerKBAB4_2400-118-2.825854glyoxalase/bleomycin resistance
BcerKBAB4_2402-116-4.371962abortive infection protein
BcerKBAB4_2403-118-3.353630hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2364BORPETOXINA280.009 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 27.8 bits (61), Expect = 0.009
Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 8 SSQQSNPKIPSQQVNANTKQGNNKKTRRRIITTLAFILPII 48
+++ SN + SQQ AN ++++ I+ TL + P+I
Sbjct: 192 TTEYSNARYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVI 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2368SACTRNSFRASE300.005 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.5 bits (66), Expect = 0.005
Identities = 21/110 (19%), Positives = 37/110 (33%), Gaps = 7/110 (6%)

Query: 36 TYPLNEQQLEKYTESANTLAFKIMDEETKAVIGHISLGQIDNINKSARIGKVLVGNTNMR 95
Y ++ + E ++ IG I + N N A I + V R
Sbjct: 49 QYEDDDMDVSYVEEEGKAAFLYYLENN---CIGRIKIRS--NWNGYALIEDIAVAKDY-R 102

Query: 96 GRSIGKHMMKAVLRIAFDELKLHRVTLGVYDFNKSAISCYERIGFVKEGL 145
+ +G ++ + A E + L D N SA Y + F+ +
Sbjct: 103 KKGVGTALLHKAIEWA-KENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2371BLACTAMASEA379e-05 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 36.7 bits (85), Expect = 9e-05
Identities = 32/159 (20%), Positives = 61/159 (38%), Gaps = 19/159 (11%)

Query: 39 ILIDTNSGEIV--YKKNEETPIQSATLSKLMTEYIALEQLNEGKIQLDELVKISNEVFRA 96
I +D SG + ++ +E P+ S K++ L +++ G QL+ + +
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMST--FKVVLCGAVLARVDAGDEQLERKIHYRQQDLV- 99

Query: 97 ETSPIQVTSKDKT-TVRDLLHALFLTGNNRSALALAEHIAGNEDNFTLLMNDKAKQL--- 152
+ SP+ TV +L A +N +A L + G + +Q+
Sbjct: 100 DYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDN 154

Query: 153 --KLSQPSPFLNATGINNDTNKQSTTTSIDAAKLATQLV 189
+L + LN + D + TTT A +L+
Sbjct: 155 VTRLDRWETELN-EALPGD--ARDTTTPASMAATLRKLL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2373HTHFIS928e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 8e-24
Identities = 36/131 (27%), Positives = 65/131 (49%), Gaps = 2/131 (1%)

Query: 5 ILIIDDDKDIVELLAVYLRNEGYNIYKAYDGDEALQMICTYEVDLMLLDIMMPKRNGLEV 64
IL+ DDD I +L L GY++ + + I + DL++ D++MP N ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 65 CQEVRE-NNTVPILMLSAKAEDMDKILGLMTGADDYMIKPFNPLELVARV-KALLRRSSF 122
+++ +P+L++SA+ M I GA DY+ KPF+ EL+ + +AL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 QNATSQKNEDG 133
+ ++DG
Sbjct: 126 PSKLEDDSQDG 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2378SACTRNSFRASE419e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.5 bits (97), Expect = 9e-07
Identities = 25/90 (27%), Positives = 35/90 (38%), Gaps = 7/90 (7%)

Query: 47 NSLEHSFLALDGDKCVGVIL--GGIKVYESIKTMRCGTLAVHPEFRGIGVSQKLFELHKE 104
+ +FL + C+G I Y I+ +AV ++R GV L E
Sbjct: 62 EEGKAAFLYYLENNCIGRIKIRSNWNGYALIED-----IAVAKDYRKKGVGTALLHKAIE 116

Query: 105 EATQNKCKQLFLEVIVGNDRAIHFYNKLGY 134
A +N L LE N A HFY K +
Sbjct: 117 WAKENHFCGLMLETQDINISACHFYAKHHF 146



Score = 40.3 bits (94), Expect = 2e-06
Identities = 33/136 (24%), Positives = 54/136 (39%), Gaps = 17/136 (12%)

Query: 150 KIMNKDNKKIEVKQLEFPTFKVEI---------QKWLNFHINWQNDIDYIEKTNHTFYGA 200
K NK N+ V P F+ + + + + + D+ Y+E+ +
Sbjct: 11 KDFNKPNEPFVVFGRMIPAFENGVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLY 70

Query: 201 YVDSDIKGSVCV----NEQGKISFIFIDKDYRNIGVGTKLLQVASE---ELKLSSLSIGF 253
Y++++ G + + N I I + KDYR GVGT LL A E E L +
Sbjct: 71 YLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLET 130

Query: 254 PNNNLLK-GFLKKSGF 268
+ N+ F K F
Sbjct: 131 QDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2386DHBDHDRGNASE761e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.9 bits (186), Expect = 1e-18
Identities = 66/261 (25%), Positives = 107/261 (40%), Gaps = 28/261 (10%)

Query: 1 MSITTLKDTRIVIIGGSSGIGLVTVKQALEQGAHVIIAGRSEEKLK--ISRELINNNHLQ 58
M+ ++ I G + GIG + QGAH+ + EKL+ +S H +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 59 TYVLD----NQNKEQLQDFFKTVGNFDHLFTPGASYTLGPI-TATEEIAESSFIGKFWPQ 113
+ D E + +G D L G I + ++E E++F
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 114 YYAVKYAIPFLSN--SGSIVLMSGAFSQRPLKGAPAYGACNGAIESLGKALAVELAP--I 169
+ A + ++ + SGSIV + + P AY + A K L +ELA I
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 170 RVNVVSPGTIRRENEQS--------EKRLAAY-EDYKS---LSLVQRPGYNDEIAHTVLY 217
R N+VSPG+ + + S E+ + E +K+ L + +P +IA VL+
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPS---DIADAVLF 237

Query: 218 LM--QNGFTTGNVLFPDGGYT 236
L+ Q G T + L DGG T
Sbjct: 238 LVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2392HTHTETR792e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 2e-20
Identities = 39/170 (22%), Positives = 77/170 (45%), Gaps = 2/170 (1%)

Query: 2 RKGELTKRMILDRSSALFNVKGYSGSSISDIMRETGLEKGGIYRHFKNKDDLAVQAFQHA 61
++ + T++ ILD + LF+ +G S +S+ +I + G+ +G IY HFK+K DL + ++ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 TSQMGVRYVEEIKKASNTLDKL--KTFISVFTSLIHNDPLPGGCPIMNVTLEADDSHPLM 119
S +G +E K + + I V S + + I+ E ++
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 120 AEQAQIAMNQLLGIIEKIITYGIEQGELKPDTQAKQFAIIWISSLEGALA 169
+ + + IE+ + + IE L D ++ AII + G +
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


23BcerKBAB4_2434BcerKBAB4_2452Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_24342221.588667thiamine pyrophosphate binding domain-containing
BcerKBAB4_2435217-2.907596hypothetical protein
BcerKBAB4_2436-117-2.858934hypothetical protein
BcerKBAB4_2437-317-3.124752N-acetyltransferase GCN5
BcerKBAB4_2438-316-2.245088excinuclease ABC subunit C
BcerKBAB4_2439-215-2.231544hypothetical protein
BcerKBAB4_2440-113-2.280063undecaprenyl pyrophosphate phosphatase
BcerKBAB4_2441016-2.534743hypothetical protein
BcerKBAB4_2442118-3.807192N-acetyltransferase GCN5
BcerKBAB4_2443118-3.523556metal-dependent hydrolase
BcerKBAB4_2444016-3.550641N-acetyltransferase GCN5
BcerKBAB4_2445-116-3.592207hypothetical protein
BcerKBAB4_2446017-3.478644patatin
BcerKBAB4_2447-117-3.540057hypothetical protein
BcerKBAB4_2448013-3.791412hypothetical protein
BcerKBAB4_2449-113-3.145919putative esterase
BcerKBAB4_2450-214-3.717996hypothetical protein
BcerKBAB4_2451-113-3.342969hypothetical protein
BcerKBAB4_2452015-3.635874endoribonuclease L-PSP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2444SACTRNSFRASE260.047 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.047
Identities = 19/97 (19%), Positives = 35/97 (36%), Gaps = 7/97 (7%)

Query: 43 ERTISRINNLDGGFYKIFVNTNLVGAICISRKEEASKFWISPMFIIPNYQGNGIAQKVLI 102
+ +S + + ++ N +G I I R I + + +Y+ G+ +L
Sbjct: 54 DMDVSYVEEEGKAAFLYYLENNCIGRIKI-RSNWNGYALIEDIAVAKDYRKKGVGTALLH 112

Query: 103 LIEEMFPEITNWELATILEEERN----CFLYEKMGYI 135
E E N +LE + C Y K +I
Sbjct: 113 KAIEWAKE--NHFCGLMLETQDINISACHFYAKHHFI 147


24BcerKBAB4_2462BcerKBAB4_2529Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2462216-0.749401hypothetical protein
BcerKBAB4_2463316-1.683255NUDIX hydrolase
BcerKBAB4_2465314-1.916589hypothetical protein
BcerKBAB4_2466515-2.452359N-acetyltransferase GCN5
BcerKBAB4_2467516-3.102168inosine/uridine-preferring nucleoside hydrolase
BcerKBAB4_2468416-3.598296hypothetical protein
BcerKBAB4_2469318-3.978480transcriptional regulator
BcerKBAB4_2470-116-5.126120hypothetical protein
BcerKBAB4_2471-117-4.768494hypothetical protein
BcerKBAB4_2472-117-3.712530glycerophosphodiester phosphodiesterase
BcerKBAB4_2473-216-1.772697N-acetyltransferase GCN5
BcerKBAB4_2474-216-1.843744hypothetical protein
BcerKBAB4_2475-215-1.678801hypothetical protein
BcerKBAB4_2476015-1.667352HxlR family transcriptional regulator
BcerKBAB4_2477216-2.341442NmrA family protein
BcerKBAB4_2480216-3.055100ABC transporter
BcerKBAB4_2481317-3.887385DegV family protein
BcerKBAB4_2482213-3.087253hypothetical protein
BcerKBAB4_2483214-3.270206hypothetical protein
BcerKBAB4_2484315-3.037765VanZ family protein
BcerKBAB4_2486015-2.871225penicillin-binding protein
BcerKBAB4_2487013-2.638961major facilitator transporter
BcerKBAB4_2488-114-3.063850peptidoglycan glycosyltransferase
BcerKBAB4_2489017-3.688886PA-phosphatase like phosphoesterase
BcerKBAB4_2490117-3.737501alcohol dehydrogenase
BcerKBAB4_2491017-4.875694histidine kinase
BcerKBAB4_2492319-5.017869two component transcriptional regulator
BcerKBAB4_2493619-5.802006hypothetical protein
BcerKBAB4_2494416-4.695619hypothetical protein
BcerKBAB4_2495415-4.307713hypothetical protein
BcerKBAB4_2496215-4.229839histidine kinase
BcerKBAB4_2497213-3.279566hypothetical protein
BcerKBAB4_2498214-3.682459cobalt transport protein
BcerKBAB4_2499214-2.795359ABC transporter
BcerKBAB4_2500113-3.204169hypothetical protein
BcerKBAB4_2501014-3.759722aspartate racemase
BcerKBAB4_2502015-3.788514glycosyl transferase family protein
BcerKBAB4_2503016-3.782964signal peptidase I
BcerKBAB4_2504015-3.460547peptidoglycan glycosyltransferase
BcerKBAB4_2505-217-4.058079PAS/PAC sensor signal transduction histidine
BcerKBAB4_2506-218-3.643070cof family hydrolase
BcerKBAB4_2507-116-2.780564cysteine dioxygenase type I
BcerKBAB4_2508-214-2.383028cytochrome P450
BcerKBAB4_2509-113-1.359100PadR-like family transcriptional regulator
BcerKBAB4_2510014-1.511528hypothetical protein
BcerKBAB4_2511015-2.005857virginiamycin A acetyltransferase
BcerKBAB4_2512-216-2.042897major facilitator transporter
BcerKBAB4_2513-217-2.534106cytochrome P450
BcerKBAB4_2514018-3.944321hypothetical protein
BcerKBAB4_2515-119-5.386023hypothetical protein
BcerKBAB4_2516219-5.531595hypothetical protein
BcerKBAB4_2517119-5.762899group-specific protein
BcerKBAB4_2518118-5.155615hypothetical protein
BcerKBAB4_2519017-4.588655hypothetical protein
BcerKBAB4_2520115-3.955439hypothetical protein
BcerKBAB4_2521116-3.718754hypothetical protein
BcerKBAB4_2522116-3.402164hypothetical protein
BcerKBAB4_2523014-2.111785TenA family transcription regulator
BcerKBAB4_2524014-2.657042D-alanine--D-alanine ligase
BcerKBAB4_2525-215-3.425919transcriptional regulator
BcerKBAB4_2526-215-3.599277homoserine dehydrogenase
BcerKBAB4_2527-215-4.060505lipase, putative
BcerKBAB4_2528-216-5.000020hypothetical protein
BcerKBAB4_2529-213-4.286785hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2470UREASE270.037 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 27.4 bits (61), Expect = 0.037
Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 8/39 (20%)

Query: 62 TIPIRVDTLHSDLDIVMEVHNYD--------FFEQEIRS 92
T P V+TL LD++M H+ F E IR
Sbjct: 303 TRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRK 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2473SACTRNSFRASE280.004 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.004
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 3/59 (5%)

Query: 27 IYDIATKEEMRGKGFGSTMFNFLLQEAKQLKNTYCVLQASPD---GINIYKKSGFQVVG 82
I DIA ++ R KG G+ + + ++ AK+ +L+ + Y K F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2477NUCEPIMERASE405e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.2 bits (94), Expect = 5e-06
Identities = 25/129 (19%), Positives = 47/129 (36%), Gaps = 26/129 (20%)

Query: 1 MKVLVTGANGNLGSKIVEYLLTRLSIEEIIVGVRDD---------KSEKALSYKEQGLEV 51
MK LVTGA G +G + + LL +VG+ D+ K + + G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEA---GHQVVGI-DNLNDYYDVSLKQARLELLAQPGFQF 56

Query: 52 RVTDFENQETLFSAFKDVD-------------RLFIMSTFGDFDTVIRQHTNAVEAAKAT 98
D ++E + F R + + D+ + N +E +
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 99 GVKQIIYPS 107
++ ++Y S
Sbjct: 117 KIQHLLYAS 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2487TCRTETA447e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.0 bits (104), Expect = 7e-07
Identities = 69/392 (17%), Positives = 140/392 (35%), Gaps = 28/392 (7%)

Query: 13 LLLSGVGIANLGAWIYLIALNVLVYNMGGSALAVA---TLYVIKPLATLFTNAWSGSMID 69
++LS V + +G + + L L+ ++ S A L + L G++ D
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 70 RLNKRKLMIHLDIYRVVFIAILPLVPSLWIVYLLVFFISMASAIYEPTAMTYMTKLIPVE 129
R +R +++ V AI+ P LW++Y+ + A A Y+ + +
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADITDGD 127

Query: 130 QRQRFNSLRSLIGSGAFLIGPAVAGILLITGTPE---FAIYMNAIAFLLSGFITLLLPNL 186
+R R S + GP + G++ A +N + FL F LLP
Sbjct: 128 ERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF---LLPES 184

Query: 187 DKKDDSNTSNDKFSLTVLKKDWNIVLNFSEKSMYIVCVYFLFQGMMVLATAIDSLELSFA 246
K + + + + + + +F M ++ +L + F
Sbjct: 185 HKGERRPLRREALNPLA-----SFRWARGMTVVAA--LMAVFFIMQLVGQVPAALWVIFG 237

Query: 247 KEVLLLTDSEYGFLVSIAGAGFILGAITNTILSKKLAPSF----LIGIGSLFIAIGYLIY 302
++ + G S+A G IL ++ +++ +A + +G + GY++
Sbjct: 238 EDRFHWDATTIGI--SLAAFG-ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294

Query: 303 AFSSEFLIAAIGFFILSFSMAYANTGFYTFYQNNVPVHMMGRIGSIYGLIIAVLTIFITI 362
AF++ +A +L+ V G++ + ++ +I +
Sbjct: 295 AFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353

Query: 363 LFGV---ATQFTSMQLVVIVGVLVMLLITIIL 391
LF A+ T I G + LL L
Sbjct: 354 LFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2492HTHFIS869e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.4 bits (214), Expect = 9e-22
Identities = 30/125 (24%), Positives = 60/125 (48%), Gaps = 1/125 (0%)

Query: 3 HILIIEDEESLADFLELELKYEGYIVDIQLDGRKGLEAALEKNYDLILLDLMLPGLNGLE 62
IL+ +D+ ++ L L GY V I + + DL++ D+++P N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VCRRLRAT-KSTPIIMLTARDSIMDRVTGLDSGADDYLPKPFAIEELLARMRVIFRREEN 121
+ R++ P+++++A+++ M + + GA DYLPKPF + EL+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 TEQKH 126
K
Sbjct: 125 RPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2499HTHFIS290.040 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.040
Identities = 7/18 (38%), Positives = 12/18 (66%)

Query: 35 VLIAGRSGSGKSTLAHCI 52
++I G SG+GK +A +
Sbjct: 163 LMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2505PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 18/102 (17%), Positives = 39/102 (38%), Gaps = 18/102 (17%)

Query: 474 QVFI-NILQNSIEAMLDGGKISIHIKEIHKKGVIISVIDEGIGIPEERIKRLGEPFYSTK 532
Q + N +++ I + GGKI + + + V + V + G +
Sbjct: 261 QTLVENGIKHGIAQLPQGGKILLKGTKDNGT-VTLEVENTGSLALKN------------T 307

Query: 533 EKGTGIGLMLSY---KIIEGHQGTISIMSEVGVGTTVTIYLP 571
++ TG GL +++ G + I + + G + +P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2506RTXTOXINA290.026 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.8 bits (64), Expect = 0.026
Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 11/84 (13%)

Query: 61 IIAENGALIYTDKKMMKRYPIQNTQALEIIEYLEENDLYYQLYTNKGVYVPDYGVESVRN 120
I ++G +I D +K+ ++ Q Y+ ND Y ++G + + N
Sbjct: 930 IFDKSGRIITPDS--LKKA-LEYQQRNNKASYVYGNDA--LAYGSQG------DLNPLIN 978

Query: 121 EIEYVKNSKENFNLKELETIAALY 144
EI + ++ +F++KE T A+L
Sbjct: 979 EISKIISAAGSFDVKEERTAASLL 1002


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2512TCRTETA354e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 4e-04
Identities = 45/302 (14%), Positives = 105/302 (34%), Gaps = 25/302 (8%)

Query: 70 FIFSFIGGTFADRWKPKKTMIWCETLSSISVFAVLITLMFGTWKIVFFVTLISAILSQFS 129
F + + G +DR+ + ++ +++ + T +V I I++ +
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYA------IMATAP-FLWVLYIGRIVAGIT 109

Query: 130 QPSG---MKLFKQHLSTEQIQLAMSLYQTIFAIFMVLGPIIGTF---IFHSFGIYISIII 183
+G ++ F MV GP++G + + +
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169

Query: 184 TGIAFLLAAAVLLFLPKDLENDNEKKEITLLQEMLDGIKYVKKKKALTLLGLCFMAAGLG 243
G+ FL LL E ++E L ++ + + L F L
Sbjct: 170 NGLNFLT-GCFLLPESHKGERRPLRREAL---NPLASFRWARGMTVVAALMAVFFIMQLV 225

Query: 244 IGLIQPLGIFIVTEQLGLSKESLQWLLTVNGAGMIVGGALAM-VFAKNVAPQKMLIIGML 302
+ L + ++ ++ L G + A+ A + ++ L++GM+
Sbjct: 226 GQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMI 285

Query: 303 GQAIGIGIIGYSTNLWVTLTAQLF---SGLALPCIQIGINTLIIQNSDTDFIGRVNGILS 359
G ++ ++T W+ + G+ +P +Q ++ + D + G++ G L+
Sbjct: 286 ADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLA 341

Query: 360 PL 361
L
Sbjct: 342 AL 343



Score = 31.3 bits (71), Expect = 0.008
Identities = 32/182 (17%), Positives = 65/182 (35%), Gaps = 6/182 (3%)

Query: 224 VKKKKALTLLGLCFMAAGLGIGLIQPLGIFIVTEQL--GLSKESLQWLLTVNGAGMIVGG 281
+K + L ++ +GIGLI P+ ++ + + LL +
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 282 ALAMVFAKNVAPQKMLIIGMLGQAIGIGIIGYSTNLWVTLTAQLFSGLALPCIQIGINTL 341
+ + + +L++ + G A+ I+ + LWV ++ +G+
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAY 119

Query: 342 IIQNSDTDFIGRVNGILSPLFTGSMVVTMSIAGSLKEMFSLSTMYEGTAL---LFIIGLL 398
I +D D R G +S F MV + G + + + AL F+ G
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 399 FI 400
+
Sbjct: 180 LL 181



Score = 30.6 bits (69), Expect = 0.012
Identities = 15/113 (13%), Positives = 38/113 (33%), Gaps = 4/113 (3%)

Query: 60 MISVAEFAPIFIFSFIGGTFADRWKPKKTMIWCETLSSISVFAVLITLMFGTWKIVFFVT 119
++ + I G A R ++ ++ I L F T + F
Sbjct: 251 SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG----YILLAFATRGWMAFPI 306

Query: 120 LISAILSQFSQPSGMKLFKQHLSTEQIQLAMSLYQTIFAIFMVLGPIIGTFIF 172
++ P+ + + + E+ + ++ ++GP++ T I+
Sbjct: 307 MVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


25BcerKBAB4_2579BcerKBAB4_2585Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2579315-2.689922hypothetical protein
BcerKBAB4_2580517-3.062850major facilitator transporter
BcerKBAB4_2581417-3.951524AraC family transcriptional regulator
BcerKBAB4_2582216-2.956931short chain dehydrogenase
BcerKBAB4_2583418-3.373156hypothetical protein
BcerKBAB4_2584420-2.492142hypothetical protein
BcerKBAB4_2585317-0.727676hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2580TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.9 bits (114), Expect = 4e-08
Identities = 66/362 (18%), Positives = 128/362 (35%), Gaps = 22/362 (6%)

Query: 17 MFLIITTTGFARMAYGII---LPFMQEGLHLSTAQAGMLGTILFLGYLLTVGTS---GIL 70
+ +I++T + G+I LP + L S G +L L L+ + G L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 71 TIRFGAKSVLLIGSWLVVISLIGLAFVSSFWIASICMLCAGAGSALVYTPLMSITVGWFP 130
+ RFG + VLL+ + +A W+ I + AG A I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDG 126

Query: 131 EKRGTAMGLLLSGAGIGMLFSGIIVPYIVRAFPEYSWRGSWLLFGVITCIVVFVASIVLK 190
++R G + + G GM + P + +S + + + +L
Sbjct: 127 DERARHFGFMSACFGFGM----VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 191 NPEVTEDEGERNN-----KSFLWKTKELYIIAWMYFIVGVVYLIPNLYQTSFMI--NNGI 243
E R SF W + ++A + + ++ L+ + ++I +
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARG-MTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 244 AASISGTVYSIAGMFSIVGAPVWGLISDRVGIKKTLCIALLLAVIGDMIPIIFGHTIG-- 301
+ S+A F I+ + +I+ V + AL+L +I D I
Sbjct: 242 HWDATTIGISLAA-FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300

Query: 302 -FIMSAIIWGSSLGGILLLIQVAASKQVSPKYVSMAISFISVFYAVGQMIGPGLAGWIIG 360
++ +S G + +Q S+QV + ++ ++ ++GP L I
Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360

Query: 361 ES 362
S
Sbjct: 361 AS 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2582DHBDHDRGNASE1005e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.7 bits (248), Expect = 5e-27
Identities = 55/188 (29%), Positives = 85/188 (45%), Gaps = 3/188 (1%)

Query: 14 KIAIITGASSGFGLLTTLELAKKDYFVIATMRNLEKQIDLISQATKLDLQQNIKVQQLDV 73
KIA ITGA+ G G LA + + A N EK ++S + DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA--EAFPADV 66

Query: 74 TDQGSIHNF-QLFLNEINRIDILINNAGYANGGFIEEIPVEDYRKQFETNLFGAISITQL 132
D +I E+ IDIL+N AG G I + E++ F N G + ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 133 VLPYMRKQKSGKIINISSISGKVGFPGLSPYVSSKYALEGWSESLRLEVKPFGIDVALLE 192
V YM ++SG I+ + S V ++ Y SSK A +++ L LE+ + I ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 193 PGSYNTNI 200
PGS T++
Sbjct: 187 PGSTETDM 194


26BcerKBAB4_2599BcerKBAB4_2609Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2599017-3.106676hypothetical protein
BcerKBAB4_2600016-2.559161glycosyl transferase family protein
BcerKBAB4_2601019-2.963882hypothetical protein
BcerKBAB4_2602021-4.241387topology modulation protein
BcerKBAB4_2603-123-4.215067aminoglycoside phosphotransferase
BcerKBAB4_26046210.536782hypothetical protein
BcerKBAB4_26056200.236831hypothetical protein
BcerKBAB4_26066200.075619N-acetyltransferase GCN5
BcerKBAB4_26077180.158143N-acetyltransferase GCN5
BcerKBAB4_26086220.501400hypothetical protein
BcerKBAB4_26094200.262088triple helix repeat-containing collagen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2606SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.1 bits (70), Expect = 0.001
Identities = 15/46 (32%), Positives = 24/46 (52%)

Query: 43 IAIGVWEENELIGFARVVTDGVFRAYIEDVVVHESVRNKGIGEKML 88
A + EN IG ++ ++ A IED+ V + R KG+G +L
Sbjct: 66 AAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALL 111


27BcerKBAB4_2665BcerKBAB4_2679Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2665014-5.006083histidine kinase
BcerKBAB4_2666116-3.967860two component transcriptional regulator
BcerKBAB4_2667016-4.139698group-specific protein
BcerKBAB4_2668016-3.698643hypothetical protein
BcerKBAB4_2669-117-3.057380N-acetyltransferase GCN5
BcerKBAB4_2670015-3.920000hydrolase
BcerKBAB4_2671216-3.755967bifunctional
BcerKBAB4_2672014-3.395948peptidase M3A and M3B thimet/oligopeptidase F
BcerKBAB4_2673015-3.241643hypothetical protein
BcerKBAB4_2674116-4.070054N-acetyltransferase GCN5
BcerKBAB4_2675117-4.523895XRE family transcriptional regulator
BcerKBAB4_2676219-3.425263hypothetical protein
BcerKBAB4_2677015-2.281756degV family protein
BcerKBAB4_2678117-3.155591N-acetyltransferase GCN5
BcerKBAB4_2679117-3.187619hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2665PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 2e-04
Identities = 36/182 (19%), Positives = 71/182 (39%), Gaps = 38/182 (20%)

Query: 282 IIKQTDQISNLIEELLRFSKLERDILQKEEFPIEPLVQSI--IDKHKIELESKEL--KLQ 337
I++ + ++ L S+L R L+ L + +D + ++L S + +LQ
Sbjct: 186 ILEDPTKAREMLTSL---SELMRYSLRYSNARQVSLADELTVVDSY-LQLASIQFEDRLQ 241

Query: 338 VNYSVGDTIVYADLNKMRMVFQNLISNAIKYTTNQ-----NIKIILEEKNGIVYFQIQN- 391
+ I+ + M + Q L+ N IK+ Q I + + NG V +++N
Sbjct: 242 FENQINPAIMDVQVPPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 392 -GIAAEGIKEIDKIWEPFYVLESSRSKEKSGTGLGLAIVKSILE-RHGFEYGVSVEDGEI 449
+A + KE TG GL V+ L+ +G E + + + +
Sbjct: 300 GSLALKNTKE--------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 450 QF 451
+
Sbjct: 340 KV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2666HTHFIS898e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 8e-23
Identities = 27/118 (22%), Positives = 54/118 (45%), Gaps = 1/118 (0%)

Query: 2 KVLIADDEQDMLKILKAYFEKEGFEVLLAKDGEEALQIFYDEKIDLAILDWMMPKSSGIT 61
+L+ADD+ + +L + G++V + + + DL + D +MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCQEIKK-NSSVKVLMLTAKSESEDELAALQTGADEYVKKPFHPGVLITRAKKLVQHE 118
+ IKK + VL+++A++ + A + GA +Y+ KPF LI + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2669SACTRNSFRASE280.008 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.008
Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 38 TGECLYGIFREDTLIGIGGLNQDPYTKNNKIGRLRRFYIAKDYRRKGLGKLLLGRILSDA 97
G+ + + E+ IG + + N + +AKDYR+KG+G LL + + A
Sbjct: 63 EGKAAFLYYLENNCIGRIKIRSNW----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWA 118

Query: 98 K-IYFTIVVLHTDTEQ--GDKFYTSSGFTKG 125
K +F ++L T FY F G
Sbjct: 119 KENHFCGLMLETQDINISACHFYAKHHFIIG 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2672STREPKINASE300.020 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 30.4 bits (68), Expect = 0.020
Identities = 31/130 (23%), Positives = 52/130 (40%), Gaps = 15/130 (11%)

Query: 16 LYPQEQNFTFSIETIERLKIEYKATKDSVILSQLIQAIEKAEYYLYCRAAEDEKHPEN-- 73
+ P +Q FT+ ++ E+ Y+ K S + ++ +E Y + E P +
Sbjct: 260 ILPMDQEFTYRVKNREQ---AYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRS 316

Query: 74 -------TLLSVKVNQLKKEVQLLIESSKG---QSVNTNHSSIKLIENELKAWEDMYTQL 123
+ V N+L K QLL S + + + KL+ N L A+ M L
Sbjct: 317 HLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTL 376

Query: 124 RNKIEVIHDK 133
K+E HD
Sbjct: 377 TGKVEDNHDD 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2674SACTRNSFRASE310.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 0.002
Identities = 21/108 (19%), Positives = 41/108 (37%), Gaps = 9/108 (8%)

Query: 31 PDFDSTPKETISETHLKEFNVRSFYSYIDGKLVSYAGVVRKTIHHNGQIFNIAGLSCVAT 90
P F + + ++++E +F Y++ + + R + I +IA
Sbjct: 45 PYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKI-RSNWNGYALIEDIA------V 97

Query: 91 DPDYHGQGLGLQTVAAATEWIEKQ--CNIDFGIFTCKPSLAHFYNRAG 136
DY +G+G + A EW ++ C + S HFY +
Sbjct: 98 AKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2679IGASERPTASE310.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.010
Identities = 32/164 (19%), Positives = 62/164 (37%), Gaps = 28/164 (17%)

Query: 26 SREIREKKESKETESKKEDKVIGIEKESEKTGLTKENKAMGIEKESKETESKKEDKVIGI 85
E + T S+ + V K+ KT E A TE+ +++ +
Sbjct: 1020 VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDA---------TETTAQNREVAK 1070

Query: 86 EKESEKTGLTKENKAMGIEKESKETESKKEDKVIGIEKESEKTGLTKENKAIGLEEEEVK 145
E +S T+ N+ E+KET++ + + A +EE+ K
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTE-----------------TKETATVEKEEKAK 1113

Query: 146 IESNKESEETEPAKEN--KVIGLEKVEVANESEKKSEETESKRE 187
+E+ K E + + K E V+ E ++++ T + +E
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157


28BcerKBAB4_2691BcerKBAB4_2720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2691212-1.856294putative lipoprotein
BcerKBAB4_2692112-2.519240hypothetical protein
BcerKBAB4_2693013-2.152949hypothetical protein
BcerKBAB4_2694114-1.969440transglycosylase-associated protein
BcerKBAB4_2695115-2.534561inosine/uridine-preferring nucleoside hydrolase
BcerKBAB4_2696216-2.308866hypothetical protein
BcerKBAB4_2697-216-2.061878methyltransferase type 11
BcerKBAB4_2698-120-1.710605group-specific protein
BcerKBAB4_2699-221-3.018659hypothetical protein
BcerKBAB4_2700-322-2.114452hypothetical protein
BcerKBAB4_2701-321-2.089850hypothetical protein
BcerKBAB4_2702-321-2.245182major facilitator transporter
BcerKBAB4_2703-219-2.725054hypothetical protein
BcerKBAB4_2704-217-2.330828hypothetical protein
BcerKBAB4_2705-216-2.682452aspartate aminotransferase
BcerKBAB4_2706-117-4.087230Beta-hydroxyacyl-(acyl-carrier-protein)
BcerKBAB4_2707018-4.173918pantothenate kinase
BcerKBAB4_2708020-4.835358hypothetical protein
BcerKBAB4_2709-119-4.885827CcdC protein
BcerKBAB4_2710-119-5.817363hypothetical protein
BcerKBAB4_2711117-5.020353ABC transporter
BcerKBAB4_2712221-4.912347GntR family transcriptional regulator
BcerKBAB4_2713323-5.755392hypothetical protein
BcerKBAB4_2714937-5.694317hypothetical protein
BcerKBAB4_27151036-5.927417hypothetical protein
BcerKBAB4_2716224-3.058341hypothetical protein
BcerKBAB4_2717116-3.143760hypothetical protein
BcerKBAB4_2718216-3.409494hypothetical protein
BcerKBAB4_2719115-4.025927hypothetical protein
BcerKBAB4_2720115-3.559721hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2691CHLAMIDIAOM6280.040 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 27.7 bits (61), Expect = 0.040
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 14/91 (15%)

Query: 1 MMKVYMKVLIFFLTLTCVAALTAC----TNAEEKKQTNSTSENSTEKKSDTSEKSKK--- 53
M K+ + + F +T VA+L A T+ E TN S T+ K +TS KSKK
Sbjct: 1 MNKLIRRAVTIF-AVTSVASLFASGVLETSMAESLSTNVISLADTKAKDNTSHKSKKARK 59

Query: 54 ---NEETAPKKENEPVEKPKGQESVKPSTES 81
E +KE PV + ++ P +S
Sbjct: 60 NHSKETPVDRKEVAPVHE---SKATGPKQDS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2694CHANLCOLICIN344e-05 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.5 bits (76), Expect = 4e-05
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 6 IVGGILGWFASLITGKDVPGGVIG-NIIAGIVGSWLGTALLGKFGPVIG 53
V ++ SL+ G G+ G I+ GI+ S++ L V+G
Sbjct: 475 GVSYVVALLFSLLAG--TTLGIWGIAIVTGILCSYIDKNKLNTINEVLG 521


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2702TCRTETA477e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.1 bits (112), Expect = 7e-08
Identities = 66/342 (19%), Positives = 118/342 (34%), Gaps = 11/342 (3%)

Query: 1 MWRNKNVWIVLIGEFIAGLGLWLGILGNLEFMQKYVPSDFMKS---VILFIGLLAGVLVG 57
M N+ + ++L + +G+ L + ++ V S+ + + ++L + L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 58 PMAGRVIDQYEKKKVLLYAGFGRVISVIFMFFAIQYESIAFMIAFMVALQISAAFYFPAL 117
P+ G + D++ ++ VLL + G + M A + I +VA A
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVL--YIGRIVAGITGATGAVAG- 117

Query: 118 QSVIPLIVREHELLQMNGVHMNVGTIARIAGTSLGGILLVVMSLQYMYAFSMAAYALLFL 177
+ I I E + G +AG LGG L+ S + + A L FL
Sbjct: 118 -AYIADITDGDERARHFGFMSACFGFGMVAGPVLGG-LMGGFSPHAPFFAAAALNGLNFL 175

Query: 178 STFFLQFEDKKSTTSSKESAKDNSFMEVFRILKGIPIAFTALILSIIPLLFIAGFNLMVI 237
+ FL E K N +A + I+ L+ L VI
Sbjct: 176 TGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI 235

Query: 238 -NISEMQHDPTIKGFIYTIEGVAFMLG-AFVIKRLSDHFKPEKLLYFFAVCTAFAHLSLF 295
D T G G+ L A + ++ + L + ++ L
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295

Query: 296 FSDIKWMALTSFGLFGFSVGCFFPIMSTIFQTKVEKSYHGRL 337
F+ WMA L G P + + +V++ G+L
Sbjct: 296 FATRGWMAFPIMVLLAS-GGIGMPALQAMLSRQVDEERQGQL 336



Score = 28.6 bits (64), Expect = 0.044
Identities = 22/108 (20%), Positives = 43/108 (39%), Gaps = 4/108 (3%)

Query: 48 IGLLAGVLVGPMAGRVIDQYEKKKVLLYAGFGRVISVIFMFFAIQYESIAFMIAFMVALQ 107
G+L + + G V + +++ L+ I + FA + +M ++ L
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG----WMAFPIMVLL 310

Query: 108 ISAAFYFPALQSVIPLIVREHELLQMNGVHMNVGTIARIAGTSLGGIL 155
S PALQ+++ V E Q+ G + ++ I G L +
Sbjct: 311 ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358


29BcerKBAB4_2830BcerKBAB4_2845Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2830318-0.375561hypothetical protein
BcerKBAB4_28313170.239418NUDIX hydrolase
BcerKBAB4_28323160.282745WGR domain-containing protein
BcerKBAB4_2833317-0.291144NAD-dependent epimerase/dehydratase
BcerKBAB4_2834216-0.261931hypothetical protein
BcerKBAB4_28354150.909983beta-lactamase
BcerKBAB4_28362140.991366AraC family transcriptional regulator
BcerKBAB4_28371121.059798N-acetyltransferase GCN5
BcerKBAB4_2838011-0.388219group-specific protein
BcerKBAB4_2840012-1.577918hypothetical protein
BcerKBAB4_2841112-3.033248transcriptional regulator
BcerKBAB4_2842318-5.971805hypothetical protein
BcerKBAB4_2843318-5.795317hypothetical protein
BcerKBAB4_2844318-5.365016hypothetical protein
BcerKBAB4_2845220-4.248336histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2832ALARACEMASE280.033 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.2 bits (63), Expect = 0.033
Identities = 21/109 (19%), Positives = 42/109 (38%), Gaps = 13/109 (11%)

Query: 19 VVKDRDYVVFYGKIGTAGSVKAKECETEEECIKEANKLIASKRKKGYTDPIL-GEDYIKE 77
VVK Y +G ++ A + ++EA L R++G+ PIL E +
Sbjct: 33 VVKANAY--GHGIERIWSAIGATDG-FALLNLEEAITL----RERGWKGPILMLEGFFHA 85

Query: 78 KTITEEEFWELLARAKSKGEDQEEQIEWLTSHLAKRTVHEIVAFDTHMH 126
+ + + L S Q++ L + K + + ++ M+
Sbjct: 86 QDLEIYDQHRLTTCVHS-----NWQLKALQNARLKAPLDIYLKVNSGMN 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2833NUCEPIMERASE310.005 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.3 bits (71), Expect = 0.005
Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 1 MKILILGGTRFLGRAFVEEALNRGHEV 27
MK L+ G F+G + L GH+V
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2837SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 6e-06
Identities = 16/70 (22%), Positives = 29/70 (41%)

Query: 61 SQTHKDEAYVHFIGVNPKFRRKGIASTLYSYFFDSARANNRKVVKAITSSVNKKSIRFHQ 120
A + I V +R+KG+ + L + A+ N+ + T +N + F+
Sbjct: 83 RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYA 142

Query: 121 EIGFKIEAGD 130
+ F I A D
Sbjct: 143 KHHFIIGAVD 152


30BcerKBAB4_2900BcerKBAB4_2910Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2900314-1.5320605'-nucleotidase domain-containing protein
BcerKBAB4_2901515-1.751508Mg2 transporter protein CorA family protein
BcerKBAB4_2902515-1.422539hypothetical protein
BcerKBAB4_2904615-1.828815hypothetical protein
BcerKBAB4_2905315-3.084823putative hydrolase
BcerKBAB4_2906216-3.175508hypothetical protein
BcerKBAB4_2907116-3.596484hypothetical protein
BcerKBAB4_2908013-3.143298polysaccharide deacetylase
BcerKBAB4_2909115-3.302652cell wall hydrolase/autolysin
BcerKBAB4_2910216-2.433341histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2910PF06580330.004 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.004
Identities = 22/110 (20%), Positives = 45/110 (40%), Gaps = 21/110 (19%)

Query: 469 QLHVHKQLSKIEVVANPHRIEQVVTNFITNAIRYTPEHEDIIISTIEENKRVKVCVENKG 528
+ ++ + ++V P ++ +V N I + I P+ I++ ++N V + VEN G
Sbjct: 243 ENQINPAIMDVQVP--PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 529 AHIAPEHVEKIWDRFYRGDTSRQRSKGGTGLGLA-ISKNILELHGAEYGV 577
+ +K TG GL + + + L+G E +
Sbjct: 301 SLALKN------------------TKESTGTGLQNVRERLQMLYGTEAQI 332


31BcerKBAB4_2959BcerKBAB4_2978Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2959214-2.9069223-oxoacyl-ACP synthase
BcerKBAB4_2960316-4.004669hypothetical protein
BcerKBAB4_2961316-3.875886hypothetical protein
BcerKBAB4_2962318-4.137307hypothetical protein
BcerKBAB4_2963317-3.783650hypothetical protein
BcerKBAB4_2964116-4.622603hypothetical protein
BcerKBAB4_2965113-3.750966hypothetical protein
BcerKBAB4_2966-111-3.726618abortive infection protein
BcerKBAB4_2967011-2.717472MerR family transcriptional regulator
BcerKBAB4_2969016-1.018112hypothetical protein
BcerKBAB4_2970014-0.595400cell wall anchor domain-containing protein
BcerKBAB4_2971111-0.275535ABC transporter
BcerKBAB4_2972010-0.586378hypothetical protein
BcerKBAB4_29731120.010797putative transcriptional regulator
BcerKBAB4_2974113-0.087355major facilitator transporter
BcerKBAB4_2975214-0.893771hypothetical protein
BcerKBAB4_2976313-1.038403histidine kinase
BcerKBAB4_2977112-0.002786two component transcriptional regulator
BcerKBAB4_2978212-0.347474hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2970cloacin461e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 46.2 bits (109), Expect = 1e-07
Identities = 49/198 (24%), Positives = 87/198 (43%), Gaps = 21/198 (10%)

Query: 18 AGANSAHAEVTDATPQQSTGDRLAEIKQHKQELDAK--LQQHKENVDQTLNELNQVKENV 75
+G N+ + V+D R E + +QE DA ++ + N ++ ELNQ E+V
Sbjct: 278 SGHNAVYVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDV 337

Query: 76 DTKVNELHERKQVADEKIGEIKQHKQELDAKLQQDKQIAEDKIAEIKEHKKQVDEKVAEI 135
++ +Q ++ K ELDA +K +A D IAEIK+ + + +A
Sbjct: 338 AR-----NQERQAKAVQV--YNSRKSELDAA---NKTLA-DAIAEIKQFNRFAHDPMAGG 386

Query: 136 KEHKQTVDEKVNEIKEHKQTVDEKVNEIKQHKENIDAKVNELKEVKKQVDDKLAELKKAK 195
Q K + + + K + DA ++ E +K+ +DK K
Sbjct: 387 HRMWQMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDK-------K 439

Query: 196 QTAEDKLAEIKENKPNTG 213
++AE+ L + ++NKP G
Sbjct: 440 RSAENNLND-EKNKPRKG 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2976PF06580384e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 4e-05
Identities = 16/102 (15%), Positives = 32/102 (31%), Gaps = 22/102 (21%)

Query: 359 NIFTNSIKFSNDGGTIEFVVEELESGIVISISDNGIGMEKEEMDRIFDRFYKVDTARARN 418
N + I GG I + + + + + G K
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------------------ 307

Query: 419 IEGSGLGLSIVQKIVELHNGN---VSVYSTKGEGTTVRVELP 457
E +G GL V++ +++ G + + +G V +P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2977HTHFIS784e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 4e-19
Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 1/120 (0%)

Query: 2 IHILLADDDKHIRELLHYHLQKEGFKVFEAEDGKVAQGVLEKENIHLAIVDIMMPFVDGY 61
IL+ADDD IR +L+ L + G+ V + + + L + D++MP + +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 TLCEEIRK-YHDIPVILLTAKDQLVDKEKGFISGTDDYIVKPFEPAEVIFRMKALLRRYQ 120
L I+K D+PV++++A++ + K G DY+ KPF+ E+I + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


32BcerKBAB4_3013BcerKBAB4_3034Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3013-2213.360167major facilitator transporter
BcerKBAB4_3014-3244.0802786-aminohexanoate-dimer hydrolase
BcerKBAB4_3015-3254.477199hypothetical protein
BcerKBAB4_3016-3244.201294MerR family transcriptional regulator
BcerKBAB4_3017-3275.874264beta-lactamase
BcerKBAB4_30180202.528576serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_3021017-0.333306hypothetical protein
BcerKBAB4_3022120-1.731613hypothetical protein
BcerKBAB4_3023414-1.672092hypothetical protein
BcerKBAB4_3024418-1.569910hypothetical protein
BcerKBAB4_3025516-1.698484hypothetical protein
BcerKBAB4_3026417-2.805020N-acetyltransferase GCN5
BcerKBAB4_3027317-0.804738serine/threonine protein kinase
BcerKBAB4_30280140.488590peptidase M20
BcerKBAB4_3029-2160.625762hypothetical protein
BcerKBAB4_3030-1150.454426hypothetical protein
BcerKBAB4_3031-1140.394040hypothetical protein
BcerKBAB4_3032-2131.726859HemK family modification methylase
BcerKBAB4_3033-114-0.120307short chain fatty acid transporter
BcerKBAB4_3034-118-3.341461group-specific protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3013TCRTETA755e-17 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 75.2 bits (185), Expect = 5e-17
Identities = 68/358 (18%), Positives = 124/358 (34%), Gaps = 21/358 (5%)

Query: 16 FSSLFL-FLTFYMLMTTLPVYVIDSLKGK--PEEIGLVATVFLISSVLCRPFTGKWLDDL 72
S++ L + ++M LP + D + G++ ++ + C P G D
Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70

Query: 73 GRKKILFISLSLFLAATVMYFGTQSLFLLLALRFLHGIGFGMATTATGTIVTDVAPAHRR 132
GR+ +L +SL+ + L++L R + GI G G + D+ R
Sbjct: 71 GRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDER 129

Query: 133 GEALAYYGVFMSLPMVIGPFLGLTIISHFSFTVLFIVCSVFSLLAFLLG-LLVNIPHEAP 191
+ MV GP LG ++ FS F + + L FL G L+ H+
Sbjct: 130 ARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188

Query: 192 VNKQKRE------KMKWKELIEPSSIPIALTGFVLAFSYSGILSFIPIYAKELGLSEIA- 244
+RE +W + + +A+ + ++
Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248

Query: 245 ----SYFFILYALVVVISRPFTGKIFDRFGENVLIYPAIIIFTIGMFILSQAQTSFWFLG 300
+ F IL++L + TG + R GE + +I G +IL T W
Sbjct: 249 GISLAAFGILHSLAQAM---ITGPVAARLGERRALMLGMIADGTG-YILLAFATRGWMAF 304

Query: 301 AGMLIGLGYGTLIPSFQTIAISAAPNHRRGSATATYYSFFDSGIGFGSFILGIVAAKS 358
M++ G +P+ Q + R+G + + G + + A S
Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3016THERMOLYSIN310.006 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 30.8 bits (69), Expect = 0.006
Identities = 11/63 (17%), Positives = 24/63 (38%), Gaps = 12/63 (19%)

Query: 141 SVKSFFDKFRSIFKQGEFFQEQFITMCPIKNFFNDDLGLSVY--------YPVLNDTEME 192
V + D+ ++ F+ G +E+ + D+LG +V + +
Sbjct: 54 LVYRYLDQEKNTFQLGGQARERL----SLIGNKLDELGHTVMRFEQAIAASLCMGAVLVA 109

Query: 193 HVD 195
HV+
Sbjct: 110 HVN 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3017BLACTAMASEA300.038 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.038
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 92 NKDTLYGIGSVSKMYATAAVMKLVDEGKVDLDAPVVHYVPD 132
D + + S K+ AV+ VD G L+ +HY
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLER-KIHYRQQ 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3018BLACTAMASEA310.010 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.5 bits (69), Expect = 0.010
Identities = 12/56 (21%), Positives = 18/56 (32%)

Query: 75 GGKTWSYAAGVADLSSKQPMKTDFRFRIGSVTKTFTATVVLQLVGENRLNLDDYIE 130
G+ +A + + D RF + S K VL V L+ I
Sbjct: 37 SGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH 92


33BcerKBAB4_3114BcerKBAB4_3161Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3114215-0.396458PhzF family phenazine biosynthesis protein
BcerKBAB4_3115114-0.676381short-chain dehydrogenase/reductase SDR
BcerKBAB4_31163170.767400N-acetyltransferase GCN5
BcerKBAB4_31172180.224727AMP-dependent synthetase and ligase
BcerKBAB4_31183220.530510NAD(P)H dehydrogenase (quinone)
BcerKBAB4_31191231.252358HxlR family transcriptional regulator
BcerKBAB4_31202241.092070ArsR family transcriptional regulator
BcerKBAB4_31213251.354585hypothetical protein
BcerKBAB4_3122120-0.462334putative RNA polymerase sigma factor SigI
BcerKBAB4_3123314-1.689184hypothetical protein
BcerKBAB4_3124413-3.082582resolvase domain-containing protein
BcerKBAB4_3125413-3.600882hypothetical protein
BcerKBAB4_3126413-3.476268peptidase M15B and M15C DD-carboxypeptidase
BcerKBAB4_3127414-3.964255hypothetical protein
BcerKBAB4_3128314-4.199302cell wall anchor domain-containing protein
BcerKBAB4_3129315-5.660145histidine kinase
BcerKBAB4_3130316-4.583938hypothetical protein
BcerKBAB4_3131314-2.912420NUDIX hydrolase
BcerKBAB4_31320180.247814putative transmembrane anti-sigma factor
BcerKBAB4_31330283.035232RNA polymerase sigma factor SigX
BcerKBAB4_31340272.788951abortive infection protein
BcerKBAB4_3135-2273.840155hypothetical protein
BcerKBAB4_3136-2273.424343hypothetical protein
BcerKBAB4_3137-1283.604867histidine kinase
BcerKBAB4_31380231.741175two component transcriptional regulator
BcerKBAB4_3140522-0.747391LysR family transcriptional regulator
BcerKBAB4_31412220.145096NAD-dependent epimerase/dehydratase
BcerKBAB4_31420252.194202hypothetical protein
BcerKBAB4_3143-1264.189325hypothetical protein
BcerKBAB4_3144-1274.236980transcription activator effector binding
BcerKBAB4_3145-2294.375112helix-turn-helix type 11 domain-containing
BcerKBAB4_3146-1214.371256aromatic amino acid permease
BcerKBAB4_3147-2224.750126ABC-2 type transporter
BcerKBAB4_3148-3182.978460ABC transporter
BcerKBAB4_3149-2130.586133hypothetical protein
BcerKBAB4_3150-3140.216038PadR-like family transcriptional regulator
BcerKBAB4_3151-213-0.787635hydroxylamine reductase
BcerKBAB4_3152-115-1.522702Beta-lactamase
BcerKBAB4_3153017-2.309875hypothetical protein
BcerKBAB4_3154217-1.647878hypothetical protein
BcerKBAB4_3155317-1.999048hypothetical protein
BcerKBAB4_3156516-2.515224group-specific protein
BcerKBAB4_3157316-2.524372beta-lactamase
BcerKBAB4_3158319-3.367175hypothetical protein
BcerKBAB4_3159216-3.133807hypothetical protein
BcerKBAB4_3160117-4.617219hypothetical protein
BcerKBAB4_3161115-3.411877hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3115DHBDHDRGNASE665e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.8 bits (160), Expect = 5e-15
Identities = 48/196 (24%), Positives = 81/196 (41%), Gaps = 17/196 (8%)

Query: 3 VLITGGNRGLGLQLVKVFHENGHIV----YRLVRTEVAVTQLKKM-FSSRCFPILADLSI 57
ITG +G+G + + G + Y + E V+ LK + FP AD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP--ADVRD 68

Query: 58 DESTEQIKNQLEEYTKYIDLVINNAGITGKETEISRTNSEELMELFNVHCLGVIRAVKGT 117
+ ++I ++E ID+++N AG+ + I + EE F+V+ GV A +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 118 YVALTKSNQPRIINVSSRLGSLHKMANKEFPQGKFSYSYRIAKAAQNMLTLCLQQEFENK 177
+ I+ V S + + + +Y +KAA M T CL E
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMA---------AYASSKAAAVMFTKCLGLELAEY 178

Query: 178 GIRVTAIHPGKLKTDI 193
IR + PG +TD+
Sbjct: 179 NIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3121cloacin358e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.7 bits (79), Expect = 8e-04
Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 1/81 (1%)

Query: 293 GNNGRGSQGNNGNGQGNNGRGSQGNNGNQQGNNGHQQENTGRESQGNNGNGQGNNGRESQ 352
G +GRG + GN G G ++G + G +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 353 GNNGNGQGNNGRGSQGNNGNQ 373
GN G GN+G GS
Sbjct: 63 GNGGGN-GNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 0.001
Identities = 24/87 (27%), Positives = 30/87 (34%), Gaps = 9/87 (10%)

Query: 280 NGRGLQGNNGNGQGNNGRGSQGNNGNGQGNNGRGSQGNNGNQQGNNGHQQENTGRESQGN 339
+GRG + GN G G G ++G G N G +G G GN
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 340 NGNGQGNNGRESQGNNGNGQGNNGRGS 366
G GN+G G G G S
Sbjct: 65 GGGN---------GNSGGGSGTGGNLS 82



Score = 33.9 bits (77), Expect = 0.001
Identities = 26/87 (29%), Positives = 36/87 (41%), Gaps = 9/87 (10%)

Query: 360 GNNGRGSQGNNGNQQGNNGRESQGNNGNGQGNNGRE-SQGNNGNGQGNNGRESQGNNGNQ 418
G +GRG + GN G G ++G S NN G G+ G+ +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS------GSGIHW 56

Query: 419 QGNNGRGSQGNNGNQQENNGRGSQGNN 445
G +G G+ G NGN G G+ GN
Sbjct: 57 GGGSGHGNGGGNGNS--GGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.001
Identities = 20/82 (24%), Positives = 28/82 (34%), Gaps = 2/82 (2%)

Query: 308 GNNGRGSQGNNGNQQGNNGHQQENTGRESQGNNGNGQGNNGRESQGNNGNGQGNNGRGSQ 367
G +GRG + GN G ++G+G + G +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 368 GNNGNQQGNNGRESQGNNGNGQ 389
GN G N G GN
Sbjct: 63 GNGGG--NGNSGGGSGTGGNLS 82



Score = 33.5 bits (76), Expect = 0.002
Identities = 19/80 (23%), Positives = 29/80 (36%)

Query: 338 GNNGNGQGNNGRESQGNNGNGQGNNGRGSQGNNGNQQGNNGRESQGNNGNGQGNNGRESQ 397
G +G G + GN G G G ++G+ + G +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 398 GNNGNGQGNNGRESQGNNGN 417
GN G + G G N +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.005
Identities = 20/82 (24%), Positives = 28/82 (34%), Gaps = 2/82 (2%)

Query: 353 GNNGNGQGNNGRGSQGNNGNQQGNNGRESQGNNGNGQGNNGRESQGNNGNGQGNNGRESQ 412
G +G G + GN G ++G+G + G +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 413 GNNGNQQGNNGRGSQGNNGNQQ 434
GN G N G G GN
Sbjct: 63 GNGGG--NGNSGGGSGTGGNLS 82



Score = 29.3 bits (65), Expect = 0.032
Identities = 17/80 (21%), Positives = 27/80 (33%)

Query: 323 GNNGHQQENTGRESQGNNGNGQGNNGRESQGNNGNGQGNNGRGSQGNNGNQQGNNGRESQ 382
G +G + GN G G ++G+G + G +G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 383 GNNGNGQGNNGRESQGNNGN 402
GN G + G G N +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3124HTHTETR280.017 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.017
Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 156 ALDLLANRKENKFTVKKICEVTGVSRTVLYERAKEK 191
AL L + + + ++ +I + GV+R +Y K+K
Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3129PF06580445e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.5 bits (105), Expect = 5e-07
Identities = 61/343 (17%), Positives = 115/343 (33%), Gaps = 69/343 (20%)

Query: 104 FIIFTIISVITIMFRYLSPTYFKEKKILLSILLILICTTSL-SICGIVTQINTG------ 156
I + V+T +R K + I+L ++ + + V +
Sbjct: 46 IAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFI 105

Query: 157 GKIDSTLIEFLFNYIIINIFTVLLS---VYLIEGMIEKYKIKERMQ-RAEKFYIASELAA 212
L II N+ V +Y + YK E Q + ++L A
Sbjct: 106 NTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMA 165

Query: 213 SIA----HEIHNPLTTVRGFTQLLNEDESAKMSQD--KYLEIMLLEMQQIQST------- 259
A H + N L +R L + ++ +M + + L Q +
Sbjct: 166 LKAQINPHFMFNALNNIRALI-LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTV 224

Query: 260 INNYLSLTKPQ--NIIKEELDINYILNQVKDNISPLALSYNVEIKQHITTDSLYINANTE 317
+++YL L Q + ++ E IN + V+ + P+ + VE
Sbjct: 225 VDSYLQLASIQFEDRLQFENQINPAIMDVQ--VPPMLVQTLVE----------------- 265

Query: 318 KLKICLTNIIQNGIEAMKNGGVLQINIQKIKGNIVIDIIDTGIGMSSQQIKRIALPFYST 377
N I++GI + GG + + K G + +++ +TG
Sbjct: 266 -------NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA------------LKN 306

Query: 378 TEKGTGLGTMIAYSIIKELNGD---IEIESELGKGTHFSITIP 417
T++ TG G ++ L G I++ + GK + IP
Sbjct: 307 TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3137PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-04
Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 21/99 (21%)

Query: 351 NILRNSIKFSEDAGVINVSIKQDIKNVTILISDTGIGIHLDDQKRIFDRFFKADRSHSRK 410
N +++ I G I + +D VT+ + +TG + L + K
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-SLALKNTKE--------------- 309

Query: 411 YDGSGMGLAIVKQIVSLHQGD---IRVKSEPGQGTTFIV 446
+G GL V++ + + G I++ + G+ ++
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3138HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 2e-24
Identities = 42/120 (35%), Positives = 63/120 (52%)

Query: 2 PTILVADDDANIRELVCLFLRNDGFATAEAADGKEALAVYISTNVDLVVLDIMMPIMDGW 61
TILVADDDA IR ++ L G+ ++ + + DLVV D++MP + +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 TLSKELRRANPDLPLLMLTARGETWEKVKGFELGTDDYLTKPFDPLELTVRVRALLKRYK 121
L +++A PDLP+L+++A+ +K E G DYL KPFD EL + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3141NUCEPIMERASE713e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 70.6 bits (173), Expect = 3e-16
Identities = 46/220 (20%), Positives = 83/220 (37%), Gaps = 42/220 (19%)

Query: 3 KILVTGAAGKIGQDVTRFLDKQGN-----------YQLRLADINLSALETFKDTNHEIIY 51
K LVTGAAG IG V++ L + G+ Y + L L L +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA---QPGFQFHK 58

Query: 52 LDVSDLAACQEF--TKGIDIVIHLAG---------NPSPNADFYGSLLENHIKGTYNIFR 100
+D++D + + + V NP + ++++ G NI
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENP-------HAYADSNLTGFLNILE 111

Query: 101 AASDNNVSKVIVASSAQTIEGYPLDYQPHAYSPTRPKNMYGVSKCFAEAVASYFAYEEGL 160
N + ++ ASS+ S P ++Y +K E +A +++ GL
Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 161 QSLAIRIGAYDDYNPYGKPLTARDMSAYLSPEDFMDLLLK 200
+ +R + Y P+G+P DM+ + F +L+
Sbjct: 172 PATGLRF--FTVYGPWGRP----DMALFK----FTKAMLE 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3157BLACTAMASEA300.013 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.013
Identities = 12/58 (20%), Positives = 23/58 (39%), Gaps = 6/58 (10%)

Query: 39 LESGTTRTVTT---NSIFNSCSISKFITSMLVLTLSDQGIVHLDEDVN---DRLTSWN 90
++ + RT+T + F S K + VL D G L+ ++ L ++
Sbjct: 45 MDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYS 102


34BcerKBAB4_3170BcerKBAB4_3175Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3170-1123.370555Bcr/CflA subfamily drug resistance transporter
BcerKBAB4_3171-1164.543565amidohydrolase 3
BcerKBAB4_3172-1195.318996aminotransferase
BcerKBAB4_3173-2194.011889methylmalonate-semialdehyde dehydrogenase
BcerKBAB4_3174-1183.370136hypothetical protein
BcerKBAB4_3175-1173.304092multi anti extrusion protein MatE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3170TCRTETB787e-18 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 78.4 bits (193), Expect = 7e-18
Identities = 43/181 (23%), Positives = 82/181 (45%), Gaps = 1/181 (0%)

Query: 9 LWLMIILVAFPQISETIYTPSLPDISKALHVSNNEVQLTLSVYFAGFALGVFFIGWLSDI 68
L + IL F ++E + SLPDI+ + + + F++G G LSD
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 69 IGRRPAMLFGIVVYGAGSFLCYIANS-IEFLLFSRFIQAFGASAGSVVTQTILRESVEGH 127
+G + +LFGI++ GS + ++ +S L+ +RFIQ GA+A + ++ +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 128 KRHVMFAQISAVIAFTPAIGPLIGGFLDQMFGFKIVFLSLVVMSVGILLYTFVSLPETKT 187
R F I +++A +GP IGG + + + L ++ + + + E +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 188 D 188

Sbjct: 196 K 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3171UREASE402e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 40.5 bits (95), Expect = 2e-05
Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 8/75 (10%)

Query: 4 DVVLINGEVITVDQKNTVVEAVAIKNNHIVVVGSN------QEVKSFIGENTDVIDLQGK 57
D V+ N + +D V + +K+ I +G V +G T+VI +GK
Sbjct: 69 DTVITN--ALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 58 TILPGFIDSHLHIIS 72
+ G +DSH+H I
Sbjct: 127 IVTAGGMDSHIHFIC 141



Score = 36.3 bits (84), Expect = 3e-04
Identities = 22/57 (38%), Positives = 28/57 (49%)

Query: 459 EVGANQSISVMEAIKLYTWNGAYASFEEEIKGSIEAGKLADLVILNDSILSVNPNQI 515
E G N + V I YT N A A GS+E GK ADLV+ N + V P+ +
Sbjct: 393 ETGDNDNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGVKPDMV 449


35BcerKBAB4_3203BcerKBAB4_3215Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3203316-2.382554agmatine deiminase
BcerKBAB4_3204317-2.748027amidohydrolase 3
BcerKBAB4_3205219-3.865162MerR family transcriptional regulator
BcerKBAB4_3206119-3.813370peptidyl-arginine deiminase
BcerKBAB4_3207119-2.780564hypothetical protein
BcerKBAB4_3208015-2.163648phosphate-starvation-inducible protein PsiE
BcerKBAB4_3209-115-1.528607agmatine deiminase
BcerKBAB4_3210017-1.328233beta-lactamase
BcerKBAB4_3211-119-1.194661hypothetical protein
BcerKBAB4_3212119-1.709508major facilitator transporter
BcerKBAB4_3213220-1.640725LysR family transcriptional regulator
BcerKBAB4_3214220-3.193446hypothetical protein
BcerKBAB4_3215217-1.957779hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3204UREASE441e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 44.3 bits (105), Expect = 1e-06
Identities = 19/31 (61%), Positives = 22/31 (70%)

Query: 517 IDAYTIKPAKALGLDNVTGSIEVGKSADMVL 547
I YTI PA A GL + GS+EVGK AD+VL
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3212TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.3 bits (84), Expect = 2e-04
Identities = 54/324 (16%), Positives = 104/324 (32%), Gaps = 10/324 (3%)

Query: 55 GLLISAVNIGPLFCMLFAGRLLDRYNERILIGVSAVLLGGALLLAHMARG-FIGLLFVLL 113
G+L++ + C G L DR+ R ++ VS L G A+ A MA F+ +L++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVS--LAGAAVDYAIMATAPFLWVLYIGR 103

Query: 114 LIGTFYSVSQPGGSKVILKWFPKEMRGLAMGIRQAGIPIGGALAGVTIPLLTLKFSLSYT 173
++ + I + R G A G +AG + L FS
Sbjct: 104 IVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF-GMVAGPVLGGLMGGFSPHAP 162

Query: 174 INVIATICIIGGLLFFIFYKESYVREHTSREDVRLSFWMQLKKVICKKELYPIFITGICM 233
A + + L ES+ E L+ + + + M
Sbjct: 163 FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222

Query: 234 SSLQMILVGHFIKFLVIEKSITPIVAGKVFSVMLVAGMVGRIILATISDTYYKGDRRTPL 293
+ + ++ F G + + + + ++ G+RR +
Sbjct: 223 QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL-GERRALM 281

Query: 294 LITVCISFCLILVVVMSINTMTLEALFVMSGLLGFFAIGWFSLFMVEVAESASEELVGVT 353
L + IL+ + M + +++ G ++ +V E +L G
Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLAS-GGIGMPALQAMLSRQVDEERQGQLQGSL 340

Query: 354 VSFALTLNQIAIIFAPALFGYIVD 377
+ + I P LF I
Sbjct: 341 AAL----TSLTSIVGPLLFTAIYA 360


36BcerKBAB4_3237BcerKBAB4_3257Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_32372200.809687hypothetical protein
BcerKBAB4_32380150.200466hypothetical protein
BcerKBAB4_3239017-0.610938hypothetical protein
BcerKBAB4_3240-214-0.608618hypothetical protein
BcerKBAB4_3241-1160.332505exonuclease
BcerKBAB4_3242-115-0.402073cold-shock DNA-binding domain-containing
BcerKBAB4_3243-115-0.417151BNR repeat-containing protein
BcerKBAB4_32444162.650090flavodoxin
BcerKBAB4_32454152.514549GPR1/FUN34/YaaH family protein
BcerKBAB4_32463152.248676MutT/NUDIX family protein
BcerKBAB4_32472151.766875hypothetical protein
BcerKBAB4_32481141.603894chloramphenicol acetyltransferase
BcerKBAB4_32491151.513087hypothetical protein
BcerKBAB4_3250-112-0.756211short chain dehydrogenase
BcerKBAB4_3251115-0.862216N-acetyltransferase GCN5
BcerKBAB4_32520100.819436argininosuccinate lyase
BcerKBAB4_32530101.549092D-alanyl-D-alanine carboxypeptidase
BcerKBAB4_32541122.516131inosine/uridine-preferring nucleoside hydrolase
BcerKBAB4_3255-1103.276983sodium/panthothenate symporter
BcerKBAB4_3256-2133.845576hypothetical protein
BcerKBAB4_3257-2123.489533aldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3249CHLAMIDIAOM6471e-06 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 46.6 bits (110), Expect = 1e-06
Identities = 40/165 (24%), Positives = 66/165 (40%), Gaps = 38/165 (23%)

Query: 623 YTVTIENTGNVLATNVIFQDPTPIGTTFIPNSVTVDGVSQPGANPATGFTVANISPGGSR 682
Y + I N G A NV+ ++P P DG + FT+ ++ PG R
Sbjct: 229 YKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFTLGDMQPGEHR 276

Query: 683 TVTFQV------RVTSTPSGGTIANRGNVAANFVVIPNQPPVTINRQTNTVVTQVNTGGL 736
T+T + R T+ + N A+ + N+P V QV+ G
Sbjct: 277 TITVEFCPLKRGRATNIATVSYCGGHKN-TASVTTVINEPCV-----------QVSIAGA 324

Query: 737 NVIKEVNTAQAAVGDTLTYTIAVQNTGNVPLTNVFFQDTISSAVS 781
+ + V + Y I+V N G++ L +V +DT+S V+
Sbjct: 325 D--------WSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVT 361



Score = 37.4 bits (86), Expect = 7e-04
Identities = 36/160 (22%), Positives = 66/160 (41%), Gaps = 28/160 (17%)

Query: 359 YTITVPNTGTGSAENVVLQDSIPNGTTFIAGSVTVGGVTQPSANPASGINLGTIPNNAQR 418
Y I + N GT +A NVV+++ +P+ G S LG + R
Sbjct: 229 YKINIVNQGTATARNVVVENPVPD------------GYAHSSGQRVLTFTLGDMQPGEHR 276

Query: 419 VVTFQVRVMSFPSPNPISNRAMVSYQFRPFVGSPPITSTASSNTVQTTVNRANVS-LQKS 477
+T + + +N A VSY + +T+ + VQ ++ A+ S + K
Sbjct: 277 TITVEFCPL---KRGRATNIATVSY-CGGHKNTASVTTVINEPCVQVSIAGADWSYVCKP 332

Query: 478 VDLQTATLNDILTYTVNVTNNGNVAANNVIFVDSIPAGTT 517
V+ Y ++V+N G++ +V+ D++ G T
Sbjct: 333 VE-----------YVISVSNPGDLVLRDVVVEDTLSPGVT 361



Score = 32.4 bits (73), Expect = 0.028
Identities = 63/339 (18%), Positives = 118/339 (34%), Gaps = 58/339 (17%)

Query: 481 QTATLNDILTYTVNVTNNGNVAANNVIFVDSIPAGTTFVTNSVTVNGVARPGANPASSIN 540
+ A L + Y +N+ N G A NV+ + +P +G A +
Sbjct: 219 ENACLRCPVVYKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFT 266

Query: 541 LGSINASQTTVVRFQVRVTSNPLVNPIPNRASATFNFIPVPGQQPVSGQATSNTVFTTIN 600
LG + + + V P + N V G + +V T IN
Sbjct: 267 LGDMQPGEHRTI----------TVEFCPLKRGRATNIATV---SYCGGHKNTASVTTVIN 313

Query: 601 IADIRTRKTVDRAFATINDVLTYTVTIENTGNVLATNVIFQDPTPIGTTFIPNSVTVDGV 660
++ ++ + + Y +++ N G+++ +V+ +D G T +
Sbjct: 314 EPCVQV-SIAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVL--------- 363

Query: 661 SQPGANPATG---FTVANISPGGSRTVTFQVRVTSTPSGGTIANRGNVAANFVVIPNQPP 717
GA + +TV ++PG ++ ++V + + G N V + +
Sbjct: 364 EAAGAQISCNKVVWTVKELNPG--ESLQYKV-LVRAQTPGQFTNNVVVKSC----SDCGT 416

Query: 718 VTINRQTNTVVTQVNTGGLNVIKEVNTAQAAVGDTLTYTIAVQNTGNVPLTNVFFQDTIS 777
T + T V + V+ + VG+ Y I V N G+ TNV S
Sbjct: 417 CTSCAEATTYWKGVAATHMCVVDTCDP--VCVGENTVYRICVTNRGSAEDTNVSLMLKFS 474

Query: 778 SAVSFVA-----------NTVTINGVPQSGLNPNTGFSL 805
+ V+ NTV + +P+ G FS+
Sbjct: 475 KELQPVSFSGPTKGTITGNTVVFDSLPRLGSKETVEFSV 513



Score = 31.6 bits (71), Expect = 0.046
Identities = 32/161 (19%), Positives = 59/161 (36%), Gaps = 26/161 (16%)

Query: 885 LVYTIEVINAGSVPATNVFFQDSIPQGTLFIENSVFVNGVLQEGADPELGFPLNNLPTGA 944
+VY I ++N G+ A NV ++ +P +G L F L ++ G
Sbjct: 227 VVYKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFTLGDMQPGE 274

Query: 945 SVIVTFEVLIDEIPQGNNVVNNANVTGDFLVNPTEPPITVTVPSNTVMTVVNSSGLNVMK 1004
+T E + + N+ + G + +V TV+N + V
Sbjct: 275 HRTITVEFCPLKRGRATNIATVSYCGGH-------------KNTASVTTVINEPCVQVSI 321

Query: 1005 SVSATEAGVGDTLTYTVRIQNSGTVAATNVSFLDPIPSGTT 1045
+ A + V + Y + + N G + +V D + G T
Sbjct: 322 A-GADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVT 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3250DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.1 bits (171), Expect = 2e-16
Identities = 56/241 (23%), Positives = 100/241 (41%), Gaps = 18/241 (7%)

Query: 2 RYVIVTGTSQGLGEAIATQLLEENTSIISISRRENKELAKLAEQYNSNCIFHS----IDL 57
+ +TG +QG+GEA+A L + I ++ N E + H+ D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDY--NPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 58 QDVHNLETHFNEVFSSIQKDNVSSIHLINNAGTVAPMKPIEKSESEQFITNVHINLIAPM 117
+D ++ E+ + I+++ L+N AG V I E++ +N
Sbjct: 67 RDSAAID----EITARIEREMGPIDILVNVAG-VLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 118 ILTSTFMKQTKDWKVDKRIINISSGAGKNPYFGWGAYCTTKAGVNMFTQCVATEEVEKEY 177
+ + K D + I+ + S P AY ++KA MFT+C+ E EY
Sbjct: 122 NASRSVSKYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA--EY 178

Query: 178 PVKTVAFAPGVVDTNMQAQI--RDTNKEDFI--NLDRFTALKEEGKLLSPEYVAKAIRNL 233
++ +PG +T+MQ + + E I +L+ F KL P +A A+ L
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 234 L 234
+
Sbjct: 239 V 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_325156KDTSANTIGN290.019 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.2 bits (65), Expect = 0.019
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 14/73 (19%)

Query: 206 NNITVNFVYTPKEARKKG---------YASSCVAALSQRMLDEGYKTTTLYTDLANPTSN 256
N I +NFV P+ +++G A VAA + R+L+ + LY DL
Sbjct: 328 NQIHLNFVMPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLV----- 382

Query: 257 KIYQEIGYEKMME 269
K+ + G K ME
Sbjct: 383 KLQRHAGIRKAME 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3253BLACTAMASEA354e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.8 bits (80), Expect = 4e-04
Identities = 21/163 (12%), Positives = 49/163 (30%), Gaps = 17/163 (10%)

Query: 13 IFVLLISGNFLVKKVWSSNNDDAQYIASFIE-EHKDEKNSALLIKRNDKVVYSVNPDVVL 71
I + +IS + + E + + + + + + D
Sbjct: 4 IRLCIIS-LLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERF 62

Query: 72 PVASTMKLIVALEYTKQVTEGKIDPSSFVSINDVNRYYVPGTDGGAQDRWQNYLQKKEKI 131
P+ ST K+++ +V G + Q +Y EK
Sbjct: 63 PMMSTFKVVLCGAVLARVDAGDEQLERKIHYR--------------QQDLVDYSPVSEKH 108

Query: 132 TEGAVSLEEVAKGMVKFSSNANTEYLMEVL-GLDNINRNLQSL 173
+++ E+ + S N+ L+ + G + L+ +
Sbjct: 109 LADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQI 151


37BcerKBAB4_3271BcerKBAB4_3278Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3271114-3.954445extracellular solute-binding protein
BcerKBAB4_3272116-4.706551extracellular solute-binding protein
BcerKBAB4_3273219-5.443515cell envelope-related transcriptional
BcerKBAB4_3274218-5.974870hypothetical protein
BcerKBAB4_3275119-6.617734ECF subfamily RNA polymerase sigma-24 factor
BcerKBAB4_3277219-6.777854hypothetical protein
BcerKBAB4_3278-217-4.043988ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3274BCTERIALGSPD280.043 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 28.3 bits (63), Expect = 0.043
Identities = 14/108 (12%), Positives = 36/108 (33%), Gaps = 9/108 (8%)

Query: 45 RVHMMFISAAVLTLSIVSMSEFSIQSFAQNGGMFNYVKKNAFSDYENEEEIKSNVNYPDK 104
R + + A + + IQ +N GM + N+ I Y
Sbjct: 343 RRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQF--TNSGLPIST--AIAGANQYNKD 398

Query: 105 EKIHQKMLDSIDHFQNVSGQFEEYSSSTRIATTYKYAIETEKQSGISS 152
+ + ++ F ++ F + + + + A+ + ++ I +
Sbjct: 399 GTVSSSLASALSSFNGIAAGFYQGNWAMLLT-----ALSSSTKNDILA 441


38BcerKBAB4_3379BcerKBAB4_3414Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3379220-1.788115hypothetical protein
BcerKBAB4_3380120-1.833258PadR-like family transcriptional regulator
BcerKBAB4_3381121-1.343653ABC transporter
BcerKBAB4_3382321-0.524051hypothetical protein
BcerKBAB4_3383321-0.551032hypothetical protein
BcerKBAB4_3384422-0.767380hypothetical protein
BcerKBAB4_3385421-1.510598HPr kinase
BcerKBAB4_3386521-2.853717asparagine synthase
BcerKBAB4_3387724-4.092680DegT/DnrJ/EryC1/StrS aminotransferase
BcerKBAB4_3388825-5.154366hypothetical protein
BcerKBAB4_3389725-5.804837sugar transferase
BcerKBAB4_3390623-5.956339hypothetical protein
BcerKBAB4_3391726-6.490634polysaccharide biosynthesis protein
BcerKBAB4_3392725-5.399074hypothetical protein
BcerKBAB4_3393724-4.854831glycosyl transferase family protein
BcerKBAB4_3394725-4.813782group 1 glycosyl transferase
BcerKBAB4_3395726-4.109120group 1 glycosyl transferase
BcerKBAB4_3396626-3.552239group 1 glycosyl transferase
BcerKBAB4_3397424-3.099761group 1 glycosyl transferase
BcerKBAB4_3398321-2.905460group 1 glycosyl transferase
BcerKBAB4_3399221-2.780564capsular polysaccharide biosynthesis protein
BcerKBAB4_3400220-2.311521glycosyl transferase family protein
BcerKBAB4_3401120-2.133145group 1 glycosyl transferase
BcerKBAB4_3402118-2.534071group 1 glycosyl transferase
BcerKBAB4_3403016-2.937902polysaccharide biosynthesis protein CapD
BcerKBAB4_3404122-3.373932exopolysaccharide tyrosine-protein kinase
BcerKBAB4_3405325-3.563423lipopolysaccharide biosynthesis protein
BcerKBAB4_3406519-1.291446hypothetical protein
BcerKBAB4_3407418-0.344123hypothetical protein
BcerKBAB4_34097263.093622hypothetical protein
BcerKBAB4_34105242.932090hypothetical protein
BcerKBAB4_34114212.921085hypothetical protein
BcerKBAB4_34124202.681395N-acetylmuramoyl-L-alanine amidase
BcerKBAB4_34134212.750661toxin secretion/phage lysis holin
BcerKBAB4_34143202.671079phage minor structural protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3403NUCEPIMERASE944e-23 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 94.1 bits (234), Expect = 4e-23
Identities = 69/355 (19%), Positives = 120/355 (33%), Gaps = 71/355 (20%)

Query: 278 ILVTGAGGSIGSEICRQIAKFRPKKLILLGHGE------NSIYSIEMELNRKYGDMKGTF 331
LVTGA G IG + K+L+ GH N Y + ++ R + F
Sbjct: 3 YLVTGAAGFIGFHVS--------KRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGF 54

Query: 332 IPEIADIQDEKKMQLIMSKHLPNVVYHAAAHKHVPLMENNPEEAVKNNLIGTMNVSEAAR 391
D+ D + M + + V+ + V NP +NL G +N+ E R
Sbjct: 55 QFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCR 114

Query: 392 AHGVGTFVMISS---------------DKAVNPTSVMGATKKLAEMVIQHKDKISHTKFV 436
+ + + SS D +P S+ ATKK E++ +
Sbjct: 115 HNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT 174

Query: 437 TVRFGNVLGSRGS---VIPLFKKQIQNGGPVTV-THPDMVRYFMTI-------------- 478
+RF V G G + F K + G + V + M R F I
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 479 ----PEASRLVVQAGALAKGGEIFVLDMGEPVKIVDLAKNLIRLSGNSVDEIGIEFTGIR 534
+ + A ++ + PV+++D + L G E ++
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI---EAKKNMLPLQ 291

Query: 535 PGEKLFE----ELLKEDEMNEKQIHPRIYVGQEMNVRIEE-IEEFISSYTDLNEV 584
PG+ L + L E +G +++ ++ F++ Y D +V
Sbjct: 292 PGDVLETSADTKALYEV------------IGFTPETTVKDGVKNFVNWYRDFYKV 334


39BcerKBAB4_3423BcerKBAB4_3463Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3423318-1.196770prophage pi2 protein 38
BcerKBAB4_3424417-0.356321prophage pi2 protein 37
BcerKBAB4_3425416-0.324973phage head-tail adaptor, putative
BcerKBAB4_3426115-0.116197phage protein DNA packaging protein
BcerKBAB4_34272130.059918hypothetical protein
BcerKBAB4_3428214-0.037234phage major capsid protein
BcerKBAB4_3429416-0.002786HK97 family phage prohead protease
BcerKBAB4_3430418-1.158942HK97 family phage portal protein
BcerKBAB4_3431215-2.611604terminase
BcerKBAB4_3432321-4.299082hypothetical protein
BcerKBAB4_3433320-5.155861HNH endonuclease
BcerKBAB4_3434218-6.451037hypothetical protein
BcerKBAB4_3435320-6.405799hypothetical protein
BcerKBAB4_3436520-6.220486hypothetical protein
BcerKBAB4_3437722-6.074854hypothetical protein
BcerKBAB4_3438622-6.558564hypothetical protein
BcerKBAB4_3439722-5.786289hypothetical protein
BcerKBAB4_3440723-5.010715hypothetical protein
BcerKBAB4_3441723-4.885312MazG nucleotide pyrophosphohydrolase
BcerKBAB4_3442723-4.226980hypothetical protein
BcerKBAB4_3443623-4.123597SMC domain-containing protein
BcerKBAB4_3444724-0.970609positive control sigma-like factor
BcerKBAB4_34456210.079956hypothetical protein
BcerKBAB4_34464232.809285ArpU family phage transcriptional regulator
BcerKBAB4_34474192.410953group-specific protein
BcerKBAB4_34484202.117530hypothetical protein
BcerKBAB4_34494191.938962thymidylate synthase, flavin-dependent
BcerKBAB4_34505190.767823C-5 cytosine-specific DNA methylase
BcerKBAB4_3451518-0.865210dUTPase
BcerKBAB4_3452416-0.322530group-specific protein
BcerKBAB4_3453618-0.021078hypothetical protein
BcerKBAB4_3454518-0.464902ATPase AAA
BcerKBAB4_3455319-0.902629hypothetical protein
BcerKBAB4_3456219-1.123105group-specific protein
BcerKBAB4_3457220-1.072346hypothetical protein
BcerKBAB4_3458222-1.596593Gp157 family protein
BcerKBAB4_3459223-2.678627hypothetical protein
BcerKBAB4_3460422-2.169374group-specific protein
BcerKBAB4_3461320-2.512467group-specific protein
BcerKBAB4_3462321-2.379494group-specific protein
BcerKBAB4_3463320-3.164810Rha family regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3430PF05043320.003 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 32.2 bits (73), Expect = 0.003
Identities = 22/123 (17%), Positives = 43/123 (34%), Gaps = 19/123 (15%)

Query: 196 AIKNSAVVKWILKFKSVLKQEDIDS------QVKNFVNNYLNISNDGGAAASSDPRYDLE 249
I+N + W L + L ++++ + Q N + N+ NI SD + +L
Sbjct: 308 EIENKDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIF----PKFVSDVKKELS 363

Query: 250 QVKPEAFVPDSKQMQETIQRIYNFFNTNENIIQSKYNEDEWNAYYESEIEVFAMQLAGEY 309
V S M Y F ++++ + +++V M +Y
Sbjct: 364 HYLETLEVCSSSMM--VNHLSYTFITHTKHLVINLLQNQP-------KLKVLVMSNFDQY 414

Query: 310 TRK 312
K
Sbjct: 415 HAK 417


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3437UREASE270.006 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 27.4 bits (61), Expect = 0.006
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 16 DEVLTTPEVMDVLGISKARISKMIKDGKL 44
D V+T ++D GI KA I +KDG++
Sbjct: 69 DTVITNALILDHWGIVKADIG--LKDGRI 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3444HELNAPAPROT270.027 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 27.1 bits (60), Expect = 0.027
Identities = 8/47 (17%), Positives = 17/47 (36%), Gaps = 3/47 (6%)

Query: 1 MQDLIKQYNTTLKQLRETQMDAKQEDVKILTDIISDITYSLE---WM 44
+Q L+ Y + + A++ D+ + +E WM
Sbjct: 101 VQALVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWM 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3456FLGMOTORFLIG250.047 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 25.2 bits (55), Expect = 0.047
Identities = 7/29 (24%), Positives = 15/29 (51%)

Query: 23 LRDKRLSELLKRCRRLENEGFDYLFPIRK 51
+++K + KR + E ++L P R+
Sbjct: 281 VQEKIFKNMSKRAASMLKEDMEFLGPTRR 309


40BcerKBAB4_3472BcerKBAB4_3477Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_34723296.051219GTP1/OBG protein
BcerKBAB4_34735336.705809hypothetical protein
BcerKBAB4_34745306.287821hypothetical protein
BcerKBAB4_34755305.901682sporulation stage V protein K
BcerKBAB4_34766326.310345integrase domain-containing protein
BcerKBAB4_34776276.486556triple helix repeat-containing collagen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3475HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 26/83 (31%), Positives = 34/83 (40%), Gaps = 16/83 (19%)

Query: 90 LHMLFRGNPGTGKTTVARMIGKLLFEMNILSKGHLVEAERA----DLVG-EYIGH----- 139
L ++ G GTGK VAR + + G V A DL+ E GH
Sbjct: 161 LTLMITGESGTGKELVARAL----HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 140 -TAQKTRD-LIKKAMGGILFIDE 160
AQ ++A GG LF+DE
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDE 239


41BcerKBAB4_3512BcerKBAB4_3520Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3512212-2.750561peptidase T
BcerKBAB4_3513415-3.337279hypothetical protein
BcerKBAB4_3514415-3.595600phosphoglycerate mutase
BcerKBAB4_3515113-3.438149glyoxalase/bleomycin resistance
BcerKBAB4_3516013-3.700296PAS/PAC sensor-containing diguanylate
BcerKBAB4_3517116-3.336462undecaprenyldiphospho-muramoylpentapeptide
BcerKBAB4_3518-113-3.634656hypothetical protein
BcerKBAB4_3519-113-3.305498NAD-dependent epimerase/dehydratase
BcerKBAB4_3520014-3.228325polysaccharide biosynthesis protein CapD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3519NUCEPIMERASE738e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 72.9 bits (179), Expect = 8e-17
Identities = 59/285 (20%), Positives = 94/285 (32%), Gaps = 54/285 (18%)

Query: 8 LITGANGFTGRHACQYFLEQGFHVI----------PMF-QNRSHREKIENG--ITCDLTN 54
L+TGA GF G H + LE G V+ Q R DL +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 55 KSEVMKVIKQIKPDYVLHLAGRNSVNESWTASLEYIEINVIGTLYLLEAIKQEASHCKTL 114
+ + + + V R +V S Y + N+ G L +LE + + + L
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH--NKIQHL 121

Query: 115 VIGSA-----------LQADSMKNIKVSNPYSLSKTMQVIIAEAWGGLMDSNIIIAKPSN 163
+ S+ D + VS Y+ +K ++A + L +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 164 LIGP-GVSNGVCSILAKKMIDIESGRSKAIIEVNSLKDSRDFLDVRDAVKA----YHVLL 218
+ GP G + K M++ K+I N K RDF + D +A V+
Sbjct: 181 VYGPWGRPDMALFKFTKAMLE-----GKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 219 RDGIN--------------GKQYNIGSGVKRSLLD---VLEQYKG 246
+ YNIG+ L+D LE G
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3520NUCEPIMERASE805e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 79.8 bits (197), Expect = 5e-19
Identities = 47/223 (21%), Positives = 85/223 (38%), Gaps = 25/223 (11%)

Query: 6 ILITGGTGSWGHELIKQLLEKSPKEIRVFSRNE--TVQFEM-QQQFINEERLKFIIGDIR 62
L+TG G G + K+LLE + + + + N+ V + + + + + +F D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 63 DKEQL--VYACQGVHYVFHLAALKHVPVCEYYPYEAIKTNIHGTQNVIEASIQNQVEKVI 120
D+E + ++A VF V P+ +N+ G N++E N+++ ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122

Query: 121 YVST---------------DKAADPSNTYGMTKAIGEKLMVHANVQTKKTKFICVRGGNV 165
Y S+ D P + Y TK E LM H +R V
Sbjct: 123 YASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE-LMAHTYSHLYGLPATGLRFFTV 181

Query: 166 LGTSGS---VVPLFKKQIKQSSKVGI-TDANMTRFFLTIEDAV 204
G G + F K + + + + M R F I+D
Sbjct: 182 YGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224


42BcerKBAB4_3592BcerKBAB4_3617Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3592020-3.374356terminase
BcerKBAB4_3593323-3.942315hypothetical protein
BcerKBAB4_3594323-3.226088HNH endonuclease
BcerKBAB4_3595021-3.048148phage protein
BcerKBAB4_3596023-2.622586hypothetical protein
BcerKBAB4_3597025-1.483025hypothetical protein
BcerKBAB4_35980260.879346cold-shock DNA-binding domain-containing
BcerKBAB4_35995356.257124integrase family protein
BcerKBAB4_36007367.754624ArpU family phage transcriptional regulator
BcerKBAB4_36019408.265380hypothetical protein
BcerKBAB4_36029397.709632hypothetical protein
BcerKBAB4_360310377.074509spore germination protein PF
BcerKBAB4_36048396.953137triple helix repeat-containing collagen
BcerKBAB4_3605125-0.878165hypothetical protein
BcerKBAB4_36060200.369713hypothetical protein
BcerKBAB4_36070190.552770hypothetical protein
BcerKBAB4_36080170.827999phage protein
BcerKBAB4_36091190.795194hypothetical protein
BcerKBAB4_36101180.810907hypothetical protein
BcerKBAB4_36112170.999790replicative DNA helicase
BcerKBAB4_36123160.224901primosome subunit DnaD
BcerKBAB4_3613519-0.150316hypothetical protein
BcerKBAB4_3614418-1.265412hypothetical protein
BcerKBAB4_3615116-2.355727hypothetical protein
BcerKBAB4_3616220-3.317657hypothetical protein
BcerKBAB4_3617221-4.545974XRE family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3592BCTERIALGSPD300.034 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.9 bits (67), Expect = 0.034
Identities = 21/100 (21%), Positives = 38/100 (38%), Gaps = 11/100 (11%)

Query: 357 FAACGLLFR--QNGEYIFKTHSFVRKEFVDIYYGYSKKAGEYKKQKFAPIKDWEEQGLLT 414
LLFR E+ +EF++ SK + I D +G +T
Sbjct: 15 LIFAALLFRPAAAEEFSASFKGTDIQEFIN---TVSKNLNK------TVIIDPSVRGTIT 65

Query: 415 VVDEPTINPQHIVDWFVEMREQYGVKKIIADNFRMEAIRP 454
V +N + +F+ + + YG I +N ++ +R
Sbjct: 66 VRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRS 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3599ARGDEIMINASE300.004 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 30.2 bits (68), Expect = 0.004
Identities = 7/39 (17%), Positives = 16/39 (41%), Gaps = 3/39 (7%)

Query: 6 PIRNPEQIQQIKEYLKEKSNRNYILFVMGINTGLRISDI 44
I+ I +K+Y + N I ++ G+ ++
Sbjct: 99 EIKTDFTINLLKDYFSSLTIDNMISKMIS---GVVTEEL 134


43BcerKBAB4_3629BcerKBAB4_3634Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_36293201.272673polynucleotide phosphorylase/polyadenylase
BcerKBAB4_36303170.64718730S ribosomal protein S15
BcerKBAB4_36313160.841740bifunctional riboflavin kinase/FMN
BcerKBAB4_36324211.062853tRNA pseudouridine synthase B
BcerKBAB4_36335231.835171ribosome-binding factor A
BcerKBAB4_36345212.095323hypothetical protein
44BcerKBAB4_3644BcerKBAB4_3650Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_36442191.8496351-deoxy-D-xylulose 5-phosphate reductoisomerase
BcerKBAB4_36452261.611328phosphatidate cytidylyltransferase
BcerKBAB4_36465302.368488undecaprenyl pyrophosphate synthase
BcerKBAB4_36475282.533733ribosome recycling factor
BcerKBAB4_36484263.007026uridylate kinase
BcerKBAB4_36494212.592685elongation factor Ts
BcerKBAB4_36502142.24786630S ribosomal protein S2
45BcerKBAB4_3662BcerKBAB4_3671Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_36623222.806450signal peptidase I
BcerKBAB4_36632203.42773350S ribosomal protein L19
BcerKBAB4_36641172.692508tRNA (guanine-N(1)-)-methyltransferase
BcerKBAB4_36651162.12458516S rRNA-processing protein RimM
BcerKBAB4_36664141.979223RNA binding protein
BcerKBAB4_36674141.98921930S ribosomal protein S16
BcerKBAB4_36684141.959561signal recognition particle protein
BcerKBAB4_36694151.276354putative DNA-binding protein
BcerKBAB4_36702141.722069signal recognition particle-docking protein
BcerKBAB4_36712131.941774chromosome segregation protein SMC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3668FLGHOOKAP1300.017 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.017
Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 399 KKRIAKGSGTTVQEINRLIKQFDDMKKMMKTMTGMQKGK 437
K++ G +V +IN KQ + + +TG+ G
Sbjct: 154 DKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGAGA 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3671GPOSANCHOR482e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.8 bits (113), Expect = 2e-07
Identities = 38/274 (13%), Positives = 82/274 (29%)

Query: 667 KQAKSSLLGRQRELEEWTKKLTDMEEKTTKLENFVKAVKQEIQEKEVQIRELRQGVETER 726
++ ++ ++ LE T K LE A+ + E + +
Sbjct: 116 QELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 175

Query: 727 VDEQKLREEINRLELDEHRINDRLSIYDLEIEGFLQDQVKMQGRKEELEKILATLQAEIG 786
+ L E LE + + L ++ K L A L+ +
Sbjct: 176 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALE 235

Query: 787 ELDSKIVVLTKQKSEQYSSKEKVQKEMTELKVQAAEQQQRLSNQKEKVERLTKEKEETDA 846
+ + + + K ++ EL+ + K++ L EK +A
Sbjct: 236 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295

Query: 847 TLVKTKEDLAFLKQEMTSNSSGEEQITNMIEKKAYDRNQTSELIRSRREQRVSLQERVEH 906
+ L S + ++ + + E + R SL+ ++
Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355

Query: 907 LERGVKETIGKHKYILEMLKDQEVKINRLDVELE 940
K+ +H+ + E K E L +L+
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLD 389



Score = 47.4 bits (112), Expect = 3e-07
Identities = 36/291 (12%), Positives = 86/291 (29%), Gaps = 11/291 (3%)

Query: 232 HEIEELHEKWEALRNQFGHNKDEEAKMSANLQKSEEELEELRGQLQAVDESVNSLQEVLL 291
++ + L+ + + + + EEL + +L+ D+S++ +
Sbjct: 57 ERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQ 116

Query: 292 LSSKELEKLEGQRELLKERKQNATTHCAQLEKLIVELTEKTTSYDGEIETSTEALMQFVN 351
LE E + LE L + + +E +
Sbjct: 117 ELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 176

Query: 352 QVKELEKKLHDNEQLLATFAENLEEQIENLKGDYIELLNQQASLRNELSMIEEQSKQQNS 411
++K LE + LE + L+ +N + ++ +E + +
Sbjct: 177 KIKTLEAEK-----------AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAA 225

Query: 412 KNERLDEENEKYVQMRVEITAKKAKLVDSYEQAREKIAGIISNIQKTETALGKCKSQYSE 471
+ L++ E + +AK L + A + ++ ++
Sbjct: 226 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 285

Query: 472 NETKLYQAYQFVQQARSRKEMLEEMQEDYSGFYQGVREVLKARENRLQGIE 522
E + + ++L ++ RE K E Q +E
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLE 336



Score = 39.7 bits (92), Expect = 7e-05
Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 1/161 (0%)

Query: 153 KSEERRGVFEEAAGVLKYKLRKKKAEGKLAETQEN-LNRVQDIIHELSSQVEPLERQASI 211
E + E L+ L + L + + + +E A
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 212 AKDYLEKKEELEKVEAALIVHEIEELHEKWEALRNQFGHNKDEEAKMSANLQKSEEELEE 271
K + + E A + EL + E N + + + A E E +
Sbjct: 240 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299

Query: 272 LRGQLQAVDESVNSLQEVLLLSSKELEKLEGQRELLKERKQ 312
L Q Q ++ + SL+ L S + ++LE + + L+E+ +
Sbjct: 300 LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340



Score = 35.4 bits (81), Expect = 0.001
Identities = 43/330 (13%), Positives = 99/330 (30%), Gaps = 34/330 (10%)

Query: 150 LSSKSEERRGVFEEAAGVLKYKLRKKKAEGKLAETQENLNRVQDIIHELSSQVEPLERQA 209
+SK +E + L+ + + + L + + + +E A
Sbjct: 111 KASKIQELEARKADLEKALEGAMNFST---ADSAKIKTLEAEKAALAARKADLEKALEGA 167

Query: 210 SIAKDYLEKKEELEKVEAALIVHEIEELHEKWEALRNQFGHNKDEEAKMSANLQKSEEEL 269
K + + E A + EL + E N + + + A
Sbjct: 168 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARK 227

Query: 270 EELRGQLQAVDESVNSLQEVLLLSSKELEKLEGQRELLKERKQNATTHCAQLEKLIVELT 329
+L L+ + + LE L
Sbjct: 228 ADLEKALEGAMNFSTADSAKI----------------------------KTLEAEKAALE 259

Query: 330 EKTTSYDGEIETSTEALMQFVNQVKELEKKLHDNEQLLA---TFAENLEEQIENLKGDYI 386
+ + +E + ++K LE + E A ++ L ++L+ D
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLD 319

Query: 387 ELLNQQASLRNELSMIEEQSKQQNSKNERLDEENEKYVQMRVEITAKKAKLVDSYEQARE 446
+ L E +EEQ+K + + L + + + + ++ A+ KL + + +
Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 379

Query: 447 KIAGIISNIQKTETALGKCKSQYSENETKL 476
+ ++ + A + + E +KL
Sbjct: 380 SRQSLRRDLDASREAKKQVEKALEEANSKL 409



Score = 32.3 bits (73), Expect = 0.014
Identities = 33/183 (18%), Positives = 74/183 (40%), Gaps = 7/183 (3%)

Query: 666 VKQAKSSLLGRQRELEEWTKKLTDMEEKTTKLENFVKAVKQEIQEKEVQIRELRQGVETE 725
+ + +L G + K+ +E + LE ++ + Q + LR+ ++
Sbjct: 262 QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDAS 321

Query: 726 RVDEQKLREEINRLELDEHRINDRLSIYDLEIEGFLQDQVKMQGRKEELEKILATLQAEI 785
R +++L E +LE ++ I + + +D + K++LE L+ +
Sbjct: 322 REAKKQLEAEHQKLE-------EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQN 374

Query: 786 GELDSKIVVLTKQKSEQYSSKEKVQKEMTELKVQAAEQQQRLSNQKEKVERLTKEKEETD 845
++ L + +K++V+K + E + A ++ +E + KEK E
Sbjct: 375 KISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQ 434

Query: 846 ATL 848
A L
Sbjct: 435 AKL 437


46BcerKBAB4_3700BcerKBAB4_3726Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3700317-3.392480hypothetical protein
BcerKBAB4_3701419-3.627125hypothetical protein
BcerKBAB4_3702115-2.102926hypothetical protein
BcerKBAB4_3703012-0.548680hypothetical protein
BcerKBAB4_3704-2101.433131hypothetical protein
BcerKBAB4_3705-2152.323356hypothetical protein
BcerKBAB4_37080233.280042hypothetical protein
BcerKBAB4_3709-1253.211032orotate phosphoribosyltransferase
BcerKBAB4_37100253.370601orotidine 5'-phosphate decarboxylase
BcerKBAB4_37111243.380574dihydroorotate dehydrogenase 1B
BcerKBAB4_37121263.178468dihydroorotate dehydrogenase electron transfer
BcerKBAB4_37131252.784966carbamoyl phosphate synthase large subunit
BcerKBAB4_37142202.728121carbamoyl phosphate synthase small subunit
BcerKBAB4_37152192.435863dihydroorotase
BcerKBAB4_37162171.377674aspartate carbamoyltransferase catalytic
BcerKBAB4_37173201.457532uracil-xanthine permease
BcerKBAB4_37182190.592668bifunctional pyrimidine regulatory protein
BcerKBAB4_37191180.634665RluA family pseudouridine synthase
BcerKBAB4_37201170.458541lipoprotein signal peptidase
BcerKBAB4_37212170.979718TraR/DksA family transcriptional regulator
BcerKBAB4_37221161.108631isoleucyl-tRNA synthetase
BcerKBAB4_37233110.223728DivIVA family protein
BcerKBAB4_37241130.525221RNA-binding S4 domain-containing protein
BcerKBAB4_37252180.268578hypothetical protein
BcerKBAB4_3726217-0.511393hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3703ACRIFLAVINRP270.023 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.1 bits (60), Expect = 0.023
Identities = 8/30 (26%), Positives = 13/30 (43%)

Query: 22 FFILRKDTTIICIVFIFLLALTATSSLPLA 51
FFI R + + + + A LP+A
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVA 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3715UREASE330.003 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.8 bits (75), Expect = 0.003
Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 20/83 (24%)

Query: 17 IVATDLLVQDGKIAKV--AEN---------ITADNAEVIDVNGKLIAPGLVDVHVHLREP 65
IV D+ ++DG+IA + A N I EVI GK++ G +D H+H P
Sbjct: 83 IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICP 142

Query: 66 GGEHKETIETGTLAAAKGGFTTI 88
+ IE A G T +
Sbjct: 143 -----QQIEE----ALMSGLTCM 156


47BcerKBAB4_3778BcerKBAB4_3797Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_37784190.684631hypothetical protein
BcerKBAB4_37793171.361777hypothetical protein
BcerKBAB4_37802161.319855hypothetical protein
BcerKBAB4_37812171.436916hypothetical protein
BcerKBAB4_37822171.190740GTP-binding protein TypA
BcerKBAB4_3783090.675401hypothetical protein
BcerKBAB4_3784-1100.828778inositol-phosphate phosphatase
BcerKBAB4_3785-1130.211606hypothetical protein
BcerKBAB4_3786-1140.319550hypothetical protein
BcerKBAB4_37870160.566023hypothetical protein
BcerKBAB4_37880180.864757Orn/Lys/Arg decarboxylase major region
BcerKBAB4_3789123-0.422073transglutaminase
BcerKBAB4_3790428-2.780564hypothetical protein
BcerKBAB4_37911240.966057hypothetical protein
BcerKBAB4_37923321.939436hypothetical protein
BcerKBAB4_37934372.481450hypothetical protein
BcerKBAB4_37944422.941105hypothetical protein
BcerKBAB4_37954453.634895hypothetical protein
BcerKBAB4_37963444.107841dihydrolipoamide dehydrogenase
BcerKBAB4_37970343.080016branched-chain alpha-keto acid dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3782TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 100/476 (21%), Positives = 196/476 (41%), Gaps = 96/476 (20%)

Query: 8 LRNIAIIAHVDHGKTTLVDQLLRQAGTFRANEHIEE--RAMDSNDLERERGITILAKNTA 65
+ NI ++AHVD GKTTL + LL +G +++ D+ LER+RGITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 66 IHYEDKRINILDTPGHADFGGEVERIMKMVDGVLLVVDAYEGCMPQTRFVLKKALEQNLT 125
+E+ ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + +
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 126 PIVVVNKIDRDFARPDEVVDEVVDLF---------IELG-------------------AN 157
I +NKID++ V ++ + +EL N
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 158 EDQLE--------------------------FPVVFASAMNGTASLDSNPANQEENMKSL 191
+D LE FPV SA N + +L
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNN------------IGIDNL 230

Query: 192 FDTIIEHIPAPIDNSEEPLQFQVALLDYNDYVGRIGVGRIFRGTMKVGQQVALMKVDGSV 251
+ I + + L +V ++Y++ R+ R++ G + + V + + +
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE--- 287

Query: 252 KQFRVTKLFGYIGLKRQEIEEAKAGDLVAVSGMEDINVGETVCPVEHEEALPLLRIDEPT 311
+ ++T+++ I + +I++A +G++V + E + + + + + P
Sbjct: 288 -KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 312 LQMTFLVNNSPFAGREGKFITSRKIEER------LRSQLETDVSLRVDNTDSPDAWIVSG 365
LQ T + K ++R L ++D LR + I+S
Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 366 RGELHLSILIENMRRE-GYELQVSKPEVIIKDVDGVRSEPVERVQIDVPEEYTGSI 420
G++ + + ++ + E+++ +P VI + ++E +++ P + SI
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 39.8 bits (93), Expect = 3e-05
Identities = 17/77 (22%), Positives = 28/77 (36%), Gaps = 1/77 (1%)

Query: 403 EPVERVQIDVPEEYTGSIMESMGARKGEMLDMVNNGNGQVRLTFMVPARGLIGYTTEFLT 462
EP +I P+EY ++D N +V L+ +PAR + Y ++
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNN-EVILSGEIPARCIQEYRSDLTF 595

Query: 463 LTRGYGILNHTFDCYQP 479
T G + Y
Sbjct: 596 FTNGRSVCLTELKGYHV 612


48BcerKBAB4_3812BcerKBAB4_3823Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3812321-2.837253hypothetical protein
BcerKBAB4_3813221-4.702854hypothetical protein
BcerKBAB4_3814116-2.942376hypothetical protein
BcerKBAB4_3815216-2.511095YruB family glutaredoxin-like protein
BcerKBAB4_3816215-1.460165hypothetical protein
BcerKBAB4_3817012-1.467489hypothetical protein
BcerKBAB4_3818-112-2.103296diguanylate phosphodiesterase
BcerKBAB4_3819-211-1.321920short chain dehydrogenase
BcerKBAB4_3820-112-2.407746metallophosphoesterase
BcerKBAB4_3821-116-2.938542hypothetical protein
BcerKBAB4_3822-116-3.851867polyphosphate kinase
BcerKBAB4_3823015-4.591626Ppx/GppA phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3818FbpA_PF05833340.001 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 34.1 bits (78), Expect = 0.001
Identities = 18/151 (11%), Positives = 43/151 (28%), Gaps = 1/151 (0%)

Query: 133 EQFNHLLMYYRTYGIQISINKVGTGTSN-LERISVLAPDILKVDLTNLRQTALLQSYQDI 191
+ + +K+ TG S L +DL+ +++ +D+
Sbjct: 179 DMIENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDL 238

Query: 192 LYSLSLLARRIGATLLYEEIDAFYQLQYAWKNGGRYYQGNYLKECLPDFIETNVLKERLG 251
+ FY L K + Q + + L +F +RL
Sbjct: 239 FKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLK 298

Query: 252 NECHQFILHEKKKLQKIYNLTEMLRDRIGDV 282
++ + + ++L + +
Sbjct: 299 SKSSDLQKIVMNNINRCTKKDKILNNTLKKC 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3819DHBDHDRGNASE994e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.4 bits (247), Expect = 4e-27
Identities = 68/257 (26%), Positives = 122/257 (47%), Gaps = 11/257 (4%)

Query: 11 VKEKVVIITGGSSGMGKGMAIRFAKEGARVVITGRTKEKLEEAKLEIEQFPGQVLSVQMD 70
++ K+ ITG + G+G+ +A A +GA + EKLE+ ++ + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 VRNTDDIQKMIEHIDEKFGRIDILINNAAGNFICPAEDLSVNGWNSVINIVLNGTFYCSQ 130
VR++ I ++ I+ + G IDIL+N A LS W + ++ G F S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 131 AIGKYWIEKGIKGNIINMVATYAWDAGPGVIHSAAAKAGVLAMTKTLAVEWGRKYGIRVN 190
++ KY +++ G+I+ + + A + A++KA + TK L +E +Y IR N
Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA-EYNIRCN 183

Query: 191 AIAPGPIERTGGADKLWISEEMAKRTLQ--------SVPLGRMGTPEEIAGLAYYLCSDE 242
++PG T LW E A++ ++ +PL ++ P +IA +L S +
Sbjct: 184 IVSPGST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 243 AAYINGTCMTMDGGQHL 259
A +I + +DGG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3821PERTACTIN250.041 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 25.1 bits (54), Expect = 0.041
Identities = 13/38 (34%), Positives = 15/38 (39%)

Query: 38 GLLGGALAFGPRPFYPPYPPPFPPPAPFPCYGGPCQQP 75
L+G P+P P P P P P P P Q P
Sbjct: 561 SLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPP 598


49BcerKBAB4_3895BcerKBAB4_3919Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_38952202.546947stage V sporulation protein AE, C-terminal
BcerKBAB4_38962201.904303sporulation stage V protein AE
BcerKBAB4_38971151.974445stage V sporulation protein AD
BcerKBAB4_3898-3151.466541sporulation stage V protein AC
BcerKBAB4_3899-2141.279431stage V sporulation protein AB
BcerKBAB4_3900-1151.560399stage V sporulation protein AA
BcerKBAB4_3901-1151.612024hypothetical protein
BcerKBAB4_3902-1151.335453sodium:neurotransmitter symporter
BcerKBAB4_39032150.013699sporulation sigma factor SigF
BcerKBAB4_3904-214-1.578529anti-sigma F factor
BcerKBAB4_3905-211-2.457723anti-sigma-factor antagonist
BcerKBAB4_3906-310-2.683495serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_3907-313-3.347072GntR family transcriptional regulator
BcerKBAB4_3908-211-2.866309ABC transporter
BcerKBAB4_3909113-0.986753acetoin transport permease
BcerKBAB4_3910112-0.332239acetoin transport permease
BcerKBAB4_39112111.398083hypothetical protein
BcerKBAB4_39121151.679658MarR family transcriptional regulator
BcerKBAB4_39131161.862007hypothetical protein
BcerKBAB4_39140171.678745xanthine/uracil/vitamin C permease
BcerKBAB4_3915-1180.960933magnesium and cobalt transport protein CorA
BcerKBAB4_39160201.386103pyrimidine-nucleoside phosphorylase
BcerKBAB4_39171190.977265purine nucleoside phosphorylase
BcerKBAB4_3918-1150.095917phosphopentomutase
BcerKBAB4_3919216-0.126454hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3904PF06580362e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-05
Identities = 17/82 (20%), Positives = 32/82 (39%), Gaps = 13/82 (15%)

Query: 29 QLDPTMEELTEIKTVVSEAVTNAIIHGYEGNAE-GVVYISVILEEAMVKLTIRD------ 81
Q++P + ++ +V V N I HG + G + + + V L + +
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL 304

Query: 82 ------EGIGIFNLDEARQPLF 97
G G+ N+ E Q L+
Sbjct: 305 KNTKESTGTGLQNVRERLQMLY 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3906BLACTAMASEA461e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 45.9 bits (109), Expect = 1e-07
Identities = 30/145 (20%), Positives = 59/145 (40%), Gaps = 14/145 (9%)

Query: 7 ILVCFM-LLLSGTSVSFAQSEKTKQEKTEETTPKLAEQASSA----IVIEQDTGKVLFDK 61
I +C + LL + A + +Q K L+E S I ++ +G+ L
Sbjct: 4 IRLCIISLLATLPLAVHASPQPLEQIK-------LSESQLSGRVGMIEMDLASGRTLTAW 56

Query: 62 NPNEKLPPASMTKIMTMLLIMEQVEKGKLKLNDKVRASEHAASMGGSQIFLE-PGEEMTV 120
+E+ P S K++ ++ +V+ G +L K+ + + S + + + MTV
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQ-QDLVDYSPVSEKHLADGMTV 115

Query: 121 NEMLKGIAIASGNDASVAVAEHIAG 145
E+ S N A+ + + G
Sbjct: 116 GELCAAAITMSDNSAANLLLATVGG 140


50BcerKBAB4_3971BcerKBAB4_3977Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3971-115-3.531435L-serine dehydratase, iron-sulfur-dependent
BcerKBAB4_3973320-6.116443ribonuclease Z
BcerKBAB4_3974722-8.427493hypothetical protein
BcerKBAB4_3975821-8.745745hypothetical protein
BcerKBAB4_3976114-5.747827hypothetical protein
BcerKBAB4_3977114-5.113185hypothetical protein
51BcerKBAB4_4001BcerKBAB4_4015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_40012273.150172branched-chain alpha-keto acid dehydrogenase
BcerKBAB4_40024243.115337transketolase central region
BcerKBAB4_40030142.3628203-methyl-2-oxobutanoate dehydrogenase
BcerKBAB4_4004-1112.223826dihydrolipoamide dehydrogenase
BcerKBAB4_4005-292.137202butyrate kinase
BcerKBAB4_4006-1102.041967Glu/Leu/Phe/Val dehydrogenase
BcerKBAB4_40070131.673923phosphate butyryltransferase
BcerKBAB4_40080140.525427PAS modulated sigma54 specific transcriptional
BcerKBAB4_4009117-1.161635hypothetical protein
BcerKBAB4_4010218-2.252752glycerophosphodiester phosphodiesterase
BcerKBAB4_4011519-3.832233hypothetical protein
BcerKBAB4_4012719-4.570540hypothetical protein
BcerKBAB4_4014521-3.757249group-specific protein
BcerKBAB4_4015520-3.675418hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4005ACETATEKNASE2294e-74 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 229 bits (585), Expect = 4e-74
Identities = 85/344 (24%), Positives = 148/344 (43%), Gaps = 36/344 (10%)

Query: 5 RILVINPGSTSTKIGVFDNE------RPVLEE--------TIRHDVEQIGKYKRIIDQYE 50
+ILVIN GS+S K + +++ + + E T + E+I K + D +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 51 FRKETILEVLHSHGINISKLNAVCGRGGLLRPIEGGTYTVNDAMLED--LKNGFSG---- 104
K + +++S I ++ + G R + GG Y + ++ D LK
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGH--RVVHGGEYFTSSVLITDDVLKAITDCIELA 119

Query: 105 --HHASNLGGILAY-EIASGLNIPAFIVDPVVVDEMEPIARISGI------AGMERKSIF 155
H+ +N+ GI A +I + + A + D M A + I RK F
Sbjct: 120 PLHNPANIEGIKACTQIMPDVPMVA-VFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGF 178

Query: 156 HALNQKAVARKVADELNHKYEDLNLLVTHMGGGITVGAHKKGKVIDVNNGLNG-EGPFSP 214
H + K V+++ A+ LN E L ++ H+G G ++ A K GK ID + G EG
Sbjct: 179 HGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMG 238

Query: 215 ERAGTVPVGQLVEMCFSGEYYRDEMVKKLVGQGGLVSLIGTNDAIK--VEQMVEKGDPEA 272
R+G++ + + +E+V L + G+ + G + + + + GD A
Sbjct: 239 TRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRA 298

Query: 273 TLIYKAMAYQVAKEIGGASAVLHGKIDAIVLTGGLAYSKILVDE 316
L AY+V K IG +A + G +D IV T G+ + + E
Sbjct: 299 QLALNVFAYRVKKTIGSYAAAM-GGVDVIVFTAGIGENGPEIRE 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4008HTHFIS399e-135 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 399 bits (1027), Expect = e-135
Identities = 135/497 (27%), Positives = 221/497 (44%), Gaps = 73/497 (14%)

Query: 229 VTDLKEIQTLLEAIIN----------SSEEAISVVDEKGRGLVINPAYTKLTGLTEEEII 278
D I+T+L ++ ++ + LV+
Sbjct: 9 ADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVV---------------- 52

Query: 279 GKPATTDIVEGESMHMKVLRTRRAVRGIHMKIGQKKRDVIV----NVAPVIVDGILKGSV 334
TD+V + +L + R V+V N + KG+
Sbjct: 53 -----TDVVMPDENAFDLLPRIKKAR--------PDLPVLVMSAQNTFMTAIKASEKGAY 99

Query: 335 GVIRDVSEIQKLTNELNRA-----RQIIRTLEAKYSFDDIVGDSDETTAAIEQAKLGANT 389
+ ++ +L + RA R+ + + +VG S T
Sbjct: 100 DYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT 159

Query: 390 PATVLLRGESGTGKELFAHAIHNSSNRKYNKFVRVNCAAISETLLESELFGYEEGAFSGA 449
T+++ GESGTGKEL A A+H+ R+ FV +N AAI L+ESELFG+E+GAF+GA
Sbjct: 160 DLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA 219

Query: 450 KRGGKRGFFEEANNGSVFLDEIGELSANTQAKLLRVLQEKEIVKVGGTKAIPINVRVIAA 509
+ G FE+A G++FLDEIG++ + Q +LLRVLQ+ E VGG I +VR++AA
Sbjct: 220 QTRST-GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAA 278

Query: 510 THVNLEKGILEGEFREDLYYRLNKIPIQIPSLRQRKGDIPAIAERLIQKINQDYGRNVEG 569
T+ +L++ I +G FREDLYYRLN +P+++P LR R DIP + +Q+ ++ G +V+
Sbjct: 279 TNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKR 337

Query: 570 LTDSAVSYLQSYEWPGNVRELENILGRAIIFMNYNEIYIDVHH----------------- 612
A+ ++++ WPGNVRELEN++ R + I ++
Sbjct: 338 FDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAA 397

Query: 613 ------LPPLHKEEQVEPKQNNLLPELEEKALEHLVTEFEGNIICEYLEKFDGNKTKTAK 666
+ +E + + + ++ E E +I L GN+ K A
Sbjct: 398 RSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAAD 457

Query: 667 ALGISVRNLYYKLEKYD 683
LG++ L K+ +
Sbjct: 458 LLGLNRNTLRKKIRELG 474


52BcerKBAB4_4030BcerKBAB4_4039Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_40302171.116798polyprenyl synthetase
BcerKBAB4_40311161.816386exodeoxyribonuclease VII small subunit
BcerKBAB4_40321171.736763exodeoxyribonuclease VII large subunit
BcerKBAB4_40331162.128577bifunctional 5,10-methylene-tetrahydrofolate
BcerKBAB4_40342151.497358transcription antitermination protein NusB
BcerKBAB4_40351121.216239hypothetical protein
BcerKBAB4_40362120.425046acetyl-CoA carboxylase biotin carboxylase
BcerKBAB4_4037211-0.708704acetyl-CoA carboxylase biotin carboxyl carrier
BcerKBAB4_4038214-0.839355hypothetical protein
BcerKBAB4_4039214-1.119434stage III sporulation protein AH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4037RTXTOXIND280.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.017
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 137 EGEIV-EILVNNGQLVEYGQPLFLVKA 162
E IV EI+V G+ V G L + A
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4039IGASERPTASE280.027 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.027
Identities = 24/120 (20%), Positives = 47/120 (39%), Gaps = 3/120 (2%)

Query: 24 VTTPDKMNTAAPATGEKMGQEKQGVDKAVTKETTKETTNKETTKENTSKETTNKETDKKE 83
+TTP+ + P+ +E VD+A T ++ T + + +K +K E
Sbjct: 997 ITTPNNIQADVPSVPSN-NEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055

Query: 84 NDKKETSKKEASVTVQSSDENFTALRMQMEDQRSVEKEKLQNVMKSSKSS--AEEKSKAK 141
D ET+ + V ++ + Q E ++ Q ++ EEK+K +
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115


53BcerKBAB4_4059BcerKBAB4_4091Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4059317-1.077631PadR-like family transcriptional regulator
BcerKBAB4_4060214-0.892658hypothetical protein
BcerKBAB4_40611140.114458hypothetical protein
BcerKBAB4_40622150.567221hypothetical protein
BcerKBAB4_40632140.237269biotin/lipoate A/B protein ligase
BcerKBAB4_40643191.029875rhodanese domain-containing protein
BcerKBAB4_40651170.622840hypothetical protein
BcerKBAB4_40661200.507418LacI family transcriptional regulator
BcerKBAB4_40673210.239755TetR family transcriptional regulator
BcerKBAB4_4068118-2.617697small multidrug resistance protein
BcerKBAB4_4069220-4.181124small multidrug resistance protein
BcerKBAB4_4070321-5.024153group-specific protein
BcerKBAB4_4071117-4.291332hypothetical protein
BcerKBAB4_4072-116-3.901776hypothetical protein
BcerKBAB4_4073-117-3.649087hypothetical protein
BcerKBAB4_4074018-2.578339ABC transporter permease
BcerKBAB4_40750140.001589ABC transporter permease
BcerKBAB4_40760190.675401ABC transporter
BcerKBAB4_40771241.378146GntR family transcriptional regulator
BcerKBAB4_40781200.956357hypothetical protein
BcerKBAB4_40791191.038881hypothetical protein
BcerKBAB4_40801191.139204glycine dehydrogenase subunit 2
BcerKBAB4_40812170.645520glycine dehydrogenase subunit 1
BcerKBAB4_40820150.039108glycine cleavage system aminomethyltransferase
BcerKBAB4_4083215-0.522060non-specific serine/threonine protein kinase
BcerKBAB4_4084016-0.491407hypothetical protein
BcerKBAB4_4085215-1.331288hypothetical protein
BcerKBAB4_4086113-2.200517hypothetical protein
BcerKBAB4_4087015-2.425954hypothetical protein
BcerKBAB4_4088-119-2.498419hypothetical protein
BcerKBAB4_4089020-2.524716shikimate kinase
BcerKBAB4_4090-121-2.7017612OG-Fe(II) oxygenase
BcerKBAB4_4091-221-3.395948ComG operon protein 7
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4062IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 1e-04
Identities = 20/110 (18%), Positives = 29/110 (26%), Gaps = 7/110 (6%)

Query: 104 KENKEAAEQEETVVEATPKKEVVVEAPKAVTPAPKPITRVETPAAPKPTPVPTPKSVSVE 163
E KE A E+ ++ +PK +P P E
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 164 AAIELSTPAP-------VKKAVPTPVTKQETAPVTPVKPKQPALTETNTK 206
+ +T A V PVT+ T + P T T
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4063DHBDHDRGNASE300.010 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.6 bits (66), Expect = 0.010
Identities = 26/98 (26%), Positives = 41/98 (41%), Gaps = 8/98 (8%)

Query: 93 VIVSEDHPNMPKTVTEAYRIISQGLLDGFKALGLE-AYYAVPKTEADRENLKNPRSG-VC 150
V V + +P+T AY + K LGLE A Y + R N+ +P S
Sbjct: 140 VTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI------RCNIVSPGSTETD 193

Query: 151 FDAPSWYEIVVEGRKIAGSAQTRQKGVILQHGSIPLEI 188
W + + I GS +T + G+ L+ + P +I
Sbjct: 194 MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDI 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4067HTHTETR631e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 1e-14
Identities = 35/166 (21%), Positives = 64/166 (38%), Gaps = 6/166 (3%)

Query: 2 TANRIKAVALSHFARYGYEGTSLANIAQEVGIKKPSIYAHFKGKEELYFTCLESALQKDL 61
T I VAL F++ G TSL IA+ G+ + +IY HFK K +L+ E +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 62 QSFTDDIENFSKSSTEELLVNLLKGYAKRFGESEESMFWLRTSYFPPDAFRE-QIIDK-- 118
+ + F +L +L + E + + + E ++ +
Sbjct: 72 ELELEYQAKFPGDP-LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 119 ANVHIENVGKLLFPVFKRASEQDELH-NIEVKDALEAFLCLLDGLM 163
N+ +E+ ++ K E L ++ + A + GLM
Sbjct: 131 RNLCLESYDRIE-QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4080HELNAPAPROT290.017 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.5 bits (66), Expect = 0.017
Identities = 15/64 (23%), Positives = 24/64 (37%)

Query: 399 DIAKRLLDFGYHPPTIYFPLNVEECIMIEPTETESKETLDGFIDKMIQIAKEVEENPEVV 458
IA+RLL G P I ET + E + ++ QI+ E + +
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 459 QEAP 462
+E
Sbjct: 123 EENQ 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4081PF04605280.031 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 28.3 bits (63), Expect = 0.031
Identities = 14/77 (18%), Positives = 32/77 (41%), Gaps = 14/77 (18%)

Query: 5 YLPMTEEDKKEMLQTIGVQTIDELFSDIPESVR------------FKGDLKIKEAKSEPE 52
Y +++ +++ + + + F+ + E V+ K ++ AK +
Sbjct: 48 YTSKEPINERRVIRIV--NKLTKKFTWLGECVKEFDITEIGEQYSLKETIQDLCAKDFHQ 105

Query: 53 LLKELSQMASKNANLKE 69
LKE ++ KN LK+
Sbjct: 106 KLKEFTEKTPKNQKLKD 122


54BcerKBAB4_4159BcerKBAB4_4165Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_41590183.281949GatB/YqeY domain-containing protein
BcerKBAB4_41602223.84388830S ribosomal protein S21
BcerKBAB4_41612223.828105RNA modification protein
BcerKBAB4_41622193.52574316S ribosomal RNA methyltransferase RsmE
BcerKBAB4_41631172.77499250S ribosomal protein L11 methyltransferase
BcerKBAB4_41642172.515696chaperone protein DnaJ
BcerKBAB4_41652201.953865molecular chaperone DnaK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4165SHAPEPROTEIN1531e-43 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 153 bits (388), Expect = 1e-43
Identities = 75/355 (21%), Positives = 133/355 (37%), Gaps = 44/355 (12%)

Query: 2 SKIIGIDLGTTNSCVAVMEGGEPKVIPNPEGNRTTPSVVAFKNEERQVGEVAKRQAITNP 61
S + IDLGT N+ + V G P+ R VG AK+ P
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDR--AGSPKSVAAVGHDAKQMLGRTP 67

Query: 62 NTIMSVKRHMGTDYKVEVEGKDFTPQEISAIILQNLKASAEAYLGETVTKAVITVPAYFN 121
I +++ K V F +++ ++ + +++ + ++ VP
Sbjct: 68 GNIAAIR-----PMKDGVIADFFVTEKMLQHFIKQVHSNS---FMRPSPRVLVCVPVGAT 119

Query: 122 DAERQATKDAGRIAGLEVERIINEPTAAALAYGLEKQDEEQKILVYDLGGGTFDVSILEL 181
ER+A +++ + AG +I EP AAA+ GL E +V D+GGGT +V+++ L
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGL-PVSEATGSMVVDIGGGTTEVAVISL 178

Query: 182 ADGTFEVISTAGDNRLGGDDFDQVIIDHLVAEFKKENSIDLSQDKMALQRLKDAAEKAKK 241
+ + R+GGD FD+ II+++ + + AE+ K
Sbjct: 179 NG-----VVYSSSVRIGGDRFDEAIINYVRRNYG-------------SLIGEATAERIKH 220

Query: 242 DLSGVTQ----TQISLPFISAGAAGPLHLELTLTRAKFEEISAGLVERTLEPTRRALKDA 297
++ +I + + P L + E + L + AL+
Sbjct: 221 EIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEALQEPLTG-IVSAVMVALEQ- 277

Query: 298 GFAPSELDK------VILVGGSTRIPAVQEAIKRETGKEPYKGVNPDEVVALGAA 346
P EL ++L GG + + + ETG +P VA G
Sbjct: 278 --CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGG 330


55BcerKBAB4_4242BcerKBAB4_4267Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_42422142.629459hypothetical protein
BcerKBAB4_42432162.595180tRNA-specific 2-thiouridylase MnmA
BcerKBAB4_42443213.448727class V aminotransferase
BcerKBAB4_42453263.467408BadM/Rrf2 family transcriptional regulator
BcerKBAB4_42464253.264691recombination factor protein RarA
BcerKBAB4_42474262.956151transcription factor RsfA
BcerKBAB4_42483283.219017UBA/THIF-type NAD/FAD binding protein
BcerKBAB4_42492212.702212aspartyl-tRNA synthetase
BcerKBAB4_42502191.548705histidyl-tRNA synthetase
BcerKBAB4_4251-1141.373540hypothetical protein
BcerKBAB4_4252-1131.466483hypothetical protein
BcerKBAB4_42530141.371605D-tyrosyl-tRNA(Tyr) deacylase
BcerKBAB4_42540141.200065(p)ppGpp synthetase I SpoT/RelA
BcerKBAB4_42550121.567817adenine phosphoribosyltransferase
BcerKBAB4_42560101.264014single-stranded-DNA-specific exonuclease RecJ
BcerKBAB4_42571140.790865cation diffusion facilitator family transporter
BcerKBAB4_42582160.672807bifunctional preprotein translocase subunit
BcerKBAB4_42590181.698093hypothetical protein
BcerKBAB4_42600182.382641sporulation stage V protein B
BcerKBAB4_42610191.900172hypothetical protein
BcerKBAB4_42620213.999936hypothetical protein
BcerKBAB4_42631204.120021preprotein translocase subunit YajC
BcerKBAB4_42640193.148051queuine tRNA-ribosyltransferase
BcerKBAB4_42650142.179067S-adenosylmethionine--tRNA
BcerKBAB4_42660130.851398hypothetical protein
BcerKBAB4_42672141.160797Holliday junction DNA helicase RuvB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4242SYCDCHAPRONE334e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 33.0 bits (75), Expect = 4e-04
Identities = 20/122 (16%), Positives = 36/122 (29%)

Query: 83 EQFAEAKAVFEQAMQVGLQSADVTFMLGITHVQLGNDRLALPFLQRATELDEGDVEAVFQ 142
E F + ++ + + + L Q G A Q LD D
Sbjct: 16 ESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLG 75

Query: 143 CGLCFARLEHIQEAKPYFEKVLEMDEEHADAYYNLGVAYVFEENNEKALTLFKKATEIQA 202
G C + A + MD + ++ + + +A + A E+ A
Sbjct: 76 LGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIA 135

Query: 203 DH 204
D
Sbjct: 136 DK 137



Score = 31.8 bits (72), Expect = 0.001
Identities = 17/91 (18%), Positives = 33/91 (36%)

Query: 8 GIKYMQEGNWEEAAKNFTEAIEENPKDALGYINFANLLDVLGDSERAIVFYKRALELDGK 67
Q G +E+A K F + D+ ++ +G + AI Y +D K
Sbjct: 43 AFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102

Query: 68 SAAAYYGLGNVYYGQEQFAEAKAVFEQAMQV 98
+ + + AEA++ A ++
Sbjct: 103 EPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4251PF05043250.019 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 24.9 bits (54), Expect = 0.019
Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 25 FISKEQNNTSMELASEFGISLQDVKRLKKQIE 56
FI + + + EF IS + R+ QI
Sbjct: 94 FIFFNEGCQAESICKEFYISSSSLYRIISQIN 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4253THERMOLYSIN280.011 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 28.4 bits (63), Expect = 0.011
Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 16/118 (13%)

Query: 16 DGEIVGQIPFGLTLLVGITHEDTEKDATYIAEKIANLRIFEDESGKMNHSVLDMKGQVLS 75
DG+ +PF + V + HE T + + A L ++++ESG +N ++ D+ G ++
Sbjct: 352 DGDGQTFLPFSGGIDV-VGHELTHA----VTDYTAGL-VYQNESGAINEAMSDIFGTLVE 405

Query: 76 ----------ISQFTLYGDCRKGRRPNFMDAAKPDYAEHLYDFFNEEVRKQGLHVETG 123
I + + D AK +H + G+H +G
Sbjct: 406 FYANRNPDWEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSG 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4258SECFTRNLCASE2701e-86 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 270 bits (693), Expect = 1e-86
Identities = 96/312 (30%), Positives = 161/312 (51%), Gaps = 20/312 (6%)

Query: 450 INFVNIGHKFFLFSIVVVIAGAIILPIFKMNLGIDFASGTRIDLQSKQATTVSNIHKDLK 509
+F F +IV++IA I+ + +N GIDF GT I +S A V L+
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73

Query: 510 ELNID---VKEEDIVPTGDDNKGFAVR-----------TLGVLSKDEIAKTKTFFN--DK 553
L + + E +D +R G ++ + K +T D
Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133

Query: 554 YGTEPNVSTVSPTIGKEIARNAFIAVLIASLVIILYVSIRFRFTYALSAVIALLHDAFVM 613
+ +V P + E+ A ++L A++VI+ Y+ +RF + +AL AV+AL+HD +
Sbjct: 134 ALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLT 193

Query: 614 IVMFSLFQIEVDLTFIAAILTIIGYSINDSIVTFDRNRELYKQKKRVRDIKDLEEIVNSS 673
+ +F++ Q++ DLT +AA+LTI GYSIND++V FDR RE + K L +++N S
Sbjct: 194 VGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKT----MPLRDVMNLS 249

Query: 674 IRQTIGRSINTVLTVLFPVIALLIFGSESLRNFSLALLIGLVVGTYSSIFVASQIWLMLE 733
+ +T+ R++ T +T L ++ +LI+G + +R F A++ G+ GTYSS++VA I L +
Sbjct: 250 VNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIG 309

Query: 734 NRRLKRGKTKKK 745
R K K
Sbjct: 310 LDRNKEKKDPSD 321



Score = 63.7 bits (155), Expect = 7e-13
Identities = 35/176 (19%), Positives = 81/176 (46%), Gaps = 3/176 (1%)

Query: 250 SVGAKFGQQALEQTIFASAIGIAIIFLFMLV-FYRLPGLVAVIMLGLYIFVTLLVFNWMH 308
SVG K + + +++ +I ++ V F L AV+ L + +T+ +F +
Sbjct: 142 SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQ 201

Query: 309 AVLTLPGIAALVLGVGIAVDANIITYERLKEELKI--GKSMMSAFRAGNHRSLSTILDAN 366
L +AAL+ G +++ ++ ++RL+E L + + +LS +
Sbjct: 202 LKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTG 261

Query: 367 ITTIAAAGVLFAYGNSSVKGFATSLIVSILVGFITNVFGTRFLLGLLVKSRYFDKK 422
+TT+ A + +G ++GF +++ + G ++V+ + ++ + R +KK
Sbjct: 262 MTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEKK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4263PF06580280.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.5 bits (61), Expect = 0.006
Identities = 8/39 (20%), Positives = 19/39 (48%)

Query: 7 NIVMIVAMFAIFYFLLIRPQQKRQKAVAQMQNEIKKGDA 45
N+V++ M+++ YF + +Q + Q + +A
Sbjct: 123 NVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEA 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4266ACRIFLAVINRP240.037 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.4 bits (53), Expect = 0.037
Identities = 13/59 (22%), Positives = 29/59 (49%), Gaps = 5/59 (8%)

Query: 1 MTEMPKLLITAGILLIVVGLAWKFIGRLPGDIFVKKGNVTFYFPIITCIVLSIALSFIM 59
M+++ L+ ++L V + F G G I+ + F I++ + LS+ ++ I+
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ-----FSITIVSAMALSVLVALIL 488


56BcerKBAB4_4303BcerKBAB4_4324Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4303-1163.154618valyl-tRNA synthetase
BcerKBAB4_4304-2122.733095spore coat protein YsxE
BcerKBAB4_4305-1122.050605sporulation stage VI protein D
BcerKBAB4_43061141.967356glutamate-1-semialdehyde aminotransferase
BcerKBAB4_43071130.809761delta-aminolevulinic acid dehydratase
BcerKBAB4_43082110.567603uroporphyrinogen-III synthase
BcerKBAB4_43092130.090355porphobilinogen deaminase
BcerKBAB4_43101130.796263cytochrome c assembly protein
BcerKBAB4_43110142.122263glutamyl-tRNA reductase
BcerKBAB4_43120172.164249MarR family transcriptional regulator
BcerKBAB4_43131202.586890OsmC family protein
BcerKBAB4_43141172.010951ribosome biogenesis GTP-binding protein YsxC
BcerKBAB4_43151171.994073ATP-dependent protease La
BcerKBAB4_43160181.522787sporulation protease LonB
BcerKBAB4_43173200.058621ATP-dependent protease ATP-binding subunit ClpX
BcerKBAB4_4318318-0.375065trigger factor
BcerKBAB4_4319216-0.693216hypothetical protein
BcerKBAB4_43201170.794620hypothetical protein
BcerKBAB4_43211151.078221hypothetical protein
BcerKBAB4_43221140.981106hypothetical protein
BcerKBAB4_43231152.074566**phosphodiesterase
BcerKBAB4_43242142.089932nucleoside-triphosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4315HTHFIS372e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.1 bits (86), Expect = 2e-04
Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 349 LCLVGPPGVGKTSLARSI-ATSLNRN--FVRVSLGGVRD---ESEIRGHRRTYVGAMPGR 402
L + G G GK +AR++ RN FV +++ + ESE+ GH + GA G
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGA 219

Query: 403 IIQGMKKAKTVNP-VFLLDEIDKMSNDFRGDPSAALLEVLD 442
+ + + LDEI M D + LL VL
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4316HTHFIS579e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 9e-11
Identities = 43/214 (20%), Positives = 76/214 (35%), Gaps = 41/214 (19%)

Query: 43 ELEQLRKMREVSLTEPLAEKVR----PTSFIDIVGQEDGIKSLK--AALCGPNPQHVIIY 96
+L +L + +L EP + + +VG+ ++ + A ++I
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166

Query: 97 GPPGVGKTAAARLVLEEAKRNPKSPFRTNATFIELDATTARFDERGIADPLIGSVHDPIY 156
G G GK AR + + KR F+ ++ A I L G
Sbjct: 167 GESGTGKELVARALHDYGKRRNGP-------FVAINM--AAIPRDLIESELFGHE----- 212

Query: 157 QGAGAMGQAGIPQPKKGAVTDAHGGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYS 216
GA G G A GG LF+DEIG++ ++L+VL+ +
Sbjct: 213 --KGAF--TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGG--- 265

Query: 217 EENSMIPTYIHDIFQKGLPADFRLVGATTRSPEE 250
+ + +D R+V AT + ++
Sbjct: 266 --------------RTPIRSDVRIVAATNKDLKQ 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4317HTHFIS310.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.014
Identities = 38/198 (19%), Positives = 77/198 (38%), Gaps = 38/198 (19%)

Query: 86 VPKPVEIREILDEY--VIGQDNAK-KALAVAVYNHYKRINSNSKIDDV-----ELAKSNI 137
+PKP ++ E++ + + + L + + ++ + ++ L ++++
Sbjct: 102 LPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161

Query: 138 S--LIGPTGSGKTLLAQTL---ARILNVPF------AIADATSLTEAGYVGEDVENILLK 186
+ + G +G+GK L+A+ L + N PF AI L E+ G +
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR--DLIESELFGH-EKGAFTG 218

Query: 187 LIQAADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPP 246
+ E+AE G +++DEI D+ Q LL++L+
Sbjct: 219 AQTRSTGRFEQAEGGTLFLDEIG-------------DMP-MDAQTRLLRVLQQG--EYTT 262

Query: 247 QGGRKHPHQEFIQIDTTN 264
GGR + + TN
Sbjct: 263 VGGRTPIRSDVRIVAATN 280


57BcerKBAB4_4350BcerKBAB4_4362Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_43500144.846698electron transfer flavoprotein alpha subunit
BcerKBAB4_43510134.207592electron transfer flavoprotein
BcerKBAB4_43520144.315491enoyl-CoA hydratase
BcerKBAB4_43530144.349871TetR family transcriptional regulator
BcerKBAB4_43540144.172032long-chain-fatty-acid--CoA ligase
BcerKBAB4_4355-1173.886103triple helix repeat-containing collagen
BcerKBAB4_4356-1170.813275periplasmic binding protein
BcerKBAB4_43570191.155559iron-hydroxamate transporter permease subunit
BcerKBAB4_4358-115-1.790465DinB family protein
BcerKBAB4_4359-216-3.311772hypothetical protein
BcerKBAB4_4360-115-3.047230hypothetical protein
BcerKBAB4_4361-216-3.307902hypothetical protein
BcerKBAB4_4362016-3.007542hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4353HTHTETR1123e-33 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 112 bits (281), Expect = 3e-33
Identities = 35/192 (18%), Positives = 75/192 (39%), Gaps = 10/192 (5%)

Query: 5 RPKYNQIIDAAVIVIAENGYHQAQVSKIAKQAGVADGTIYLYFKNKEDILISLFQEKMGE 64
+ I+D A+ + ++ G + +IAK AGV G IY +FK+K D+ +++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 65 FVETIRQKIAGIESAVAKLFMLVETHFLLLSQNDPL--AIVTQLELRQSNQDLRLKINEV 122
E + A + + H L + + ++ + + + +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 123 LKGY----LQVMDEILETGIKQGEFQADLNVRVARQMIFGTVDEVVTNWVMSDHKYDLVA 178
+ +++ L+ I+ ADL R A ++ G + ++ NW+ + +DL
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL-- 187

Query: 179 LSKTVHGLLIAA 190
K +A
Sbjct: 188 --KKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4356FERRIBNDNGPP1805e-57 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 180 bits (458), Expect = 5e-57
Identities = 60/258 (23%), Positives = 115/258 (44%), Gaps = 11/258 (4%)

Query: 52 AKKVVVLEWVYSEDLLALGVQPVGMADIKNYNKWVNTATKPGKDVVDVGTRQQPNLEEIS 111
++V LEW+ E LLALG+ P G+AD NY WV+ P V+DVG R +PNLE ++
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLP-DSVIDVGLRTEPNLELLT 93

Query: 112 RLKPDLIITASFRSKAIKNELEQIAPTVMFDPSTSNNDHFAEMTETFKQIAKAVGKEEEG 171
+KP ++ S L +IAP F+ S A ++ ++A + +
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQP-LAMARKSLTEMADLLNLQSAA 151

Query: 172 KKVLADMDKTFADAKAKIDKADLKDKNIAMVQAFTAKNVPTFRILTDNSTALQVTKKLGL 231
+ LA + K + K + + + +++ + NS ++ + G+
Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARP--LLLTTLIDPRHM---LVFGPNSLFQEILDEYGI 206

Query: 232 TNTFEAGKSEADGFKQTTVESLQSVQNSNFIYIVADDDNIFDTQLKGNPAWEGLNFKKED 291
N ++ G++ G +++ L + ++ + + D+ D L P W+ + F +
Sbjct: 207 PNAWQ-GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMD-ALMATPLWQAMPFVRAG 264

Query: 292 KMYKLKGDTWIFGGPESA 309
+ ++ W +G SA
Sbjct: 265 RFQRVP-AVWFYGATLSA 281


58BcerKBAB4_4389BcerKBAB4_4409Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_43892252.285882ATPase central domain-containing protein
BcerKBAB4_43903333.660433asparaginyl-tRNA synthetase
BcerKBAB4_43912302.655172phenylalanyl-tRNA synthetase subunit beta
BcerKBAB4_43920211.543465phenylalanyl-tRNA synthetase subunit alpha
BcerKBAB4_43931180.310834tRNA/rRNA methyltransferase SpoU
BcerKBAB4_43940150.065878small acid-soluble spore protein SspI
BcerKBAB4_43951130.074010metal dependent phosphohydrolase
BcerKBAB4_4396214-0.363176abortive infection protein
BcerKBAB4_43972141.091594hypothetical protein
BcerKBAB4_43982160.658748hypothetical protein
BcerKBAB4_43993200.938692EmrB/QacA family drug resistance transporter
BcerKBAB4_44004240.945962hypothetical protein
BcerKBAB4_44015261.397624TetR family transcriptional regulator
BcerKBAB4_44023332.115611cellulase
BcerKBAB4_44034270.010560dUTPase
BcerKBAB4_44042230.70636050S ribosomal protein L20
BcerKBAB4_44052151.12195850S ribosomal protein L35
BcerKBAB4_44063140.890508translation initiation factor IF-3
BcerKBAB4_44072131.056008threonyl-tRNA synthetase
BcerKBAB4_44082120.564507sporulation protein YtxC
BcerKBAB4_44092121.502490primosomal protein DnaI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4389HTHFIS364e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 4e-04
Identities = 35/137 (25%), Positives = 55/137 (40%), Gaps = 31/137 (22%)

Query: 185 LEKKEVISVEKAMAELDELIGLQSVKQKVKEIYNLVIFNQMRKEQGMKTDNLSLHMIFTG 244
K+ +E + L+G + Q++ + M+TD L ++ TG
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR----------VLARLMQTD---LTLMITG 167

Query: 245 NPGTGKTTVARLV-------AKIFKAL--GVLSKGHLVETDRSELVG----EFIGHTAPK 291
GTGK VAR + F A+ + + L+E SEL G F G
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPR-DLIE---SELFGHEKGAFTGAQTRS 223

Query: 292 TMKKIKEALGGVLFVDE 308
+ ++A GG LF+DE
Sbjct: 224 -TGRFEQAEGGTLFLDE 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4394DNABINDINGHU250.025 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 24.7 bits (54), Expect = 0.025
Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 19 DQLQETIVDAIQSGEEKMLPGLGVLFEVIWKNA 51
D + + + GE+ L G G FEV + A
Sbjct: 27 DAVFSAVSSYLAKGEKVQLIGFGN-FEVRERAA 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4399TCRTETB1425e-39 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 142 bits (360), Expect = 5e-39
Identities = 86/402 (21%), Positives = 175/402 (43%), Gaps = 18/402 (4%)

Query: 106 FVSILNQTIINVALPPLMNEFNVSTSTAQWLITGFMLVNGILVPISAFLVSRFTYRKLFI 165
F S+LN+ ++NV+LP + N+FN ++ W+ T FML I + L + ++L +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 166 AAMLFFTVGSIICATSGN-FTMMMTGRVVQAVGAGILMPVGMNIFMTLFPPNKRGAAMGL 224
++ GS+I + F++++ R +Q GA + M + P RG A GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 225 LGVAMILAPAIGPTVTGWVIENYSWNLMFYGMFVIGLIITFLSFKFFTLAQPVSKTKLDV 284
+G + + +GP + G + W+ + + + IIT + K D+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 285 FGVISSSIGLGSLLYGFSEAGNNGWTSAEVVITLIIGVIGLAVFIWRELTTDNKMLDLQV 344
G+I S+G+ + + + LI+ V+ +F+ + +D +
Sbjct: 202 KGIILMSVGIVFFMLF---TTSYSISF------LIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 345 FKYPTFTFTLVINAIVTMALFGGMLLLPVYLQNIRGFTPMESG-LLLLPGSLIMGIMGPV 403
K F ++ I+ + G + ++P ++++ + E G +++ PG++ + I G +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 404 AGKLFDKYGIRPLAIVGLAITTFATYKFTTLSMDTPYSVIMTDYIIRSI--GMSFIMMPI 461
G L D+ G + +G+ + + F T S + II + G+SF I
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVS---FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVI 369

Query: 462 MTAGMNALPMKLISHGTATQNTSRQVAGSIGTAILITLMTQQ 503
T ++L + G + N + ++ G AI+ L++
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4400RTXTOXIND754e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 75.3 bits (185), Expect = 4e-18
Identities = 29/140 (20%), Positives = 47/140 (33%), Gaps = 22/140 (15%)

Query: 87 QTVDVTIPQNATVVQSNATT-NAFVGAGSPIAYAFDMS------NLWVTANIEETNIDDV 139
Q + P + V Q T V + M L VTA ++ +I +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETL-----MVIVPEDDTLEVTALVQNKDIGFI 380

Query: 140 QKGQTVDVYVDAYPDTT---LTGKVEQVGLTTANTFSMLPSSNATANYTKVKQVVPVKVS 196
GQ + V+A+P T L GKV+ + L V +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI-------EDQRLGLVFNVIISIEENCL 433

Query: 197 LDHSKSVNIVPGMNVSVRIH 216
+K++ + GM V+ I
Sbjct: 434 STGNKNIPLSSGMAVTAEIK 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4401HTHTETR619e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 9e-14
Identities = 22/98 (22%), Positives = 37/98 (37%), Gaps = 6/98 (6%)

Query: 9 PRVKRTRQLIQDAFVALVGEKGFENVTVQHIAERAPVNRATFYSHYHDKYDLLEKSIEEM 68
+ TRQ I D + L ++G + ++ IA+ A V R Y H+ DK DL + E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 69 LEKLAAVIKPQNRNKEDFQLTFDSPHPTFLALFEHIAD 106
+ + E P + H+ +
Sbjct: 67 ESNIGELE------LEYQAKFPGDPLSVLREILIHVLE 98


59BcerKBAB4_4474BcerKBAB4_4492Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_44740143.231167amidohydrolase 3
BcerKBAB4_44750152.188380glyoxalase/bleomycin resistance
BcerKBAB4_44760141.706023AMP-dependent synthetase and ligase
BcerKBAB4_44772161.216462small acid-soluble spore protein alpha/beta
BcerKBAB4_44781141.321275thiamine biosynthesis protein ThiI
BcerKBAB4_44791122.050858class V aminotransferase
BcerKBAB4_44801121.417749septation ring formation regulator EzrA
BcerKBAB4_44821141.779892TetR family transcriptional regulator
BcerKBAB4_44832222.661097putative GAF sensor protein
BcerKBAB4_44842232.411796methionine gamma-lyase
BcerKBAB4_44853272.05082430S ribosomal protein S4
BcerKBAB4_44860170.011401hypothetical protein
BcerKBAB4_44870170.695372hypothetical protein
BcerKBAB4_4488-1182.489198tyrosyl-tRNA synthetase
BcerKBAB4_44891132.245891hypothetical protein
BcerKBAB4_44900142.288561ECF subfamily RNA polymerase sigma-24 factor
BcerKBAB4_44910173.206304putative lipoprotein
BcerKBAB4_4492-1173.413493hexapaptide repeat-containing transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4482HTHTETR713e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.8 bits (173), Expect = 3e-17
Identities = 29/174 (16%), Positives = 61/174 (35%), Gaps = 9/174 (5%)

Query: 2 KQTKQKVIDAAIALFNTKGYDGTSVREIAKRADVNVANISYYFAGKQGLLEQLITDFLEG 61
++T+Q ++D A+ LF+ +G TS+ EIAK A V I ++F K L ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 YIHVIETSFEQREYLSAKDVMVQMVRGI-LRYQFDNRELTRFFYRELSL---DTTLIREV 117
+ + + ++ + + R L + ++++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 118 MTVYFSRERYYIEQIIKQGQMNQEFK-----KVSFTMFMTQLKGMMNMPFLYPQ 166
IEQ +K + + + + + G+M PQ
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183


60BcerKBAB4_4501BcerKBAB4_4521Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4501220-1.339987large-conductance mechanosensitive channel
BcerKBAB4_4502013-2.395207putative transcriptional regulator
BcerKBAB4_4503114-1.896725hypothetical protein
BcerKBAB4_4504418-0.621439hypothetical protein
BcerKBAB4_45053230.797460hypothetical protein
BcerKBAB4_45062230.856238global transcriptional regulator, catabolite
BcerKBAB4_45082250.254849CamS sex pheromone cAM373 family protein
BcerKBAB4_45094262.033152hypothetical protein
BcerKBAB4_45102232.042785hypothetical protein
BcerKBAB4_4511-1191.167656peptidase M29 aminopeptidase II
BcerKBAB4_4512015-2.060048hypothetical protein
BcerKBAB4_4513-114-0.708851hypothetical protein
BcerKBAB4_4514-1130.032564N-acetyltransferase GCN5
BcerKBAB4_45152132.311859N-acetyltransferase GCN5
BcerKBAB4_45163112.197867PadR-like family transcriptional regulator
BcerKBAB4_45173112.174549hypothetical protein
BcerKBAB4_45184122.735868UDP-N-acetylmuramate--L-alanine ligase
BcerKBAB4_45194122.704353nicotinate phosphoribosyltransferase
BcerKBAB4_45205122.650304cell division protein FtsK
BcerKBAB4_45212111.026543cell wall hydrolase/autolysin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4501MECHCHANNEL1411e-46 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 141 bits (358), Expect = 1e-46
Identities = 74/135 (54%), Positives = 96/135 (71%), Gaps = 10/135 (7%)

Query: 1 MWNEFKKFAFKGNVIDLAVGVVIGAAFGKIVSSLVKDIITPLLGMVLGGVNFTDLKLTFG 60
+ EF++FA +GNV+DLAVGV+IGAAFGKIVSSLV DII P LG+++GG++F +T
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 61 KS-------SIMYGNFIQTIFDFLIIAAAIFMFVKVFNKLTSKREEEEKKEELPEPTKEE 113
+ + YG FIQ +FDFLI+A AIFM +K+ NKL K+EE P PTKEE
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEP---AAAPAPTKEE 119

Query: 114 EILGEIRDLLKQQNS 128
+L EIRDLLK+QN+
Sbjct: 120 VLLTEIRDLLKEQNN 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4514SACTRNSFRASE496e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 48.8 bits (116), Expect = 6e-10
Identities = 29/123 (23%), Positives = 57/123 (46%), Gaps = 4/123 (3%)

Query: 30 YNVPITKEEQPDLLEIETFYQRDNGNFWVATYDGKVVGTIALLDIGKHQVALRKMFVKKE 89
++ P K+ + D +++ + + + ++ + +G I + + + V K+
Sbjct: 42 FSKPYFKQYEDDDMDVS-YVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKD 100

Query: 90 FRGKEWGASYTLLQTAISWAKEKNLKDIYLGTTVKFLAAHRFYEKNNFQSVSIDE-LPKS 148
+R K G LL AI WAKE + + L T ++A FY K++F ++D L +
Sbjct: 101 YRKK--GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSN 158

Query: 149 FPV 151
FP
Sbjct: 159 FPT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4520IGASERPTASE722e-14 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 72.0 bits (176), Expect = 2e-14
Identities = 63/329 (19%), Positives = 106/329 (32%), Gaps = 6/329 (1%)

Query: 378 VELEKSEKATEEVVELEKSEEAAEEVVELEKSEEAAEEVVELEKPEEATEEVVELEETEE 437
VE T + + A V E A + + P AT E
Sbjct: 985 VEKRNQTVDTTNITTPNNIQ-ADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 438 AIEEVAELEEAEEAAEE--VVELEETKEATEEVVELEKSEEVTEEVVELEETKEATEEVA 495
+ +E +E+ E+ A E E KEA V ++ EV + E +ET+ +
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 496 ELEEAEEEAAEEVVELEKSEEATEEVVELEEAEEATEEVAELEKSEEVTEEVVEL-EETK 554
E EE+A E + ++ + T +V +E E + AE + + T + E +T
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 555 EATEEVAELEEAEEAAEEVVELEKSEEATEEVVE--LEEAEEATEEVVELEEAEEATEEV 612
+ +E E+ V + VVE T+ V E + +
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 613 AELEGTKEEEPTSQETVIEETMNTDLVENTPVAEQPVISQQETITFKEESEVFVPVSETD 672
+ T + L + T V+S V VS+
Sbjct: 1224 RRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHI 1283

Query: 673 EQTKKDVQNFANVLVEEAEEKKQVAEEQP 701
Q + + + NV V K + Q
Sbjct: 1284 SQLEMNNEGQYNVWVSNTSMNKNYSSSQY 1312



Score = 67.0 bits (163), Expect = 4e-13
Identities = 58/325 (17%), Positives = 110/325 (33%), Gaps = 9/325 (2%)

Query: 319 AEVAANQVEEETLEDVVIVKADEKLEETITIEIPDAFEEAKEAEEVVELEATEEAIEEVV 378
E V+ + ++AD + EI +EA + E E V
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-RVDEAPVPPPAPATPS--ETTETVA 1041

Query: 379 ELEKSEKATEEVVELEKSEEAAEEVVELEKSEEAAEEVVELEKPEEATEEVVELEETEEA 438
E K E T E E + +E A+ E ++EA V + E + E +ET+
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNR---EVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 439 IEEVAELEEAEEAAEEVVELEETKEATEEVVELEKSEEVTEEVVELEETKEATEEVAELE 498
+ E EE A+ VE E+T+E + ++ +E +E V E + ++
Sbjct: 1099 ETKETATVEKEEKAK--VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 499 EAEEEAAEEVVELEKSEEATEEVVELEEAEEATEEVAELEKSEEVTEEVVELEETKEATE 558
E + + + ++E + V + + ++ E T +
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 559 EVAELEEAEEAAEEVVELEKSEEATEEVVELEEAEE-ATEEVVELEEAEEATEEVAELEG 617
+ +E + ++ + + + +T L +A + VA G
Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276

Query: 618 TKEEEPTSQETVIEETMNTDLVENT 642
+ SQ + E V NT
Sbjct: 1277 KAVSQHISQLEMNNEGQYNVWVSNT 1301



Score = 59.3 bits (143), Expect = 1e-10
Identities = 55/290 (18%), Positives = 87/290 (30%), Gaps = 33/290 (11%)

Query: 632 ETMNTDLVENTPVAEQPVISQQETITFKEESEVFVPVSETDEQTKKDVQNFANVLVEEAE 691
+T N N V S E I +E+ V P T +T + V + + E
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVE 1052

Query: 692 EKKQVAEEQPALQIEEPKREKKRHVPFNVVMLKQDRKKLMERHAARTNVMQSTVSERVEE 751
+ +Q A E Q E +E K +V N + + +
Sbjct: 1053 KNEQDATE-TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT-------------- 1097

Query: 752 KPMQQVVVEPQAEEKPMQQVVVEPQAEEKPMQQVVVEPQAEEKPMQQVVVEPQAEEKPMQ 811
E EK + V + +E P V P+ E+ Q EP E P
Sbjct: 1098 ----TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 812 QVVVEPQAE-------EKPMQQMVVDPQVEEKPVQQVVVDPQVEESPVQQVVVEAQVEEK 864
+ EPQ++ E+P ++ + + V V E+P Q
Sbjct: 1154 N-IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ---- 1208

Query: 865 PMPQVVVEPQVEEKPMQQVVVAGQVQEPISSTEVQEKAYVVNQKENDMRN 914
P V E + K + V +T V + N
Sbjct: 1209 --PTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTN 1256



Score = 44.3 bits (104), Expect = 4e-06
Identities = 51/298 (17%), Positives = 89/298 (29%), Gaps = 36/298 (12%)

Query: 496 ELEEAEEEAAEEVVELEKSEEATEEVVELEEAEEATEEVAELEKSEEVTEEVVELEETKE 555
+L E E + V+ ++ EE+A + +E E
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA---RVDEAPVPPPAPATPSE 1035

Query: 556 ATEEVAELEEAEEAAEEVVELEKSE------EATEEVVELEEAEEATEEVVEL-EEAEEA 608
TE VAE + E E E + +E E +E +A T EV + E +E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 609 TEEVAELEGTKEEEPTSQETVIEETMNTDLVENTPVAEQPVISQQETITFKEESEVFVPV 668
+ T E+E ++ VE E P ++ Q + +E+SE P
Sbjct: 1096 QTTETKETATVEKEEKAK------------VETEKTQEVPKVTSQVSPK-QEQSETVQP- 1141

Query: 669 SETDEQTKKDVQNFANVLVEEAEEKKQVAEEQPALQIEEPKREKKRHVPFNVVMLKQDRK 728
Q + +N V ++E + + + E+P +E +V V +
Sbjct: 1142 -----QAEPARENDPTVNIKEPQSQTNTTADT-----EQPAKETSSNVEQPVT--ESTTV 1189

Query: 729 KLMERHAARTNVMQSTVSERVEEKPMQQVVVEPQAEEKPMQQVVVEPQAEEKPMQQVV 786
++ VEP + V
Sbjct: 1190 NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4521IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 1e-05
Identities = 17/86 (19%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 23 EEPKKETTSSIQEKNKDNKEDAPVEKQQEEQEKKEQPQAIQTNEQVEHKQEEVPAEEKKE 82
E K+ + ++++ +D E ++ ++ K QTNE + E + +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 83 ETTPLQPTEQPLQNNEQKVESNEKQE 108
+ T E+ + KVE+ + QE
Sbjct: 1101 KETATVEKEE-----KAKVETEKTQE 1121



Score = 31.2 bits (70), Expect = 0.007
Identities = 22/88 (25%), Positives = 32/88 (36%), Gaps = 4/88 (4%)

Query: 22 QEEPKKETTSSIQEKNKDNKEDAPVEKQQEEQEKKEQPQAIQTNEQVEHKQEEVPAEEKK 81
+EE K T QE K + +P ++Q E QPQA E + P +
Sbjct: 1108 KEEKAKVETEKTQEVPKVTSQVSPKQEQSET----VQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 82 EETTPLQPTEQPLQNNEQKVESNEKQEK 109
QP ++ N EQ V +
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNT 1191



Score = 30.4 bits (68), Expect = 0.012
Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 5/92 (5%)

Query: 23 EEPKKETTSSIQEKNKDNKEDAPVEKQQEE-----QEKKEQPQAIQTNEQVEHKQEEVPA 77
+E K ++ Q E Q E +KE+ ++T + E +
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129

Query: 78 EEKKEETTPLQPTEQPLQNNEQKVESNEKQEK 109
K+E++ +QP +P + N+ V E Q +
Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161



Score = 28.5 bits (63), Expect = 0.043
Identities = 19/110 (17%), Positives = 35/110 (31%), Gaps = 4/110 (3%)

Query: 21 SQEEPKKETTSSIQEKN-KDNKEDAPVEKQQEEQEKKEQPQAIQTNEQVEHKQEEVPAEE 79
++ + + T +I+E + N + +E EQP T + E P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202

Query: 80 KKEETTPLQPTE---QPLQNNEQKVESNEKQEKFLVVIDPGHQQKANLNL 126
T P +E +P + + V S + A +L
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252


61BcerKBAB4_4536BcerKBAB4_4568Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_45360163.067390DeoR family transcriptional regulator
BcerKBAB4_45370162.891741pseudouridine synthase
BcerKBAB4_45381183.329302polysaccharide biosynthesis protein
BcerKBAB4_4539-1202.938138hypothetical protein
BcerKBAB4_4540-1191.961270EmrB/QacA family drug resistance transporter
BcerKBAB4_4541217-0.020479ArsR family transcriptional regulator
BcerKBAB4_45420170.927876activator of Hsp90 ATPase 1 family protein
BcerKBAB4_45432191.151615hypothetical protein
BcerKBAB4_45440170.758365hypothetical protein
BcerKBAB4_45450181.261849hypothetical protein
BcerKBAB4_4546-1191.963737CarD family transcriptional regulator
BcerKBAB4_4547-2182.429674glucose-1-dehydrogenase
BcerKBAB4_4548-2163.083145DMT superfamily L-rhamnose-proton symporter
BcerKBAB4_45490153.106958hypothetical protein
BcerKBAB4_4550-1123.611446molybdopterin converting factor subunit 1
BcerKBAB4_4551-2133.384536molybdopterin biosynthesis MoaE protein
BcerKBAB4_45523168.531717molybdopterin-guanine dinucleotide biosynthesis
BcerKBAB4_45533178.859419molybdenum cofactor synthesis domain-containing
BcerKBAB4_45545199.455292molybdenum cofactor biosynthesis protein MoaC
BcerKBAB4_45555229.508466thiamine/molybdopterin biosynthesis MoeB-like
BcerKBAB4_455662110.252173molybdenum cofactor biosynthesis protein A
BcerKBAB4_45576219.940853triple helix repeat-containing collagen
BcerKBAB4_4558-2110.785666hypothetical protein
BcerKBAB4_4559-112-0.287206hypothetical protein
BcerKBAB4_4560-112-1.180007rhodanese domain-containing protein
BcerKBAB4_4561-113-1.394636hypothetical protein
BcerKBAB4_4562-214-0.735574homoserine O-acetyltransferase
BcerKBAB4_45632191.782625GerA spore germination protein
BcerKBAB4_45644201.781564spore germination protein
BcerKBAB4_45652243.843820spore germination B3 GerAC family protein
BcerKBAB4_45661225.215982hypothetical protein
BcerKBAB4_4567-1194.849782hypothetical protein
BcerKBAB4_4568-1174.491637VrrB protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4540TCRTETB1222e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (308), Expect = 2e-32
Identities = 95/416 (22%), Positives = 174/416 (41%), Gaps = 28/416 (6%)

Query: 4 KVMMSLMLMTFLSAVEGTIVSTAIPRITSDLSGVE-LVSWVYAIYMLATAVSTPIYGKLA 62
++++ L +++F S + +++ ++P I +D + +WV +ML ++ T +YGKL+
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 63 DLFGRKKVLLIGATIFLIGSALCGVVTSM-EQLIFFRALQGIGAGAVMPITMTIIGDLYS 121
D G K++LL G I GS + V S LI R +QG GA A + M ++
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 122 EAKDRAKAQGWMSAVWGVSGVIGPLVGGFLVDSLSWRYIFFLNVPFGIIACLMFATSYKE 181
+ R KA G + ++ + +GP +GG + + W Y+ +P I + F +
Sbjct: 134 KEN-RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLK 190

Query: 182 SVKSAAKQHIDYLGATVFSLSTIALLYALLTGSSKQNWGDISIIGLLIFAAVSFIIFLYI 241
K H D G + S+ + + + S I LI + +SF+IF+
Sbjct: 191 KEVR-IKGHFDIKGIILMSVGIVFFMLFTTSYS----------ISFLIVSVLSFLIFVKH 239

Query: 242 EKKSPEPLIPLALFSNRTLSTINILTLIAGAMIISITV-----YLPIWSQGVLGKNATEA 296
+K +P + L N + + II TV +P + V + E
Sbjct: 240 IRKVTDPFVDPGLGKNIP-----FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 297 GLILM-PIPVMWTFGAIFSGNLVGKLQTKQIILLGASILSVATFLLFTLSTDSPSFLIYV 355
G +++ P + G LV + ++ +G + LSV+ L + F+ +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354

Query: 356 AVGLFGLGMGLVTPIYMVTIQAAVPANTRGTAVGLNTFINTFSQTLGAAIFGTIFN 411
V + G T I + + G + L F + S+ G AI G + +
Sbjct: 355 IVFVLGGLSFTKTVISTIVSSSLKQQEA-GAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4547DHBDHDRGNASE1221e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (306), Expect = 1e-35
Identities = 76/257 (29%), Positives = 121/257 (47%), Gaps = 12/257 (4%)

Query: 5 LEGKVVVITGSATGLGRAMGVRFAKEKAKVV-INYRSRESEANDVLEEIKKVGGEAIAVK 63
+EGK+ ITG+A G+G A+ A + A + ++Y + E V+ +K A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEK--VVSSLKAEARHAEAFP 63

Query: 64 GDVTVEADVVNLIQSAVKEFGTLDVMINNAGIENAVPSHEMPLEDWNKVIHTNLTGAFLG 123
DV A + + +E G +D+++N AG+ H + E+W N TG F
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 124 SREAIKYFVEHDIKGSVINMSSVHEKIPWPLFVHYAASKGGIKLMTETLALEYAPKGIRV 183
SR KY ++ GS++ + S +P YA+SK + T+ L LE A IR
Sbjct: 124 SRSVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NNIGPGAINTPINAEKFADPKKRDDV--------ESMIPMGYIGKPEEIAAVATWLASAE 235
N + PG+ T + +AD + V ++ IP+ + KP +IA +L S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 236 ASYVTGITLFADGGMTL 252
A ++T L DGG TL
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4557cloacin330.007 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.007
Identities = 29/92 (31%), Positives = 35/92 (38%), Gaps = 7/92 (7%)

Query: 169 IQGPQGPSGNTGATGVTGQGISGPTGITGPTGITGPSGGPPGPTGATGATGPGGGPSGST 228
+ G G NTGA +G GPTG+ G GG +G + P GG SGS
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGL-------GVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 229 GATGATGNTGVTGSAGVTGNTGSTGSTGETGA 260
G G G G +G TG A
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4559PF07675250.023 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 25.4 bits (55), Expect = 0.023
Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 33 IYAGAGGSSAAVFLNGKRQPEAVIRTSVFLPPLATSTRTLG 73
+YA + G+ A+ F N + +T V P TR G
Sbjct: 1175 VYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRAQG 1215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4563IGASERPTASE555e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 55.5 bits (133), Expect = 5e-10
Identities = 28/134 (20%), Positives = 66/134 (49%), Gaps = 5/134 (3%)

Query: 18 KETDDTKQQSNDQKGNNQKQTRSMKHNQDKNSQQKNVEQTEGSSQNKQQQSTQEDSSQSK 77
K + +Q + + N++ + K N N+Q V Q+ ++ Q T+E ++ K
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 78 QQQSTQEDSSQNKQQQSTQEDSSQDKQQQSTQEDSSQDKQQPAAQKNPSQDKQQPAAQKN 137
++++ E + + T + S + +Q ++ Q + +PA + +P+ + ++P +Q N
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA-----EPARENDPTVNIKEPQSQTN 1163

Query: 138 PSQDKQQPAAQKNP 151
+ D +QPA + +
Sbjct: 1164 TTADTEQPAKETSS 1177



Score = 39.3 bits (91), Expect = 6e-05
Identities = 27/172 (15%), Positives = 68/172 (39%), Gaps = 25/172 (14%)

Query: 4 NWLRKKKKSNTADRKETDDTKQQSNDQKGNNQKQTRSMKHNQDKNSQQKNVEQTEGSSQN 63
N T + + D SN N++ R + + E TE ++N
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSN-----NEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 64 KQQQSTQEDSSQSKQQQSTQEDSSQNKQQQSTQEDSSQ----DKQQQSTQEDSSQDKQQP 119
+Q+S + ++ ++T ++ K+ +S + ++Q + T+E + + ++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 120 A---------AQKNPSQDKQQPAAQKNPSQDKQ-------QPAAQKNPSQSA 155
A + +Q+ + +Q +P Q++ +PA + +P+ +
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4565PF06291280.027 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.7 bits (61), Expect = 0.027
Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 16/108 (14%)

Query: 1 MEKIKRKLMLLSCISVLSLTGCLQKNIIDDVNLIQGTVFDTAKDNKVKVTFVCPI-QKKG 59
M+ K K ML S + +TGC Q+ + T K+ FV I QKK
Sbjct: 1 MQDNKMKKMLFSAALAMLITGCAQQTF----TVGNKPTAVTPKETITHHFFVSGIGQKKT 56

Query: 60 -NKVQVFEGVGNAVKQVKADTSLESSQPFASGQM---RFALFTTRIAK 103
+ ++ G N VK E+ Q F +G + ++T A+
Sbjct: 57 VDAAKICGGAENVVK-------TETQQTFVNGLLGFITLGIYTPLEAR 97


62BcerKBAB4_4590BcerKBAB4_4598Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_45902253.130766molybdopterin-guanine dinucleotide biosynthesis
BcerKBAB4_45912273.154154molybdenum cofactor synthesis domain-containing
BcerKBAB4_45922302.650746hypothetical protein
BcerKBAB4_45932251.971779hypothetical protein
BcerKBAB4_45942241.978251S-adenosylmethionine synthetase
BcerKBAB4_4595-1190.470125phosphoenolpyruvate carboxykinase
BcerKBAB4_45967251.901270ATP synthase I
BcerKBAB4_4597420-0.323879hypothetical protein
BcerKBAB4_4598217-1.221589hypothetical protein
63BcerKBAB4_4653BcerKBAB4_4663Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4653212-2.687778hypothetical protein
BcerKBAB4_4654111-3.385809hypothetical protein
BcerKBAB4_4655-111-2.382686hypothetical protein
BcerKBAB4_4657-111-2.699024hypothetical protein
BcerKBAB4_4658011-3.197230hypothetical protein
BcerKBAB4_4659013-2.894329ABC transporter
BcerKBAB4_4660115-3.610991putative lipoprotein
BcerKBAB4_4661117-2.912446polysaccharide deacetylase
BcerKBAB4_4662118-3.122444hypothetical protein
BcerKBAB4_4663219-2.977673rod shape-determining protein MreC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4660RTXTOXIND368e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 8e-05
Identities = 25/157 (15%), Positives = 56/157 (35%), Gaps = 20/157 (12%)

Query: 66 NDGKDSNQAIMPKLEQATKSIDEREKVWNKEK----EAFGKAQEEVKSVHKTIDKMEDAA 121
D ++ + T I E+ W +K K + E +V I++ E+ +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 122 --LQKQAKNIQDIYKKRYASFSKINDSYQKLMKSERELYKSLGEKETNLKKVSEKIKGVN 179
+ + + + K+ + + + K +++ EL + +
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ--------------LE 276

Query: 180 QMNEDIQREKEKFNRYTQEYNKEKLDFYKQAKIKMKE 216
Q+ +I KE++ TQ + E LD +Q +
Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4663LIPPROTEIN48320.003 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 31.5 bits (71), Expect = 0.003
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 8 KKILLFLSIILLALLMVYVCTNNKHVQNIVHNIEDIYKVYKENQVLKEKIEHQESLKSKV 67
KKILL LS I L V V N NI +DI K N K+ +++ E LK K
Sbjct: 5 KKILLGLSPIAAILPAVAVSCGNNDESNISFKEKDISKYTTTNANGKQVVKNAELLKLKP 64

Query: 68 QMLSEE----KENFNK 79
++++E ++FN+
Sbjct: 65 VLITDEGKIDDKSFNQ 80


64BcerKBAB4_4682BcerKBAB4_4697Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_46822110.804562hypothetical protein
BcerKBAB4_46831143.136249hypothetical protein
BcerKBAB4_46840132.891078gluconate transporter
BcerKBAB4_46850132.807936GntR family transcriptional regulator
BcerKBAB4_46861152.248027ribokinase-like domain-containing protein
BcerKBAB4_46870151.537134hypothetical protein
BcerKBAB4_46881171.902494pyridoxal phosphate-dependent enzyme
BcerKBAB4_46890182.392668dihydroorotase
BcerKBAB4_4690-1132.832949serine-type D-Ala-D-Ala carboxypeptidase
BcerKBAB4_46910143.645742histidine kinase
BcerKBAB4_46920144.585003two component transcriptional regulator
BcerKBAB4_46930134.404167mandelate racemase/muconate lactonizing protein
BcerKBAB4_4694-1134.209133O-succinylbenzoic acid--CoA ligase
BcerKBAB4_46950153.666561naphthoate synthase
BcerKBAB4_46961143.151060alpha/beta hydrolase
BcerKBAB4_46970163.2847282-succinyl-5-enolpyruvyl-6-hydroxy-3-
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4683cloacin250.015 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 25.4 bits (55), Expect = 0.015
Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 36 GASCFGGGGGGCGYGGYGGYGGGYGG 61
G+ GGG G G GG G GG G
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSG 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4689UREASE320.004 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.0 bits (73), Expect = 0.004
Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 17/85 (20%)

Query: 19 DIVIENNKIAQVTKAG-----------AGEGGKVLDYSGTYVSSGWIDLHVHAFPEFDPY 67
DI +++ +IA + KAG G G +V+ G V++G +D H+H
Sbjct: 87 DIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI------ 140

Query: 68 GDEVDEIGVKQGVTTIVDAGSCGAD 92
+ E + G+T ++ G+ A
Sbjct: 141 CPQQIEEALMSGLTCMLGGGTGPAH 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4691TYPE3OMBPROT320.003 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 32.4 bits (73), Expect = 0.003
Identities = 43/204 (21%), Positives = 80/204 (39%), Gaps = 31/204 (15%)

Query: 94 IHHLANGDFSNQVRVSSNDEFGYIAREINVASEKLKEAVERGDFAESSKDQLIVNL---- 149
H+AN S V + F I + A K + ER A + ++L+
Sbjct: 200 SDHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSR 259

Query: 150 -----------AHDLRTPLTSVLGYLDLILKDENLTKEQIKHFSTIAFTKSERLEILIDE 198
DL+ TS+L L +E++ K+Q+ + + E ++LI
Sbjct: 260 PELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKRGEPTKLLI-- 317

Query: 199 LFEITRMNYGMLQIEKRSIDISELLIQLDEELYPLLEKKGLGARLNMDSYLPINGDGKLL 258
R + G+L+ ++ + ++E L K GLG R N+D D +
Sbjct: 318 -----RNSDGLLKEVSVNLKVVTFNFGVNE----LALKMGLGWR-NVDKL----NDESIC 363

Query: 259 ARVFENLLTNAIRYGYDGKFVDVN 282
+ + +N L N + G+ + ++ N
Sbjct: 364 SLLGDNFLKNGVIGGWAAEAIEKN 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4692HTHFIS992e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 2e-26
Identities = 34/121 (28%), Positives = 61/121 (50%), Gaps = 1/121 (0%)

Query: 1 MKRISILIADDEAEIADLIEIHLEKEGYHVVKAADGEEAMHIIQTQPIDLVVLDIMMPKM 60
M +IL+ADD+A I ++ L + GY V ++ I DLVV D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGYEVTRQIR-GKHHMPIIFLSAKTSDFDKVTGLVLGADDYMTKPFTPIELVARVNAQLR 119
+ +++ +I+ + +P++ +SA+ + + GA DY+ KPF EL+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 R 120

Sbjct: 121 E 121


65BcerKBAB4_4798BcerKBAB4_4805Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_47982192.311770sporulation protein YunB
BcerKBAB4_47991233.366008hypothetical protein
BcerKBAB4_48001263.3084295'-nucleotidase
BcerKBAB4_48012312.794643FeS assembly protein SufB
BcerKBAB4_48023261.591634NifU family SUF system FeS assembly protein
BcerKBAB4_48033241.659107SufS subfamily cysteine desulfurase
BcerKBAB4_48043211.634202FeS assembly protein SufD
BcerKBAB4_48052160.302938FeS assembly ATPase SufC
66BcerKBAB4_4814BcerKBAB4_4835Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4814218-3.878861glycine cleavage system protein H
BcerKBAB4_4815423-7.446003arsenate reductase-like protein
BcerKBAB4_4816832-5.117235hypothetical protein
BcerKBAB4_4817830-4.753695group-specific protein
BcerKBAB4_4818424-3.690928putative lipoprotein
BcerKBAB4_4819319-0.941483hypothetical protein
BcerKBAB4_4820115-0.068781group-specific protein
BcerKBAB4_48211140.628527hypothetical protein
BcerKBAB4_48230150.631069putative lipoprotein
BcerKBAB4_48241150.116831PAP2 family protein
BcerKBAB4_48252130.783033hypothetical protein
BcerKBAB4_48260150.007657L-lactate dehydrogenase
BcerKBAB4_48270170.863161coat F domain-containing protein
BcerKBAB4_4828-1121.732867hypothetical protein
BcerKBAB4_4829-1152.970928abortive infection protein
BcerKBAB4_48301203.819409permease
BcerKBAB4_48312223.827366TetR family transcriptional regulator
BcerKBAB4_48321213.691518beta-lactamase domain-containing protein
BcerKBAB4_48333242.996848acyl-CoA dehydrogenase domain-containing
BcerKBAB4_48342203.125500acetyl-CoA acetyltransferase
BcerKBAB4_48352181.9754333-hydroxyacyl-CoA dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4831TETREPRESSOR411e-06 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 40.7 bits (95), Expect = 1e-06
Identities = 17/43 (39%), Positives = 26/43 (60%)

Query: 7 LTLQKIVETAAEIADINGIQEVTLASLAQTLGVRSPSLYNHVK 49
L + +++ A E+ + GI +T LAQ LG+ P+LY HVK
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVK 46


67BcerKBAB4_4891BcerKBAB4_4906Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_48912130.650475hypothetical protein
BcerKBAB4_48923202.781710hypothetical protein
BcerKBAB4_48931192.7434622,5-didehydrogluconate reductase
BcerKBAB4_48942172.756349major facilitator transporter
BcerKBAB4_48951192.323947HxlR family transcriptional regulator
BcerKBAB4_48962161.605401hypothetical protein
BcerKBAB4_48970131.255524FAD-dependent pyridine nucleotide-disulfide
BcerKBAB4_4898-2150.412077tyrosyl-tRNA synthetase
BcerKBAB4_4899-2140.065491UDP-N-acetylenolpyruvoylglucosamine reductase
BcerKBAB4_4900-1170.058473nuclear protein SET
BcerKBAB4_4901-2170.039335methyl-accepting chemotaxis sensory transducer
BcerKBAB4_49020190.177589endonuclease/exonuclease/phosphatase
BcerKBAB4_4903423-1.316919camphor resistance protein CrcB
BcerKBAB4_4904323-2.201311camphor resistance protein CrcB
BcerKBAB4_4905225-2.729673hypothetical protein
BcerKBAB4_4906118-3.297360hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4893HELNAPAPROT310.003 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 31.0 bits (70), Expect = 0.003
Identities = 22/85 (25%), Positives = 29/85 (34%), Gaps = 16/85 (18%)

Query: 92 TTLAAYEESLKKLELDYLDLYL----VHWPVEGK-YKDSWRALETLYKE---------ER 137
T E SL ++ LY HW V+G + E LY ER
Sbjct: 8 TNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAER 67

Query: 138 VRAIGVSNFQIHHLKDVLEGAEIKP 162
+ AIG +K+ E A I
Sbjct: 68 LLAIGGQPVAT--VKEYTEHASITD 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4894TCRTETA665e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 66.0 bits (161), Expect = 5e-14
Identities = 72/336 (21%), Positives = 126/336 (37%), Gaps = 20/336 (5%)

Query: 11 VQTNRRSMFALLALAISAFGIGTTEFISVGLLPSISEDLHVSVTTA---GLTVSLYALGV 67
++ NR + L +A+ A GIG + + +LP + DL S G+ ++LYAL
Sbjct: 1 MKPNRPLIVILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQ 56

Query: 68 AFGAPVLTSLTASMSRKTLLMWIMIVFIIGNGIAAVATSFTVLLIARVVSAFAHGVFMSI 127
APVL +L+ R+ +L+ + + I A A VL I R+V+
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA 116

Query: 128 GSTIAAALVPENKRASAIAFMFTGLTVATITGVPIGTFIGQQFGWRASFMVIVVIGIIAL 187
G+ I A + ++RA FM + G +G +G F A F + +
Sbjct: 117 GAYI-ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNF 174

Query: 188 VANSMLIPSNLKKGTRVSFRDQFKLITNGRLLLVFVITALGYGGTF-------VTFTYLS 240
+ L+P + K R R+ + + R + A F V
Sbjct: 175 LTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234

Query: 241 PLLQEVTGFKSSAVTIILLVYGIAIAIGN-MVGGKLSNH-NPIRALFYMFFIQAIVLFVL 298
++ + ++ + I L +GI ++ M+ G ++ RAL +L
Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294

Query: 299 TFTAPFQVAGLITIIFMGLFTFMNVPGLQVYVVILA 334
F +A I ++ M P LQ +
Sbjct: 295 AFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQV 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4896PF07299270.024 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 27.1 bits (60), Expect = 0.024
Identities = 15/82 (18%), Positives = 33/82 (40%), Gaps = 3/82 (3%)

Query: 53 VNILTKSYDFAQTVATDEVLKSDTVSAITELVEPVKDTVKSMAATAIEAKDRADESNEVI 112
IL + A + LKS + I + E + D K + T + ++R D + ++
Sbjct: 23 AYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELIDTVLTVQNREDAESFLL 82

Query: 113 GLFGLL---KLLKDPQAQKMFR 131
+ + + + +K+F
Sbjct: 83 KINPYVIPFQEVTAQTLKKLFP 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4898TACYTOLYSIN300.021 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 29.9 bits (67), Expect = 0.021
Identities = 23/92 (25%), Positives = 36/92 (39%), Gaps = 18/92 (19%)

Query: 333 DEIEQGFKEMPTFQSSKETKNIVEWLVDLGIEPSRRQAREDINNGAISMN---------- 382
D I+ KEMP + KE K + + S E+IN+ S+N
Sbjct: 77 DMIKLAPKEMPLESAEKEEKKSED------NKKSEEDHTEEINDKIYSLNYNELEVLAKN 130

Query: 383 GEKVTDVSRDVTVENSFDGRFIIIRKGKKNYS 414
GE + + +FI+I + KKN +
Sbjct: 131 GETIENFV--PKEGVKKADKFIVIERKKKNIN 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4899CHANLCOLICIN290.040 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.5 bits (63), Expect = 0.040
Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 18 HVKQDEMLKNHTHIKVGGKADVFVSPTNYDEIQEVIKYANQYNIPVTFLGNGSNVIIKDG 77
VK D+ K+ K VS YD + +++K + + FL D
Sbjct: 418 SVKYDDWAKHLDQFAKYLKITGHVS-FGYDVVSDILKIKDTGDWKPLFLTLEKKAA--DA 474

Query: 78 GLRGITVSLIHITNVT---VTGTAIVAGCGAAIIDVSRI 113
G+ + L + T + G AIV G + ID +++
Sbjct: 475 GVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYIDKNKL 513


68BcerKBAB4_4923BcerKBAB4_4941Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4923-1333.507629preprotein translocase subunit SecG
BcerKBAB4_49240354.154200LrgB family protein
BcerKBAB4_49250414.513670holin-like protein
BcerKBAB4_49262464.641718inosine/uridine-preferring nucleoside hydrolase
BcerKBAB4_49274445.071223phosphopyruvate hydratase
BcerKBAB4_49284364.296466phosphoglyceromutase
BcerKBAB4_49293263.467199triosephosphate isomerase
BcerKBAB4_49302222.732483phosphoglycerate kinase
BcerKBAB4_49312192.276572glyceraldehyde-3-phosphate dehydrogenase, type
BcerKBAB4_49322191.804655DeoR family transcriptional regulator
BcerKBAB4_4933222-0.050679glutaredoxin
BcerKBAB4_49341201.299038RNA polymerase factor sigma-54
BcerKBAB4_4935-1254.043583hypothetical protein
BcerKBAB4_4936-1265.180178*hypothetical protein
BcerKBAB4_49370285.126582hypothetical protein
BcerKBAB4_49380273.915343putative lipoprotein
BcerKBAB4_4939-1294.953455sporulation stage V protein AC
BcerKBAB4_4940-1264.911744stage V sporulation protein AD
BcerKBAB4_49412181.954639sporulation stage V protein AE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4923SECGEXPORT407e-08 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 39.5 bits (92), Expect = 7e-08
Identities = 21/77 (27%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 1 MHTLLSVLLIIVSILMIVMVLMQSSNSSGLSGAISGGAE-QLFGKQKARGIEAVLNRVTV 59
M+ L V+ +IV+I ++ ++++Q + + + GA LFG + G + R+T
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFG---SSGSGNFMTRMTA 57

Query: 60 VLAVLFFVLTIGVTYLN 76
+LA LFF++++ + +N
Sbjct: 58 LLATLFFIISLVLGNIN 74


69BcerKBAB4_4953BcerKBAB4_4985Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_49532222.674867transferase
BcerKBAB4_49543232.882040pyrophosphatase PpaX
BcerKBAB4_49552232.885159prolipoprotein diacylglyceryl transferase
BcerKBAB4_49562232.509562HPr kinase/phosphorylase
BcerKBAB4_49571222.040080membrane protein
BcerKBAB4_49581222.320886excinuclease ABC subunit A
BcerKBAB4_49590210.573164excinuclease ABC subunit B
BcerKBAB4_4960025-2.178607hypothetical protein
BcerKBAB4_4961026-1.707956integral membrane protein
BcerKBAB4_49623241.890349MerR family transcriptional regulator
BcerKBAB4_49635243.845535hypothetical protein
BcerKBAB4_49644263.899010hypothetical protein
BcerKBAB4_49654284.753262hypothetical protein
BcerKBAB4_49673264.164774XRE family transcriptional regulator
BcerKBAB4_49683264.584994hypothetical protein
BcerKBAB4_49692232.701892LysR family transcriptional regulator
BcerKBAB4_49701182.355650MerR family transcriptional regulator
BcerKBAB4_49710152.167467NAD(P)H dehydrogenase (quinone)
BcerKBAB4_49720131.704741ABC transporter
BcerKBAB4_49730151.793341hypothetical protein
BcerKBAB4_4974-1151.512244ABC transporter
BcerKBAB4_49750152.103242hypothetical protein
BcerKBAB4_49761192.240663carboxyl-terminal protease
BcerKBAB4_49772212.215942hypothetical protein
BcerKBAB4_49783212.281785cell division ATP-binding protein FtsE
BcerKBAB4_49793202.270561cytochrome c-551
BcerKBAB4_49803192.038256peptide chain release factor 2
BcerKBAB4_49821132.234499preprotein translocase subunit SecA
BcerKBAB4_49832121.257812hypothetical protein
BcerKBAB4_49842121.450464sigma 54 modulation protein/ribosomal protein
BcerKBAB4_49852121.750588cold-shock DNA-binding domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4953INVEPROTEIN300.003 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 30.5 bits (68), Expect = 0.003
Identities = 16/73 (21%), Positives = 37/73 (50%)

Query: 24 FWKVMKNFIIIQIARYTPFLSVKNWLYRTFLRMEVGKKTSFALMVMPDIMFPEKITVGDN 83
F ++++ +++ R L V L +F + +++S+ L+++ + P ++
Sbjct: 243 FGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSLLA 302

Query: 84 SIIGYNTTLLAHE 96
IIG N LL+H+
Sbjct: 303 DIIGLNALLLSHK 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4970SYCECHAPRONE260.028 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 26.2 bits (57), Expect = 0.028
Identities = 15/57 (26%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 50 QMYLQIGLNLEETERMLRCLEIEP---HLYENPCSSILALYEDKLDEVTKQITLLSN 103
Q++ Q+ L++ +T + +++ H+ E+P IL LD ++ TLLS+
Sbjct: 10 QLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILMFTLPSLDNNDEKETLLSH 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4976BINARYTOXINB300.028 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.7 bits (66), Expect = 0.028
Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 210 GKDIGYMQITSFAENTAKEFKDQLKELEKKNIKGLVIDVRGNPG 253
GKDI F + T++ K+QL EL NI ++ ++ N
Sbjct: 573 GKDITEFDFN-FDQQTSQNIKNQLAELNATNIYTVLDKIKLNAK 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4982SECA11560.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1156 bits (2993), Expect = 0.0
Identities = 437/897 (48%), Positives = 592/897 (65%), Gaps = 65/897 (7%)

Query: 1 MIGILKKVF-DVNQRQIKRMQKTVEQIDALESSIKPLTDEQLKGKTIEFKERLTKGETVD 59
+I +L KVF N R ++RM+K V I+A+E ++ L+DE+LKGKT EF+ RL KGE ++
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 DLLPEAFAVVREAANRVLGMRPYGVQLMGGIALHEGNISEMKTGEGKTLTSTLPVYLNAL 119
+L+PEAFAVVREA+ RV GMR + VQL+GG+ L+E I+EM+TGEGKTLT+TLP YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 TGKGVHVVTVNEYLAQRDASEMGQLHEFLGLTVGINLNSMSREEKQAAYAADITYSTNNE 179
TGKGVHVVTVN+YLAQRDA L EFLGLTVGINL M K+ AYAADITY TNNE
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 LGFDYLRDNMVLYREQCVQRPLNFAIIDEVDSILVDEARTPLIISGQAQKSAELYMFANA 239
GFDYLRDNM E+ VQR L++A++DEVDSIL+DEARTPLIISG A+ S+E+Y N
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241

Query: 240 FVRTL-----------ENEKEYSFDVKTKNVMLTEDGITKAEKAFHI-------DNLFDL 281
+ L + E +S D K++ V LTE G+ E+ ++L+
Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301

Query: 282 KHVALLHHINQALRAHVVMHLDTDYVVQEGEIVIVDQFTGRLMKGRRYSEGLHQAIEAKE 341
++ L+HH+ ALRAH + D DY+V++GE++IVD+ TGR M+GRR+S+GLHQA+EAKE
Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361

Query: 342 GVEIQNESMTLATITFQNYFRMYEKLSGMTGTAKTEEEEFRSIYNMNVIVIPTNKDIIRD 401
GV+IQNE+ TLA+ITFQNYFR+YEKL+GMTGTA TE EF SIY ++ +V+PTN+ +IR
Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421

Query: 402 DRADLIFKSMEGKFNAVVEDIVNRHKQGQPILVGTVAIETSELISKMLTRKGVRHNILNA 461
D DL++ + K A++EDI R +GQP+LVGT++IE SEL+S LT+ G++HN+LNA
Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481

Query: 462 KNHAREADIIAEAGMKGAVTIATNMAGRGTDIKLGDDIRNV------------------- 502
K HA EA I+A+AG AVTIATNMAGRGTDI LG +
Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541

Query: 503 -----------GLAVIGTERHESRRIDNQLRGRSGRQGDPGVTQFYLSMEDELMRRFGSD 551
GL +IGTERHESRRIDNQLRGRSGRQGD G ++FYLSMED LMR F SD
Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601

Query: 552 NMKAMMDRLGMDDSQPIESKMVSRAVESAQKRVEGNNYDARKQLLQYDDVLRQQREVIYK 611
+ MM +LGM + IE V++A+ +AQ++VE N+D RKQLL+YDDV QR IY
Sbjct: 602 RVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYS 661

Query: 612 QRQEVMDSENLRSIIEGMMKSTIERAV-ALHTQEEIEEDWSIKGLVDYLNTNLLEEGDVK 670
QR E++D ++ I + + + + A + +EE W I GL + L + + +
Sbjct: 662 QRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIA 721

Query: 671 E--EELRRLAPEEMSESIIAKLLERYNEREKLLPEEQTREFEKVVVFRVVDTKWTDHIDA 728
E ++ L E + E I+A+ +E Y +E+++ E R FEK V+ + +D+ W +H+ A
Sbjct: 722 EWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAA 781

Query: 729 MDHLREGIHLRAYGQIDPLREYQMEGFAMFESMIASIEEEISRYIMKAEI---------- 778
MD+LR+GIHLR Y Q DP +EY+ E F+MF +M+ S++ E+ + K ++
Sbjct: 782 MDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELE 841

Query: 779 -EQNLERQEVVQGEAVHPSSDGEDAKKKPVVKGDQ--MGRNDLCKCGSGKKYKNCCG 832
++ +E + + Q + + D A + + +GRND C CGSGKKYK C G
Sbjct: 842 QQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHG 898


70BcerKBAB4_5013BcerKBAB4_5023Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_5013-2173.541787hypothetical protein
BcerKBAB4_50140184.178693group-specific protein
BcerKBAB4_5015-1184.182815cytochrome bd ubiquinol oxidase subunit I
BcerKBAB4_50160183.656426cytochrome d ubiquinol oxidase, subunit II
BcerKBAB4_50171183.563847arsenical pump membrane protein
BcerKBAB4_50183203.712668thiamine biosynthesis protein ThiC
BcerKBAB4_50192231.386103L-lactate transport
BcerKBAB4_50202260.076579hypothetical protein
BcerKBAB4_50212230.156857hypothetical protein
BcerKBAB4_50223291.754320hypothetical protein
BcerKBAB4_50232311.731591hypothetical protein
71BcerKBAB4_5052BcerKBAB4_5067Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_50521313.346503putative lipoprotein
BcerKBAB4_50530272.606103hypothetical protein
BcerKBAB4_5054-1241.188491histidine kinase
BcerKBAB4_5055119-1.083655two component transcriptional regulator
BcerKBAB4_5056116-3.817810UDP-glucose 4-epimerase
BcerKBAB4_5057417-5.557070EPSX protein
BcerKBAB4_5058519-6.463324membrane-bound transcriptional regulator LytR
BcerKBAB4_5059720-7.281514UDP-galactopyranose mutase
BcerKBAB4_5060721-7.207183polysaccharide biosynthesis protein
BcerKBAB4_5061618-6.029279hypothetical protein
BcerKBAB4_5062517-5.098096group 1 glycosyl transferase
BcerKBAB4_5063418-4.772974group 1 glycosyl transferase
BcerKBAB4_5064418-4.457604hypothetical protein
BcerKBAB4_5065215-3.741818NAD-dependent epimerase/dehydratase
BcerKBAB4_5066115-2.885811UDP-glucose 6-dehydrogenase
BcerKBAB4_5067014-3.202710group 1 glycosyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5055HTHFIS1021e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (257), Expect = 1e-27
Identities = 36/136 (26%), Positives = 69/136 (50%), Gaps = 1/136 (0%)

Query: 2 KILVVDDESSIRNLIRMQLEMEGYEVLTAADGREALERW-NEQPDVLILDVMLPDTDGYE 60
ILV DD+++IR ++ L GY+V ++ D+++ DV++PD + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 61 LLRLFREKERDIPVLMLTAKSQMNDKLLGLQLGADDYVTKPFNYAELILRVKNMARRVKK 120
LL ++ D+PVL+++A++ + + GA DY+ KPF+ ELI + K+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 KEVPLNHEVIGAGNLL 136
+ L + L+
Sbjct: 125 RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5056NUCEPIMERASE1752e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 175 bits (445), Expect = 2e-54
Identities = 87/344 (25%), Positives = 152/344 (44%), Gaps = 44/344 (12%)

Query: 3 SILICGGAGYIGSHAVKKLVDEGLSVVVVDNLQTGHEDAI---------TEGAKFYNGDL 53
L+ G AG+IG H K+L++ G VV +DNL ++ ++ G +F+ DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 RDKEFLRDVFKQENIEAVMHFAADSLVGVSMEKPLQYYNNNVYGALCLLEVMDEFKVEKF 113
D+E + D+F + E V V S+E P Y ++N+ G L +LE K++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 114 IFSSTAATYGEVDVDLITEETKTN-PTNTYGETKLAIEKMLHWYSQASNLRYKIFRYFNV 172
+++S+++ YG + + + P + Y TK A E M H YS L R+F V
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 173 AGATPSGIIGEDHRPETHLIPLVLQVALGQREKIMMFGDDYNTPDGTCIRDYIHVEDLVA 232
G P G RP+ L + G+ + YN G RD+ +++D+
Sbjct: 182 YG--PWG------RPDMALFKFTKAMLEGKSIDV------YN--YGKMKRDFTYIDDIAE 225

Query: 233 AHFLGLKDLQNGGESDF----------------YNLGNGNGFSVKEIVDAVREVTKHEIP 276
A + L+D+ ++ + YN+GN + + + + A+ + E
Sbjct: 226 A-IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 277 AEMAPRRAGDPARLVASSQKAKEKLGWNPEYVNVKTIIEHAWDW 320
M P + GD A ++ E +G+ PE VK +++ +W
Sbjct: 285 KNMLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5057TYPE4SSCAGA290.040 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 28.5 bits (63), Expect = 0.040
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 17/108 (15%)

Query: 19 VVSGKLYWNKKVANA--TGQTSEVTKTKAEVKDSGAKKE--EKKEEKKQDAKSSFNEAYA 74
+V L +NK VA+A TG EV K + +++ S K+E EK+ EKK ++KS
Sbjct: 584 LVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSG-----N 638

Query: 75 KNLPDAVKEKLKKAAQDKKAVNLVIVGDEASSSEKDAWVAKFTANLEA 122
KN K + K A +K ++ EA+ +DA + NL+
Sbjct: 639 KN-----KMEAKAQANSQKDEIFALINKEAN---RDARAIAYAQNLKG 678


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5065NUCEPIMERASE5030.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 503 bits (1296), Expect = 0.0
Identities = 192/329 (58%), Positives = 242/329 (73%), Gaps = 12/329 (3%)

Query: 11 TYLITGAAGFIGMHLSKKLLEMGCKVIGYDNLNDYYDISLKESRLNILNQYNNFTFHKAD 70
YL+TGAAGFIG H+SK+LLE G +V+G DNLNDYYD+SLK++RL +L F FHK D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELL-AQPGFQFHKID 60

Query: 71 LTDKEYLEKLFNENNIHIVVNLAAQAGVRYSIENPDAYIQSNVVGFLNILEMCRHHKVEH 130
L D+E + LF + V + VRYS+ENP AY SN+ GFLNILE CRH+K++H
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 131 LLYASSSSVYGANKKIPFSTEDKVDNPVSLYAATKKSNELMAHTYSHLYNVPTTGLRFFT 190
LLYASSSSVYG N+K+PFST+D VD+PVSLYAATKK+NELMAHTYSHLY +P TGLRFFT
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 191 VYGPYGRPDMAYFSFTKAITEGKPIKVFNEGDMYRDFTYIDDIVDGIIKLLENSPVLNNK 250
VYGP+GRPDMA F FTKA+ EGK I V+N G M RDFTYIDDI + II+L + P + +
Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240

Query: 251 -----------ELPYKVYNIGNNKPVKLLDFIQAIESAVGKEAVKEYYPMQPGDVYQTYA 299
PY+VYNIGN+ PV+L+D+IQA+E A+G EA K P+QPGDV +T A
Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300

Query: 300 DVSDLINDVGFKPDTPIQEGINKFVDWFK 328
D L +GF P+T +++G+ FV+W++
Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYR 329


72BcerKBAB4_5076BcerKBAB4_5145Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_50762191.522269(3R)-hydroxymyristoyl-ACP dehydratase
BcerKBAB4_50771170.846887rod shape-determining protein Mbl
BcerKBAB4_50782150.523401sporulation stage III transcriptional regulator
BcerKBAB4_50791131.319713hypothetical protein
BcerKBAB4_50800132.153927peptidase M23B
BcerKBAB4_5081-1141.636461ABC-2 type transporter
BcerKBAB4_50820142.215578ABC transporter
BcerKBAB4_5083-3162.831855LytTr DNA-binding protein
BcerKBAB4_5084-1163.731942sporulation stage II protein D
BcerKBAB4_50851174.094436UDP-N-acetylglucosamine
BcerKBAB4_50863223.998824hypothetical protein
BcerKBAB4_50874234.457152hypothetical protein
BcerKBAB4_50883234.279791NADH dehydrogenase subunit N
BcerKBAB4_50894254.806501NADH dehydrogenase subunit M
BcerKBAB4_50903275.428392NADH dehydrogenase subunit L
BcerKBAB4_50915306.802510NADH dehydrogenase subunit K
BcerKBAB4_50923306.970680NADH dehydrogenase subunit J
BcerKBAB4_50933297.077723NADH dehydrogenase subunit I
BcerKBAB4_50941183.808098NADH dehydrogenase subunit H
BcerKBAB4_50951152.679837NADH dehydrogenase subunit D
BcerKBAB4_50961120.948718NADH dehydrogenase subunit C
BcerKBAB4_5097-29-1.223787NADH dehydrogenase subunit B
BcerKBAB4_5098-213-0.245530NADH dehydrogenase subunit A
BcerKBAB4_50990130.142131PAS/PAC sensor-containing diguanylate
BcerKBAB4_51002253.000297hypothetical protein
BcerKBAB4_51013283.461498hypothetical protein
BcerKBAB4_51024314.004605F0F1 ATP synthase subunit epsilon
BcerKBAB4_51034323.834608F0F1 ATP synthase subunit beta
BcerKBAB4_51043283.000579F0F1 ATP synthase subunit gamma
BcerKBAB4_51051262.946195F0F1 ATP synthase subunit alpha
BcerKBAB4_51060191.548938F0F1 ATP synthase subunit delta
BcerKBAB4_5107-2201.987891F0F1 ATP synthase subunit B
BcerKBAB4_51081242.861797F0F1 ATP synthase subunit C
BcerKBAB4_51091212.960344F0F1 ATP synthase subunit A
BcerKBAB4_51102193.316997ATP synthase I
BcerKBAB4_51113203.473287hypothetical protein
BcerKBAB4_51122193.487489hypothetical protein
BcerKBAB4_51132193.329681uracil phosphoribosyltransferase
BcerKBAB4_51142213.297704serine hydroxymethyltransferase
BcerKBAB4_51151213.879376hypothetical protein
BcerKBAB4_51162222.893995ribose-5-phosphate isomerase B
BcerKBAB4_51172162.027824protein tyrosine phosphatase
BcerKBAB4_51180112.923536PTS system glucose subfamily transporter subunit
BcerKBAB4_51190142.509487DoxX family protein
BcerKBAB4_51200142.033959putative flavin reductase
BcerKBAB4_51210181.921243hypothetical protein
BcerKBAB4_5122-2162.519643hypothetical protein
BcerKBAB4_5123-1172.911997Sua5/YciO/YrdC/YwlC family protein
BcerKBAB4_51241172.353310hypothetical protein
BcerKBAB4_51250173.095767sporulation stage II protein R
BcerKBAB4_5126-1184.214826HemK family modification methylase
BcerKBAB4_5127-1164.387301peptide chain release factor 1
BcerKBAB4_51280204.790838thymidine kinase
BcerKBAB4_51290214.51797750S ribosomal protein L31
BcerKBAB4_51300204.170194transcription termination factor Rho
BcerKBAB4_51310254.181687fructose 1,6-bisphosphatase II
BcerKBAB4_51320243.457366UDP-N-acetylglucosamine
BcerKBAB4_51332242.398719fructose-bisphosphate aldolase
BcerKBAB4_5134-1152.823603response regulator receiver protein
BcerKBAB4_5135-1153.781824hypothetical protein
BcerKBAB4_5136-1153.824972CTP synthetase
BcerKBAB4_51371145.035655DNA-directed RNA polymerase subunit delta
BcerKBAB4_51380165.222842TetR family transcriptional regulator
BcerKBAB4_51391134.603486acyl-CoA dehydrogenase domain-containing
BcerKBAB4_51400133.253106acyl-CoA dehydrogenase domain-containing
BcerKBAB4_5141-1130.8637023-hydroxybutyryl-CoA dehydrogenase
BcerKBAB4_5142-1140.618082acetyl-CoA acetyltransferase
BcerKBAB4_5143-113-1.359180hypothetical protein
BcerKBAB4_5144-214-3.306949phospholipase D/transphosphatidylase
BcerKBAB4_5145-313-3.123029putative UV damage endonuclease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5077SHAPEPROTEIN476e-172 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 476 bits (1226), Expect = e-172
Identities = 180/330 (54%), Positives = 244/330 (73%), Gaps = 5/330 (1%)

Query: 1 MFARDIGIDLGTANVLIHVKGKGIVLNEPSVVAIDRNSG----KVLAVGEEARSMVGRTP 56
MF+ D+ IDLGTAN LI+VKG+GIVLNEPSVVAI ++ V AVG +A+ M+GRTP
Sbjct: 8 MFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTP 67

Query: 57 GNIVAIRPLKDGVIADFEITEAMLKYFINKLDVKSFFS-KPRILICCPTNITSVEQKAIR 115
GNI AIRP+KDGVIADF +TE ML++FI ++ SF PR+L+C P T VE++AIR
Sbjct: 68 GNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIR 127

Query: 116 EAAERSGGKTVFLEEEPKVAAVGAGMEIFQPSGNMVVDIGGGTTDIAVLSMGDIVTSSSI 175
E+A+ +G + VFL EEP AA+GAG+ + + +G+MVVDIGGGTT++AV+S+ +V SSS+
Sbjct: 128 ESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSV 187

Query: 176 KMAGDKFDMEILNYVKRKYKLLIGERTSENIKIKVGTVFPGARSEELEIRGRDMVTGLPR 235
++ GD+FD I+NYV+R Y LIGE T+E IK ++G+ +PG E+E+RGR++ G+PR
Sbjct: 188 RIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPR 247

Query: 236 TITVCSEEITEALKEDAAIIVQAAKGVLERTPPELSADIIDRGVILTGGGALLHGIDMLL 295
T+ S EI EAL+E IV A LE+ PPEL++DI +RG++LTGGGALL +D LL
Sbjct: 248 GFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLL 307

Query: 296 AEELKVPVLIAENPMQCVAIGTGIMLENID 325
EE +PV++AE+P+ CVA G G LE ID
Sbjct: 308 MEETGIPVVVAEDPLTCVARGGGKALEMID 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5096IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.4 bits (94), Expect = 1e-05
Identities = 31/229 (13%), Positives = 65/229 (28%), Gaps = 25/229 (10%)

Query: 16 ARHAKEEARKRL-AAKHGAEMSKLEGEHREKEKALPKDKGINVEEAKAKAAA-----AAK 69
R +EA+ + A E+++ E +E + K+ +E KAK K
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124

Query: 70 AAALAKQKREGSEEVTDEERAKAKAKAAAAAKAKAAVLAKQ----------KREGSEEVT 119
+ K+E SE V + + K + + VT
Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 120 ---------DEEKAKAKAKAAAAAKAKAAALAKQKREGSEEVTDEEKAKAKAKAAAAAKA 170
+ A + + + + + ++
Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244

Query: 171 KAAALAKQKREGSEEVTDEEKAKAKAKAAAAAKAKAAALAKQKREGSEE 219
AL + V + +AKA+ A KA + +++ + +
Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 38.5 bits (89), Expect = 6e-05
Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 11/151 (7%)

Query: 178 QKREGSEEVTDEEKAKAKAKAAAAAKAKAAALAKQKREGSEEVTDEEKVKAKAKAAAAAK 237
+KR + + T+ + + +A+ + A +K
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 238 AKAAVLAKQKREGLEEVTDEEKVKAKAKAAAAAKAKAAALAKQKASQGEGDSGDEKAKAI 297
++ + K +++ E +V +AK+ A + +A+ + E + + K A
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 298 AAAKAKAAAAARAKGSVNKIEDELQQEEPSV 328
+ KA K+E E QE P V
Sbjct: 1106 VEKEEKA-----------KVETEKTQEVPKV 1125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5107IGASERPTASE300.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.003
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 5/83 (6%)

Query: 49 TNEIDAAERSNAEAKKLVEEQREMLKQSRVEAQELIERAKKQAVDQKDVIVAAAKEEAES 108
TNE+ +S +E K+ + + E + +E K Q V + V+ +E++E+
Sbjct: 1082 TNEVA---QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138

Query: 109 IKTSAVQEIQREKEQAIAALQEQ 131
++ A E RE + + + Q
Sbjct: 1139 VQPQA--EPARENDPTVNIKEPQ 1159



Score = 28.9 bits (64), Expect = 0.012
Identities = 18/103 (17%), Positives = 43/103 (41%), Gaps = 2/103 (1%)

Query: 53 DAAERSNAEAKKLVEEQREMLKQSRVEAQELIERAKKQAVDQKDVIVAAAKEEAESIKTS 112
+ +++ + +K ++ E Q+R A+E ++ +A Q + + + E E+ T
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKE--AKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 113 AVQEIQREKEQAIAALQEQVASLSVHIASKVIEKELKEEDQVK 155
+ EKE+ E+ + + ++E E Q +
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5125IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 2e-04
Identities = 28/95 (29%), Positives = 35/95 (36%), Gaps = 5/95 (5%)

Query: 178 ESPEEGEVKEQDEEVVDVPEKKEGKVKETKVVKPEKAEKVTAQEKKVVKHETQVEEQPVK 237
S V E ++ EK E ET E A+ + K VK TQ E
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK----EAKSNVKANTQTNEVAQS 1088

Query: 238 KVETKTVEKVEKPVEQKQEKQNEYVTVEEEEKPEV 272
ETK + E EK+ E VE E+ EV
Sbjct: 1089 GSETKETQTTETKETATVEKE-EKAKVETEKTQEV 1122



Score = 32.7 bits (74), Expect = 0.002
Identities = 25/105 (23%), Positives = 42/105 (40%), Gaps = 1/105 (0%)

Query: 167 AVRKEEHVVKAESPEEGEVKEQDEEVVDVPEKKEGKVKETKVVKPEKAEKVTAQEKKVVK 226
++E V+ + E Q+ EV + +T V +E Q K
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT-ETK 1101

Query: 227 HETQVEEQPVKKVETKTVEKVEKPVEQKQEKQNEYVTVEEEEKPE 271
VE++ KVET+ ++V K Q KQ + TV+ + +P
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146



Score = 29.3 bits (65), Expect = 0.025
Identities = 21/117 (17%), Positives = 42/117 (35%), Gaps = 5/117 (4%)

Query: 167 AVRKEEHVVKAESPEEGEVKEQDEEVVDVPEKKEGKVKETKVVKPEKAEKVTAQEKKVVK 226
V K E + + EV ++ + V + +V ++ + ++ E K
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA-NTQTNEVAQSG----SETKETQTTETKETA 1104

Query: 227 HETQVEEQPVKKVETKTVEKVEKPVEQKQEKQNEYVTVEEEEKPEVKLFIVEAFTSL 283
+ E+ V+ +T+ V KV V KQE+ E + ++ S
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5134HTHFIS1091e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 109 bits (275), Expect = 1e-31
Identities = 31/117 (26%), Positives = 56/117 (47%)

Query: 3 GKILIVDDQYGIRVLLHEVFQKEGYQTFQAANGFQALDIVKKDNPDLVVLDMKIPGMDGI 62
IL+ DD IR +L++ + GY +N + + DLVV D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EILKHVKEIDESIKVILMTAYGELDMIQEAKDLGALMHFAKPFDIDEIRQAVRDQLA 119
++L +K+ + V++M+A +A + GA + KPFD+ E+ + LA
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5138HTHTETR652e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.4 bits (159), Expect = 2e-15
Identities = 27/141 (19%), Positives = 62/141 (43%), Gaps = 6/141 (4%)

Query: 19 RREQMIKGAVQLFKQKGFPRTTTREIAKAAGFSIGTLYEYIRTKDDVLYLVCDSIYEHVK 78
R+ ++ A++LF Q+G T+ EIAKAAG + G +Y + + K D+ + + ++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 79 ERLEEV-VCTEKGSIESLKIAIMNYFKVMDELQEE---VLIMYQEVRFLPKESLPYVLEK 134
E E + L+ +++ + + + I++ + F+ + ++ ++
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 135 EF--QMVGMFEDILEQCTENG 153
+ E L+ C E
Sbjct: 132 NLCLESYDRIEQTLKHCIEAK 152


73BcerKBAB4_5198BcerKBAB4_5222Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_51982130.411356homoserine O-succinyltransferase
BcerKBAB4_51993120.323697O-acetylhomoserine
BcerKBAB4_5200313-0.989519hypothetical protein
BcerKBAB4_5201112-0.833824chloride channel core
BcerKBAB4_5202-110-0.518659HAD family hydrolase
BcerKBAB4_5203-211-0.595378DNA-binding transcriptional activator YeiL
BcerKBAB4_5204-113-0.347474azoreductase
BcerKBAB4_52050130.385084two component LuxR family transcriptional
BcerKBAB4_5206-112-0.160409GAF sensor signal transduction histidine kinase
BcerKBAB4_52072110.230295pyridoxal kinase
BcerKBAB4_5208312-0.094544diguanylate cyclase
BcerKBAB4_52092120.112955hypothetical protein
BcerKBAB4_5210112-0.587581carbon starvation protein CstA
BcerKBAB4_5211011-1.516255LytTR family two component transcriptional
BcerKBAB4_5212-111-1.646143major facilitator transporter
BcerKBAB4_5213-19-3.406613WecB/TagA/CpsF family glycosyl transferase
BcerKBAB4_5214-110-3.524117group 1 glycosyl transferase
BcerKBAB4_5215-210-3.655985hypothetical protein
BcerKBAB4_5216-112-4.063411hypothetical protein
BcerKBAB4_5217-212-5.291616methyl-accepting chemotaxis sensory transducer
BcerKBAB4_5218-111-5.072935hypothetical protein
BcerKBAB4_5219-313-3.086209thioesterase superfamily protein
BcerKBAB4_5220-113-3.622315glycosyl transferase family protein
BcerKBAB4_5221-113-4.016747hypothetical protein
BcerKBAB4_5222-211-3.712492polysaccharide biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5205HTHFIS933e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.0 bits (231), Expect = 3e-24
Identities = 39/219 (17%), Positives = 85/219 (38%), Gaps = 31/219 (14%)

Query: 2 KIKVLLVDDHTVVLKGLAFFLSTQEDFELVGEANNGKEALVKVGETSPDVVLMDLYMPEM 61
+L+ DD + L L ++ +++ +N + D+V+ D+ MP+
Sbjct: 3 GATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGIEATACIKKEYPNVKVIVLTSFSDQAHVLPALKAGASGYILKDIEPDQLVEAIRSAYK 121
+ + IKK P++ V+V+++ + + A + GA Y+ K + +L+ I
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG---- 116

Query: 122 GNIQLHPDIANALLSQTLPQEEKEEEPSVQVDVL--TARENEVLQLLAKGMSNKEIASVL 179
AL + E++ + ++ +A E+ ++LA+ M +++
Sbjct: 117 ----------RALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD--LTLM 164

Query: 180 VITE----KTVKAHMSSILSKLH-LSDRTQAALYAVKNG 213
+ E K + A LH R A+
Sbjct: 165 ITGESGTGKELVARA------LHDYGKRRNGPFVAINMA 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5206PF06580320.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.1 bits (73), Expect = 0.005
Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 447 NVSKHA---NVREATIYFKVTEKNVSLEIVDQGNGFVE-KDIKEKKSLGMTTMRERVELV 502
N KH + I K T+ N ++ + + G + K+ KE G+ +RER++++
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 503 GG---TIKIVSSKKRTSIKVNVP 522
G IK+ + + + V +P
Sbjct: 326 YGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5211HTHFIS473e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.1 bits (112), Expect = 3e-08
Identities = 20/134 (14%), Positives = 44/134 (32%), Gaps = 12/134 (8%)

Query: 2 KILLIMEETEERRKLVENFTENIRNVECFEAKTGTESLLIMKKHTPDFVFLNSQLMDGTG 61
IL+ ++ R L + + + + D V + + D
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 FEYSSLLREVNCYTKFIFIGE--DIEEAITAFRFQAFYYLLRPFREEDLQFILYKMGKEQ 119
F+ +++ + + AI A A+ YL +PF +L I+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII------- 115

Query: 120 GEKAKSHLRKLPIE 133
+A + ++ P +
Sbjct: 116 -GRALAEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5212TCRTETA561e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 55.6 bits (134), Expect = 1e-10
Identities = 64/369 (17%), Positives = 132/369 (35%), Gaps = 13/369 (3%)

Query: 7 ISKRKLLGIAGLGWLFDAMDVGMLSFVIVALQKDWGLSTQEMGWIG---SVNSIGMAVGA 63
+ + L + DA+ +G++ V+ L +D S G ++ ++ A
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 64 LLFGILSDKIGRKSVFIITLLLFSIGSGLTALTTTFAMFLVLRFLIGMGLGGELPVASTL 123
+ G LSD+ GR+ V +++L ++ + A + + R + G+ G VA
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAY 119

Query: 124 VSESVEAHERGKIVVLLESFWAGGWLIAALISYF---VIPKYGWEVAMVLSAVPALYALY 180
+++ + ER + + + + G + ++ P + A L+ + L +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 181 LRWNLPDS-PRFQKVAKRPSVIENIKSVWSVEYRKATIMLWILWFCVVFSYYGMFLWLPS 239
L LP+S ++ +R ++ W+ ++ + + + LW+
Sbjct: 180 L---LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 240 VMVLKGFSLIK-SFQYVLIMTLAQLPGYFTAAWFIERLGRKFVLVTYLIGTACSAYVFGI 298
+ L L RLG + L+ +I +
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 299 ADSLTALIVAGMLLSFFNLGAWGALYAYTPEQYPTTIRGTGAGMAAAFGRIGGILGPLLV 358
A +LL+ +G AL A Q +G G AA + I+GPLL
Sbjct: 297 ATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355

Query: 359 GYLVASQAS 367
+ A+ +
Sbjct: 356 TAIYAASIT 364



Score = 33.3 bits (76), Expect = 0.001
Identities = 29/125 (23%), Positives = 45/125 (36%), Gaps = 5/125 (4%)

Query: 274 ERLGRKFVLVTYLIGTACSAYVFGIADSLTALIVAGMLLSFFNLGAWGALYAYTPEQYPT 333
+R GR+ VL+ L G A + A L L + G +++ AY +
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI-GRIVAGITGATGAVAGAYIADITDG 126

Query: 334 TIRGTGAG-MAAAFGRIGGILGPLLVGYLVASQASLSLIFTIFCGSILIGALAVVILGQE 392
R G M+A FG G + GP+L G + + L L E
Sbjct: 127 DERARHFGFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALN--GLNFLTGCFLLPE 183

Query: 393 TKQRE 397
+ + E
Sbjct: 184 SHKGE 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5215IGASERPTASE371e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.4 bits (86), Expect = 1e-04
Identities = 21/53 (39%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 289 LSEQNVAQQGQKEKEIKEKIKKEEKQHKVEKPEEKAKVEAEVKKELEKEQKKE 341
VAQ G + KE + KE VEK EEKAKVE E +E+ K +
Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETAT--VEK-EEKAKVETEKTQEVPKVTSQV 1129


74BcerKBAB4_0060BcerKBAB4_0067N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0060-3193.410574ATP-dependent metalloprotease FtsH
BcerKBAB4_0061-1192.560846pantothenate kinase
BcerKBAB4_00620192.944074Hsp33-like chaperonin
BcerKBAB4_00631182.571943cysteine synthase A
BcerKBAB4_00641191.742159anthranilate synthase
BcerKBAB4_00652191.800934para-aminobenzoate/anthranilate synthase
BcerKBAB4_0066-2172.1198564-amino-4-deoxychorismate lyase
BcerKBAB4_00670162.396119dihydropteroate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0060HTHFIS364e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.3 bits (84), Expect = 4e-04
Identities = 38/179 (21%), Positives = 57/179 (31%), Gaps = 41/179 (22%)

Query: 185 RKFAEVGARIPKGVLLVGPPGTGKTLLARAV---AGEAGVPFFS-----ISGSDFVEMFV 236
+ + +++ G GTGK L+ARA+ PF + I
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELF 209

Query: 237 GV------GASRVRD-LFENAKKNAPCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQLL 289
G GA FE A+ +F+DEI + L +
Sbjct: 210 GHEKGAFTGAQTRSTGRFEQAEGGT---LFLDEIGDMPMDAQTRLLRVLQQG-------- 258

Query: 290 VEMDGFGANEGII----IIAATNRPDILDPALLRPGRFDRQITVDRPDVNGREAVLKVH 344
E G I I+AATN+ L + G F R D+ R V+ +
Sbjct: 259 -EYTTVGGRTPIRSDVRIVAATNKD--L-KQSINQGLF-------REDLYYRLNVVPLR 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0061PF03309377e-135 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 377 bits (970), Expect = e-135
Identities = 97/269 (36%), Positives = 163/269 (60%), Gaps = 12/269 (4%)

Query: 1 MIFVLDVGNTNAVLGVF----EEGELRQHWRMETDRHKTEDEYGMLVKQLLEHEGLSFED 56
M+ +DV NT+ V+G+ + ++ Q WR+ T+ T DE + + L+ G E
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLI---GDDAER 57

Query: 57 VKGIIVSSVVPPIMFALERMCEKYFKIKP-LVVGPGIKTGLNIKYENPREVGADRIVNAV 115
+ G S VP ++ + M E+Y+ P +++ PG++TG+ + +NP+EVGADRIVN +
Sbjct: 58 LTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCL 117

Query: 116 AGIHLYGSPLIIVDFGTATTYCYINEEKHYMGGVITPGIMISAEALYSRAAKLPRIEITK 175
A H YG+ I+VDFG++ ++ + ++GG I PG+ +S++A +R+A L R+E+T+
Sbjct: 118 AAYHKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTR 177

Query: 176 PGSVIGKNTVSAMQSGILYGYVGQVEGIVKRMKEEA----KQEPKVIATGGLAKLISEES 231
P SVIGKNTV MQ+G ++G+ G V+G+V R++++ + V+ATG A L+ +
Sbjct: 178 PRSVIGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDL 237

Query: 232 NVIDIVDPFLTLKGLYMLYERNANLQHEK 260
++ D LTL GL +++ERN Q K
Sbjct: 238 RTVEHYDRHLTLDGLRLVFERNRANQRGK 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0066RTXTOXINA280.040 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.4 bits (63), Expect = 0.040
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)

Query: 191 ILYTPSLETGILNGITRAFIIKAAEELDIEVKEGFFTKDELLSADEVFVTNSIQEIVPL 249
IL P G + + +++ A+EL IEV+ + K+ +VF + ++++ L
Sbjct: 50 ILLIPKDYKGQGSSLND--LVRTADELGIEVQ--YDEKNGTAITKQVF--GTAEKLIGL 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0067PF07201300.010 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 29.8 bits (67), Expect = 0.010
Identities = 11/72 (15%), Positives = 26/72 (36%), Gaps = 4/72 (5%)

Query: 152 ILMHNRDNMNYRNLMADMIADLYESIKIAKDAGVRDENIILDPGIGFAKTPEQNLEAMRN 211
L + + +L+ + + E G R I +++ L+ +R+
Sbjct: 145 ALKGRPELAHLSHLVEQALVSMAEEQGETIVLGAR----ITPEAYRESQSGVNPLQPLRD 200

Query: 212 LEQLNVLGYPVL 223
+ V+GY +
Sbjct: 201 TYRDAVMGYQGI 212


75BcerKBAB4_0365BcerKBAB4_0377N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0365-112-0.378516ABC transporter
BcerKBAB4_0366-2120.326069glycoside hydrolase family protein
BcerKBAB4_0367017-0.769657hypothetical protein
BcerKBAB4_0368-114-0.440470hypothetical protein
BcerKBAB4_0369-114-0.056105hypothetical protein
BcerKBAB4_0370-114-0.415630TetR family transcriptional regulator
BcerKBAB4_0371-311-0.173681major facilitator transporter
BcerKBAB4_03721213.055398XRE family transcriptional regulator
BcerKBAB4_03731192.602848major facilitator transporter
BcerKBAB4_03743212.988330type I phosphodiesterase/nucleotide
BcerKBAB4_03754253.572472hypothetical protein
BcerKBAB4_03763263.668587prolyl-tRNA synthetase
BcerKBAB4_03774304.064286cell wall anchor domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0365PF05272358e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 8e-04
Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 34 LVGKNGIGKSTLLRLLTGELIHDDGNIE 61
L G GIGKSTL+ L G D + +
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFD 628


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0369cloacin270.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.4 bits (60), Expect = 0.006
Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 57 HDGGSHDCGGSFGGDSGGSCGGGGGGD 83
H+ G+H G+ G G GGG D
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGGGASD 35



Score = 25.8 bits (56), Expect = 0.022
Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 50 GSDNDSSHDGGSHDCGGSFGGDSGGSCGGGGGG 82
GS + GGS G G+SGG G GG
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0370HTHTETR842e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.9 bits (207), Expect = 2e-22
Identities = 46/198 (23%), Positives = 85/198 (42%), Gaps = 8/198 (4%)

Query: 1 MRRSAEEIKKEIAYKAESLFSQKGYAATSMEEICEITERSKGSIYYHFKSKEELFLFVVK 60
++ A+E ++ I A LFSQ+G ++TS+ EI + ++G+IY+HFK K +LF + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 QHTYDWLEKWNEK-EKLYSTSTEKLYGLAEYYVEDIQQPISN----AIEEFSMSQVVSKE 115
+ E E K L + + +E I V
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 116 ILDELLALT-RESYVMVEKLIEAGIQSGEFQED-DTRDLMYIVNGLLSGL-GVLYYELDY 172
++ + ESY +E+ ++ I++ D TR I+ G +SGL +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 173 KELKRIYKKAIDVLLKGM 190
+LK+ + + +LL+
Sbjct: 185 FDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0371TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 3e-05
Identities = 27/130 (20%), Positives = 50/130 (38%), Gaps = 5/130 (3%)

Query: 34 FIMERTNNDPVSVSL-LSVMEYAPIFIFSFIGGALADRWNPKRTMVAGDVLSVLSIVGIV 92
F +R + D ++ + L+ + I G +A R +R ++ G + I+
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY--IL 293

Query: 93 LLLKLDYWQAIFFATLISAIVGQFSQPSSSRIFKRYVKEEQVANAIAFNQTLQSLFMIFG 152
L W + F ++ G P+ + R V EE+ L SL I G
Sbjct: 294 LAFATRGW--MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351

Query: 153 PVVGSLVYTQ 162
P++ + +Y
Sbjct: 352 PLLFTAIYAA 361



Score = 33.6 bits (77), Expect = 0.001
Identities = 58/342 (16%), Positives = 122/342 (35%), Gaps = 22/342 (6%)

Query: 58 FIFSFIGGALADRWNPKRTMVAGDVLSVLSIVGIVLLLKLDYWQAIFFATLISAIVGQFS 117
F + + GAL+DR+ + ++ + V ++ + ++ +++ I G +
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-T 112

Query: 118 QPSSSRIFKRYVKEEQVANAIAFNQTLQSLFMIFGPVVGSL---VYTQLGLFTSLYSLII 174
+ ++ A F M+ GPV+G L F + +
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 175 LFLLSAIALSFLPKWVEQEQVARGSLKNDIKEGWKYVLHTKNLHMITITFTIMGLAVGLT 234
FL LP+ + E+ + +++ + + F IM L +
Sbjct: 173 NFLTGCF---LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229

Query: 235 NPLEVFLVIERLGMEKEAVQYLAAADGI-GMLIGGIVAAVFASKVNPKKMFVFGMSILAM 293
L V +R + + AA GI L ++ A+++ ++ + GM
Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 294 SFLVEGLSTSFWITSFMRFGTGICLACVNI---VVGTLMIQLVPENMIGRVNGTILPLFM 350
+++ +T W M F + LA I + ++ + V E G++ G++ L
Sbjct: 290 GYILLAFATRGW----MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS 345

Query: 351 GAMLIGTSLAGGLKEMTSLV---TVFCIAMALILFAIGPVLR 389
++G L + + + AL L + P LR
Sbjct: 346 LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCL-PALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0373TCRTETB484e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.0 bits (114), Expect = 4e-08
Identities = 31/158 (19%), Positives = 58/158 (36%), Gaps = 3/158 (1%)

Query: 264 DLGISATNLLIILFVTQIVACPFALLYGKLSETFTGKKMLYVGIIIYIIICTYAYFLKTT 323
D + + + +YGKLS+ K++L GIII + +
Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSF 102

Query: 324 FDFWILAMLV-ATSQGGIQALSRSYFAKLVPKESANEFFGFYNIFGKFAAIMGPVLVGVT 382
F I+A + AL A+ +PKE+ + FG +GP + G+
Sbjct: 103 FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI 162

Query: 383 TQLTGKTNAGVLSIIVLFIIGGFLLTRVPENDTSVTPP 420
+ +L I ++ II L ++ + + +
Sbjct: 163 AHYIHWSY--LLLIPMITIITVPFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0377cloacin493e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 48.6 bits (115), Expect = 3e-08
Identities = 36/81 (44%), Positives = 41/81 (50%)

Query: 231 SGGNGSGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSG 290
SGG+G G N + SG G G G G GSG S GGSGSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 291 SGGSGSGGSGSGGSGSGGSGS 311
G G G+ GGSG+GG+ S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 48.6 bits (115), Expect = 3e-08
Identities = 36/81 (44%), Positives = 41/81 (50%)

Query: 226 SGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSG 285
SGG+G G N + SG G G G G GSG S GGSGSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 286 SGGSGSGGSGSGGSGSGGSGS 306
G G G+ GGSG+GG+ S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 48.2 bits (114), Expect = 3e-08
Identities = 36/81 (44%), Positives = 42/81 (51%)

Query: 221 SGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGSG 280
SGG+G G N + SG G G G G +GSG S GGSGSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 281 SGGSGSGGSGSGGSGSGGSGS 301
G G G+ GGSG+GG+ S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 47.0 bits (111), Expect = 8e-08
Identities = 38/103 (36%), Positives = 47/103 (45%)

Query: 211 SGGSGSGDNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSG 270
SGG G G N + SG G G G G +GSG + GGSGSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 271 SGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSQG 313
G G G+ GGSG+GG+ S + G + G+G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 46.6 bits (110), Expect = 1e-07
Identities = 35/81 (43%), Positives = 42/81 (51%)

Query: 166 SGDNGSGGNGSGGNGSGGNGSGGRGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNG 225
SG +G G N + SG G G G G +GSG S GGSGSG + GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 226 SGGNGSGGNGSGGNGSGGNGS 246
G G GN GG+G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 46.6 bits (110), Expect = 1e-07
Identities = 34/80 (42%), Positives = 39/80 (48%)

Query: 236 SGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSG 295
SGG+G G N + SG G +G G G GSG S GGSGSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 296 SGGSGSGGSGSGGSGSQGGN 315
G G G+ GGSG+ G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 46.2 bits (109), Expect = 1e-07
Identities = 35/81 (43%), Positives = 42/81 (51%)

Query: 116 SGGNGSDGNGSGGNGSGDNGSGGNGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNG 175
SGG+G N + SG+ G G G G +GSG S GGSGSG + GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 176 SGGNGSGGNGSGGRGSGGNGS 196
G G GN GG G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 46.2 bits (109), Expect = 2e-07
Identities = 40/116 (34%), Positives = 51/116 (43%), Gaps = 2/116 (1%)

Query: 176 SGGNGSGGNGSGGRGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNG 235
SGG+G G N G + GN +GG G G G + G S +N GG G GG
Sbjct: 2 SGGDGRGHNT-GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 236 SGGNGSGGNGSGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGS 291
GNG GGNG+ G GSG G+ + + G G+GG S S +
Sbjct: 61 GHGNG-GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 45.9 bits (108), Expect = 2e-07
Identities = 35/81 (43%), Positives = 43/81 (53%)

Query: 156 SGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGRGSGGNGSGGNGSGGSGSGGNGSGGSG 215
SGG+G G + + SG G G G G GSG + GGSGSG + GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 216 SGDNGSGGNGSGGNGSGGNGS 236
G+ G GN GG+G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 45.9 bits (108), Expect = 2e-07
Identities = 33/81 (40%), Positives = 41/81 (50%)

Query: 171 SGGNGSGGNGSGGNGSGGRGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNG 230
SGG+G G N + SG G G G G GSG + G +GSG + GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 231 SGGNGSGGNGSGGNGSGGNGS 251
G G GN GG+G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 45.1 bits (106), Expect = 3e-07
Identities = 34/81 (41%), Positives = 41/81 (50%)

Query: 131 SGDNGSGGNGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGRG 190
SG +G G N + SG G +G G G GSG + GG+GSG + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 191 SGGNGSGGNGSGGSGSGGNGS 211
G G GN GGSG+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 45.1 bits (106), Expect = 3e-07
Identities = 37/109 (33%), Positives = 49/109 (44%)

Query: 206 SGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGSGSGGSG 265
SGG+G G + + SG G G G G +GSG + GG+GSG GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 266 SGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSQGG 314
G G G+ GGSG+GG+ S + G + G+G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 43.9 bits (103), Expect = 7e-07
Identities = 32/81 (39%), Positives = 42/81 (51%)

Query: 146 SGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGRGSGGNGSGGNGSGGSG 205
SGG+G G + + SG G G G G +GSG + GG+GSG + GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 206 SGGNGSGGSGSGDNGSGGNGS 226
G G G+ G +G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 43.5 bits (102), Expect = 9e-07
Identities = 34/81 (41%), Positives = 43/81 (53%)

Query: 106 SGGNGSGGSGSGGNGSDGNGSGGNGSGDNGSGGNGSGGNGSGGNGSGGSGSGGNGSGGSG 165
SGG+G G + + S G G G G +GSG + GGSGSG + GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 166 SGDNGSGGNGSGGNGSGGNGS 186
G+ G GN GG+G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 42.8 bits (100), Expect = 2e-06
Identities = 33/81 (40%), Positives = 41/81 (50%)

Query: 81 SGGNGSGGSGSDGSGSDGNGSDGNGSGGNGSGGSGSGGNGSDGNGSGGNGSGDNGSGGNG 140
SGG+G G + S S G G G GSG + + GG+GSG + GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 141 SGGNGSGGNGSGGSGSGGNGS 161
G G GN GGSG+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 42.8 bits (100), Expect = 2e-06
Identities = 32/81 (39%), Positives = 39/81 (48%)

Query: 101 SDGNGSGGNGSGGSGSGGNGSDGNGSGGNGSGDNGSGGNGSGGNGSGGNGSGGSGSGGNG 160
S G+G G N S SG G G G +GSG + GG+GSG GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 161 SGGSGSGDNGSGGNGSGGNGS 181
G G N GG+G+GGN S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 42.4 bits (99), Expect = 2e-06
Identities = 40/116 (34%), Positives = 50/116 (43%), Gaps = 2/116 (1%)

Query: 191 SGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGNGSGGNGSGGNG 250
SGG+G G N S SG G +G G G +GSG S N G GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGGG 59

Query: 251 SGGNGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGSGGSGS 306
SG GG+G+ G GSG G+ + + G G+GG S S +
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 39.3 bits (91), Expect = 2e-05
Identities = 36/103 (34%), Positives = 43/103 (41%), Gaps = 2/103 (1%)

Query: 136 SGGNGSGGNGSGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGGRGSGGNG 195
SGG+G G N + SG G G G G +GSG S N GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW-SSENNPWGGGSGSGIHWGGGS 60

Query: 196 SGGNGSGGSGSGGNGSGGSGSGDNGSGGNGSGGNGSGGNGSGG 238
GNG G G+ G GSG G+ + G G+GG
Sbjct: 61 GHGNGGGN-GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 38.2 bits (88), Expect = 5e-05
Identities = 27/68 (39%), Positives = 34/68 (50%)

Query: 64 SGSGGNGSGGNGSGGSGSGGNGSGGSGSDGSGSDGNGSDGNGSGGNGSGGSGSGGNGSDG 123
S SG G G G G +GSG S + G+GS + GG+G G G GN G
Sbjct: 15 STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 124 NGSGGNGS 131
+G+GGN S
Sbjct: 75 SGTGGNLS 82



Score = 36.6 bits (84), Expect = 2e-04
Identities = 37/85 (43%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 246 SGGNGSGGNGSGGSGSGGSGSGGSGSGGSGSGGS-GSGGSGSGGSGSGGSGSGGSGSGGS 304
SGG+G G N G + G+ +GG G G G S GSG S GGSGSG GGS
Sbjct: 2 SGGDGRGHNT-GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 305 GSGGSGSQGGNRVKGDSSSQGGNGS 329
G G G G + G S GGN S
Sbjct: 61 GHGNGGGNGNS---GGGSGTGGNLS 82



Score = 34.7 bits (79), Expect = 6e-04
Identities = 28/74 (37%), Positives = 37/74 (50%), Gaps = 5/74 (6%)

Query: 66 SGGNGSGGNGSGGSGSGGNGSGGSGSDGSGSDGN---GSDGNGSGGNGSGGSGSGGNGSD 122
+G + + GN +GG G G G S G S+ N G G+G G G G+GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN-- 68

Query: 123 GNGSGGNGSGDNGS 136
GN GG+G+G N S
Sbjct: 69 GNSGGGSGTGGNLS 82



Score = 30.1 bits (67), Expect = 0.016
Identities = 28/64 (43%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 58 NGSQDDSGSGGNGSGGNGSGGSGSGGNGSGGSGSDGSGSDGNGSDGNGSGGNGSGGSGSG 117
NG G GG S +GSG S GGSGS G+G G GN GGSG+G
Sbjct: 21 NGGPTGLGVGGGAS--DGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78

Query: 118 GNGS 121
GN S
Sbjct: 79 GNLS 82


76BcerKBAB4_0467BcerKBAB4_0473N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0467111-1.817868cell wall anchor domain-containing protein
BcerKBAB4_0468011-0.660232collagenase
BcerKBAB4_0469-1110.194235hypothetical protein
BcerKBAB4_0470-1110.352878hypothetical protein
BcerKBAB4_0471-112-0.427808methyl-accepting chemotaxis sensory transducer
BcerKBAB4_0472013-0.084063signal transduction histidine kinase regulating
BcerKBAB4_0473-1150.234978response regulator receiver
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0467IGASERPTASE469e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.8 bits (108), Expect = 9e-07
Identities = 49/287 (17%), Positives = 93/287 (32%), Gaps = 26/287 (9%)

Query: 705 QVPVYDLEGETIENIKLTSEGGTFNNGVMKWSTPGEKVYKFDLESDETSICFNGTVIQNI 764
++ ++D +++ ++ G T + G K+ ++DL + E
Sbjct: 939 ELTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVN-GRYDLYNPE------------- 984

Query: 765 VEKEEVTEPTKEVEEI-KEEVKEPIKEVEETKEEVKEPVKEIEEIKEEVKEPVKEVEETK 823
VEK T T + + P + E V E P + E
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVP---SVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 824 EEVKEPVKEIEETKEEVKEPAKEVAGTKEEVKEPAK------EVAGTKEEVKEPVKEVEE 877
E K+ K +E+ +++ E + +E K K EVA + E KE +
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 878 IKEEVKEPAKEVAGTKEEVKEPAKEVEEIKEEVKEEVKEPAKEVAGTKEEVKESATGLDQ 937
V++ K T++ + P + ++ + E +P E A + Q
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 938 EPKGNNQVGPVKVVGQ--ESAKEKDSNKRHANKQEENQKSLAATGGQ 982
+ P K E + + N EN ++ Q
Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208



Score = 41.2 bits (96), Expect = 2e-05
Identities = 36/173 (20%), Positives = 60/173 (34%), Gaps = 12/173 (6%)

Query: 820 EETKEEVKEPVKEIEETKEEVKEPAKEVAGTKEEVKEPAKEVAGTKEEVKEPVKEVEEIK 879
+ E V E P++ E K+ +K V +++ E + E+
Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 880 EEVKEPAK------EVAGTKEEVKEPAKEVEEIKEEVKEEVKEPAKEVAGTKEEVKESAT 933
+E K K EVA + E KE + E KE E +E AK +EV + +
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKE--TQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 934 GLDQEPKGNNQVGPVKVVGQESAKEKDSNKRHANKQEENQKSLAATGGQANTP 986
+ + + + V P E A+E D Q + + T
Sbjct: 1128 QVSPKQEQSETVQP----QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176



Score = 38.5 bits (89), Expect = 2e-04
Identities = 38/168 (22%), Positives = 64/168 (38%), Gaps = 14/168 (8%)

Query: 769 EVTEPTKEVEEIKEEVKEPIKEVEETKEEVKEPVKEIEEIKEEVKEPVKEVE-ETKEEVK 827
EV + E +E + + VE+ ++ E K E K + K+ + ET +
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 828 EPVKEIEETKEEVKEPAKEVAGTKEEVKEPAKEVAGTKEEVKEPVKE---VEEIKEEVKE 884
EP +E + +KEP + T + ++PAKE T V++PV E V V+
Sbjct: 1144 EPARE-NDPTVNIKEPQSQ-TNTTADTEQPAKE---TSSNVEQPVTESTTVNTGNSVVEN 1198

Query: 885 PAKEVAGTKEEVKEPAKEVEEIKEEVKEEVKEPAKEVAGTKEEVKESA 932
P T + E + K + + V E S+
Sbjct: 1199 PENTTPATTQP-----TVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241



Score = 32.0 bits (72), Expect = 0.017
Identities = 26/159 (16%), Positives = 53/159 (33%), Gaps = 14/159 (8%)

Query: 835 ETKEEVKEPAKEVAGTKEEVKEPAKEVAGTKEEVKEPVKEVEEIKEEVKEPAKEVAGTKE 894
E + + + + P+ + E V E P++ E
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVP---SNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 895 EVKEPAKEVEEIKEEVKEEVKEPAKEVAGTKEEVKESATGLDQEPKGNNQVGPVKVVGQE 954
K+ +K VE+ +++ E T + +E A K N Q V G E
Sbjct: 1043 NSKQESKTVEKNEQDATE-----------TTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 955 SAKEKDSNKRHANKQEENQKSLAATGGQANTPTLLSGLA 993
+ + + + + E+ +K+ T P + S ++
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0468MICOLLPTASE7500.0 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 750 bits (1936), Expect = 0.0
Identities = 413/912 (45%), Positives = 581/912 (63%), Gaps = 20/912 (2%)

Query: 70 KVGDFSQRPASITKNSAVKQAKEGYSMADLNKMNDQELIETLGSIKWHQITDLFQFNEDA 129
+V + RP + + + + Y+ +LN+MN +L+E + +I + + DLF FN+ +
Sbjct: 69 EVATDNNRPLGPSIAPSRARNNKIYTFDELNRMNYSDLVELIKTISYENVPDLFNFNDGS 128

Query: 130 KAFYKDNGKMQVIIDELAHRGSTFTKDDSKGIQTFTEVLRSAFYLAFYNNELSDLNERSF 189
F+ + ++Q II L G T+T DD KGI T E LR+ +YL FYN +LS LN
Sbjct: 129 YTFFSNRDRVQAIIYGLEDSGRTYTADDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQL 188

Query: 190 QDKCLPALKAIAKNPNFKLGTNEQDTVVSAYGKLISNASSDVETVQYASDILKQYNDNFN 249
+++CLPA+KAI N NF+LGT QD VV A G+LI NAS+D E + +L + DN +
Sbjct: 189 KNECLPAMKAIQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVLSDFKDNID 248

Query: 250 AYVDDRMKGQAVYDMMQGIDYDIQSYLNEARKE-ANETMWYGKVDGFINEINRIALL-NQ 307
Y + KG AV+++M+GIDY S + + A T +Y ++D ++ + + + ++
Sbjct: 249 KYGSNYSKGNAVFNLMKGIDYYTNSVIYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDK 308

Query: 308 VTPENKWLVNNGIYFASRLGKFHSNPNKGLEVVTQAMHMYPRLSEPYFVAVEQITTNYNG 367
+ +N WLVNN +Y+ R+GKF +P+ + +AM YP LS Y A + N+ G
Sbjct: 309 LNNDNAWLVNNALYYTGRMGKFREDPSISQRALERAMKEYPYLSYQYIEAANDLDLNFGG 368

Query: 368 KDYSGNTVDLEKIRKEGKEQYLPKTYTFDDGSIVFKTGDKVSEEKIKRLYWAAKEVKAQY 427
K+ SGN +D KI+ + +E+YLPKTYTFDDG V K GDKV+EEKIKRLYWA+KEVKAQ+
Sbjct: 369 KNSSGNDIDFNKIKADAREKYLPKTYTFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQF 428

Query: 428 HRVIGNDKALEQGNADDVLTIVIYNTPDEYQLNRQLYGYETNNGGIYIEETGTFFTYERT 487
RV+ NDKALE+GN DD+LT+VIYN+P+EY+LNR + G+ T+NGGIYIE GTFFTYERT
Sbjct: 429 MRVVQNDKALEEGNPDDILTVVIYNSPEEYKLNRIINGFSTDNGGIYIENIGTFFTYERT 488

Query: 488 PEQSIYSLEELFRHEFTHYLQGRYEVPGLFGRGDMYQNERLTWFQEGNAEFFAGSTRTNN 547
PE+SIY+LEELFRHEFTHYLQGRY VPG++G+G+ YQ LTW++EG AEFFAGSTRT+
Sbjct: 489 PEESIYTLEELFRHEFTHYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDG 548

Query: 548 VVPRKSIISGLSSDPASRYTAERTLFAKYGSWDFYNYSFALQSYLYTHQFETFDKIQDLI 607
+ PRKS+ GL+ D +R + L AKYGSWDFYNY FAL +Y+Y + F+K+ + I
Sbjct: 549 IKPRKSVTQGLAYDRNNRMSLYGVLHAKYGSWDFYNYGFALSNYMYNNNMGMFNKMTNYI 608

Query: 608 RANDVGNYDTYRETLSKDSKLNKEYQEYMQQLIDNQDKYNVPEVADDYLAEHAPKALTEV 667
+ NDV Y Y ++S D LN +YQ+YM L++N D +VP V+D+Y+ H K + E+
Sbjct: 609 KNNDVSGYKDYIASMSSDYGLNDKYQDYMDSLLNNIDNLDVPLVSDEYVNGHEAKDINEI 668

Query: 668 KKEISETLSMKDTKMTKHTSQFFNTFTLEGTFTGSVTKGESEDWKAMSKRVNEALEQLTQ 727
+I E ++KD SQFF T+ + GT+ G ++GE DWK M+ ++N+ L++L++
Sbjct: 669 TNDIKEVSNIKDLSSNVEKSQFFTTYDMRGTYVGGRSQGEENDWKDMNSKLNDILKELSK 728

Query: 728 KEWSGYKTVTAYFVNYRVNSSNEFEYDVVFHG----IAKDDRENKAPTVNVNGPYNGFVK 783
K W+GYKTVTAYFVN++V+ + + YDVVFHG D NK P + + V+
Sbjct: 729 KSWNGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVE 788

Query: 784 EGIQFKSDGSKDEDGKIVSYLWDFGDGRTSKEVNPVHAYEREGTYKVALVVKDDKGKESR 843
E I F SKDEDG+I +Y WDFGDG S E H Y + G Y+V L V D+ G
Sbjct: 789 EEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNG--GI 846

Query: 844 SETTVTIKDGS------VTESEPNNRPEEANRIG-LNTTIKGSLIGGDHTDVYTFNVASA 896
+ + IK + ESEPNN E+AN+I N +KG+L D++D Y F+VA
Sbjct: 847 NTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKK 906

Query: 897 KDIDISVLNEYGIGMTWVLHHESDMQNYAAYGQANGNHIEAK---FNAKPGKYYLYVYKY 953
++ I++ N +G+TW L+ E D+ NY Y A GN +PG+YYL VY Y
Sbjct: 907 GNVKITLNNLNSVGITWTLYKEGDLNNYVLY--ATGNDGTVLKGEKTLEPGRYYLSVYTY 964

Query: 954 DNGDGTYALSVK 965
DN GTY ++VK
Sbjct: 965 DNQSGTYTVNVK 976



Score = 98.6 bits (245), Expect = 5e-23
Identities = 60/250 (24%), Positives = 99/250 (39%), Gaps = 47/250 (18%)

Query: 762 KDDRENKAPTVNVNGPYNGFVKEGIQFKSD----GSKDEDGKIVSYLWDF---------- 807
K + +N + P N F K KS+ G+ E+ Y +D
Sbjct: 854 KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITL 913

Query: 808 ---------------GDGRT-SKEVNPVHAYEREGTYKVA-----LVVKDDKGKES---- 842
GD +G + L V +
Sbjct: 914 NNLNSVGITWTLYKEGDLNNYVLYATGNDGTVLKGEKTLEPGRYYLSVYTYDNQSGTYTV 973

Query: 843 ------RSETTVTIKDGSVTESEPNNRPEEANRIGLNTTIKGSLIGGDHTDVYTFNVASA 896
++E T KD ++ E E NN ++A ++ N+ I G+L D D+Y+ ++ +
Sbjct: 974 NVKGNLKNEVKETAKD-AIKEVENNNDFDKAMKVDSNSKIVGTLSNDDLKDIYSIDIQNP 1032

Query: 897 KDIDISVLNEYGIGMTWVLHHESDMQNYAAYGQANGNHIEAKFNAKPGKYYLYVYKYDN- 955
D++I V N I M W+L+ D+ NY Y A+GN + PGKYYL VY+++N
Sbjct: 1033 SDLNIVVENLDNIKMNWLLYSADDLSNYVDYANADGNKLSNTCKLNPGKYYLCVYQFENS 1092

Query: 956 GDGTYALSVK 965
G G Y ++++
Sbjct: 1093 GTGNYIVNLQ 1102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0470IGASERPTASE411e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 1e-05
Identities = 29/194 (14%), Positives = 67/194 (34%), Gaps = 12/194 (6%)

Query: 201 QPQIAMVKRDATVANAEREKEARIEKARAEKEAKEAEYQRDAQIAEAEKHKELKVQSYKR 260
P + T AE K+ + E++A E Q EA+ + + Q+ +
Sbjct: 1026 PPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV 1085

Query: 261 DQEQARADADLSYELQQAKAQQGVTEEQMRVKIIEREKQIELEEKEIARREKQYDAEVKK 320
Q + E Q + ++ T E+ E + ++E E+ + + + ++
Sbjct: 1086 AQSGSETK-----ETQTTETKETATVEK------EEKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 321 KADADRYAVEQSAEAEKVKQMKKADADQYKIEAEARARAEEVRVEGLAKAEIEKAQGQAK 380
+++ + E + E + +K+ + A+ A+E
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNT-TADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 381 AEVQKAQGTAEADV 394
+ V+ + T A
Sbjct: 1194 SVVENPENTTPATT 1207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0472PF06580290.041 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.041
Identities = 20/105 (19%), Positives = 39/105 (37%), Gaps = 19/105 (18%)

Query: 426 ILGNLITNAFE-AIERNHEHNKKVRMFVTDIGEEILIEVEDSGQGVHDEIITSIFYKGFS 484
++ L+ N + I + K+ + T + +EVE++G
Sbjct: 259 LVQTLVENGIKHGIAQL-PQGGKILLKGTKDNGTVTLEVENTGSLALKN----------- 306

Query: 485 TKAGEKRGYGLAKVKELVEDLNG---SIAIEKGDLGGALFIIALP 526
E G GL V+E ++ L G I + + G ++ +P
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQ-GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0473HTHFIS764e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 4e-18
Identities = 24/110 (21%), Positives = 45/110 (40%), Gaps = 2/110 (1%)

Query: 4 GIEVLIVEDDIRIAEIHRRFTEKIEGFKVIGTATTGEQAKEWLEFVKPQLVLLDVYLPDM 63
G +L+ +DD I + + + G+ V + W+ LV+ DV +PD
Sbjct: 3 GATILVADDDAAIRTVLNQALSR-AGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 64 QGTELVTYIRHNLHDTDIIMITAASETDVVRHALRGGVTDYIVKPLTFDR 113
+L+ I+ D +++++A + A G DY+ KP
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


77BcerKBAB4_0480BcerKBAB4_0491N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0480-315-0.426364ABC transporter
BcerKBAB4_0481-414-0.901649binding-protein-dependent transport system inner
BcerKBAB4_0482-214-0.793691binding-protein-dependent transport system inner
BcerKBAB4_0483-116-0.538009extracellular solute-binding protein
BcerKBAB4_0484015-1.089507metallophosphoesterase
BcerKBAB4_0485116-1.418862two component transcriptional regulator
BcerKBAB4_0486116-1.764476histidine kinase
BcerKBAB4_0487014-1.446118hypothetical protein
BcerKBAB4_0488-113-0.853171hypothetical protein
BcerKBAB4_0489-212-0.791734methyl-accepting chemotaxis sensory transducer
BcerKBAB4_0490-211-0.708107sensory histidine kinase DcuS
BcerKBAB4_0491-211-0.459164response regulator receiver
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0480PF05272371e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 37.0 bits (85), Expect = 1e-04
Identities = 14/33 (42%), Positives = 16/33 (48%)

Query: 33 VLVGPSGCGKSTLLRMLAGLEEISSGDLIINEH 65
VL G G GKSTL+ L GL+ S I
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTG 632


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0483MALTOSEBP371e-04 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 37.0 bits (85), Expect = 1e-04
Identities = 71/327 (21%), Positives = 119/327 (36%), Gaps = 43/327 (13%)

Query: 131 IKKDKYDTSKLEKAITNYYSVDEKMYSMPFNSSTPVLIYNKDAFAKAGLDPEKAPQTYDE 190
I DK KL + + K+ + P LIYNKD P+T++E
Sbjct: 105 ITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEE 157

Query: 191 LKEAAKKLTIKEGGNVKQYGFSMLNYGWFFEELLATQGALYVDNENGRKDAAKKAVFNGK 250
+ K+L K G + + + W L+A G ENG+ D V N
Sbjct: 158 IPALDKELKAK-GKSALMFNLQEPYFTW---PLIAADGGYAFKYENGKYDIKDVGVDNAG 213

Query: 251 EGQKVFGMLDELNKAGALGKYGASWDDIRAAFQSGQVAMYLDSSAGVRDLIDASKFNVGV 310
+ ++D + + AAF G+ AM ++ + ID SK N GV
Sbjct: 214 AKAGLTFLVDLIKNKHM--NADTDYSIAEAAFNKGETAMTINGPWAWSN-IDTSKVNYGV 270

Query: 311 SYIPYPEDSKQN---GVVIGGASLWMTNMVSEETQQGAWDFMKYLTKPDVQAKWHTATGY 367
+ +P + GV+ G + N K L K ++ T G
Sbjct: 271 TVLPTFKGQPSKPFVGVLSAGINAASPN--------------KELAKEFLENYLLTDEGL 316

Query: 368 FSINPD----AYNEPLVKEQYEKYPQLKVTVEQLQATKQSPATQGALISVFPESRDAVVK 423
++N D A +E+ K P++ T+E Q +G ++ P+
Sbjct: 317 EAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQ--------KGEIMPNIPQMSAFWYA 368

Query: 424 ALEAMYDGKNSKEALDEAAKATDRAIS 450
A+ + + ++ +DEA K I+
Sbjct: 369 VRTAVINAASGRQTVDEALKDAQTRIT 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0485HTHFIS956e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 6e-25
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 2/143 (1%)

Query: 2 RLLVVEDNASLLESIVQILRDE-FEVDTAMNGEDGLFLALQNIYDAILLDVMMPEMDGFE 60
+LV +D+A++ + Q L ++V N D ++ DV+MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 61 VIQKIRDEKIETPVLFLTARDSLEDRVKGLDFGGDDYIVKPFQAPELKARI-RALLRRSG 119
++ +I+ + + PVL ++A+++ +K + G DY+ KPF EL I RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 SLTTQQTIRYKGIELFGKDKDIQ 142
+ + G+ L G+ +Q
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQ 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0486PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 26/155 (16%), Positives = 57/155 (36%), Gaps = 42/155 (27%)

Query: 268 LLEEI--INPYKEI--ASYQEKAMILKVERDITFMGDRERIHQMMV------ILLDNAMK 317
L +E+ ++ Y ++ ++++ L+ E I I + V L++N +K
Sbjct: 218 LADELTVVDSYLQLASIQFEDR---LQFENQIN-----PAIMDVQVPPMLVQTLVENGIK 269

Query: 318 Y----TNEDGHIQIDCTQTSNSIRIRVKDNGIGVKEEDIPNLFDRFYQGDKARSMSEGAG 373
+ + G I + T+ + ++ + V++ G + E G
Sbjct: 270 HGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK-----------------ESTG 312

Query: 374 LGLSIANWIVEKHYGK---ISVESKWGEGTCFEVI 405
GL ++ YG I + K G+ +I
Sbjct: 313 TGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0490PF06580381e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 1e-04
Identities = 19/99 (19%), Positives = 41/99 (41%), Gaps = 19/99 (19%)

Query: 434 LIDNALEAVIHCKKKQVEVGIQYQ---NALTITVQDTGKGIEEDAVDALFTKGYSTKGDN 490
L++N ++ I + ++ ++ +T+ V++TG ++ ++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 491 RGYGLYLVKESIQRINGE---IHISSLLGEGTTITIEIP 526
G GL V+E +Q + G I +S G + IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0491HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 2/105 (1%)

Query: 2 IKVLIVEDDPMVAMLNTHYLEQVGGFELVHAVNSVKEAIEVLERSRVDFILLDIFMPEET 61
+L+ +DD + + L + G V ++ + D ++ D+ MP+E
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GFELLMYIRNQEKEIDIMMISAVHDMGSIKKALQYGVVDYLIKPF 106
F+LL I+ ++ ++++SA + + KA + G DYL KPF
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


78BcerKBAB4_0497BcerKBAB4_0501N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0497-2110.318191N-acetyltransferase GCN5
BcerKBAB4_0498-2100.215364histidine kinase
BcerKBAB4_0499-2100.959274two component transcriptional regulator
BcerKBAB4_0500-280.055368N-acetyltransferase GCN5
BcerKBAB4_0501090.465837formate dehydrogenase accessory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0497SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-05
Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 4/87 (4%)

Query: 59 GAFKDGTLIGVATLETKPYVKQEHKAKIGSVYVSPKARGLGAGKALIKECLELAKSLEVE 118
+ + IG + + A I + V+ R G G AL+ + +E AK
Sbjct: 69 LYYLENNCIGRIKIRSN----WNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFC 124

Query: 119 QVMLDVVVGNDGAKKLYESLGFKTFGV 145
+ML+ N A Y F V
Sbjct: 125 GLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0499HTHFIS941e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 1e-24
Identities = 34/121 (28%), Positives = 63/121 (52%), Gaps = 1/121 (0%)

Query: 3 KILLVDDEERMLRLLDLFLSPRGYFCMKATSGLEALELIEQKDFDIILLDVMMPNMDGWD 62
IL+ DD+ + +L+ LS GY ++ I D D+++ DV+MP+ + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 TCYQIRQI-SNVPIIMLTARNQNYDMVKGLTMGADDYITKPFDEHVLVARIEAILRRTKK 121
+I++ ++P+++++A+N +K GA DY+ KPFD L+ I L K+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 D 122

Sbjct: 125 R 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0500SACTRNSFRASE482e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 47.6 bits (113), Expect = 2e-09
Identities = 30/119 (25%), Positives = 42/119 (35%), Gaps = 7/119 (5%)

Query: 27 EAFSSSYEDVLKQENPVAAMAKRLSNPDKYTLGVFKDNDLIGIATLETKPFIKQEHKAKI 86
E FS Y KQ + K + +N+ IG + + A I
Sbjct: 40 ERFSKPY---FKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSN----WNGYALI 92

Query: 87 GSVFVSPKARGLGAGRALIKAIIENADKLHVEQLMLDVVADNTAAKKLYESLGFQTYGV 145
+ V+ R G G AL+ IE A + H LML+ N +A Y F V
Sbjct: 93 EDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0501AEROLYSIN300.015 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 29.6 bits (66), Expect = 0.015
Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 5/48 (10%)

Query: 215 SKIGCEIVLSK---SAPTKLALQLAHDLGITVVGFIRNGSCNIYTHPE 259
SKI +I L K S P + +++DL T+ GF+R G YTHP+
Sbjct: 312 SKIPVKIELYKADISYPYEFKADVSYDL--TLSGFLRWGGNAWYTHPD 357


79BcerKBAB4_0529BcerKBAB4_0535N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_05290141.009675periplasmic binding protein
BcerKBAB4_05301151.496493transport system permease
BcerKBAB4_05310161.571449transport system permease
BcerKBAB4_05320140.576033ABC transporter
BcerKBAB4_0533018-0.087624methyltransferase type 11
BcerKBAB4_0534-118-0.2253952-amino-3-ketobutyrate coenzyme A ligase
BcerKBAB4_0535-2160.622491NAD-dependent epimerase/dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0529FERRIBNDNGPP966e-25 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 95.8 bits (238), Expect = 6e-25
Identities = 62/255 (24%), Positives = 106/255 (41%), Gaps = 27/255 (10%)

Query: 58 NPKRVVILTNEGTEALLELGVKPVGAV-----KSWTGDPWYPHIKDKMKDVKVVGDEGQV 112
+P R+V L E LL LG+ P G + W +P P V VG +
Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLP------DSVIDVGLRTEP 87

Query: 113 NVETIASLKPDLIIGNKMRHEKVYEQLKAIAPTV---FSETLR--GEWKDNFTFYAKALN 167
N+E + +KP ++ + + E L IAP FS+ + + + T A LN
Sbjct: 88 NLELLTEMKPSFMVWS-AGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLN 146

Query: 168 KEKEGQKVLANYEKRMKDLKGQLGDKVNQEISMVRFM-PGDVRIYHGDTFSGVILKELGF 226
+ + LA YE ++ +K + + + + + + P + ++ ++ IL E G
Sbjct: 147 LQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 227 KRPGDQNKDDFAERNVSKERISAM-DGDVLFYFTFDKGNEKKGSELEKEYINDPLFKNLN 285
+ + VS +R++A D DVL FD N K L PL++ +
Sbjct: 207 PNAWQGETNFWGSTAVSIDRLAAYKDVDVLC---FDHDNSKDMDALMA----TPLWQAMP 259

Query: 286 AVKNGKAYKVDDVIW 300
V+ G+ +V V W
Sbjct: 260 FVRAGRFQRVPAV-W 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0530TYPE3IMSPROT330.001 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 33.2 bits (76), Expect = 0.001
Identities = 24/173 (13%), Positives = 65/173 (37%), Gaps = 16/173 (9%)

Query: 100 GAAFFIVVAIVLFSVTSLSAFTWIAFL-------GAAVAAVLVFASSSLGKEGTTPLKLT 152
A + ++ +L ++ + + + L + ++ E
Sbjct: 31 STALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPL 90

Query: 153 LAGVAISALFSSLTQGLLVLNEKAFE------EVLFWLAGSVQGRKL-EILQSVFPYLLI 205
L A+ A+ S + Q +++ +A + + + L E L+S+ +L+
Sbjct: 91 LTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLL 150

Query: 206 GWIASFMMAGKVNTLMMGEDVAKGLGQRTILMKSFVLLIIVLLSGGSVAVAGP 258
+ ++ G + TL+ + G+ T L+ + ++V+ + G V ++
Sbjct: 151 SILIWIIIKGNLVTLL--QLPTCGIECITPLLGQILRQLMVICTVGFVVISIA 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0531BORPETOXINA290.021 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 29.4 bits (65), Expect = 0.021
Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 286 PHISRRLVGSFYGALLPVAAIIGAILVLAAD 316
P+ SRR V S G L+ +A +IGA + A+
Sbjct: 211 PYTSRRSVASIVGTLVRMAPVIGACMARQAE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0535NUCEPIMERASE871e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 87.1 bits (216), Expect = 1e-21
Identities = 57/241 (23%), Positives = 102/241 (42%), Gaps = 17/241 (7%)

Query: 3 KILVTGSLGQIGSELVMKLRD----VYGASNVIA---TDIRETDSEVVTSGPFE--TLDV 53
K LVTG+ G IG + +L + V G N+ +++ E++ F+ +D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 TDGQKLHDIAKRNEVDTIIHLAALLSAT-AEKNPLFAWNLNMGGLVNALEAARELNCKFF 112
D + + D+ + + L+ + +NP + N+ G +N LE R +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 113 T-PSSIGAFGPSTPKDNTPQDTIQRPTTMYGVNKVAGELLCDYYHQKFGVDTRGVRFPGL 171
SS +G + + D++ P ++Y K A EL+ Y +G+ G+RF
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF--- 178

Query: 172 ISYVAPPGGGTTDYAVEIYYEAIKKGTYTSYIAEGTYM-DMMYMPDALQAIISLMEADPN 230
V P G D A+ + +A+ +G G D Y+ D +AII L + P+
Sbjct: 179 -FTVYGP-WGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 231 K 231

Sbjct: 237 A 237


80BcerKBAB4_0741BcerKBAB4_0747N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0741115-0.485366acetyltransferase
BcerKBAB4_0742014-0.042369short-chain dehydrogenase/reductase SDR
BcerKBAB4_07430150.174632MerR family transcriptional regulator
BcerKBAB4_0744-1170.766175small multidrug resistance protein
BcerKBAB4_0745-219-0.052516small multidrug resistance protein
BcerKBAB4_0746-2200.212394TetR family transcriptional regulator
BcerKBAB4_0747-3180.531880major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0741SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.0 bits (75), Expect = 2e-04
Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 11/83 (13%)

Query: 42 LYVMKEGEEVVGVAGLHVLGEDLAEVRSLVVSHTYAGKGIGRKLVNHVMNEAAKIKVNRV 101
++ +G + A + + V+ Y KG+G L++ + A K N
Sbjct: 67 AFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWA---KENHF 123

Query: 102 ISLTYETE--------FFQKCGF 116
L ET+ F+ K F
Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0742DHBDHDRGNASE473e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 46.6 bits (110), Expect = 3e-08
Identities = 50/223 (22%), Positives = 80/223 (35%), Gaps = 18/223 (8%)

Query: 2 KYTVITGASSGIGYESALAFASRGKNLILV--ARRQAELEELKLKINEMHPE---LDVVI 56
K ITGA+ GIG A AS+G ++ V + E LK H E DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV-- 66

Query: 57 RRTDLSVTEEVYKFYESLQSFQIETWINNAGFGNFASIAEQNLNKIETMLHVNIEALTIL 116
R ++ E + + I +N AG I + + E VN +
Sbjct: 67 -RDSAAIDEITARIEREMGPIDI--LVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 117 SSLFVRDYSMTAGTQLINVSSGGGYTIVADAVTYCATKFYVSAFTEGLSHELKEQGAKLQ 176
S + ++ V S Y ++K FT+ L EL E ++
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN--IR 181

Query: 177 AKVLAPAATETEFAKRSFDIDEFQYDNVVPKFHTAKQMAQFML 219
+++P +TET+ + S DE + V+ + F
Sbjct: 182 CNIVSPGSTETDM-QWSLWADENGAEQVIKGS-----LETFKT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0743HTHTETR270.013 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 27.3 bits (60), Expect = 0.013
Identities = 19/87 (21%), Positives = 38/87 (43%), Gaps = 12/87 (13%)

Query: 3 TISEVAKLLGVSTHTLRYY--EKENILIANRDVNGNRLYEESHIKWLQFVMKL--KQTQM 58
++ E+AK GV+ + ++ +K ++ ++E S + ++ K
Sbjct: 33 SLGEIAKAAGVTRGAIYWHFKDKSDLFSE--------IWELSESNIGELELEYQAKFPGD 84

Query: 59 PIAKIREYARLYLEGEHTTEARLQLLE 85
P++ +RE LE T E R L+E
Sbjct: 85 PLSVLREILIHVLESTVTEERRRLLME 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0746HTHTETR784e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 77.7 bits (191), Expect = 4e-20
Identities = 29/168 (17%), Positives = 56/168 (33%), Gaps = 8/168 (4%)

Query: 1 MNKKEKIVYAAIEVFQEKGVEKTKISDIVKLAGIAQGTFYLYFPSKLSVMPAIAEVMVEK 60
++ I+ A+ +F ++GV T + +I K AG+ +G Y +F K + I E+
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 MILAVKEKVQNDSPFSSK-VEQVIDAVFTFIAEYREIQALMYAGLASTEHIKEWEAV--- 116
+ E + +++ V + LM E + E V
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 117 ----YEPLYIWLSDFLSEAKESGEIRDSVHAERTAKLFIALVESAAEQ 160
Y + L E+ + + R A + + E
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0747TCRTETA2565e-84 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 256 bits (655), Expect = 5e-84
Identities = 93/380 (24%), Positives = 170/380 (44%), Gaps = 15/380 (3%)

Query: 12 LIILLSNIFIAFLGIGLIIPVMPSFMNDMNLTGK---TMGYLVAVFAMAQLITSPITGRW 68
LI++LS + + +GIGLI+PV+P + D+ + G L+A++A+ Q +P+ G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 69 VDIYGRKKMIIIGLFIFGVSELLFGLGTDVWMLYAARVLGGISAAFIMPGVTAYVADITS 128
D +GR+ ++++ L V + +W+LY R++ GI+ A AY+ADIT
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITD 125

Query: 129 IQERPKAMGYLSAAISTGFIIGPGIGGFIAEYGIRVPFFVAAVIAFVACVISIFILKEPL 188
ER + G++SA G + GP +GG + + PFF AA + + + F+L E
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185

Query: 189 TKEE--LAEISANTKESSFIGDLKKSLNPMYAIAFIIVFVLAFGLSAYETVFSLFSDHKF 246
E L + N S + + A+ FI+ V ++ +F + +F
Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP----AALWVIFGEDRF 241

Query: 247 GFTPKDIAAIITISSIFGVVVQVFMFGKLVDMFGEKVLIQICLIVGAVLAFVSTVVFNYW 306
+ I + I + Q + G + GE+ + + +I + W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 307 IVLLVTCFIFLAFDLLRPALTTFLSKAAGKE-QGFVAGMNSTYTSLGNIAGPAMGGILFD 365
+ + + + + PAL LS+ +E QG + G + TSL +I GP + ++
Sbjct: 302 MAFPIM-VLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360

Query: 366 ININYPYAFSGVVLIVGLGI 385
+I ++G I G +
Sbjct: 361 ASITT---WNGWAWIAGAAL 377


81BcerKBAB4_0809BcerKBAB4_0812N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0809-2182.538403Beta-lactamase
BcerKBAB4_08100212.025894general substrate transporter
BcerKBAB4_0811022-0.059081signal transduction histidine kinase regulating
BcerKBAB4_0812318-1.514248response regulator receiver
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0809BLACTAMASEA356e-126 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 356 bits (916), Expect = e-126
Identities = 91/291 (31%), Positives = 150/291 (51%), Gaps = 17/291 (5%)

Query: 20 VLLSCVSLIGCSNSNTQSEPPKQTNQANQIKQENTGNQSFAKLEKEYDAKLGIYALDTGT 79
+ L +SL+ + P + E + ++G+ +D +
Sbjct: 4 IRLCIISLLATLPLAVHASPQPL--------------EQIKLSESQLSGRVGMIEMDLAS 49

Query: 80 NQTV-AYHSDDRFAFASTSKSLAVGALLRKNSL--EALDQRITYTHEDLSNYNPITEKHV 136
+T+ A+ +D+RF ST K + GA+L + E L+++I Y +DL +Y+P++EKH+
Sbjct: 50 GRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHL 109

Query: 137 DTGMTLKELADASVRYSDSTAHNLILKQLGGPSEFEKILREMGDTVTTSERFEPELNEVH 196
GMT+ EL A++ SD++A NL+L +GGP+ LR++GD VT +R+E ELNE
Sbjct: 110 ADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEAL 169

Query: 197 PGETHDTSTPEAIAKTLQSFTLGTALPIEKRELLVDWMKRNTTGDNLIRAGVPKGWEVAD 256
PG+ DT+TP ++A TL+ L + L+ WM + LIR+ +P GW +AD
Sbjct: 170 PGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIAD 229

Query: 257 KTGAGSYGTRNDIAIIWPPNKKPIVLAILSNHDKEDAKYDDKLIADATKVV 307
KTGAG G R +A++ P NK ++ I ++ IA +
Sbjct: 230 KTGAGERGARGIVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAAL 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0810TCRTETA348e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 8e-04
Identities = 30/129 (23%), Positives = 57/129 (44%), Gaps = 16/129 (12%)

Query: 40 EFFPKGDPTSQLLNTAAIFAVGFLMRPIGSLLMGRYADRHGRRAALTLSITVMAGGSLII 99
+ D T+ A++A LM+ + ++G +DR GRR L +S+ A I+
Sbjct: 34 DLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM 90

Query: 100 ACTPSYESIGIMAPIILVLARLLQGLSLGGEYGTSATYLSEMASSGRR----GFYSSFQY 155
A P +L + R++ G++ G + Y++++ R GF S+
Sbjct: 91 ATAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFG 141

Query: 156 VTLVAGQMV 164
+VAG ++
Sbjct: 142 FGMVAGPVL 150



Score = 29.0 bits (65), Expect = 0.037
Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 11/82 (13%)

Query: 285 VVLQPIAGLLSDKIGRRPLLMAFGILGTLLTAPIFFFMEKTTEPMVAFLLMMVGLII--V 342
P+ G LSD+ GRRP+L+ +L A + + + T ++ +G I+ +
Sbjct: 57 FACAPVLGALSDRFGRRPVLLV-----SLAGAAVDYAIMATAP---FLWVLYIGRIVAGI 108

Query: 343 TGYT-SINAIVKAELFPTEIRA 363
TG T ++ A++ + RA
Sbjct: 109 TGATGAVAGAYIADITDGDERA 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0811PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 3e-04
Identities = 23/131 (17%), Positives = 53/131 (40%), Gaps = 24/131 (18%)

Query: 397 EKKIDFHIEGDSALHPLPDHIKVSHLITILGNIIDNAFD-AVSERGEKN-VSFFVTDIGH 454
E ++ F + + A+ ++V ++ + +++N +++ + + T
Sbjct: 237 EDRLQFENQINPAIM----DVQVPPML--VQTLVENGIKHGIAQLPQGGKILLKGTKDNG 290

Query: 455 DIVFEVIDSGAGILAEKITNIFQKGFSTKGNDRGYGLANVKEMVDLL---EGTIEIQNEK 511
+ EV ++G+ L G GL NV+E + +L E I++ +EK
Sbjct: 291 TVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYGTEAQIKL-SEK 337

Query: 512 NGGAIFTIYLP 522
G + +P
Sbjct: 338 QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0812HTHFIS631e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 1e-13
Identities = 37/151 (24%), Positives = 69/151 (45%), Gaps = 5/151 (3%)

Query: 3 KVAIAEDDFRVAQIQEEFLSKIK-DVKVIGKALNAKETMELLQKEEIDLLLLDNYLPDGI 61
+ +A+DD + + + LS+ DV++ NA + + DL++ D +PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GTDLLPKIHADFPNVDVIMVTAANENHMLEKAIRNGVSNYLIKPVTLEKFVRTIEDYKRK 121
DLLP+I P++ V++++A N KA G +YL KP L + + I +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 KQLLHSNNEVNQEIIDNFFGTS-QIQDMKNL 151
+ S E + + G S +Q++ +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


82BcerKBAB4_0902BcerKBAB4_0910N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0902012-3.719531serine-protein kinase RsbW
BcerKBAB4_0903013-3.933348RNA polymerase sigma factor SigB
BcerKBAB4_0904013-4.626383ferritin Dps family protein
BcerKBAB4_0905013-4.222871response regulator receiver modulated serine
BcerKBAB4_0906013-4.170343chemotaxis protein CheR
BcerKBAB4_0907014-3.409951GAF sensor hybrid histidine kinase
BcerKBAB4_0908122-2.322374hypothetical protein
BcerKBAB4_0909316-0.620415hypothetical protein
BcerKBAB4_09102130.413235hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0902PF06580270.040 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.8 bits (59), Expect = 0.040
Identities = 6/35 (17%), Positives = 13/35 (37%), Gaps = 2/35 (5%)

Query: 53 NIVQHAY--KEDVGEITIVFGLYEDRLEIMVADNG 85
N ++H G+I + + + V + G
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0905HTHFIS831e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-19
Identities = 35/147 (23%), Positives = 74/147 (50%), Gaps = 12/147 (8%)

Query: 2 SILIVDDNPVNIFVIEKILKQAGYQDLVSLNSAQELFEYIHFGKDSSRHNEIDLILLDIM 61
+IL+ DD+ V+ + L +AGY D+ ++A L+ +I + DL++ D++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWI-------AAGDGDLVVTDVV 56

Query: 62 MPEIDGLEVCRRLQNEEKFKDIPIIFVTALEDANKLAEALDIGAMDYITKPINKVELLAR 121
MP+ + ++ R++ D+P++ ++A +A + GA DY+ KP + EL+
Sbjct: 57 MPDENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 122 MRVALRLKSELNWHKEQEENLRNELDL 148
+ AL + E++ ++ + L
Sbjct: 115 IGRALAEPKRR--PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0906VACCYTOTOXIN280.041 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.5 bits (63), Expect = 0.041
Identities = 22/76 (28%), Positives = 30/76 (39%), Gaps = 4/76 (5%)

Query: 184 SNYYSTDNRFAYFNPSLLQN-IIFAQHNLVTDQSFNEFHIILCRNVLIYFTSKLQNQVQQ 242
+ YY D + Y N +LQ F N V S N F + RN L + +
Sbjct: 1201 ARYYYGDTSYFYMNAGVLQEFANFGSSNAV---SLNTFKVNATRNPLNTHARVMMGGELK 1257

Query: 243 LFYESLGHNGFLCLGN 258
L E + GF+ L N
Sbjct: 1258 LAKEVFLNLGFVYLHN 1273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0907HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 2e-15
Identities = 26/107 (24%), Positives = 51/107 (47%), Gaps = 3/107 (2%)

Query: 777 TILIVDDDHRNIFALQNALEKQHANIITAQNGIECLEILKSNTNIDLILMDIMMPNMDGY 836
TIL+ DDD L AL + ++ N + + DL++ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 837 ETMEHIRMNLGLHEIPIIALTAKAMPNDKEKCLSAGASDYISKPLNL 883
+ + I+ ++P++ ++A+ K GA DY+ KP +L
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0910PF07132290.015 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 28.5 bits (63), Expect = 0.015
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 6/79 (7%)

Query: 51 SGEKVNSETAHKADIFSATGLVAGGVAGGLGGLLTGLGILAVSGMGPIVAAGPIAAAIGG 110
G + ++ +DI + + + GGLGG L GLG G ++ G G
Sbjct: 40 FGGQRSNIAEQLSDIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGG------LG 93

Query: 111 AGIGGGAGSLIGAFIGLGI 129
G+G GS +G+ +G G+
Sbjct: 94 GGLGSSLGSGLGSALGGGL 112


83BcerKBAB4_0974BcerKBAB4_0983N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0974-214-0.682290TetR family transcriptional regulator
BcerKBAB4_0975-213-0.430694hypothetical protein
BcerKBAB4_0976-212-0.476296membrane-flanked domain-containing protein
BcerKBAB4_0977417-1.140043membrane-flanked domain-containing protein
BcerKBAB4_0978517-1.470004hypothetical protein
BcerKBAB4_0979617-1.676598hypothetical protein
BcerKBAB4_0980617-1.338718hypothetical protein
BcerKBAB4_0981414-1.002152TetR family transcriptional regulator
BcerKBAB4_0982414-0.768741cell wall anchor domain-containing protein
BcerKBAB4_09831140.923140hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0974HTHTETR757e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 7e-19
Identities = 31/160 (19%), Positives = 65/160 (40%), Gaps = 5/160 (3%)

Query: 6 QTSQNIVEASFKLMAEHGIEKMSLSMIAKEVGISKPAIYYHFSSKEALVDFLFEEIFS-- 63
+T Q+I++ + +L ++ G+ SL IAK G+++ AIY+HF K L ++E S
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 64 GYHFASYFDKEQYTKENFAEKLIADGLHMLSEYEGQEGILRVINEFIVTASRNEKYQKRL 123
G Y K + +++ L E + ++ +I Q+
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 FEIQEDFLNGFHDLLKKGVELDVVSQQATEENAHTLALVI 163
+ + + LK +E ++ + A+++
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKML---PADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0975FLGMRINGFLIF270.003 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 26.9 bits (59), Expect = 0.003
Identities = 8/33 (24%), Positives = 15/33 (45%)

Query: 16 YKIPGMIEAFQADKGWLALISLVWLLWFGYFIP 48
++ I+ A WL ++ + W+LW P
Sbjct: 449 WQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRP 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_097760KDINNERMP290.041 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.041
Identities = 14/76 (18%), Positives = 29/76 (38%), Gaps = 24/76 (31%)

Query: 15 KITDFIPLLIFMFSLNGKFPFWYLIPAGFGLLTIFSAFEKWYYTTYWVENNVLHVKQGLF 74
KI F+P++ +F L P+G L Y++ +N++ + Q
Sbjct: 493 KIMTFMPVIFTVFFLW--------FPSGLVL--------------YYIVSNLVTIIQQQL 530

Query: 75 VKKESYLNKERVQTIN 90
+ + L K + +
Sbjct: 531 IYRG--LEKRGLHSRE 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0981HTHTETR571e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 1e-12
Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 6 TRQKILAAASQIVQFKGVAKLTLEAVAKEAGVSKGGLLYHFSNKEALIEGMILKGTEEYH 65
TRQ IL A ++ +GV+ +L +AK AGV++G + +HF +K L + E
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW----ELSE 67

Query: 66 GAIHNRVTEDTEKKGRWIRSFVEERL 91
I E K S + E L
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREIL 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0982TONBPROTEIN407e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 39.6 bits (92), Expect = 7e-05
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 2038 LAPPGPEKPDPEKPEKPDPEKPEKPDPEKPGTTDPEKPGTTDPEKPETTDPEKPGTTDPE 2097
L PP +P PE +P+PE P+P E P + KP+ KP E
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPP------KEAPVVIEKPKPKPKPKPKPVKKVQE 108

Query: 2098 KPEKELPKTGQKMPVEPYM 2116
+P++++ + P P+
Sbjct: 109 QPKRDVKPVESR-PASPFE 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0983IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 2e-04
Identities = 25/102 (24%), Positives = 43/102 (42%)

Query: 170 TKIGEEEKTAKSVTTAAKAQKNDAVTSQKVATEKNQKAETEKNQKAVTEKNQKTETEKNQ 229
T+ E +S T Q T+Q K K+ + N + +ET++ Q
Sbjct: 1037 TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 230 KVVTEKNQKTETEKNQKVVTEKNQKVETGKNQDAFTFEKAKE 271
T++ E E+ KV TEK Q+V +Q + E+++
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138



Score = 31.2 bits (70), Expect = 0.007
Identities = 30/142 (21%), Positives = 47/142 (33%), Gaps = 6/142 (4%)

Query: 170 TKIGEEEKTAKSVTTAAKAQKNDAVTSQKVATEKNQ---KAETEKNQKAVTEKNQK--TE 224
I + + S V AT AE K + EKN++ TE
Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATE 1060

Query: 225 TEKNQKVVTEKNQKTETEKNQKVVTEKNQKVETGKNQDAFTFEKAKEYIKNEYKEEYNYT 284
T + V ++ K+ + N + ET + Q T E A + + K E T
Sbjct: 1061 TTAQNREVAKE-AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 285 LEDTQVENGKKYYQIRVRTTYK 306
E +V + Q + T
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQP 1141


84BcerKBAB4_0995BcerKBAB4_1002N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_0995-211-2.317880TetR family transcriptional regulator
BcerKBAB4_0996-113-2.443863ABC-2 type transporter
BcerKBAB4_0997418-5.115334hypothetical protein
BcerKBAB4_0998118-2.292759group-specific protein
BcerKBAB4_0999118-1.575744hypothetical protein
BcerKBAB4_1000-215-0.265944integral membrane protein
BcerKBAB4_1001-1160.676028PlcR-regulated protein PRP2
BcerKBAB4_10020151.083790N-acetyltransferase GCN5
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0995HTHTETR792e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 2e-20
Identities = 42/194 (21%), Positives = 81/194 (41%), Gaps = 7/194 (3%)

Query: 2 AIDRKRSIIEAATKSFSAFGYKATTMDQVAKLANVGKGTIYTFFKNKEELFGEIISNLIT 61
A + ++ I++ A + FS G +T++ ++AK A V +G IY FK+K +LF EI +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 62 EMKQVAENAIRSDIS-----FFENVHRALYSILEFRKEHQLMIKLIQEERDMGTKE-VQE 115
+ ++ E + L S + + LM + + +G VQ+
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 116 VMQQVDVEIVSVIQSYLEIAIEKGEISK-CDPEITAFIMLRLYVSLIFDWEKNHEPLEKE 174
+ + +E I+ L+ IE + A IM L+ +W + + +
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188

Query: 175 KISELFELYLLKGL 188
K + + LL+
Sbjct: 189 KEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_0996ABC2TRNSPORT350.001 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 34.5 bits (79), Expect = 0.001
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 4/103 (3%)

Query: 698 SIPYFILFSIVTSLAFISLIQCLVTAFGDA-GRFIAIITLIIQ--LTTSAGTFPLELIPK 754
S+ Y + +T LAF SL +VTA + FI TL+I L S FP++ +P
Sbjct: 146 SLLYALPVIALTGLAFASL-GMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPI 204

Query: 755 YLQPFNAWLPMTYSVSGLKAVVSSGDFNFMWQNIGILMIFIVV 797
Q +LP+++S+ ++ ++ + Q++G L I+IV+
Sbjct: 205 VFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1000ACRIFLAVINRP310.004 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.004
Identities = 19/97 (19%), Positives = 37/97 (38%), Gaps = 11/97 (11%)

Query: 15 PSIFGIFTSPTLQFKRMKNQRNILFPLALLMVLIIISSALMSWN-----------ALNNP 63
I +T + Q + NQ L ++ ++V + +++ SW+ +
Sbjct: 852 AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGV 911

Query: 64 ALSMFHDKTGFAVPKYITFFTTFGFSAVSGIVAILFA 100
L+ V + TT G SA + I+ + FA
Sbjct: 912 LLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFA 948


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1002SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.0 bits (88), Expect = 6e-06
Identities = 24/117 (20%), Positives = 45/117 (38%), Gaps = 6/117 (5%)

Query: 25 TTTKFFISSPNKLPNDIDSEREKIQTSSEKGNLYIVYEVDSKVVGFLVFKRYELERLRHA 84
T T+ S P D + + E + +Y +++ +G + + +A
Sbjct: 36 TYTEERFSKP-YFKQYEDDDMDVSYVEEEGKAAF-LYYLENNCIGRIKIRS---NWNGYA 90

Query: 85 GTMGMGIREAYCNQGIGTKLIEFLISWAKGQKGLEKICLGVVSINDRAIKVYKRTGF 141
+ + + Y +G+GT L+ I WAK + + L IN A Y + F
Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAK-ENHFCGLMLETQDINISACHFYAKHHF 146


85BcerKBAB4_1215BcerKBAB4_1221N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1215-2111.231616two component transcriptional regulator
BcerKBAB4_1216-2110.469745histidine kinase
BcerKBAB4_1217-2110.981466GntR family transcriptional regulator
BcerKBAB4_1218-2130.512912hypothetical protein
BcerKBAB4_1219-2110.523856iron-sulfur cluster binding protein
BcerKBAB4_1220-114-0.691956hypothetical protein
BcerKBAB4_1221015-0.811406peptidase A24A domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1215HTHFIS1102e-30 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 110 bits (277), Expect = 2e-30
Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 1/130 (0%)

Query: 1 MSKYNVLVVDDESDMRQLVGMYLDNFGYEWGEAENGKEALRKLETDHYDFVVLDIMMPEM 60
M+ +LV DD++ +R ++ L GY+ N R + D VV D++MP+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLSVCKEIRKT-SDVPIIFLTAKGEEWNRVNGLRMGADDYIVKPFSPGELIARMEAVLR 119
+ + I+K D+P++ ++A+ + GA DY+ KPF ELI + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RYTKQEQQEE 129
++ + E
Sbjct: 121 EPKRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1216PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 2e-04
Identities = 31/188 (16%), Positives = 72/188 (38%), Gaps = 32/188 (17%)

Query: 275 EKVTQLIHKEAGRMQRLVHDLLD-----LAQLEGEHFPLKKQPIVFSQLIEDVLNTYELQ 329
+ LI ++ + + ++ L + L L + +++ L +Q
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADE----LTVVDSYLQLASIQ 235

Query: 330 FVEKNLRISTNLNPEII-VMIDEDRMQQVLHNVLDNAIRYTNQNGDIIITLKQIDDYCEL 388
F ++ L+ +NP I+ V + +Q ++ N + + I Q G I++ + + L
Sbjct: 236 FEDR-LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294

Query: 389 HIKDTGIGIDREHLENLGERFYRVDKARSRQHGGTGLGLAIVRQ-IVHIHDGEWR--IES 445
+++TG + TG GL VR+ + ++ E + +
Sbjct: 295 EVENTGSLALKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 446 EKGKGTTV 453
++GK +
Sbjct: 337 KQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1219ANTHRAXTOXNA300.023 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.1 bits (67), Expect = 0.023
Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 7/87 (8%)

Query: 88 KTKEDAAKYIQDVAKKKQAKKVVKSKSMVTEEISMNHALEEIGCEVLE--SDLGEYILQV 145
KT+++ K + K + K T+++ L++I +VLE S+LG I
Sbjct: 53 KTEKEKFKDSINNLVKTEFTNETLDKIQQTQDL-----LKKIPKDVLEIYSELGGEIYFT 107

Query: 146 DNDPPSHIIAPALHKNRTQIRDVFKEK 172
D D H L + + EK
Sbjct: 108 DIDLVEHKELQDLSEEEKNSMNSRGEK 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1220MALTOSEBP290.018 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.9 bits (64), Expect = 0.018
Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 4/66 (6%)

Query: 50 LLEVFKK--QCTNIHTTVVETTNDRLREDIQKVIVENGGGPIILSADERFDSYGLTSLFK 107
L EV KK + T I TV D+L E +V G II A +RF Y + L
Sbjct: 46 LAEVGKKFEKDTGIKVTVEHP--DKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLA 103

Query: 108 EELPKQ 113
E P +
Sbjct: 104 EITPDK 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1221PREPILNPTASE1335e-40 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 133 bits (337), Expect = 5e-40
Identities = 67/264 (25%), Positives = 124/264 (46%), Gaps = 35/264 (13%)

Query: 13 YLYALLIGMVFGSFFMVIAMRVPV------------------------GESIITPRSYCH 48
+ L ++ GSF V+ R+P+ +++ PRS C
Sbjct: 16 FSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSCCP 75

Query: 49 YCKYTLQPKELIPIISFCIQKGRCTNCKSKISSLYIVFEFVTGSLFFLTVYVIGMERELI 108
+C + + E IP++S+ +GRC C++ IS+ Y + E +T L + +
Sbjct: 76 HCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWGTL 135

Query: 109 IILSLFSLLLIISVTDLVYMLIPNCI---LIWFALLLILECIFVPLVTWTDSIVGSGVIF 165
L L +L+ ++ DL ML+P+ + L+W LL L FV L D+++G+ +
Sbjct: 136 AALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLG---DAVIGAMAGY 192

Query: 166 ILLYCMQKIY-----PEGLGGGDIKLLSLLGFIVGLKGIFMILFLASCFSLCFFGAGIVL 220
++L+ + + EG+G GD KLL+ LG +G + + ++L L+S I+L
Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILL 252

Query: 221 KRMKKRSQIPFGPFISLGAICYML 244
+ + IPFGP++++ +L
Sbjct: 253 RNHHQSKPIPFGPYLAIAGWIALL 276


86BcerKBAB4_1545BcerKBAB4_1548N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1545-110-1.181274flagellar motor protein MotS
BcerKBAB4_1546010-1.437890response regulator receiver protein
BcerKBAB4_1547011-1.734868CheA signal transduction histidine kinase
BcerKBAB4_1548213-2.031705flagellar motor switch protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1545OMPADOMAIN674e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 66.9 bits (163), Expect = 4e-15
Identities = 29/127 (22%), Positives = 53/127 (41%), Gaps = 17/127 (13%)

Query: 112 SIVIVDNLIFDTGDANVKPEAKEVISQLVGFFQSVPNP---IVVEGHTDSRPIHNENFPS 168
+ +++F+ A +KPE + + QL ++ +VV G+TD N
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQG- 272

Query: 169 NWELSSARAANMIHHLIEVYNVDDKRLAAVGYADTKPVASN---------DSPQNWEKNR 219
LS RA +++ +LI + +++A G ++ PV N +R
Sbjct: 273 ---LSERRAQSVVDYLIS-KGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 220 RVVIYIK 226
RV I +K
Sbjct: 329 RVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1546HTHFIS835e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 5e-22
Identities = 28/112 (25%), Positives = 46/112 (41%), Gaps = 2/112 (1%)

Query: 4 KILVVDDAMFMRTMIKNLLKSNSEFEVIGEAENGVEAIQKYKELQPDIVTLDITMPEMDG 63
ILV DD +RT++ L + + N + D+V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LEALKEIMKIDSSAKVVICSAMGQQGMVLDAIKGGAKDFIVKPFQADRVIEA 115
+ L I K V++ SA + A + GA D++ KPF +I
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1547PF06580381e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 1e-04
Identities = 21/136 (15%), Positives = 38/136 (27%), Gaps = 53/136 (38%)

Query: 385 LIRNAIDHGIETVENRRDAGKNETGTIKLEAFHSGNHVVIQITDDGNGIHKGKVLEKAIK 444
L+ N I HGI + G I L+ V +++ + G+ K
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALK--------- 305

Query: 445 NGVVTESEANKLTDREVFDLIFQPGFSTAEVVSDLSGRGVGLDVVKHTIHSLGG---HLI 501
+ G GL V+ + L G +
Sbjct: 306 --------------------------------NTKESTGTGLQNVRERLQMLYGTEAQIK 333

Query: 502 IDSEEGKGSTFRIELP 517
+ ++GK + +P
Sbjct: 334 LSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1548FLGMOTORFLIN576e-12 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 56.8 bits (137), Expect = 6e-12
Identities = 23/71 (32%), Positives = 40/71 (56%)

Query: 475 DTSILQNVEMNVKFVFGSTVRTIQDILSLQENEAVVLDEDIDEPIQIYVNDVLVAYGELV 534
D ++ ++ + + G T TI+++L L + V LD EP+ I +N L+A GE+V
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 535 NVDGFFGVKVT 545
V +GV++T
Sbjct: 113 VVADKYGVRIT 123


87BcerKBAB4_1555BcerKBAB4_1570N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1555113-2.453766flagellar hook-associated protein FlgK
BcerKBAB4_1556316-2.213762flagellar hook-associated protein FlgL
BcerKBAB4_1557215-2.430357flagellar capping protein
BcerKBAB4_15583170.184396flagellar protein fliS
BcerKBAB4_1559214-0.164211hypothetical protein
BcerKBAB4_1560314-0.274298flagellar basal body rod protein FlgB
BcerKBAB4_15612130.057121flagellar basal body rod protein FlgC
BcerKBAB4_1562314-0.622699flagellar hook-basal body protein FliE
BcerKBAB4_1563412-0.651162flagellar MS-ring protein
BcerKBAB4_1564311-0.921100flagellar motor switch protein G
BcerKBAB4_1565113-0.021555flagellar assembly protein H
BcerKBAB4_15661120.132838flagellum-specific ATP synthase
BcerKBAB4_1567112-0.062911hypothetical protein
BcerKBAB4_1568012-0.289155hypothetical protein
BcerKBAB4_1569013-0.393237flagellar basal body rod modification protein
BcerKBAB4_1570-211-0.658290flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1555FLGHOOKAP11012e-25 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 101 bits (253), Expect = 2e-25
Identities = 69/249 (27%), Positives = 110/249 (44%), Gaps = 14/249 (5%)

Query: 4 SDYNTPLSGMLAAQMGLQTTKQNLSNIHTPGYVRQMVNYGSVGASNGHTPEQRIGYGVQT 63
S N +SG+ AAQ L T N+S+ + GY RQ ++ G +G GV
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAG--GWVGNGVYV 59

Query: 64 LGVDRITDEVKTKQFNDQLSQFSYYAYMNSTLSRVESMVGTTGKNSLSSLMDGFFNAFRE 123
GV R D T Q +Q S +S++++M+ T+ SL++ M FF + +
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTS-SLATQMQDFFTSLQT 118

Query: 124 VAKNPEQPNYYDTLVSETGKFTSQINRLAKNLDTAEAQTTEDIEAHVNEFNRLGASLAEA 183
+ N E P L+ ++ +Q + L + Q I A V++ N +A
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKKI----GQAGTQVPNQLLDERDRIVTEMSKYANIEVS---YESMNPNIASVRMNGVLT 236
N +I G PN LLD+RD++V+E+++ +EVS + N +A NG
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA----NGYSL 234

Query: 237 VNGQDTYPL 245
V G L
Sbjct: 235 VQGSTARQL 243



Score = 58.8 bits (142), Expect = 2e-11
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 358 QFIVGVASDKSAVNAY-QNIHKDLLEGIQQEKMSIEGVNMEEEMVNLMAFQKYFVANSKA 416
+V +K+A +++ + ++ SI GVN++EE NL FQ+Y++AN++
Sbjct: 472 ASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQV 531

Query: 417 ITTMNEVFDSLFSI 430
+ T N +FD+L +I
Sbjct: 532 LQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1556FLAGELLIN383e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 38.1 bits (88), Expect = 3e-05
Identities = 28/127 (22%), Positives = 60/127 (47%)

Query: 1 MRVSTFQNANWAKNQMMDLNVQQQYHRNQVTSGKKNLFMSEDPLAASKSFAIQHSLANIE 60
++T + +N + +++SG + +D + + ++ +
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 QMQKDLADSKNVLTQTENTLQGVFKSLTRADQLTLQALNGTNSEKELKAIGAEIDQILKQ 120
Q ++ D ++ TE L + +L R +L++QA NGTNS+ +LK+I EI Q L++
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 VVYLANT 127
+ ++N
Sbjct: 122 IDRVSNQ 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1560FLGHOOKAP1300.004 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.004
Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 20 NTVSSNIANANTPGYKAQDVTFAEKMN 46
NT S+NI++ N GY Q A+ +
Sbjct: 19 NTASNNISSYNVAGYTRQTTIMAQANS 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1561FLGHOOKAP1355e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.9 bits (80), Expect = 5e-05
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 5 INASGSGLTTARKWMEVTSNNIVNANTTGAPGAEPYHRRSVVLESNNSFASMLDGAPTNG 64
IN + SGL A+ + SNNI + N G Y R++ ++ NS G NG
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAG------YTRQTTIMAQANSTLGA-GGWVGNG 56

Query: 65 VKIKSIETDRNENLV 79
V + ++ + + +
Sbjct: 57 VYVSGVQREYDAFIT 71



Score = 28.4 bits (63), Expect = 0.009
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 97 NIDVTAEMTNVMVAQKMYEANTSVLNANKKMLDKDLEI 134
+++ E N+ Q+ Y AN VL + D + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1562FLGHOOKFLIE355e-06 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 35.4 bits (81), Expect = 5e-06
Identities = 17/63 (26%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 38 LEDMNQTQNNAQTAVYDLLTKGVG-ETHDVLIQQKKAESQMKTAALVRDNLIENYKSLIN 96
L+ ++ TQ A+T G +DV+ +KA M+ VR+ L+ Y+ +++
Sbjct: 41 LDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMS 100

Query: 97 MQI 99
MQ+
Sbjct: 101 MQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1563FLGMRINGFLIF1599e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 159 bits (404), Expect = 9e-45
Identities = 97/540 (17%), Positives = 215/540 (39%), Gaps = 46/540 (8%)

Query: 17 LVIGAALLAIATGALLYFTLPDKYVVVYQNLNDTDKQEITAELSKLGVDYQLAADG-SIR 75
+V G+A +AI +L+ PD Y ++ NL+D D I A+L+++ + Y+ A +I
Sbjct: 28 IVAGSAAVAIVVAMVLWAKTPD-YRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGAIE 86

Query: 76 VQKNDAPWVRKEMNGMGLPFNSKSGEEILLESSLGSSEQDKKMKQIVGTKKQLEQDIVRN 135
V + +R + GLP G E+L + G S+ +++ + +L + I
Sbjct: 87 VPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI-ET 145

Query: 136 FATIETANVQITLPEKETIFDEEKAKGTAAITVGVKRGQLLTADQVAGIQQMISAAVPGV 195
+++A V + +P+ ++F E+ +A++TV ++ G+ L Q++ + ++S+AV G+
Sbjct: 146 LGPVKSARVHLAMPKP-SLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGL 204

Query: 196 KAEEVSVIDSKKGIISKGADEAHSNSSSSYEKEVEMQHQIEGKLKQDIDATLMTMFKSNE 255
V+++D ++++ + + ++ + +E ++++ I+A L + +
Sbjct: 205 PPGNVTLVDQSGHLLTQSNTSGRDLNDAQ----LKFANDVESRIQRRIEAILSPIVGNGN 260

Query: 256 YKVNTKVSVNYDEVTRQSEKYG-DKGVLRSKQEQEESSTA-QEGADTKQGAGITANGEVP 313
+++ + E Y + ++ + + + Q GA G + +
Sbjct: 261 VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPA 320

Query: 314 NYGT----------NNNQNGKVVYDNKNGNKI----------ENYEIDKTVETIKKHP-E 352
N QN + N N NYE+D+T+ K + +
Sbjct: 321 PPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKMNVGD 380

Query: 353 LTKTNVVVWVDNDTLVKRKI------DMTTFKEAIGTAAGLQADPNGNFTNGQVNVVTVQ 406
+ + +V V V+ TL K M ++ A G +NVV
Sbjct: 381 IERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK-----RGDTLNVVNSP 435

Query: 407 FDQPKVEKEKEPEKSGMNWWLFGGITAGLLALIGLVWFFLARRKKKREEEEYEEYLAEEE 466
F + P ++ L ++ + W + + + EE A +E
Sbjct: 436 FSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQE 495

Query: 467 VAASSESIFEIPEEKI----VPEPKPEPVEPSEPTLDDQVQEATKEHVEGTAKVIKKWLN 522
A + E E ++ + + + +++E + A VI++W++
Sbjct: 496 QAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMS 555


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1564FLGMOTORFLIG2057e-66 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 205 bits (522), Expect = 7e-66
Identities = 116/336 (34%), Positives = 196/336 (58%), Gaps = 6/336 (1%)

Query: 2 LDEISSKEKAAILIRTLNEEVAAKVIEYMTAEEKEVLLREIAKFRVYKPETLENVLGEFL 61
+ ++ K+KAAIL+ ++ E+++KV +Y++ EE E L EIAK E +NVL EF
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 62 YELNVKELNLVTPDKEYIRRIF-KNMPEEDLEKLLEDLWYN-KDNPFEFLNSLTDLEPLL 119
EL + + + +Y R + K++ + ++ +L + PFEF+ D +L
Sbjct: 72 -ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRA-DPANIL 129

Query: 120 TVLNDESPQTIAIIASYIKPQLASQLIERLPDHKRVETVMGIAKLEQVDGELINQIGELL 179
+ E PQTIA+I SY+ PQ AS ++ LP + IA +++ E++ ++ +L
Sbjct: 130 NFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVL 189

Query: 180 KAKLNNMAFSAINKTDGLKTIVNILNNVSRGVEKTVFQKLDEVDYALSEKIKENMFVFED 239
+ KL +++ G+ +V I+N R EK + + L+E D L+E+IK+ MFVFED
Sbjct: 190 EKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFED 249

Query: 240 LLGLEDLALRRVLEEITDNGVIAKALKIAKEEIKEKLFTCMSSNRREMILEELDGLGPLK 299
++ L+D +++RVL EI D +AKALK ++EK+F MS M+ E+++ LGP +
Sbjct: 250 IVLLDDRSIQRVLREI-DGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 300 MTDAEKAQQTITGTVKKLEKEGRIIVQRG-EDDVLI 334
D E++QQ I ++KLE++G I++ RG E+DVL+
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1568IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 1e-05
Identities = 40/224 (17%), Positives = 76/224 (33%), Gaps = 10/224 (4%)

Query: 13 PPQKEKGLEVQSKNENSSFDNTMRIENKKQPKTEKPKREEAPEEEKQEYILAKKTVTKEE 72
Q+ K +E ++ + + + + + + + E + T TKE
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 73 PIVKKEEK-KEEKKETEQLLLAVSEQMVAIEQLR-VQPELLYQYIQKIQALYKEYGNIKL 130
V+KEEK K E ++T+++ S+ EQ VQP+ KE
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ--SQ 1161

Query: 131 NELPAAELQQLQELFSNMNIKNAICLEDTMQMVLDKVTMPEQTLQVLKVVETETCNIAKK 190
A Q +E ++ N++ + T+ V PE T T + K
Sbjct: 1162 TNTTADTEQPAKE--TSSNVEQPVTESTTVNTGNSVVENPENTTPA-TTQPTVNSESSNK 1218

Query: 191 QEESKEVELQKAESDDVKLELPEVDQLNDSSSAGAELLNKATGT 234
+ ++ + E + S+ A +L + T
Sbjct: 1219 PKNRHRRSVRSVPHNV---EPATTSSNDRSTVALCDLTSTNTNA 1259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1570FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.3 bits (94), Expect = 1e-05
Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 5 LYTSITGMNATQNALSVTSNNIANAQTVGYKKQK 38
+ +++G+NA Q AL+ SNNI++ GY +Q
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.2 bits (86), Expect = 9e-05
Identities = 18/71 (25%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 335 GNYTATNTTGILALGASGQNGAGKIRGGAQEGANVDLSVEFVDLMLYQRGFQGNAKVIKI 394
GN TAT T A+ N ++ Q + V+L E+ +L +Q+ + NA+V++
Sbjct: 479 GNKTATLKT----SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQT 534

Query: 395 SDEVLNEVVNL 405
++ + + ++N+
Sbjct: 535 ANAIFDALINI 545


88BcerKBAB4_1573BcerKBAB4_1595N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1573-310-1.559146response regulator receiver modulated CheW
BcerKBAB4_1574-211-0.630026hypothetical protein
BcerKBAB4_15761150.012976cell wall hydrolase/autolysin
BcerKBAB4_15773200.307286PAS/PAC sensor-containing diguanylate
BcerKBAB4_15786283.139472integral membrane protein TerC
BcerKBAB4_15807322.552770flagellin domain-containing protein
BcerKBAB4_15816292.657102flagellin domain-containing protein
BcerKBAB4_15823242.292438flagellin domain-containing protein
BcerKBAB4_15833242.139800flagellin domain-containing protein
BcerKBAB4_15843291.437005lytic transglycosylase
BcerKBAB4_15854320.805502flagellar motor switch protein
BcerKBAB4_15863300.769220flagellar motor switch protein FliM
BcerKBAB4_1587325-0.567798flagellar motor switch protein
BcerKBAB4_1588421-0.047383flagellar motor switch protein
BcerKBAB4_1589316-0.450123flagellar biosynthesis protein FliP
BcerKBAB4_15904140.009239flagellar biosynthesis protein FliQ
BcerKBAB4_15912120.662963flagellar biosynthesis protein FliR
BcerKBAB4_15921110.777719flagellar biosynthesis protein FlhB
BcerKBAB4_15930111.366508flagellar biosynthesis protein FlhA
BcerKBAB4_1594-1121.267327flagellar biosynthesis regulator FlhF
BcerKBAB4_1595-1141.634769flagellar basal body rod protein FlgG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1573HTHFIS482e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.3 bits (115), Expect = 2e-08
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 14/125 (11%)

Query: 176 IYIAEDSAMLRQILEETLSSAGYTKMNFFSNGAEALAQIEKLAKEQGEKMYEHIHLLITD 235
I +A+D A +R +L + LS AGY + A I L++TD
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSN-AATLWRWIAA----------GDGDLVVTD 54

Query: 236 IEMPKMDGHHLTKVIKDSEVMNQLPVIIFSSLITNELFHKGEAVGANAQVSKP-DIQELI 294
+ MP + L IK + LPV++ S+ T K GA + KP D+ ELI
Sbjct: 55 VVMPDENAFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 295 GLVDK 299
G++ +
Sbjct: 113 GIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1580FLAGELLIN845e-22 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 84.3 bits (208), Expect = 5e-22
Identities = 40/142 (28%), Positives = 64/142 (45%)

Query: 3 IGTNVLSMNASQSLYENEKRMKMATDKKLNTASDTPANVAIVTRMHARASGIQVAIRKIE 62
+ + A + ++ +A + + + I A + I+
Sbjct: 366 GESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASID 425

Query: 63 EALQNISLYRADLGAMMKRLQFNIENLNNQSLALTGASSRIEDADMAQEMSDFFKFKLLT 122
AL + R+ LGA+ R I NL N L A SRIEDAD A E+S+ K ++L
Sbjct: 426 SALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQ 485

Query: 123 EVALSMVSQANQIPQMVSKLLQ 144
+ S+++QANQ+PQ V LL+
Sbjct: 486 QAGTSVLAQANQVPQNVLSLLR 507



Score = 40.0 bits (93), Expect = 1e-06
Identities = 19/88 (21%), Positives = 37/88 (42%), Gaps = 5/88 (5%)

Query: 1 MRIGTNVLSMNASQSLYENEKRM-----KMATDKKLNTASDTPANVAIVTRMHARASGIQ 55
I TN LS+ +L +++ + ++++ ++N+A D A AI R + G+
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 56 VAIRKIEEALQNISLYRADLGAMMKRLQ 83
A R + + L + LQ
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQ 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1581FLAGELLIN1286e-36 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 128 bits (322), Expect = 6e-36
Identities = 78/271 (28%), Positives = 122/271 (45%), Gaps = 3/271 (1%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMDKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDHIADKTNFNGNAFLGGTGAGGKDITIQLSDASSDTMKITAIDTKAITTATLATATATG 180
ID ++++T FNG L I + +D + T+ + ID K++
Sbjct: 122 IDRVSNQTQFNGVKVLSQD--NQMKIQVGANDGETITIDLQKIDVKSLGLDGF-NVNGPK 178

Query: 181 PDKALNAASAPAQITALDTAIQGIADARATFGSQLNRLDHNLNNVTSQATNMAASASQIE 240
+ S+ +T DT G R S D V + AA+
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 241 DADMAKEMSNMTKFKILNEAGISMLSQANQT 271
D ++ K + A
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 80.9 bits (199), Expect = 2e-19
Identities = 57/252 (22%), Positives = 99/252 (39%), Gaps = 1/252 (0%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMDKEYQALAQEIDHIADKTNFNGNAFLGGTGAGGKDITIQ 149
A ++ + K++ + Q + + A G +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 LSDASSDTMKIT-AIDTKAITTATLATATATGPDKALNAASAPAQITALDTAIQGIADAR 208
A++ K+T A T I +T D A S + ++D+A+ + R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 209 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 268
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 269 NQTPQMVSKLLQ 280
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1582FLAGELLIN1263e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 126 bits (317), Expect = 3e-35
Identities = 77/272 (28%), Positives = 121/272 (44%), Gaps = 4/272 (1%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMNKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDHIADKTNFNGNAFLGGTGAGGKDITIQLSDASSDTMTIAAIDTKDITTTKLAVGADPA 180
ID ++++T FNG L I + +D + T+ + ID K + V
Sbjct: 122 IDRVSNQTQFNGVKVLSQD--NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNG--P 177

Query: 181 SATKNLNATTAATEITALDTAIQNIADARATFGSQLNRLDHNLNNVTSQATNMAASASQI 240
+ ++ +T DT R S D V + AA+
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 241 EDADMAKEMSNMTKFKILNEAGISMLSQANQT 272
D ++ K + A
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 78.9 bits (194), Expect = 1e-18
Identities = 49/252 (19%), Positives = 94/252 (37%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMNKEYQALAQEIDHIADKTNFNGNAFLGGTGAGGKDITIQ 149
A ++ + K++ + Q + + A G +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 LSDASSDTMTIAAIDTKDITTTKLAVGADPASATKNLNATTAATEITALDTAIQNIADAR 209
A++ + + + + + A + ++D+A+ + R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 210 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 269
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 270 NQTPQMVSKLLQ 281
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1583FLAGELLIN1314e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 131 bits (330), Expect = 4e-37
Identities = 78/272 (28%), Positives = 125/272 (45%), Gaps = 7/272 (2%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNTSMNRLSSGKSINSAADDAAGLAIATRMRAKEGGLN 60
INTN S+ TQ + ++Q +++++ RLSSG INSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 VGARNTQDAMSALRTGDAALGSVSNILLRMRDLATQASSGTNNDKDIASMDKEYQALAQE 120
+RN D +S +T + AL ++N L R+R+L+ QA++GTN+D D+ S+ E Q +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 INHIADKTNFNGNAFLNKGTNPGEGKDITIQLSDASSDTMTIAAIDTKDITTTKLATDGT 180
I+ ++++T FNG L++ I + +D + T+ + ID K + +G
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQ----MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGP 177

Query: 181 K---KLDATTAATEITALDTAIQEIADARATFGSQLNRLDHNLNNVTSQATNMAASASQI 237
K D ++ +T DT R S D V + AA+
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 238 EDADMAKEMSNMTKFKILNEAGISMLSQANQT 269
D ++ K + A
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 78.9 bits (194), Expect = 9e-19
Identities = 52/252 (20%), Positives = 99/252 (39%), Gaps = 3/252 (1%)

Query: 30 LSSGKSINSAADDAAGLAIATRMRAKEGGLNVGARNTQDAMSALRTGDAALGSVSNILLR 89
+ + A G K + + D + T +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 90 MRDLATQASSGTNNDKDIASMDKEYQALAQEINHIADKTNFNGNAFLNKGTNPGEGKDIT 149
A ++ + K++ + Q + + A +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 150 IQLSDASSDTMTIAAIDTKDITTTKLATDGTKKLDATTAAT---EITALDTAIQEIADAR 206
++A+ D +T+A T + + A + + ++D+A+ ++ R
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVR 435

Query: 207 ATFGSQLNRLDHNLNNVTSQATNMAASASQIEDADMAKEMSNMTKFKILNEAGISMLSQA 266
++ G+ NR D + N+ + TN+ ++ S+IEDAD A E+SNM+K +IL +AG S+L+QA
Sbjct: 436 SSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQA 495

Query: 267 NQTPQMVSKLLQ 278
NQ PQ V LL+
Sbjct: 496 NQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1584PF06580310.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.0 bits (70), Expect = 0.005
Identities = 9/43 (20%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 121 ELTNKY-NIQKIRSSNEGKYEDIIDRASSTYGIPKTLIQKMIE 162
+ + Y + I+ + ++E+ I+ A +P L+Q ++E
Sbjct: 223 TVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1585TYPE3OMOPROT424e-08 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 41.5 bits (97), Expect = 4e-08
Identities = 14/67 (20%), Positives = 31/67 (46%)

Query: 5 DDIPLTIYFEIGNTKKKIEDLLHITKGTLYRLENSTKNTVRLMLENEEIGTGKILTKNGK 64
+ +P+ + F + + +L + + L L + + V +M +G G+++ N
Sbjct: 228 NQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDT 287

Query: 65 MYVEIVE 71
+ VEI E
Sbjct: 288 LGVEIHE 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1586FLGMOTORFLIM1442e-42 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 144 bits (364), Expect = 2e-42
Identities = 93/329 (28%), Positives = 165/329 (50%), Gaps = 10/329 (3%)

Query: 4 EKLSQEQIDALLKAVNEGEEMPAFAQEAGKQDKFQEYDFNRPEKFGVEHLRSLQAIASTF 63
E LSQ++ID LL A++ G+ A+ K YDF RP+KF E +R+L + TF
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 64 GKQTSQTLSARMRIPIELEPSTVEQVPFTSEYVEKMPKDYYLYCVIDLGLPELGEIVIEI 123
+ T+ +LSA++R + + ++V+Q+ + E++ +P L VI + P G V+E+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTY-EEFIRSIPTPSTL-AVITMD-PLKGNAVLEV 119

Query: 124 DLAFVIYIHECWLGGDSKRNFTMRRPLTAFEFLTLDNIFLLLCKNLEQSFESVVAIEPKF 183
D + I + GG + ++R LT E ++ + + + N+ +S+ V+ + P+
Sbjct: 120 DPSITFSIIDRLFGGT-GQAAKVQRDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 184 VTTETDPNALKITTASDIISLLNVNMKTEFWNTTVRIGIPFLSVEEIMDKLTSENIVEHS 243
ET+P +I S+++ L+ + K + IP++++E I+ KL+S+ S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFW--FS 236

Query: 244 SDKRKK---YTSEVEVKVNQVYKPVHVAVGEQKMTMGEIEQIEEGDIIPLH-TKVSDQLR 299
S +R Y + K++ V V VG ++++ +I + GDII LH T V D
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 300 GYVDGKHKFNCFIGKDGTRKALLFKSFIE 328
+ + KF C G G + A IE
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1587FLGMOTORFLIN585e-14 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 58.0 bits (140), Expect = 5e-14
Identities = 22/94 (23%), Positives = 51/94 (54%)

Query: 13 LEEFAGKRNEAGKAHIDTVSDISIELGVKLGKSSITLGDVKQLKVGDVLEVEKNLGHKVD 72
++ G ID + DI ++L V+LG++ +T+ ++ +L G V+ ++ G +D
Sbjct: 39 FQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLD 98

Query: 73 VYLSDMKVGIGEAIVMDEKFGIIISEIEADKKHA 106
+ ++ + GE +V+ +K+G+ I++I +
Sbjct: 99 ILINGYLIAQGEVVVVADKYGVRITDIITPSERM 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1589FLGBIOSNFLIP1634e-52 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 163 bits (415), Expect = 4e-52
Identities = 70/203 (34%), Positives = 127/203 (62%)

Query: 48 SSVQLFALVTLLSLSSSIVLLFTHFTYFMIVLGITRQGLGVMNLPPNQVLVGLALFLSLF 107
VQ +T L+ +I+L+ T FT +IV G+ R LG + PPNQVL+GLALFL+ F
Sbjct: 40 LPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFF 99

Query: 108 TMQPVLGQLKSDVWDPMTKEKITVSQAAETTAPIMKDYMSKHTYKHDLKMMLKVRGEELP 167
M PV+ ++ D + P ++EKI++ +A E A ++++M + T + DL + ++
Sbjct: 100 IMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPL 159

Query: 168 KDLKDLSLFTLVPSFTLTQIQKGLLTGMFIYLAFVFIDLIISTLLMYLGMMMVPPMILSL 227
+ + + + L+P++ ++++ G I++ F+ IDL+I+++LM LGMMMVPP ++L
Sbjct: 160 QGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIAL 219

Query: 228 PFKILVFVYLGGYTKIVDIMFKT 250
PFK+++FV + G+ +V + ++
Sbjct: 220 PFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1590TYPE3IMQPROT383e-07 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 38.2 bits (89), Expect = 3e-07
Identities = 15/81 (18%), Positives = 35/81 (43%)

Query: 4 SPIIDIFQTFFYKGVMILMPIAVVSMIVVIIIAVIMAMMQIQEQTLTFLPKMASIVLVII 63
++ Y +++ +V+ I+ +++ + + Q+QEQTL F K+ + L +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 ILGPWMFQELTMLILDLFDKI 84
+L W + L +
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1591TYPE3IMRPROT967e-26 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 96.0 bits (239), Expect = 7e-26
Identities = 51/233 (21%), Positives = 113/233 (48%), Gaps = 1/233 (0%)

Query: 10 FFAFCRITSFLYFLPFFSGRSIPAMAKVTFGLALSITVADQVDVSHIKTVWDVAA-YAGT 68
F+ R+ + + P S RS+P K+ + ++ +A + + + A A
Sbjct: 17 FWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQ 76

Query: 69 QIVIGLSLSKIVEMLWNIPKMAGHILDFDIGLSQASLFDVNAGSQSTLLSTIFDIFFLII 128
QI+IG++L ++ + + AG I+ +GLS A+ D + +L+ I D+ L++
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 129 FISLGGINYFVATILKSFQYTEAISKLLTTSFLDSLLATLLFAITSAVEIALPLMGSLFI 188
F++ G + ++ ++ +F + L ++ +L + + +ALPL+ L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLT 196

Query: 189 INFVLILIAKNAPQLNVFMNAYVIKITCGILFIAMSVPMLGYVFKNMTDVLLE 241
+N L L+ + APQL++F+ + + +T GI +A +P++ +++ +
Sbjct: 197 LNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1592TYPE3IMSPROT2871e-97 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 287 bits (737), Expect = 1e-97
Identities = 90/343 (26%), Positives = 184/343 (53%), Gaps = 2/343 (0%)

Query: 4 DNKTEKATPQKRKKSREEGNIARSKDLNNLFSILVLAVVVYFFGDWLGYEIAHSVAVLFD 63
KTE+ TP+K + +R++G +A+SK++ + I+ L+ ++ D+ + + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QIGKNTDS--TEYFYLMGILLLKVSAPILILVYAFHLFNYMIQVGFLFSSKVLKPKASRI 121
Q + + + + P+L + + ++++Q GFL S + +KP +I
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPKNYFTRLFSRKSLVDILKSLFYMGLIGYVSYVLFKKNLEKIVSMIGFNWTASLTEIIS 181
NP R+FS KSLV+ LKS+ + L+ + +++ K NL ++ + +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 QIKFIFLAILIILIVLSIIDFIYQKWEYEQDIKMKKEEVKQEHKDNEGDPQVKGKRKNFM 241
++ + + + +V+SI D+ ++ ++Y +++KM K+E+K+E+K+ EG P++K KR+ F
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 HAILQGTIAKKMDGATFIVNNPTHISVVLRYNKQVDAAPIVVAKGEDELALYIRTLAREQ 301
I + + + ++ +V NPTHI++ + Y + P+V K D +R +A E+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 EIPMVENRPLARSLYYQVEEDETIPEDLYVAVIEVMRYLIQTK 344
+P+++ PLAR+LY+ D IP + A EV+R+L +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1593TYPE3IMSPROT397e-05 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 38.6 bits (90), Expect = 7e-05
Identities = 26/163 (15%), Positives = 58/163 (35%), Gaps = 19/163 (11%)

Query: 192 IFGIVILFVNIIFGLIVGMMQQGMSFADAAI-----------HYTQLTVGDGIVNQIGSL 240
+L V + + ++Q G + AI ++ +V + S+
Sbjct: 85 YLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSI 144

Query: 241 MLAISTGIIVTRVFDGSADTVTEGIFKELLAHEVVVYALGGLFIAMGVFTPLPFLPFALV 300
+ + I++ + G+ T+ + E + LG + + V + F+ ++
Sbjct: 145 LKVVLLSILIWIIIKGNLVTLLQLPT---CGIECITPLLGQILRQLMVICTVGFVVISIA 201

Query: 301 GGTI-IFLGVRNKKRIKKEKEDELQKELE---MIQGDEEQLQQ 339
+ ++ K K E + E KE+E I+ Q Q
Sbjct: 202 DYAFEYYQYIKELKMSKDEIKREY-KEMEGSPEIKSKRRQFHQ 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1595FLGHOOKAP1300.015 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.015
Identities = 6/40 (15%), Positives = 15/40 (37%)

Query: 2 NGLYIGSMGMMNYMQHINVHSNNVANAQTTGFKAENMTSK 41
+ + G+ +N SNN+++ G+ +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


89BcerKBAB4_1738BcerKBAB4_1743N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1738011-1.0328072-dehydropantoate 2-reductase
BcerKBAB4_1739110-1.487675hypothetical protein
BcerKBAB4_1740011-1.175896amino acid permease
BcerKBAB4_1741113-1.720622hemolytic enterotoxin
BcerKBAB4_1742112-1.871848hemolytic enterotoxin
BcerKBAB4_1743116-2.755744hemolytic enterotoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1738NUCEPIMERASE320.003 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.003
Identities = 18/83 (21%), Positives = 32/83 (38%), Gaps = 14/83 (16%)

Query: 1 MRILVLGAGG-VGGFFGGRLVEKGEDVTFL----------VRSKRKQQLEEKGLVIRSVN 49
M+ LV GA G +G RL+E G V + ++ R + L + G
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQF--HK 58

Query: 50 GDFSFQPKLVTKEDRTTPFDVIL 72
D + + +T + F+ +
Sbjct: 59 IDLADRE-GMTDLFASGHFERVF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1741PF05844300.018 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 29.6 bits (66), Expect = 0.018
Identities = 24/145 (16%), Positives = 51/145 (35%), Gaps = 22/145 (15%)

Query: 182 SIKADEAI-KTLQGSSGDIVKLREDIKRIQGEIQAELTTILNRPQEIIKGSINIGKQVFT 240
+I ++ + K + G + I + + + E + + + Q ++ + F
Sbjct: 148 AISQEKTLQKNIDGRNELIDAKMQALGKTSDEDRKIVGKVWAADQAQDSVALRAAGRAFE 207

Query: 241 ITNQTAQTKTIDFVSIGTLSNEIVNA------ADSQTREAALRIQQKQK----------- 283
N Q S ++N V A ++ E I Q QK
Sbjct: 208 SRNGALQVANTVIQSFVQMANASVQVRQGESQASAREEEVNATIGQSQKQKVEDQMSFDA 267

Query: 284 ----ELLPLIQKLSQTESEATQITF 304
++L LIQ+ +Q+ ++A +
Sbjct: 268 GFMKDVLQLIQQYTQSHNQAWRAAA 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1742FLAGELLIN300.027 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.027
Identities = 27/275 (9%), Positives = 65/275 (23%), Gaps = 6/275 (2%)

Query: 92 VSSVDAALKGKVIQHQDTARGNAKQWLDVLKPQLISTNQNIINYNTKFQ-----NYYDTL 146
+ D K + DT A ++ + + T+ K T
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 147 VAAVDAKDKATLTKGLTRLSSSINENKAQVDQLVEDLKKFRNKMTSDTQNFKGDANQITS 206
A + T T ++ + E +T G+
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 207 ILASQDAGIPLLQNQITTYNEAISKYNAIIIGSSVATALGPIAIIGGAVVIATGAGTPLG 266
+ L IT + + + + + + L
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 267 VALIAGGAAAVGGGTAGIVLAKKELDNAQAEIQKITGQVTTAQLEVAGLTNIKTQTEYLT 326
G + + A D + + T + + + +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAG-DKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTA 418

Query: 327 NTIDTAITALQNISNQWYTMGSKYNSLLQNVDSIS 361
N + + +AL + ++G+ N + ++
Sbjct: 419 NPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLG 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1743GPOSANCHOR290.036 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 28.9 bits (64), Expect = 0.036
Identities = 45/326 (13%), Positives = 98/326 (30%), Gaps = 24/326 (7%)

Query: 35 IKTLQESAKNYSLGPAGLQDVMAQTTSSIFAMDSYAKLIQNQQETDLSKISSINGNLRGN 94
TL+ + S L+D + T + + SKI +
Sbjct: 66 NNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL 125

Query: 95 --MIQHQKDAKINAAYWLNNMKPQIMKTDQNIIDYNNTFQAYYSDMLLAIDQKDSVKLKA 152
++ + + + ++ + D + + D L+A
Sbjct: 126 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF--STADSAKIKTLEA 183

Query: 153 DLEKLYADILKNQNEVDVLLGNLKAFRDRMAKDTNSFKEDTNQLTAILASTNAGIPALEQ 212
+ L A + + L F + + + + L A A +
Sbjct: 184 EKAALEARQAELEKA----LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 213 QINTYNDSIKKSNDMVIAGGVLCVALITCLAGGPMIAIAKKDIANAEREIASLKNRISGA 272
+ IK A L A +K + A + +I
Sbjct: 240 FSTADSAKIKTLE-----------AEKAALE--ARQAELEKALEGAMNFSTADSAKIKTL 286

Query: 273 QAEVAILTDVKNKTTNMTETIDAAITALQ---NISNQWYTVGAKYNNLLQNVKGISPEEF 329
+AE A L K + ++ ++A +L+ + S + + L+ IS
Sbjct: 287 EAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASR 346

Query: 330 TFIKEDLNTAKDSWKDVKDYTEKLHE 355
++ DL+ ++++ K ++ +KL E
Sbjct: 347 QSLRRDLDASREAKKQLEAEHQKLEE 372


90BcerKBAB4_1824BcerKBAB4_1829N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1824-414-0.758505two component transcriptional regulator
BcerKBAB4_1825-314-1.003078histidine kinase
BcerKBAB4_1826-113-0.8010923-ketoacyl-ACP reductase
BcerKBAB4_1827016-0.935637O-methyltransferase family protein
BcerKBAB4_1828113-1.343594adenylyltransferase
BcerKBAB4_1829-1120.110700polysaccharide deacetylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1824HTHFIS988e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.0 bits (244), Expect = 8e-26
Identities = 36/125 (28%), Positives = 62/125 (49%), Gaps = 1/125 (0%)

Query: 3 PKILIVDDDPHIRELVSVFLEREGFQTYEAVDGLDALRKIEEVKVEMAIIDIMMPNMDGF 62
IL+ DDD IR +++ L R G+ + R I ++ + D++MP+ + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 DLCYELRKYY-DIPILMLTAKGETSQKVKGFHLGTDDYLVKPFDPLELVVRVKALLKRYQ 121
DL ++K D+P+L+++A+ +K G DYL KPFD EL+ + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 ITVSQ 126
S+
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1825PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 5e-05
Identities = 23/105 (21%), Positives = 39/105 (37%), Gaps = 26/105 (24%)

Query: 255 LIHNSIKF----TPNGGMITIHLKEHKEFLEVSIHDTGIGISEEQKQHIFERFYKADSSR 310
L+ N IK P GG I + + + + + +TG + K+
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 311 NRAYGGSGLGLAIVKKVLDLHQGK---IKVESVEGNGTEFIVRIP 352
+G GL V++ L + G IK+ +G +V IP
Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1826DHBDHDRGNASE1086e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (272), Expect = 6e-31
Identities = 65/208 (31%), Positives = 111/208 (53%)

Query: 2 AELLQGKNALITGAGRGIGRAVAIALAKEGVNVGLLARSEENLKAVAKEVEAEGVKAVIA 61
A+ ++GK A ITGA +GIG AVA LA +G ++ + + E L+ V ++AE A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 62 TADVSSYEEVTTAIETLKNGLGSIDILINNAGISKFGKFLELEVADWEKIIQVNLMGVYY 121
ADV + ++ +G IDIL+N AG+ + G L +WE VN GV+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 122 ATRAALPSMIEQQSGDIINISSTAGQKGAPVTSAYSASKFGVLGLTESLAMEVRKHNIRV 181
A+R+ M++++SG I+ + S +AY++SK + T+ L +E+ ++NIR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 182 TALTPSTVATDMAVDLGLTDGNPDKVMQ 209
++P + TDM L + ++V++
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIK 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1829THERMOLYSIN280.039 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 28.4 bits (63), Expect = 0.039
Identities = 17/66 (25%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 176 FIRPPYGEILE--NQLKWATEQNFMIVQWSVDTVDWKGVSADTITNNVLGNSFPGSVILQ 233
I G++L NQ+ A V + +GV D N +S+ G LQ
Sbjct: 200 MIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTTYSSYYGYYYLQ 259

Query: 234 HSTPGG 239
+T G
Sbjct: 260 DNTRGS 265


91BcerKBAB4_1998BcerKBAB4_2004N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_1998-1150.803429hypothetical protein
BcerKBAB4_1999-114-0.471664metal dependent phosphohydrolase
BcerKBAB4_2000-315-0.455922ABC transporter
BcerKBAB4_2001-214-0.268001hypothetical protein
BcerKBAB4_2002-213-0.666233hypothetical protein
BcerKBAB4_2003-114-0.137831single-stranded DNA-binding protein
BcerKBAB4_2004015-0.285621TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1998IGASERPTASE310.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.003
Identities = 17/116 (14%), Positives = 32/116 (27%), Gaps = 2/116 (1%)

Query: 10 PQQPLYPQHQEQRYAEQYEEQETQHQEPQYEQNPYATPQNQETEYPQNPYDTRPNYEYPQ 69
+ E+ + E E + P+ + ET PQ +
Sbjct: 1097 TTETKETATVEKE-EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 70 NPYVAQPSQEPQFQGNPYETQPQYQQNPYQQQMYQPNYDARVSPPKPPTIDPTQPQ 125
+Q + P + + P + ++ V P+ T TQP
Sbjct: 1156 KEPQSQTNTTAD-TEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_1999ARGDEIMINASE280.033 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 27.9 bits (62), Expect = 0.033
Identities = 8/38 (21%), Positives = 19/38 (50%)

Query: 57 LHDVADEKLNESEEAGMKKVSDWLEELRVEEEESKHVL 94
+ D+ E L S K +S ++ E ++ + + ++L
Sbjct: 72 IEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLL 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2000TYPE4SSCAGX300.030 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.1 bits (67), Expect = 0.030
Identities = 33/125 (26%), Positives = 59/125 (47%), Gaps = 23/125 (18%)

Query: 513 EGEVREFLGSYTEYLEMEKTRELI-----------EKAEVQKEKKVVEEAPKQQRKRKLS 561
E E F +Y E KT++LI +K ++KEK+ E+A K Q+ ++
Sbjct: 109 EKEAVNFALMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREK 168

Query: 562 YNEQREWETIEDTIAQLEEKLESIGEELTNVGSDFTKAQELSE-AGQKIEEELEKTMERW 620
E+R A+ LE++ ++N + + + LSE Q+ E EL++ MER
Sbjct: 169 RKEER---------AKNRANLENLTNAMSN-PQNLSNNKNLSELIKQQRENELDQ-MERL 217

Query: 621 SELSD 625
++ +
Sbjct: 218 EDMQE 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2004HTHTETR756e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.0 bits (184), Expect = 6e-19
Identities = 34/201 (16%), Positives = 73/201 (36%), Gaps = 13/201 (6%)

Query: 3 RETRKKELKELIFLKAVQLFQERGYENVTVQDITTACGIAKGTFFNYFPKKENILLFLGD 62
+ +E ++ I A++LF ++G + ++ +I A G+ +G + +F K ++ + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 63 SQIELWNESLKTYENVEH--PKERIKLVLGDLLDRFTGHGELMKHAVFEIIKSNYLVENE 120
E Y+ P ++ +L +L+ T E + + EII E
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLE-STVTEERRR-LLMEIIFHKCEFVGE 122

Query: 121 LKSIQQLQ--------ESLSSIITTAKETGKLNSQWDINIITSTIMSTYFYTLMSQSLLN 172
+ +QQ Q + + + E L + + Y LM L
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRG-YISGLMENWLFA 181

Query: 173 SNETNAKNILNQQLDVVWEGI 193
+ K + ++ E
Sbjct: 182 PQSFDLKKEARDYVAILLEMY 202


92BcerKBAB4_2174BcerKBAB4_2184N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2174-2110.974976major facilitator transporter
BcerKBAB4_2175-2110.9642822,3-dihydroxybenzoate-2,3-dehydrogenase
BcerKBAB4_2176-3110.951494isochorismate synthase DhbC
BcerKBAB4_2177-3120.7843662,3-dihydroxybenzoate-AMP ligase
BcerKBAB4_2178-3120.351245isochorismatase
BcerKBAB4_2179-3120.317079amino acid adenylation domain-containing
BcerKBAB4_2180-116-1.856026MbtH domain-containing protein
BcerKBAB4_2181-116-2.269886EmrB/QacA family drug resistance transporter
BcerKBAB4_2182017-1.6229924'-phosphopantetheinyl transferase
BcerKBAB4_2183-117-2.546152hypothetical protein
BcerKBAB4_2184017-2.229220histone family protein DNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2174TCRTETA454e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 4e-07
Identities = 39/195 (20%), Positives = 82/195 (42%), Gaps = 8/195 (4%)

Query: 206 MLGTKQVYLLFIMLFTSCMSGLYLIGMVKDIGVQLVGLSAATAANAVAMVAIFNTLGRI- 264
M + + ++ + + G+ LI V ++ + S A+ ++A++ +
Sbjct: 1 MKPNRPLIVILSTVALDAV-GIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFAC 59

Query: 265 --ILGPLSDKIGRLKIVTGTFVAMAASVLVLSFVDLNYGIYFVCVASVAFCFGGNITIFP 322
+LG LSD+ GR ++ + A +++ + +Y + VA G +
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAG 117

Query: 323 AIVGDFYGMKNHSKNYGIVYQGFGFGALAGSFIGAVLGGFKP--TFMVIGVLCVVSFIIA 380
A + D ++++G + FGFG +AG +G ++GGF P F L ++F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 381 LLIQVPKQKKEKEEE 395
+ K E+
Sbjct: 178 CFLLPESHKGERRPL 192



Score = 35.6 bits (82), Expect = 3e-04
Identities = 50/317 (15%), Positives = 104/317 (32%), Gaps = 38/317 (11%)

Query: 63 FASKLQEKWGLRKLIMIAGLALGIGLILSSQASSLLMLYVLAGVVVGYADGT-------- 114
L +++G R +++++ + + + A L +LY + +V G T
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLY-IGRIVAGITGATGAVAGAYI 120

Query: 115 AYITSLSNLIKWFPKRKGLIAGISVSAYGSGSLIFKYINAQLIESVGVSQAFIYWGLIVT 174
A IT + F G + +G G + + + F +
Sbjct: 121 ADITDGDERARHF--------GFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNG 171

Query: 175 AMIVLGACLI---HQAADQGAVHETKTQEYTTKEMLGTKQVYLLFIMLFTSCMSGLYLIG 231
+ G L+ H+ + E + + G V L + F + G
Sbjct: 172 LNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231

Query: 232 MVKDIGVQLVGLSAATAANAVAMVAIFNTLGRIIL-GPLSDKIGRLKIVTGTFVAMAASV 290
+ G A T ++A I ++L + ++ GP++ ++G + + +A
Sbjct: 232 LWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY 291

Query: 291 LVLSFVDLNYGIYFVCVASVAFCFGGNITIFPAIVGDFYGMKNHSKNYGIVYQGFGFGAL 350
++L+F + + PA+ S+ QG G+L
Sbjct: 292 ILLAFATRGW-----MAFPIMVLLASGGIGMPALQAML------SRQVDEERQGQLQGSL 340

Query: 351 A-----GSFIGAVLGGF 362
A S +G +L
Sbjct: 341 AALTSLTSIVGPLLFTA 357



Score = 35.6 bits (82), Expect = 3e-04
Identities = 24/151 (15%), Positives = 60/151 (39%), Gaps = 7/151 (4%)

Query: 8 PWLVVLGTVIVQMGLGTIYTWSLFNQPLVSKYGWSLNAVAITFSITSLSLA-FSTLFASK 66
L+ + ++ +G W +F + ++ W + I+ + + + +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGE---DRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 67 LQEKWGLRKLIMIAGLALGIGLILSSQASSLLMLYVLAGVVVGYADGT-AYITSLSNLIK 125
+ + G R+ +M+ +A G G IL + A+ M + + ++ G A LS +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVD 329

Query: 126 WFPKRKGLIAGISVSAYGSGSLIFKYINAQL 156
+R+G + G + S++ + +
Sbjct: 330 --EERQGQLQGSLAALTSLTSIVGPLLFTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2175DHBDHDRGNASE330e-117 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 330 bits (846), Expect = e-117
Identities = 163/260 (62%), Positives = 195/260 (75%)

Query: 1 MNLGEFDGKTVLVTGAAQGIGSVVAKMFLERGATVIAVDQNEEGLNVFLNENELNETRMK 60
MN +GK +TGAAQGIG VA+ +GA + AVD N E L ++ + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 TFHLDVSDSTAVEDMVNGIENDIAPIDILINVAGVLRMGPIHSLSDEDWNKTFSVNSTGV 120
F DV DS A++++ IE ++ PIDIL+NVAGVLR G IHSLSDE+W TFSVNSTGV
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 121 FYMSRAVSKHMMQRRSGAIVTVGSNAANTPRMEMAAYAASKAATTMFMKCLGLELAAYNI 180
F SR+VSK+MM RRSG+IVTVGSN A PR MAAYA+SKAA MF KCLGLELA YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 181 RCNLVSPGSTETEMQRLLWADENGAENIIAGSQNTYRLGIPLQKIAQPSEIAEAVLFLAS 240
RCN+VSPGSTET+MQ LWADENGAE +I GS T++ GIPL+K+A+PS+IA+AVLFL S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 241 DKASHITMHNLCVDGGATLG 260
+A HITMHNLCVDGGATLG
Sbjct: 241 GQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2178ISCHRISMTASE393e-140 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 393 bits (1010), Expect = e-140
Identities = 175/306 (57%), Positives = 234/306 (76%), Gaps = 9/306 (2%)

Query: 1 MAIPSISVYKMPIESELPKNKVNWTPNPKRAVLLIHDMQEYFLDAYSDTESPKVELISNI 60
MAIP+I Y+MP S++P+NKV+W P+P RAVLLIHDMQ YF+DA++ SP EL +NI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 KVIRERCKELGIPVVYTAQPGGQTLEQRGLLQDFWGDGIPAGPDKKKIVDELTPDEDDIF 120
+ ++ +C +LGIPVVYTAQPG Q + R LL DFWG G+ +GP ++KI+ EL P++DD+
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LTKWRYSAFKKTNLLEILNEQGRDQLIICGIYAHIGCLLTACEAFMDGIQPFFVADAVAD 180
LTKWRYSAFK+TNLLE++ ++GRDQLII GIYAHIGCL+TACEAFM+ I+ FFV DAVAD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSLEHHKQALQYASNRCAVTTSTNLLLKDLQSLKGD---------ESEGITLQEVHELVA 231
FSLE H+ AL+YA+ RCA T T+ LL LQ+ D + T + + + +A
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 232 QLLRESVESIEVDEDLLNRGLDSVRIMSLVEKWRREGKGITFADLAERPTVDDWYRLLSS 291
+LL+E+ E I EDLL+RGLDSVRIM+LVE+WRREG +TF +LAERPT+++W +LL++
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300

Query: 292 QAAQVL 297
++ QVL
Sbjct: 301 RSQQVL 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2181TCRTETB1251e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 125 bits (315), Expect = 1e-33
Identities = 91/398 (22%), Positives = 173/398 (43%), Gaps = 14/398 (3%)

Query: 20 FMAAMDATIVNVALQTISKELQVPPSAMGTVNVGYLVSLAIFLPISGWLGDRFGTKKVFL 79
F + ++ ++NV+L I+ + PP++ VN ++++ +I + G L D+ G K++ L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TALFVFTIASALCGIANDITSLNIF-RIIQGAGGGLLTPVGMAMLFRTFSPEERPKISRF 138
+ + S + + + SL I R IQGAG + M ++ R E R K
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 IVLPIAVAPAIGPIIGGFFVDQMSWRWAFYINLPFGIIALLFGLLFLAEHIEKSAGRFDS 198
I +A+ +GP IGG + W++ + +P I + L+ L + + G FD
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 199 LGFVLSAPGFAMLIYALSQGPSKGWVSPEIISIGIAGIVLLTLFIIVELKVKQPMLDLRL 258
G +L + G + + IS I ++ +F+ KV P +D L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 259 LKEPVFRKMSLISLFSSAGLLGMLFIFPLMYQNVIGVSALESG-LTTFPEAIGLMISSQI 317
K F L + G + + P M ++V +S E G + FP + ++I I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 318 VPWSYKKLGARKVISIGLVSTAVIFILLSFVNHDTNPWQIRALLFGIGIFLGQSVGAVQF 377
+ G V++IG+ +V F+ SF+ T+ + ++F +G L + +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 378 SAFNNIAPPSMGRATTIFNVQNRLGSAIGVAVLASILS 415
+++ G ++ N + L G+A++ +LS
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2182ENTSNTHTASED381e-05 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 38.5 bits (89), Expect = 1e-05
Identities = 28/126 (22%), Positives = 53/126 (42%), Gaps = 17/126 (13%)

Query: 53 RARFIIGCVISRLVLGKVLSMSPVQVPIDRMCPVCKLQHGRPQLPEGMPQISVSHSGEWV 112
+A + G + + L + + + V D+ +P P+G+ S+SH
Sbjct: 47 KAEHLAGRIAAVHALRE-VGVRTVPGMGDK---------RQPLWPDGLFG-SISHCATTA 95

Query: 113 VVAFTKSAPVGVDVEQMNPNVDVMKMAEGVLTDIE--IAQVMKLPDEQRLEGFLTYWTRK 170
+ A +G+D+E++ ++A ++ E I Q LP L L + + K
Sbjct: 96 L-AVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALT--LAF-SAK 151

Query: 171 EAVLKA 176
E+V KA
Sbjct: 152 ESVYKA 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2184DNABINDINGHU1246e-41 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 124 bits (312), Expect = 6e-41
Identities = 56/89 (62%), Positives = 74/89 (83%)

Query: 2 NKTELVKNVAQSADISQKDASAAVQSVFDTIANALQSGDKVQLIGFGTFEVRERSARTGR 61
NK +L+ VA++ ++++KD++AAV +VF +++ L G+KVQLIGFG FEVRER+AR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIQIAAGKVPAFKAGKELKEAVK 90
NPQTGEEI+I A KVPAFKAGK LK+AVK
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


93BcerKBAB4_2219BcerKBAB4_2225N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2219017-4.007199TetR family transcriptional regulator
BcerKBAB4_2220116-3.735788MMPL domain-containing protein
BcerKBAB4_2221417-4.483727hypothetical protein
BcerKBAB4_2222115-3.688211chloramphenicol O-acetyltransferase
BcerKBAB4_2223116-3.423355N-acetyltransferase GCN5
BcerKBAB4_2224114-3.575210N-acetyltransferase GCN5
BcerKBAB4_2225112-2.930376N-acetyltransferase GCN5
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2219HTHTETR793e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 3e-20
Identities = 40/190 (21%), Positives = 75/190 (39%), Gaps = 12/190 (6%)

Query: 18 KSTKEIILEVATRLFLTQNYQVVSMDEVAKECGVTKATVYYYYSTKADLFTATMIQMMVR 77
+ T++ IL+VA RLF Q S+ E+AK GVT+ +Y+++ K+DLF+
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 78 IRENMDQILS-TNKTLEERLLNFATVYLHATMDIDMNNFMKDAKLSLSEEQLKEL----- 131
I E + + L L +T+ + + + + E + E+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEI-IFHKCEFVGEMAVVQQ 128

Query: 132 --KNAEDNMYEVLEKALDNAILLGEIPKG-NPKFAAHAFVALLS--IGNFKDENHNPTLA 186
+N Y+ +E+ L + I +P + AA +S + N+ + L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188

Query: 187 NIDELAKEIV 196
I+
Sbjct: 189 KEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2220ACRIFLAVINRP533e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 53.3 bits (128), Expect = 3e-09
Identities = 40/232 (17%), Positives = 90/232 (38%), Gaps = 25/232 (10%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADNGWIKVDAQAISIM 262
L A +L+ LV+ + L ++ ++P + V + T LA G+ +I+ +
Sbjct: 344 LFEAIMLVFLVMYLFL-QNMRATLIPTIAVPVV---LLGTFAILAAFGY------SINTL 393

Query: 263 T----VLLFGAGTDYCLFLISRYKEYLLEEESKYK-ALQLAIKASGGAIIMSALTVVLGL 317
T VL G D + ++ + ++E++ K A + ++ GA++ A+ +
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF 453

Query: 318 GTLLL--AHYGAFHR-FAVPFSVAVFIMGIAALTILPALLLIFGRVVFFPFIPRTAEMNE 374
+ GA +R F++ A+ + + AL + PAL + P +AE +E
Sbjct: 454 IPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK-------PVSAEHHE 506

Query: 375 EFARKKKKVVKVKNTKGFFSKKLGDIVVRKPWTIIMLTVFLLGGLASFVPRI 426
+ ++ +++ ++ G+ R+
Sbjct: 507 NKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558



Score = 38.7 bits (90), Expect = 1e-04
Identities = 29/161 (18%), Positives = 67/161 (41%), Gaps = 9/161 (5%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADNGWIKVDAQAISIM 262
L+ + ++V + L LY S + + +LVV I+ L N V +
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLG--IVGVLLAATLFNQKNDVYFM---VG 929

Query: 263 TVLLFGAGTDYCLFLISRYKEYLLEE-ESKYKALQLAIKASGGAIIMSALTVVLGLGTLL 321
+ G + ++ K+ + +E + +A +A++ I+M++L +LG+ L
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 322 LAH---YGAFHRFAVPFSVAVFIMGIAALTILPALLLIFGR 359
+++ GA + + + + A+ +P ++ R
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2224SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 4e-05
Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 6/100 (6%)

Query: 49 YSSVEMMKYLIEELD--TYKVIMDEKVIGGIIVTISGKSYGRIDRIFVEPFLQGKGIGSR 106
Y +M +EE + ++ IG I + + Y I+ I V + KG+G+
Sbjct: 50 YEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTA 109

Query: 107 VIKLIEE---KFPNIRIWDLETSSRQINNHHFYKKMGYEI 143
++ E + + LET I+ HFY K + I
Sbjct: 110 LLHKAIEWAKENHFCGLM-LETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2225SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 1e-05
Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 11/91 (12%)

Query: 54 FVAEYDGEVVGFVGLTQSPGRRSHSGDLFIGVDSEYHNKGIGKALLTKMLDLADNWLMLE 113
F+ + +G + + + + D I V +Y KG+G AL L A W E
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIED--IAVAKDYRKKGVGTAL----LHKAIEW-AKE 120

Query: 114 RVELGV-LET---NPRAKVLYEKFGFEEEGV 140
G+ LET N A Y K F V
Sbjct: 121 NHFCGLMLETQDINISACHFYAKHHFIIGAV 151


94BcerKBAB4_2351BcerKBAB4_2357N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2351-113-0.918271ABC transporter
BcerKBAB4_2352-114-0.927672ABC-2 type transporter
BcerKBAB4_2353-112-0.220869TetR family transcriptional regulator
BcerKBAB4_2354-1130.161708hypothetical protein
BcerKBAB4_2355-1151.442893acyl-CoA dehydrogenase domain-containing
BcerKBAB4_2356-2141.612431acetyl-CoA carboxylase biotin carboxylase
BcerKBAB4_2357-2141.721044acetyl-CoA carboxylase biotin carboxyl carrier
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2351PF05272300.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.011
Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 10/65 (15%)

Query: 36 LVGPSGSGKTTLIKMIAGINESTTGDVIVFNTNMPNLNEMKRIGYMAQADALYE--ELSA 93
L G G GK+TLI + G++ F+ ++ K YE E++A
Sbjct: 601 LEGTGGIGKSTLINTLVGLD--------FFSDTHFDIGTGKDSYEQIAGIVAYELSEMTA 652

Query: 94 YENAD 98
+ AD
Sbjct: 653 FRRAD 657


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2352ABC2TRNSPORT511e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 51.5 bits (123), Expect = 1e-09
Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%)

Query: 166 SFVRERLSGTLERLLSTPVRRWEIVLGYIIGFGIFAFIQSIIIVSFSVYILDLYVAGSIW 225
+F R T E +L T +R +IVLG + A + I + + Y
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALG--YTQW--L 145

Query: 226 LTLLITCMLSLTAL---TLGTFLSAYANNEFQMIQFIPLVIVPQVFFSG-LFPMESMNTW 281
L +++LT L +LG ++A A + I + LVI P +F SG +FP++ +
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 282 LQMLGKLFPLTYGADAMRQVMIRNQGFTEIALDLTVLLLFSLL 324
Q + PL++ D +R +M+ + ++ + L ++ ++
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPV-VDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2353HTHTETR843e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 84.3 bits (208), Expect = 3e-22
Identities = 32/182 (17%), Positives = 74/182 (40%), Gaps = 5/182 (2%)

Query: 16 DKRNERQMRILEAAVDMFGEKGYASTSTSEIAKRAGVAEGTIFRYYKTKKDLLFAVVMPT 75
+ E + IL+ A+ +F ++G +STS EIAK AGV G I+ ++K K DL + +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 76 LTKFAAPFFVQAFAKEIFKTEYESYEVLLRVIIQNRFEFAKKHFPMIKILIQEVPFQPEL 135
+ + + +L ++++ ++ +++I+ + F E+
Sbjct: 67 ESNIGE--LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL-LMEIIFHKCEFVGEM 123

Query: 136 KSEIQ--QLIETELLVHFKKLIAKFQEEGEIIELPPSSVLRLTLSAVLGLLLTRFLLLPE 193
Q + + E ++ + E + + + + + L+ +L P+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 194 EK 195

Sbjct: 184 SF 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2356PHPHTRNFRASE340.001 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 34.4 bits (79), Expect = 0.001
Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 6/80 (7%)

Query: 97 EEGIVFIGPSEEIITKMGSKIESRIAMQA--ADVPVVPGITTNIETAEEAIEIAKQIGYP 154
EGIV + P+EE + K + + A + P T + +E+A IG P
Sbjct: 224 IEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKD----GAHVELAANIGTP 279

Query: 155 LMLKASAGGGGIGMQLMETE 174
+ GG G+ L TE
Sbjct: 280 KDVDGVLANGGEGIGLYRTE 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2357RTXTOXIND321e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 1e-04
Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 42 IVSEEAGTVMKINVQEGDFVNEGDVLLEIE 71
I E V +I V+EG+ V +GDVLL++
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLT 128


95BcerKBAB4_2662BcerKBAB4_2672N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2662-114-2.330113methyltransferase type 11
BcerKBAB4_2663013-2.677428hypothetical protein
BcerKBAB4_2664015-2.760933x-prolyl-dipeptidyl aminopeptidase
BcerKBAB4_2665014-5.006083histidine kinase
BcerKBAB4_2666116-3.967860two component transcriptional regulator
BcerKBAB4_2667016-4.139698group-specific protein
BcerKBAB4_2668016-3.698643hypothetical protein
BcerKBAB4_2669-117-3.057380N-acetyltransferase GCN5
BcerKBAB4_2670015-3.920000hydrolase
BcerKBAB4_2671216-3.755967bifunctional
BcerKBAB4_2672014-3.395948peptidase M3A and M3B thimet/oligopeptidase F
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2662DHBDHDRGNASE300.006 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.4 bits (68), Expect = 0.006
Identities = 10/49 (20%), Positives = 17/49 (34%), Gaps = 5/49 (10%)

Query: 46 VGSGR-----VIIPFLEAGFKVDGIDYSPEMLDSCRMRCKERGLHPNLY 89
G+ + V G + +DY+PE L+ K H +
Sbjct: 14 TGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2665PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 2e-04
Identities = 36/182 (19%), Positives = 71/182 (39%), Gaps = 38/182 (20%)

Query: 282 IIKQTDQISNLIEELLRFSKLERDILQKEEFPIEPLVQSI--IDKHKIELESKEL--KLQ 337
I++ + ++ L S+L R L+ L + +D + ++L S + +LQ
Sbjct: 186 ILEDPTKAREMLTSL---SELMRYSLRYSNARQVSLADELTVVDSY-LQLASIQFEDRLQ 241

Query: 338 VNYSVGDTIVYADLNKMRMVFQNLISNAIKYTTNQ-----NIKIILEEKNGIVYFQIQN- 391
+ I+ + M + Q L+ N IK+ Q I + + NG V +++N
Sbjct: 242 FENQINPAIMDVQVPPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 392 -GIAAEGIKEIDKIWEPFYVLESSRSKEKSGTGLGLAIVKSILE-RHGFEYGVSVEDGEI 449
+A + KE TG GL V+ L+ +G E + + + +
Sbjct: 300 GSLALKNTKE--------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 450 QF 451
+
Sbjct: 340 KV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2666HTHFIS898e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 8e-23
Identities = 27/118 (22%), Positives = 54/118 (45%), Gaps = 1/118 (0%)

Query: 2 KVLIADDEQDMLKILKAYFEKEGFEVLLAKDGEEALQIFYDEKIDLAILDWMMPKSSGIT 61
+L+ADD+ + +L + G++V + + + DL + D +MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCQEIKK-NSSVKVLMLTAKSESEDELAALQTGADEYVKKPFHPGVLITRAKKLVQHE 118
+ IKK + VL+++A++ + A + GA +Y+ KPF LI + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2669SACTRNSFRASE280.008 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.008
Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 38 TGECLYGIFREDTLIGIGGLNQDPYTKNNKIGRLRRFYIAKDYRRKGLGKLLLGRILSDA 97
G+ + + E+ IG + + N + +AKDYR+KG+G LL + + A
Sbjct: 63 EGKAAFLYYLENNCIGRIKIRSNW----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWA 118

Query: 98 K-IYFTIVVLHTDTEQ--GDKFYTSSGFTKG 125
K +F ++L T FY F G
Sbjct: 119 KENHFCGLMLETQDINISACHFYAKHHFIIG 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2672STREPKINASE300.020 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 30.4 bits (68), Expect = 0.020
Identities = 31/130 (23%), Positives = 52/130 (40%), Gaps = 15/130 (11%)

Query: 16 LYPQEQNFTFSIETIERLKIEYKATKDSVILSQLIQAIEKAEYYLYCRAAEDEKHPEN-- 73
+ P +Q FT+ ++ E+ Y+ K S + ++ +E Y + E P +
Sbjct: 260 ILPMDQEFTYRVKNREQ---AYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRS 316

Query: 74 -------TLLSVKVNQLKKEVQLLIESSKG---QSVNTNHSSIKLIENELKAWEDMYTQL 123
+ V N+L K QLL S + + + KL+ N L A+ M L
Sbjct: 317 HLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTL 376

Query: 124 RNKIEVIHDK 133
K+E HD
Sbjct: 377 TGKVEDNHDD 386


96BcerKBAB4_2685BcerKBAB4_2694N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2685-18-0.888547EmrB/QacA family drug resistance transporter
BcerKBAB4_2686-111-1.404902hypothetical protein
BcerKBAB4_2687-211-1.731913hypothetical protein
BcerKBAB4_2688-111-1.980854hypothetical protein
BcerKBAB4_2689-112-1.355061extracellular solute-binding protein
BcerKBAB4_2690013-1.270612major facilitator transporter
BcerKBAB4_2691212-1.856294putative lipoprotein
BcerKBAB4_2692112-2.519240hypothetical protein
BcerKBAB4_2693013-2.152949hypothetical protein
BcerKBAB4_2694114-1.969440transglycosylase-associated protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2685TCRTETB1469e-41 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 146 bits (369), Expect = 9e-41
Identities = 93/406 (22%), Positives = 166/406 (40%), Gaps = 16/406 (3%)

Query: 19 ILMASMDNTIVVTAMGTIVGDLGGLENFV-WVVSAYMVAEMAGMPIFGKLSDMYGRKRFF 77
+ ++ ++ ++ I D WV +A+M+ G ++GKLSD G KR
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 78 IFGLIVFMIGSALCGTAENITQLGIY-RAIQGIGGGALVPIAFTIVFDIFPPEKRGKMGG 136
+FG+I+ GS + + L I R IQG G A + +V P E RGK G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 137 LFGAVFGLSSIFGPLLGAYITDYISWHWVFYINLPLGILALIFITLFYKESRVHREQKID 196
L G++ + GP +G I YI HW + + +P+ + + + + V + D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 197 WFGAITLVGAVVCLMFALELGGQKYDWDSSFILSLFAGFAILVIAFIVIERKVEEPIISF 256
G I + +V M L Y S + + F+ RKV +P +
Sbjct: 201 IKGIILMSVGIVFFM----LFTTSYSI------SFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 257 EMFKQRLFGMSTIIALCYGAAFMSATVYIPLFIQGVYG-GTATNSGLLLLPMMLGSVVTA 315
+ K F + + +P ++ V+ TA +++ P + ++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 316 QLGGFLTSKLSYRNIMIISAVIMLIGLFLLSTLTPETSRILLTIYMVIIGFGVGFSFSVL 375
+GG L + ++ I + + FL ++ ET+ +TI +V + G+ F+ +V+
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVI 369

Query: 376 SMAAIHNFGMEQRGSATSTSNFIRSLGMTLGITIFGMIQRTGFQDQ 421
S + ++ G+ S NF L GI I G + DQ
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2688PYOCINKILLER310.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.004
Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%)

Query: 12 LELTGISYGQLYRWKRKNLIPEDWFVRKSTFTGQETFFPKEKILERIDKIQTM 64
L+ + G KNL P D R T G +K+L KI ++
Sbjct: 97 LDKADAALGPA-----KNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSL 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2690TCRTETA719e-16 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 71.4 bits (175), Expect = 9e-16
Identities = 61/321 (19%), Positives = 115/321 (35%), Gaps = 15/321 (4%)

Query: 50 LLFGLQPLADIVFTLIAGGVTDKYGRKKIMLLGLLLQAFAVSGFVFAESVAFFALLY--V 107
+L L L + G ++D++GR+ ++L+ L AV + A + + L +
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAG--AAVDYAIMATAPFLWVLYIGRI 104

Query: 108 VNGIGRSLYIPAQRAQIADLTKEEQQAEIFAVLHTTGAIGSVIGPLIGAFFYTSHPEYLF 167
V GI + A IAD+T +++A F + G V GP++G P F
Sbjct: 105 VAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPF 163

Query: 168 ILQGVALTLYAILVWTQLPETVPLRQSVNKTKEVYSPKQFISKHYAVFGLMVSTLPISFF 227
L + LPE+ + + + S +A +V+ L FF
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREA---LNPLASFRWARGMTVVAALMAVFF 220

Query: 228 YAQ-----TESNYRIFAESVFPNFLFILVFISTCKAIMEVMLQIFLV-KWSERFSMPKII 281
Q + + IF E F + I+ + Q + + R + +
Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRAL 280

Query: 282 VISYTCYTLAAIGYGYSTTIWSLFFTLLFLVIGQSIALNHLLRFVSQIAPSHRRGLYFSI 341
++ I ++T W F ++ L G I + L +S+ R+G
Sbjct: 281 MLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGS 339

Query: 342 YGIHWDISRTCGPFVGALLLS 362
++ GP + + +
Sbjct: 340 LAALTSLTSIVGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2691CHLAMIDIAOM6280.040 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 27.7 bits (61), Expect = 0.040
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 14/91 (15%)

Query: 1 MMKVYMKVLIFFLTLTCVAALTAC----TNAEEKKQTNSTSENSTEKKSDTSEKSKK--- 53
M K+ + + F +T VA+L A T+ E TN S T+ K +TS KSKK
Sbjct: 1 MNKLIRRAVTIF-AVTSVASLFASGVLETSMAESLSTNVISLADTKAKDNTSHKSKKARK 59

Query: 54 ---NEETAPKKENEPVEKPKGQESVKPSTES 81
E +KE PV + ++ P +S
Sbjct: 60 NHSKETPVDRKEVAPVHE---SKATGPKQDS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2694CHANLCOLICIN344e-05 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.5 bits (76), Expect = 4e-05
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 6 IVGGILGWFASLITGKDVPGGVIG-NIIAGIVGSWLGTALLGKFGPVIG 53
V ++ SL+ G G+ G I+ GI+ S++ L V+G
Sbjct: 475 GVSYVVALLFSLLAG--TTLGIWGIAIVTGILCSYIDKNKLNTINEVLG 521


97BcerKBAB4_2738BcerKBAB4_2743N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2738-311-2.373608TetR family transcriptional regulator
BcerKBAB4_2739-211-2.035749EmrB/QacA family drug resistance transporter
BcerKBAB4_2740-212-2.062173sulfatase
BcerKBAB4_2742010-1.116766MMPL domain-containing protein
BcerKBAB4_27430130.079456TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2738HTHTETR423e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.9 bits (98), Expect = 3e-07
Identities = 15/77 (19%), Positives = 32/77 (41%), Gaps = 1/77 (1%)

Query: 4 PKKEDPRTVRSREMFKNAVFSLLCENPSISSLTVQKVATKAGLNRTTFYLHYQDIQDLLD 63
+K +R+ + L + +SS ++ ++A AG+ R Y H++D DL
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQG-VSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 64 QITNEISNELSNKIADL 80
+I + + +
Sbjct: 61 EIWELSESNIGELELEY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2739TCRTETB1363e-37 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 136 bits (343), Expect = 3e-37
Identities = 91/413 (22%), Positives = 180/413 (43%), Gaps = 14/413 (3%)

Query: 13 ILIVLISGCFLSTLNQTLLNVAMSNLMEVFDVTAATVQWLSTGFMLINGVLVPITAFLMK 72
ILI L F S LN+ +LNV++ ++ F+ A+ W++T FML + + L
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 73 RFTTRQLFICSMLSLFIGTVLCACAMN-FGILLTGRMIQAVGAGIIMPLMMTVILYLYPS 131
+ ++L + ++ G+V+ + F +L+ R IQ GA L+M V+ P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 132 EKRGSIMGTIGFAIIFAPAIAPTLSGFIIEYVSWRWLFIGFAPFVLIVIILALKYLMNVA 191
E RG G IG + + P + G I Y+ W +L + P + I+ + L L+
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLL--IPMITIITVPFLMKLLKKE 192

Query: 192 ETTKAKLDIVSVILSTIGFGCIIFGFSSAGSKGWDHPVVITTIIIGIIVTTLFCLRQIKS 251
K DI +IL ++G + +S I+ +I+ ++ +F K
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRKV 243

Query: 252 SDPLLNLSVFKYKIFTLTSVINVLITMIMYADLILLPIYLQNGRGFTAFESG-LLLLPGA 310
+DP ++ + K F + + +I + + ++P +++ + E G +++ PG
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 311 VINAFLSPITGKMFDKYGAKPLFIIGLICIIISMWGVIDLTESTTYMYLMVRTIILRIGL 370
+ I G + D+ G + IG+ + +S TT ++ + + + GL
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLT-ASFLLETTSWFMTIIIVFVLGGL 362

Query: 371 SFISMPLNTAGLNALPRELGSHGSAVNNTVRQLAGAIGTAVIITVYTIQSTSH 423
SF ++T ++L ++ G ++ N L+ G A++ + +I
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2742ACRIFLAVINRP482e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 48.3 bits (115), Expect = 2e-07
Identities = 39/199 (19%), Positives = 80/199 (40%), Gaps = 20/199 (10%)

Query: 170 FLTSSQEGVKKTEVISIIFILVVLIIV---FRSPVVPIISLLTVGVSYLISMGIIAHLVD 226
F+ S V KT +I+ + +V+ + R+ ++P I+ V V L + I+A
Sbjct: 332 FVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIA---VPVVLLGTFAILAA--- 385

Query: 227 QFNFPFSNFTQVFVVVVLFGVGTDYNILLYTRFKEELSKQENAYL-ATKETFKSAGKTVL 285
F + + T +F +V+ G+ D I++ + + + + AT+++ ++
Sbjct: 386 -FGYSINTLT-MFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALV 443

Query: 286 YSGIAVLIGFASLALAS---FKLYQSTS-AVAIGVAVLLLVLTTLNPFFMVLLGKGMFYP 341
+ + F +A +Y+ S + +A+ +LV L P L K P
Sbjct: 444 GIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK----P 499

Query: 342 VKTFKGHEDSRLWGFFAKN 360
V +G+F
Sbjct: 500 VSAEHHENKGGFFGWFNTT 518



Score = 44.1 bits (104), Expect = 3e-06
Identities = 33/148 (22%), Positives = 68/148 (45%), Gaps = 6/148 (4%)

Query: 184 ISIIFILVVLIIVFRSPVVPIISLLTVGVSYLISMGIIAHLVDQFNFPFSNFTQVFVVVV 243
IS + + + L ++ S +P+ +L V + I ++A + FN + V ++
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLG--IVGVLLAATL--FNQKNDVYFMV-GLLT 932

Query: 244 LFGVGTDYNILLYTRFKEELSKQ-ENAYLATKETFKSAGKTVLYSGIAVLIGFASLALAS 302
G+ IL+ K+ + K+ + AT + + +L + +A ++G LA+++
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISN 992

Query: 303 FKLYQSTSAVAIGVAVLLLVLTTLNPFF 330
+ +AV IGV ++ T L FF
Sbjct: 993 GAGSGAQNAVGIGVMGGMVSATLLAIFF 1020


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2743HTHTETR762e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 76.2 bits (187), Expect = 2e-19
Identities = 39/191 (20%), Positives = 76/191 (39%), Gaps = 18/191 (9%)

Query: 1 MAKNKQED-------IFDAAIKLFAERGYDGTTIPMIAEKANVGAGTIYRYFENKEALVN 53
MA+ +++ I D A++LF+++G T++ IA+ A V G IY +F++K L +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 54 SLFSKSMLQLSETIKT---DFP--VEANIREQFSHTYNRLF-EFARNNVDAFLFT--NSH 105
++ S + E FP + +RE H E R + +F
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 106 CDSYFLDEQSKKIFDDFIGFFMNIIEDGIEKGFLRP-LPIIALIIIVYQPLEKLIK--VM 162
+ + + + + + ++ IE L L II+ + L++ +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 163 ATGQLEYSKEL 173
A + KE
Sbjct: 181 APQSFDLKKEA 191


98BcerKBAB4_2887BcerKBAB4_2894N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2887-113-0.119336hypothetical protein
BcerKBAB4_2888-2120.053279nucleoside recognition domain-containing
BcerKBAB4_2889-2110.842273aminoglycoside phosphotransferase
BcerKBAB4_2890-3110.041028hypothetical protein
BcerKBAB4_2891-3110.381305hypothetical protein
BcerKBAB4_2892-380.254580hypothetical protein
BcerKBAB4_2893-311-0.331809TetR family transcriptional regulator
BcerKBAB4_2894-212-0.201431phosphoenolpyruvate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2887IGASERPTASE544e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.5 bits (128), Expect = 4e-10
Identities = 29/154 (18%), Positives = 63/154 (40%), Gaps = 3/154 (1%)

Query: 57 SKSITPEEKLAMDKKREEQQIAKEQEKKKKAEEKKIQQENEKKEKEENERKQKEAKEKKA 116
SK++ E+ A + + +++AKE + KA Q + E + Q ++ A
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKAN---TQTNEVAQSGSETKETQTTETKETA 1104

Query: 117 QEEAEVKVKKEAEEQQKQAELEKKKQEQQEKKAQEEAEAKVKKEAEEQQKLAQIEKEKEQ 176
E E K K E E+ Q+ ++ + +QE+ + +A+ +E + + + + +
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 177 PKTPATPNGTTADVNGGSYSKDAIKTTLNRLNDN 210
P T+ ++ T N + +N
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198



Score = 50.1 bits (119), Expect = 5e-09
Identities = 24/130 (18%), Positives = 57/130 (43%), Gaps = 4/130 (3%)

Query: 64 EKLAMDKKREEQQIAKEQEKKKKAEEKKIQQENEKKEKEENERKQKEAKEKKAQ-EEAEV 122
E +A + K+E + + K ++ + + + E K + + E + ++ +E +
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 123 KVKKEAEEQQKQAELEKKKQEQQEKKAQEEAEAKVKKEAEEQQKLAQIEKEKE---QPKT 179
KE +K+ + + + ++ QE + ++++E Q A+ +E + K
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 180 PATPNGTTAD 189
P + TTAD
Sbjct: 1158 PQSQTNTTAD 1167



Score = 48.1 bits (114), Expect = 2e-08
Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 15/120 (12%)

Query: 81 QEKKKKAEEKKIQQENEKKEKEENERKQKEAKEKKAQEEAEVKVK---KEAEEQQKQAEL 137
+ + AE K QE++ EK E + + A+ ++ +EA+ VK + E Q +E
Sbjct: 1035 ETTETVAENSK--QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 138 EKKKQEQQEKKAQEEAEAKVKKEAEEQQKLAQIEKE----KEQPKT------PATPNGTT 187
++ + + ++ A E E K K E E+ Q++ ++ + +EQ +T PA N T
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152



Score = 38.5 bits (89), Expect = 3e-05
Identities = 23/123 (18%), Positives = 44/123 (35%), Gaps = 9/123 (7%)

Query: 67 AMDKKREEQQIAKEQEKKKKAEEKKIQQENEKKEKEENERKQKEAKEKKAQEEAEVKVKK 126
+ + E + EK++KA+ E ++ +E + + K+ Q E V+ +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKV-------ETEKTQEVPKVTSQVSPKQEQSE-TVQPQA 1143

Query: 127 EAEEQQKQAELEKKKQEQQEKKAQEEAEAKVKKEAEEQQKLAQIEKEKEQPKTPATPNGT 186
E + K+ Q Q A E AK + + +Q + + P T
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAK-ETSSNVEQPVTESTTVNTGNSVVENPENT 1202

Query: 187 TAD 189
T
Sbjct: 1203 TPA 1205



Score = 37.0 bits (85), Expect = 8e-05
Identities = 30/165 (18%), Positives = 55/165 (33%), Gaps = 11/165 (6%)

Query: 30 ATRTRRRVLLITVPTILVSFIFASYFASKSITPEEKLAMDK---KREEQQIAKEQEKKKK 86
T R V + + S S T E + K E+++ AK + +K +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 87 AEEKKIQQENEKKEKEENERKQKEAKEKKAQEEAEVKVKKEAEEQQKQAELEKKKQEQQE 146
K Q + K+E+ E + Q E + + V +K+ + A+ EQ
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEP---ARENDPTVNIKEPQSQTNTTAD-----TEQPA 1172

Query: 147 KKAQEEAEAKVKKEAEEQQKLAQIEKEKEQPKTPATPNGTTADVN 191
K+ E V + + +E + P + N
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_289256KDTSANTIGN310.006 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 31.5 bits (71), Expect = 0.006
Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 31 PKPKNLPIAIVNEDQGVEIPNQPK 54
P+P PI+I + D G++IPN P+
Sbjct: 137 PQPTMSPISIADRDFGIDIPNIPQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2893HTHTETR648e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 8e-15
Identities = 39/195 (20%), Positives = 72/195 (36%), Gaps = 14/195 (7%)

Query: 12 RTKTAIRNALVELIEEKGFDAITVKDITTKANINRGTFYTHYQDKFDLMTKCQEEIMYEF 71
T+ I + + L ++G + ++ +I A + RG Y H++DK DL EI
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF----SEIWELS 66

Query: 72 SSIAKQRLPEVIADLGSSPSPTMPFILIASI-LEFLNENSDFMKAVLSPK----GDLSFQ 126
S + E A P + ILI + E + ++ K G+++
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 127 TKLKD---FMWKTLFEDTNGPLINKENLL--VPSQYLASYMASAHIGVIQQWLNNGQKET 181
+ + E T I + L + ++ A M G+++ WL Q
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186

Query: 182 PEEIARILSTIAVHG 196
++ AR I +
Sbjct: 187 LKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2894PHPHTRNFRASE656e-13 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 65.2 bits (159), Expect = 6e-13
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 793 EEGDILVTAFTDPGWTPLFVS-IKGLVTEVGGLMTHGAVIAREYGLPAVVGVENATKLIK 851
EE I+ T L +KG T++GG +H A+++R +PAVVG + T+ I+
Sbjct: 155 EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQ 214

Query: 852 DGQRIRVHGTEGYIEV 867
G + V G EG + V
Sbjct: 215 HGDMVIVDGIEGIVIV 230


99BcerKBAB4_2910BcerKBAB4_2914N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_2910216-2.433341histidine kinase
BcerKBAB4_2911117-1.116344two component transcriptional regulator
BcerKBAB4_2913016-1.012493hypothetical protein
BcerKBAB4_2914012-0.0612651,4-dihydroxy-2-naphthoate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2910PF06580330.004 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.004
Identities = 22/110 (20%), Positives = 45/110 (40%), Gaps = 21/110 (19%)

Query: 469 QLHVHKQLSKIEVVANPHRIEQVVTNFITNAIRYTPEHEDIIISTIEENKRVKVCVENKG 528
+ ++ + ++V P ++ +V N I + I P+ I++ ++N V + VEN G
Sbjct: 243 ENQINPAIMDVQVP--PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 529 AHIAPEHVEKIWDRFYRGDTSRQRSKGGTGLGLA-ISKNILELHGAEYGV 577
+ +K TG GL + + + L+G E +
Sbjct: 301 SLALKN------------------TKESTGTGLQNVRERLQMLYGTEAQI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2911HTHFIS904e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 4e-23
Identities = 27/132 (20%), Positives = 61/132 (46%), Gaps = 2/132 (1%)

Query: 1 MQR-TILIVEDEDILREIMKDYLLNEGYNVLEAIDGKEALSIFEEHEVHLIILDIMLPEL 59
M TIL+ +D+ +R ++ L GY+V + + L++ D+++P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGWAVCRRIRK-KSNVPIIMLTARVDEDDTLLGFEMGADDYVTKPYSPPILLARAKRLIE 118
+ + + RI+K + ++P+++++A+ + E GA DY+ KP+ L+ R +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 119 SRYSSTINVSTA 130
+
Sbjct: 121 EPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2913FERRIBNDNGPP280.008 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 27.6 bits (61), Expect = 0.008
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 6/48 (12%)

Query: 47 AIEYFQKEKQLDVVITKVGVVGEIGYKVWIEGHELNNEQQKIDAIIDV 94
A+E+ E L + I GV I Y++W+ +E D++IDV
Sbjct: 40 ALEWLPVELLLALGIVPYGVADTINYRLWV------SEPPLPDSVIDV 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_2914ACRIFLAVINRP290.033 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.033
Identities = 22/86 (25%), Positives = 40/86 (46%), Gaps = 5/86 (5%)

Query: 156 LMGTCFVLIAFFIQTNTITIESVLISIPIGILV-GAINMSNNIRDIEEDIKGGRKTLVIL 214
L+GT +L AF NT+T+ + + IG+LV AI + N+ + + K K
Sbjct: 376 LLGTFAILAAFGYSINTLTM--FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEK 433

Query: 215 LGRE--KAVVTLAVAFFIAYLWIAVI 238
+ A+V +A+ ++ +A
Sbjct: 434 SMSQIQGALVGIAMVLSAVFIPMAFF 459


100BcerKBAB4_3013BcerKBAB4_3018N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3013-2213.360167major facilitator transporter
BcerKBAB4_3014-3244.0802786-aminohexanoate-dimer hydrolase
BcerKBAB4_3015-3254.477199hypothetical protein
BcerKBAB4_3016-3244.201294MerR family transcriptional regulator
BcerKBAB4_3017-3275.874264beta-lactamase
BcerKBAB4_30180202.528576serine-type D-Ala-D-Ala carboxypeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3013TCRTETA755e-17 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 75.2 bits (185), Expect = 5e-17
Identities = 68/358 (18%), Positives = 124/358 (34%), Gaps = 21/358 (5%)

Query: 16 FSSLFL-FLTFYMLMTTLPVYVIDSLKGK--PEEIGLVATVFLISSVLCRPFTGKWLDDL 72
S++ L + ++M LP + D + G++ ++ + C P G D
Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70

Query: 73 GRKKILFISLSLFLAATVMYFGTQSLFLLLALRFLHGIGFGMATTATGTIVTDVAPAHRR 132
GR+ +L +SL+ + L++L R + GI G G + D+ R
Sbjct: 71 GRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDER 129

Query: 133 GEALAYYGVFMSLPMVIGPFLGLTIISHFSFTVLFIVCSVFSLLAFLLG-LLVNIPHEAP 191
+ MV GP LG ++ FS F + + L FL G L+ H+
Sbjct: 130 ARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188

Query: 192 VNKQKRE------KMKWKELIEPSSIPIALTGFVLAFSYSGILSFIPIYAKELGLSEIA- 244
+RE +W + + +A+ + ++
Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248

Query: 245 ----SYFFILYALVVVISRPFTGKIFDRFGENVLIYPAIIIFTIGMFILSQAQTSFWFLG 300
+ F IL++L + TG + R GE + +I G +IL T W
Sbjct: 249 GISLAAFGILHSLAQAM---ITGPVAARLGERRALMLGMIADGTG-YILLAFATRGWMAF 304

Query: 301 AGMLIGLGYGTLIPSFQTIAISAAPNHRRGSATATYYSFFDSGIGFGSFILGIVAAKS 358
M++ G +P+ Q + R+G + + G + + A S
Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3016THERMOLYSIN310.006 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 30.8 bits (69), Expect = 0.006
Identities = 11/63 (17%), Positives = 24/63 (38%), Gaps = 12/63 (19%)

Query: 141 SVKSFFDKFRSIFKQGEFFQEQFITMCPIKNFFNDDLGLSVY--------YPVLNDTEME 192
V + D+ ++ F+ G +E+ + D+LG +V + +
Sbjct: 54 LVYRYLDQEKNTFQLGGQARERL----SLIGNKLDELGHTVMRFEQAIAASLCMGAVLVA 109

Query: 193 HVD 195
HV+
Sbjct: 110 HVN 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3017BLACTAMASEA300.038 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.038
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 92 NKDTLYGIGSVSKMYATAAVMKLVDEGKVDLDAPVVHYVPD 132
D + + S K+ AV+ VD G L+ +HY
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLER-KIHYRQQ 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3018BLACTAMASEA310.010 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.5 bits (69), Expect = 0.010
Identities = 12/56 (21%), Positives = 18/56 (32%)

Query: 75 GGKTWSYAAGVADLSSKQPMKTDFRFRIGSVTKTFTATVVLQLVGENRLNLDDYIE 130
G+ +A + + D RF + S K VL V L+ I
Sbjct: 37 SGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH 92


101BcerKBAB4_3041BcerKBAB4_3051N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3041-112-0.301591ABC transporter
BcerKBAB4_3042-2100.351120IS605 family transposase OrfB
BcerKBAB4_3044-3100.599222cobalt transport protein
BcerKBAB4_3045-291.386103hypothetical protein
BcerKBAB4_3046-380.847222cell wall anchor domain-containing protein
BcerKBAB4_3047-2120.928046ribonuclease activity regulator protein RraA
BcerKBAB4_3048-2130.886324C4-dicarboxylate anaerobic carrier
BcerKBAB4_30490140.361513endonuclease I
BcerKBAB4_30500160.604853purine phosphorylase family 1
BcerKBAB4_30510180.503750XRE family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3041PF05272310.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.017
Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 327 LVGKNGTGKSTLLTILAGLQ 346
L G G GKSTL+ L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3045IGASERPTASE633e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.8 bits (152), Expect = 3e-13
Identities = 33/196 (16%), Positives = 63/196 (32%), Gaps = 14/196 (7%)

Query: 28 EKKAEPKQEEQKEVAKQEEQKDVPKPEEKQAEPEQKP--------------QEEQKDKQE 73
E AE ++E K V K E+ + ++ E K E K+ Q
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 74 QKVEEKPKEEKAQEQPEVKKEEKPKEQPAANKQPEQQKVQEEKPQQPKAPEVKEEAKVVN 133
+ +E EK ++ ++ + + + P+Q++ + +PQ A E +
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 134 QPKETPKQEQKEDPNKEEKTIPTPVVPTPEPPKPVPVPKPQSKQVTISVKGNEGYLLGAK 193
+T E P KE + V + T + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 194 KVDVQEGDTVYSVLQS 209
K + +V SV +
Sbjct: 1218 KPKNRHRRSVRSVPHN 1233



Score = 52.0 bits (124), Expect = 1e-09
Identities = 40/175 (22%), Positives = 69/175 (39%), Gaps = 13/175 (7%)

Query: 22 ETAVKPEKKAEPKQEEQKEVAKQEEQKDVPKPEEKQAEPEQKPQEEQKDKQE--QKVEEK 79
T + +E+A+ +E P +E + E K + + +K E+
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 80 PKEEKAQEQPEVKKEEKPKEQPAANKQPE----QQKVQEEKPQQPKAP---EVKEEAKV- 131
E AQ EV KE K A + E + +E + + K E +E+AKV
Sbjct: 1058 ATETTAQ-NREVAKEAKS-NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 132 VNQPKETPKQEQKEDPNKEE-KTIPTPVVPTPEPPKPVPVPKPQSKQVTISVKGN 185
+ +E PK + P +E+ +T+ P E V + +PQS+ T +
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170



Score = 51.6 bits (123), Expect = 2e-09
Identities = 30/180 (16%), Positives = 55/180 (30%), Gaps = 28/180 (15%)

Query: 24 AVKPEKKAEPKQEEQKEVAKQEEQKDVPKPEEKQAEPEQKPQEEQKDKQEQKVEE----- 78
K KA + E + + ++ + E + + +E+ K + E+ E
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKET---QTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 79 --KPKEEKA---QEQPEVKKEEKPKE-----QPAANKQPEQQKVQEEKPQQPKAPEVKEE 128
PK+E++ Q Q E +E P Q N + ++ +E + P +
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 129 -AKVVNQPKETPKQEQKED---------PNKEEKTIPTPVVPTPEPPKPVPVPKPQSKQV 178
N E P+ NK + V P +P V
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247



Score = 47.8 bits (113), Expect = 2e-08
Identities = 32/151 (21%), Positives = 52/151 (34%), Gaps = 19/151 (12%)

Query: 25 VKPEKKAEPKQEEQKEVAKQEEQKDVPKPEEKQAEPEQKPQEEQKDKQEQKVEEKPKEEK 84
V P + + +++ + +E P P + + V E K+E
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEA---PVPPPAPATPSETTETVAENSKQES 1048

Query: 85 AQEQPEVKKEEKPKEQPAANKQPEQQKVQEEKPQQPKAPEVKEEAKVVNQPKETPKQEQK 144
+ K E+ E A N++ +E K + E A+ ++ KET E K
Sbjct: 1049 KTVE---KNEQDATETTAQNRE----VAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 145 EDPNKE---------EKTIPTPVVPTPEPPK 166
E E EKT P V + PK
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132



Score = 46.6 bits (110), Expect = 5e-08
Identities = 28/131 (21%), Positives = 43/131 (32%), Gaps = 6/131 (4%)

Query: 21 EETAVKPEKKAE--PKQEEQKEVAKQEEQKDVPKP----EEKQAEPEQKPQEEQKDKQEQ 74
E+T P+ ++ PKQE+ + V Q E P +E Q++ EQ K+
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 75 KVEEKPKEEKAQEQPEVKKEEKPKEQPAANKQPEQQKVQEEKPQQPKAPEVKEEAKVVNQ 134
E+P E E P+ A QP KP+ V+ V
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236

Query: 135 PKETPKQEQKE 145
+
Sbjct: 1237 ATTSSNDRSTV 1247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3046TONBPROTEIN431e-06 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 43.4 bits (102), Expect = 1e-06
Identities = 11/60 (18%), Positives = 16/60 (26%)

Query: 154 EEPKPEEPKTEKPDGKPEEPKTEKPDGKPEEPKTEKPDGKPEEPKTEKPDGKPDGKPEDK 213
EP+ E + KP+ KP + + K D KP
Sbjct: 64 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPAS 123



Score = 42.3 bits (99), Expect = 3e-06
Identities = 28/97 (28%), Positives = 35/97 (36%), Gaps = 11/97 (11%)

Query: 153 VEEPKPEEPKT---------EKPDGKPEEPKTEKPDGKPEEPKTEKPDGKPEEPKTEKPD 203
+E P P +P + E P P+ EP E P P + KP
Sbjct: 36 IELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 95

Query: 204 GKPDGKPEDKVTEQPKEE--KVEIPAAQLNEAISKTS 238
KP KP KV EQPK + VE A E +
Sbjct: 96 PKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPAR 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3047CHLAMIDIAOM6280.020 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 27.7 bits (61), Expect = 0.020
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 4/46 (8%)

Query: 12 EKELQICRQSFRSFGKKKQFYGKIATVKVKDD-NVLVKEGLQTLPE 56
KE+ +S + K+ +G++ TVKV DD NV E Q +PE
Sbjct: 69 RKEVAPVHESKATGPKQDSCFGRMYTVKVNDDRNV---EITQAVPE 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3051PF05704290.012 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 29.5 bits (66), Expect = 0.012
Identities = 13/74 (17%), Positives = 28/74 (37%), Gaps = 4/74 (5%)

Query: 8 KIIADKRKEKGITQEELAAYIGITKASVSKWETGQ----SYPDITFVPLLASYFNISIDE 63
K + K I ++ I + +W+ G+ + DI + LL Y + ID
Sbjct: 94 KKNSGDFKVIIIDGNNYKEWVDIPDFLIKRWQEGKMLDAWFSDILRLFLLCKYGGLWIDA 153

Query: 64 LICYTLQMEQEDIK 77
+ ++ ++
Sbjct: 154 TVYMFDKVPNYIVE 167


102BcerKBAB4_3249BcerKBAB4_3253N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_32491151.513087hypothetical protein
BcerKBAB4_3250-112-0.756211short chain dehydrogenase
BcerKBAB4_3251115-0.862216N-acetyltransferase GCN5
BcerKBAB4_32520100.819436argininosuccinate lyase
BcerKBAB4_32530101.549092D-alanyl-D-alanine carboxypeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3249CHLAMIDIAOM6471e-06 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 46.6 bits (110), Expect = 1e-06
Identities = 40/165 (24%), Positives = 66/165 (40%), Gaps = 38/165 (23%)

Query: 623 YTVTIENTGNVLATNVIFQDPTPIGTTFIPNSVTVDGVSQPGANPATGFTVANISPGGSR 682
Y + I N G A NV+ ++P P DG + FT+ ++ PG R
Sbjct: 229 YKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFTLGDMQPGEHR 276

Query: 683 TVTFQV------RVTSTPSGGTIANRGNVAANFVVIPNQPPVTINRQTNTVVTQVNTGGL 736
T+T + R T+ + N A+ + N+P V QV+ G
Sbjct: 277 TITVEFCPLKRGRATNIATVSYCGGHKN-TASVTTVINEPCV-----------QVSIAGA 324

Query: 737 NVIKEVNTAQAAVGDTLTYTIAVQNTGNVPLTNVFFQDTISSAVS 781
+ + V + Y I+V N G++ L +V +DT+S V+
Sbjct: 325 D--------WSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVT 361



Score = 37.4 bits (86), Expect = 7e-04
Identities = 36/160 (22%), Positives = 66/160 (41%), Gaps = 28/160 (17%)

Query: 359 YTITVPNTGTGSAENVVLQDSIPNGTTFIAGSVTVGGVTQPSANPASGINLGTIPNNAQR 418
Y I + N GT +A NVV+++ +P+ G S LG + R
Sbjct: 229 YKINIVNQGTATARNVVVENPVPD------------GYAHSSGQRVLTFTLGDMQPGEHR 276

Query: 419 VVTFQVRVMSFPSPNPISNRAMVSYQFRPFVGSPPITSTASSNTVQTTVNRANVS-LQKS 477
+T + + +N A VSY + +T+ + VQ ++ A+ S + K
Sbjct: 277 TITVEFCPL---KRGRATNIATVSY-CGGHKNTASVTTVINEPCVQVSIAGADWSYVCKP 332

Query: 478 VDLQTATLNDILTYTVNVTNNGNVAANNVIFVDSIPAGTT 517
V+ Y ++V+N G++ +V+ D++ G T
Sbjct: 333 VE-----------YVISVSNPGDLVLRDVVVEDTLSPGVT 361



Score = 32.4 bits (73), Expect = 0.028
Identities = 63/339 (18%), Positives = 118/339 (34%), Gaps = 58/339 (17%)

Query: 481 QTATLNDILTYTVNVTNNGNVAANNVIFVDSIPAGTTFVTNSVTVNGVARPGANPASSIN 540
+ A L + Y +N+ N G A NV+ + +P +G A +
Sbjct: 219 ENACLRCPVVYKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFT 266

Query: 541 LGSINASQTTVVRFQVRVTSNPLVNPIPNRASATFNFIPVPGQQPVSGQATSNTVFTTIN 600
LG + + + V P + N V G + +V T IN
Sbjct: 267 LGDMQPGEHRTI----------TVEFCPLKRGRATNIATV---SYCGGHKNTASVTTVIN 313

Query: 601 IADIRTRKTVDRAFATINDVLTYTVTIENTGNVLATNVIFQDPTPIGTTFIPNSVTVDGV 660
++ ++ + + Y +++ N G+++ +V+ +D G T +
Sbjct: 314 EPCVQV-SIAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVL--------- 363

Query: 661 SQPGANPATG---FTVANISPGGSRTVTFQVRVTSTPSGGTIANRGNVAANFVVIPNQPP 717
GA + +TV ++PG ++ ++V + + G N V + +
Sbjct: 364 EAAGAQISCNKVVWTVKELNPG--ESLQYKV-LVRAQTPGQFTNNVVVKSC----SDCGT 416

Query: 718 VTINRQTNTVVTQVNTGGLNVIKEVNTAQAAVGDTLTYTIAVQNTGNVPLTNVFFQDTIS 777
T + T V + V+ + VG+ Y I V N G+ TNV S
Sbjct: 417 CTSCAEATTYWKGVAATHMCVVDTCDP--VCVGENTVYRICVTNRGSAEDTNVSLMLKFS 474

Query: 778 SAVSFVA-----------NTVTINGVPQSGLNPNTGFSL 805
+ V+ NTV + +P+ G FS+
Sbjct: 475 KELQPVSFSGPTKGTITGNTVVFDSLPRLGSKETVEFSV 513



Score = 31.6 bits (71), Expect = 0.046
Identities = 32/161 (19%), Positives = 59/161 (36%), Gaps = 26/161 (16%)

Query: 885 LVYTIEVINAGSVPATNVFFQDSIPQGTLFIENSVFVNGVLQEGADPELGFPLNNLPTGA 944
+VY I ++N G+ A NV ++ +P +G L F L ++ G
Sbjct: 227 VVYKINIVNQGTATARNVVVENPVP------------DGYAHSSGQRVLTFTLGDMQPGE 274

Query: 945 SVIVTFEVLIDEIPQGNNVVNNANVTGDFLVNPTEPPITVTVPSNTVMTVVNSSGLNVMK 1004
+T E + + N+ + G + +V TV+N + V
Sbjct: 275 HRTITVEFCPLKRGRATNIATVSYCGGH-------------KNTASVTTVINEPCVQVSI 321

Query: 1005 SVSATEAGVGDTLTYTVRIQNSGTVAATNVSFLDPIPSGTT 1045
+ A + V + Y + + N G + +V D + G T
Sbjct: 322 A-GADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVT 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3250DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.1 bits (171), Expect = 2e-16
Identities = 56/241 (23%), Positives = 100/241 (41%), Gaps = 18/241 (7%)

Query: 2 RYVIVTGTSQGLGEAIATQLLEENTSIISISRRENKELAKLAEQYNSNCIFHS----IDL 57
+ +TG +QG+GEA+A L + I ++ N E + H+ D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDY--NPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 58 QDVHNLETHFNEVFSSIQKDNVSSIHLINNAGTVAPMKPIEKSESEQFITNVHINLIAPM 117
+D ++ E+ + I+++ L+N AG V I E++ +N
Sbjct: 67 RDSAAID----EITARIEREMGPIDILVNVAG-VLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 118 ILTSTFMKQTKDWKVDKRIINISSGAGKNPYFGWGAYCTTKAGVNMFTQCVATEEVEKEY 177
+ + K D + I+ + S P AY ++KA MFT+C+ E EY
Sbjct: 122 NASRSVSKYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA--EY 178

Query: 178 PVKTVAFAPGVVDTNMQAQI--RDTNKEDFI--NLDRFTALKEEGKLLSPEYVAKAIRNL 233
++ +PG +T+MQ + + E I +L+ F KL P +A A+ L
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 234 L 234
+
Sbjct: 239 V 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_325156KDTSANTIGN290.019 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.2 bits (65), Expect = 0.019
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 14/73 (19%)

Query: 206 NNITVNFVYTPKEARKKG---------YASSCVAALSQRMLDEGYKTTTLYTDLANPTSN 256
N I +NFV P+ +++G A VAA + R+L+ + LY DL
Sbjct: 328 NQIHLNFVMPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLV----- 382

Query: 257 KIYQEIGYEKMME 269
K+ + G K ME
Sbjct: 383 KLQRHAGIRKAME 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3253BLACTAMASEA354e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.8 bits (80), Expect = 4e-04
Identities = 21/163 (12%), Positives = 49/163 (30%), Gaps = 17/163 (10%)

Query: 13 IFVLLISGNFLVKKVWSSNNDDAQYIASFIE-EHKDEKNSALLIKRNDKVVYSVNPDVVL 71
I + +IS + + E + + + + + + D
Sbjct: 4 IRLCIIS-LLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERF 62

Query: 72 PVASTMKLIVALEYTKQVTEGKIDPSSFVSINDVNRYYVPGTDGGAQDRWQNYLQKKEKI 131
P+ ST K+++ +V G + Q +Y EK
Sbjct: 63 PMMSTFKVVLCGAVLARVDAGDEQLERKIHYR--------------QQDLVDYSPVSEKH 108

Query: 132 TEGAVSLEEVAKGMVKFSSNANTEYLMEVL-GLDNINRNLQSL 173
+++ E+ + S N+ L+ + G + L+ +
Sbjct: 109 LADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQI 151


103BcerKBAB4_3818BcerKBAB4_3825N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_3818-112-2.103296diguanylate phosphodiesterase
BcerKBAB4_3819-211-1.321920short chain dehydrogenase
BcerKBAB4_3820-112-2.407746metallophosphoesterase
BcerKBAB4_3821-116-2.938542hypothetical protein
BcerKBAB4_3822-116-3.851867polyphosphate kinase
BcerKBAB4_3823015-4.591626Ppx/GppA phosphatase
BcerKBAB4_3824-116-2.089717hypothetical protein
BcerKBAB4_3825-117-1.961681hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3818FbpA_PF05833340.001 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 34.1 bits (78), Expect = 0.001
Identities = 18/151 (11%), Positives = 43/151 (28%), Gaps = 1/151 (0%)

Query: 133 EQFNHLLMYYRTYGIQISINKVGTGTSN-LERISVLAPDILKVDLTNLRQTALLQSYQDI 191
+ + +K+ TG S L +DL+ +++ +D+
Sbjct: 179 DMIENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDL 238

Query: 192 LYSLSLLARRIGATLLYEEIDAFYQLQYAWKNGGRYYQGNYLKECLPDFIETNVLKERLG 251
+ FY L K + Q + + L +F +RL
Sbjct: 239 FKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLK 298

Query: 252 NECHQFILHEKKKLQKIYNLTEMLRDRIGDV 282
++ + + ++L + +
Sbjct: 299 SKSSDLQKIVMNNINRCTKKDKILNNTLKKC 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3819DHBDHDRGNASE994e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.4 bits (247), Expect = 4e-27
Identities = 68/257 (26%), Positives = 122/257 (47%), Gaps = 11/257 (4%)

Query: 11 VKEKVVIITGGSSGMGKGMAIRFAKEGARVVITGRTKEKLEEAKLEIEQFPGQVLSVQMD 70
++ K+ ITG + G+G+ +A A +GA + EKLE+ ++ + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 VRNTDDIQKMIEHIDEKFGRIDILINNAAGNFICPAEDLSVNGWNSVINIVLNGTFYCSQ 130
VR++ I ++ I+ + G IDIL+N A LS W + ++ G F S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 131 AIGKYWIEKGIKGNIINMVATYAWDAGPGVIHSAAAKAGVLAMTKTLAVEWGRKYGIRVN 190
++ KY +++ G+I+ + + A + A++KA + TK L +E +Y IR N
Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA-EYNIRCN 183

Query: 191 AIAPGPIERTGGADKLWISEEMAKRTLQ--------SVPLGRMGTPEEIAGLAYYLCSDE 242
++PG T LW E A++ ++ +PL ++ P +IA +L S +
Sbjct: 184 IVSPGST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 243 AAYINGTCMTMDGGQHL 259
A +I + +DGG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3821PERTACTIN250.041 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 25.1 bits (54), Expect = 0.041
Identities = 13/38 (34%), Positives = 15/38 (39%)

Query: 38 GLLGGALAFGPRPFYPPYPPPFPPPAPFPCYGGPCQQP 75
L+G P+P P P P P P P P Q P
Sbjct: 561 SLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPP 598


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_3825cloacin290.016 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.9 bits (64), Expect = 0.016
Identities = 11/57 (19%), Positives = 27/57 (47%)

Query: 66 VQEGKDNNQAVKDKLDQAVKNTAEREKVLIKEKEALNKAQEEVKSADKHVKKIEDNK 122
V+ + N + + +L+QA ++ A ++ K + N + E+ +A+K +
Sbjct: 316 VEAAERNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELDAANKTLADAIAEI 372


104BcerKBAB4_4094BcerKBAB4_4097N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4094-213-1.112573comG operon protein 4
BcerKBAB4_4095012-0.702531comG operon protein 3
BcerKBAB4_4096-212-0.909073type II secretion system protein
BcerKBAB4_4097-114-0.197168type II secretion system protein E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4094BCTERIALGSPH414e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 40.7 bits (95), Expect = 4e-07
Identities = 17/66 (25%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 1 MKQKGFTLLEMLLVLFAISVLSVVTHFNVTSLHEKQKVEQFLKQFSNDILYMQQLAIKRQ 60
M+Q+GFTLLEM+L+L + V + + + + + L +F + ++QQ ++
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQT-LARFEAQLRFVQQRGLQTG 59

Query: 61 QHYTLR 66
Q + +
Sbjct: 60 QFFGVS 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4095BCTERIALGSPG503e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 3e-11
Identities = 18/68 (26%), Positives = 39/68 (57%)

Query: 1 MQNEEGFTLLEMLLVMVVITVLLLLIIPNVVTQRSSVQGKGCAAYVKSIEAQIQAYHLQH 60
+ GFTLLE+++V+V+I VL L++PN++ + + + + ++E + Y L +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 NKIPSIEE 68
+ P+ +
Sbjct: 64 HHYPTTNQ 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4096BCTERIALGSPF741e-16 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 73.7 bits (181), Expect = 1e-16
Identities = 61/357 (17%), Positives = 154/357 (43%), Gaps = 26/357 (7%)

Query: 4 FKRKWSLSDQALLCKRLSDLLEKGYSLLQALEFLQLQLPLGKKLQLQRMIEGLKN----G 59
K + S SD ALL ++L+ L+ L +AL+ + Q +K L +++ +++ G
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQ---SEKPHLSQLMAAVRSKVMEG 117

Query: 60 QSLHASFHQLMFHSEMLSYLFYA-----ERHGDISFALQQGSVLLYKKDKYRKDMMKVMR 114
SL + + L+ A E G + L + + ++ + R + + M
Sbjct: 118 HSLADA---MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMI 174

Query: 115 YPMFLSFFLMIMLSVFNLILLPQFEMMYSSLRSTAPPLTEQILVAIKLLPYFIYMIVLIV 174
YP L+ + ++S+ +++P+ + ++ P T ++ + F ++L +
Sbjct: 175 YPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLAL 234

Query: 175 ITGFSIYIFYFRKLPPTQKVKI---MIRIPLMKTFLILNHSHYFSTQLSGLLHGGLSVHE 231
+ GF + R+ ++V ++ +PL+ ++ ++ LS L + + +
Sbjct: 235 LAGFMAFRVMLRQ--EKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQ 292

Query: 232 ALTIMMKQNYHPFFQYEADRIERQLIAGEPLQSIIDKSGYYEKELSYIITHGQANGNLAN 291
A+ I + + ++ + G L ++++ + + ++I G+ +G L +
Sbjct: 293 AMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDS 352

Query: 292 EL---GDYSELIIEKVEQKIKRMLFVIQPILFTCLGVIVILMYLAMIMPMFQMMNSI 345
L D + + ++ L + +P+L + +V+ + LA++ P+ Q+ +
Sbjct: 353 MLERAADNQD---REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4097PF05272300.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.016
Identities = 12/50 (24%), Positives = 18/50 (36%), Gaps = 6/50 (12%)

Query: 138 GLLVFTGPTGSGKTTTMYALLEVARKWQTRRIITLEDPVEQRKDGLLQIQ 187
+V G G GK+T + L V + + + KD QI
Sbjct: 597 YSVVLEGTGGIGKSTLINTL--VGLDFFSDTHFDIGT----GKDSYEQIA 640


105BcerKBAB4_4394BcerKBAB4_4401N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_43940150.065878small acid-soluble spore protein SspI
BcerKBAB4_43951130.074010metal dependent phosphohydrolase
BcerKBAB4_4396214-0.363176abortive infection protein
BcerKBAB4_43972141.091594hypothetical protein
BcerKBAB4_43982160.658748hypothetical protein
BcerKBAB4_43993200.938692EmrB/QacA family drug resistance transporter
BcerKBAB4_44004240.945962hypothetical protein
BcerKBAB4_44015261.397624TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4394DNABINDINGHU250.025 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 24.7 bits (54), Expect = 0.025
Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 19 DQLQETIVDAIQSGEEKMLPGLGVLFEVIWKNA 51
D + + + GE+ L G G FEV + A
Sbjct: 27 DAVFSAVSSYLAKGEKVQLIGFGN-FEVRERAA 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4399TCRTETB1425e-39 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 142 bits (360), Expect = 5e-39
Identities = 86/402 (21%), Positives = 175/402 (43%), Gaps = 18/402 (4%)

Query: 106 FVSILNQTIINVALPPLMNEFNVSTSTAQWLITGFMLVNGILVPISAFLVSRFTYRKLFI 165
F S+LN+ ++NV+LP + N+FN ++ W+ T FML I + L + ++L +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 166 AAMLFFTVGSIICATSGN-FTMMMTGRVVQAVGAGILMPVGMNIFMTLFPPNKRGAAMGL 224
++ GS+I + F++++ R +Q GA + M + P RG A GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 225 LGVAMILAPAIGPTVTGWVIENYSWNLMFYGMFVIGLIITFLSFKFFTLAQPVSKTKLDV 284
+G + + +GP + G + W+ + + + IIT + K D+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 285 FGVISSSIGLGSLLYGFSEAGNNGWTSAEVVITLIIGVIGLAVFIWRELTTDNKMLDLQV 344
G+I S+G+ + + + LI+ V+ +F+ + +D +
Sbjct: 202 KGIILMSVGIVFFMLF---TTSYSISF------LIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 345 FKYPTFTFTLVINAIVTMALFGGMLLLPVYLQNIRGFTPMESG-LLLLPGSLIMGIMGPV 403
K F ++ I+ + G + ++P ++++ + E G +++ PG++ + I G +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 404 AGKLFDKYGIRPLAIVGLAITTFATYKFTTLSMDTPYSVIMTDYIIRSI--GMSFIMMPI 461
G L D+ G + +G+ + + F T S + II + G+SF I
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVS---FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVI 369

Query: 462 MTAGMNALPMKLISHGTATQNTSRQVAGSIGTAILITLMTQQ 503
T ++L + G + N + ++ G AI+ L++
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4400RTXTOXIND754e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 75.3 bits (185), Expect = 4e-18
Identities = 29/140 (20%), Positives = 47/140 (33%), Gaps = 22/140 (15%)

Query: 87 QTVDVTIPQNATVVQSNATT-NAFVGAGSPIAYAFDMS------NLWVTANIEETNIDDV 139
Q + P + V Q T V + M L VTA ++ +I +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETL-----MVIVPEDDTLEVTALVQNKDIGFI 380

Query: 140 QKGQTVDVYVDAYPDTT---LTGKVEQVGLTTANTFSMLPSSNATANYTKVKQVVPVKVS 196
GQ + V+A+P T L GKV+ + L V +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI-------EDQRLGLVFNVIISIEENCL 433

Query: 197 LDHSKSVNIVPGMNVSVRIH 216
+K++ + GM V+ I
Sbjct: 434 STGNKNIPLSSGMAVTAEIK 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4401HTHTETR619e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 9e-14
Identities = 22/98 (22%), Positives = 37/98 (37%), Gaps = 6/98 (6%)

Query: 9 PRVKRTRQLIQDAFVALVGEKGFENVTVQHIAERAPVNRATFYSHYHDKYDLLEKSIEEM 68
+ TRQ I D + L ++G + ++ IA+ A V R Y H+ DK DL + E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 69 LEKLAAVIKPQNRNKEDFQLTFDSPHPTFLALFEHIAD 106
+ + E P + H+ +
Sbjct: 67 ESNIGELE------LEYQAKFPGDPLSVLREILIHVLE 98


106BcerKBAB4_4644BcerKBAB4_4648N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4644-113-1.586210putative proton-coupled thiamine transporter
BcerKBAB4_4645-115-1.807847histidine kinase
BcerKBAB4_4646014-0.766675two component transcriptional regulator
BcerKBAB4_4647019-0.226409sortase family protein
BcerKBAB4_46481200.039175cell wall anchor domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4644ACRIFLAVINRP290.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.011
Identities = 31/226 (13%), Positives = 76/226 (33%), Gaps = 44/226 (19%)

Query: 3 NTNLQAMIESAILAAFALIIDILPLSLKLPTGGSISFAMIPIFIIAYRWGFKIAF--LGG 60
+ ++ + E+ +L + + + + L ++ ++ F I +G+ I + G
Sbjct: 338 HEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFG 397

Query: 61 LIWGLLQIVVGDAIIV-------------------------------TPTQTIIEYFVAF 89
++ + ++V DAI+V + F+
Sbjct: 398 MVLAI-GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 90 AFI-GFAGLFYKPIQQTLMNNQRKKTIAYIIF-----ATFIGSLARYFWHFIAGIIFWG- 142
AF G G Y+ T+++ + +I AT + ++ G W
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFN 516

Query: 143 ---QYAPKGQSAVLYSLIVNGSTMLASFTLCTVLLILLFTTSPRLF 185
++ + + ++ + L + L +++LF P F
Sbjct: 517 TTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSF 562


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4645IGASERPTASE300.040 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.040
Identities = 12/45 (26%), Positives = 21/45 (46%)

Query: 168 KPLEKKISEIREEKEKKKADQEPLVKEKKQVTQEQAAQERANQEQ 212
+ E K + E++EK K + E + K +Q QE++ Q
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4646HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 2e-22
Identities = 30/130 (23%), Positives = 59/130 (45%), Gaps = 2/130 (1%)

Query: 1 MSK-NILIVEDEDILREILKDYFLSEQYKVLEARDGKEALVLFEEEEVDLVILDIMLPEL 59
M+ IL+ +D+ +R +L Y V + + DLV+ D+++P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGWSVCRRIRKT-SGVPIIMLTARVDEDDTLLGFELGADDYVAKPYSPPILLARAKRLLE 118
+ + + RI+K +P+++++A+ + E GA DY+ KP+ L+ R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 119 SRQASKKPLE 128
+ LE
Sbjct: 121 EPKRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4648IGASERPTASE335e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 5e-04
Identities = 25/109 (22%), Positives = 35/109 (32%), Gaps = 2/109 (1%)

Query: 24 ASTGTLTKEESAQMQQDKAKKEEAIKEQQKSEVEKKEAAQVQEKSDMAKKEEAIKAGQKN 83
+ T E S Q + K E+ E E + A+ K++ E A +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 84 DTAKKEV-TPQVQEKEALAKKEAAIKAGQKNDAAKQEVAKQAVQGEKLP 131
+T E EKE AK E K + Q KQ P
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETE-KTQEVPKVTSQVSPKQEQSETVQP 1141


107BcerKBAB4_4660BcerKBAB4_4683N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_4660115-3.610991putative lipoprotein
BcerKBAB4_4661117-2.912446polysaccharide deacetylase
BcerKBAB4_4662118-3.122444hypothetical protein
BcerKBAB4_4663219-2.977673rod shape-determining protein MreC
BcerKBAB4_4664118-2.734745hypothetical protein
BcerKBAB4_4665-212-2.577621histidine kinase
BcerKBAB4_4666-312-1.563172two component transcriptional regulator
BcerKBAB4_4667-314-1.433762hypothetical protein
BcerKBAB4_4668-212-1.719598ribosomal RNA adenine dimethylase
BcerKBAB4_4669-211-1.633810hypothetical protein
BcerKBAB4_4670-211-1.584612hypothetical protein
BcerKBAB4_4671-213-1.150017ABC transporter
BcerKBAB4_4672-212-1.417880AraC family transcriptional regulator
BcerKBAB4_4673-214-1.601702hypothetical protein
BcerKBAB4_4674-213-2.291474ABC transporter
BcerKBAB4_4675-214-2.506090hypothetical protein
BcerKBAB4_4676-215-2.031500ArsR family transcriptional regulator
BcerKBAB4_4677-113-2.530027two component transcriptional regulator
BcerKBAB4_4678-113-2.348982histidine kinase
BcerKBAB4_4679012-2.120498hypothetical protein
BcerKBAB4_4680011-1.309975hypothetical protein
BcerKBAB4_46811100.355906ABC transporter
BcerKBAB4_46822110.804562hypothetical protein
BcerKBAB4_46831143.136249hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4660RTXTOXIND368e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 8e-05
Identities = 25/157 (15%), Positives = 56/157 (35%), Gaps = 20/157 (12%)

Query: 66 NDGKDSNQAIMPKLEQATKSIDEREKVWNKEK----EAFGKAQEEVKSVHKTIDKMEDAA 121
D ++ + T I E+ W +K K + E +V I++ E+ +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 122 --LQKQAKNIQDIYKKRYASFSKINDSYQKLMKSERELYKSLGEKETNLKKVSEKIKGVN 179
+ + + + K+ + + + K +++ EL + +
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ--------------LE 276

Query: 180 QMNEDIQREKEKFNRYTQEYNKEKLDFYKQAKIKMKE 216
Q+ +I KE++ TQ + E LD +Q +
Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4663LIPPROTEIN48320.003 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 31.5 bits (71), Expect = 0.003
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 8 KKILLFLSIILLALLMVYVCTNNKHVQNIVHNIEDIYKVYKENQVLKEKIEHQESLKSKV 67
KKILL LS I L V V N NI +DI K N K+ +++ E LK K
Sbjct: 5 KKILLGLSPIAAILPAVAVSCGNNDESNISFKEKDISKYTTTNANGKQVVKNAELLKLKP 64

Query: 68 QMLSEE----KENFNK 79
++++E ++FN+
Sbjct: 65 VLITDEGKIDDKSFNQ 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4664ECOLIPORIN300.002 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 30.3 bits (68), Expect = 0.002
Identities = 16/65 (24%), Positives = 23/65 (35%), Gaps = 3/65 (4%)

Query: 49 DVYYAVVDSDGEYEVKSK--GFGRWSYKFKGY-DESGKEQKIMLTTTKHLRIGAYLKIKS 105
D Y V GE ++ + G+G+W Y + E L+ G Y
Sbjct: 53 DQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDY 112

Query: 106 KRTYG 110
R YG
Sbjct: 113 GRNYG 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4666HTHFIS913e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 3e-23
Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%)

Query: 1 MSKETILIVDDEKEIRKLIAIYLKNEGYEVLQAGDGEEGLEIVKKRDVHLIVLDIMMPKI 60
M+ TIL+ DD+ IR ++ L GY+V + + D L+V D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGIHMCMKVREI-AEMPIIMLSAKTQDMDKILGLTTGADDYVAKPFNPLELIARIKSQLR 119
+ + ++++ ++P++++SA+ M I GA DY+ KPF+ ELI I L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RY 121

Sbjct: 121 EP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4670ACRIFLAVINRP320.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.7 bits (72), Expect = 0.009
Identities = 36/213 (16%), Positives = 72/213 (33%), Gaps = 41/213 (19%)

Query: 116 SLIIGIAIGSVLSKLFLELLVSMMGLNLNVHFEVPMA----AIVDTAIIFFVIILYTSLQ 171
+LI IA+ VL F +++ G ++N M +VD AI+
Sbjct: 365 TLIPTIAVPVVLLGTFA--ILAAFGYSINTLTMFGMVLAIGLLVDDAIV----------- 411

Query: 172 GYRLIYRFKLIE-LFRAEREGEAMPK-------GSVIMALISVFLIGSGYFLALMFMKAV 223
++E + R E + PK + AL+ + ++ S F+ + F
Sbjct: 412 ---------VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 224 ---MYANF-MAVALYILLATVLGTYLLFMFFTVFVLKRAR---NNKSSFYNGMNMVTTSQ 276
+Y F + + + L+ ++ L + + NK F+ N
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHS 522

Query: 277 LLYRIKGNAKSLATISILSAVTLTAVGTSVTMY 309
+ + K L + + V V ++
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALIVAGMVVLF 555


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4671PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 41 GPSGSGKTTLLNVLSTIDNATDGEILI-DGKD 71
G G GK+TL+N L +D +D I GKD
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4676HTHTETR305e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.4 bits (68), Expect = 5e-04
Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 7/43 (16%)

Query: 9 ADPTRRKILD----LLKEG---DLTAGEIAEQFNMTKPSISHH 44
A TR+ ILD L + + GEIA+ +T+ +I H
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWH 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4677HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 1/117 (0%)

Query: 3 KILIVEDDPNISSLLQSHIQKYGYDAVVAENFDDIMESFNAVKPHLVLLDVNLPKFDGFY 62
IL+ +DD I ++L + + GYD + N + A LV+ DV +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 WCRQIR-HESTCPIIFISARAGEMEQIMAIESGADDYITKPFHYDVVMAKIKGQLRR 118
+I+ P++ +SA+ M I A E GA DY+ KPF ++ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4678PF06580422e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 2e-06
Identities = 23/113 (20%), Positives = 43/113 (38%), Gaps = 23/113 (20%)

Query: 216 DAKWLKFIIYQLMTNAVRY---SGERGKKVFLSAYRNGKDIILEVRDEGVGIPQEDVRRV 272
D + ++ L+ N +++ +G K+ L ++ + LEV + G +
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT---- 307

Query: 273 FEPFYTGKNGRTFGESTGMGLYIVSK-ICDYLG--HSVKLDSEVGKGTTIKII 322
ESTG GL V + + G +KL + GK + +I
Sbjct: 308 -------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4683cloacin250.015 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 25.4 bits (55), Expect = 0.015
Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 36 GASCFGGGGGGCGYGGYGGYGGGYGG 61
G+ GGG G G GG G GG G
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSG 76


108BcerKBAB4_4893BcerKBAB4_4899N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_48931192.7434622,5-didehydrogluconate reductase
BcerKBAB4_48942172.756349major facilitator transporter
BcerKBAB4_48951192.323947HxlR family transcriptional regulator
BcerKBAB4_48962161.605401hypothetical protein
BcerKBAB4_48970131.255524FAD-dependent pyridine nucleotide-disulfide
BcerKBAB4_4898-2150.412077tyrosyl-tRNA synthetase
BcerKBAB4_4899-2140.065491UDP-N-acetylenolpyruvoylglucosamine reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4893HELNAPAPROT310.003 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 31.0 bits (70), Expect = 0.003
Identities = 22/85 (25%), Positives = 29/85 (34%), Gaps = 16/85 (18%)

Query: 92 TTLAAYEESLKKLELDYLDLYL----VHWPVEGK-YKDSWRALETLYKE---------ER 137
T E SL ++ LY HW V+G + E LY ER
Sbjct: 8 TNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAER 67

Query: 138 VRAIGVSNFQIHHLKDVLEGAEIKP 162
+ AIG +K+ E A I
Sbjct: 68 LLAIGGQPVAT--VKEYTEHASITD 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4894TCRTETA665e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 66.0 bits (161), Expect = 5e-14
Identities = 72/336 (21%), Positives = 126/336 (37%), Gaps = 20/336 (5%)

Query: 11 VQTNRRSMFALLALAISAFGIGTTEFISVGLLPSISEDLHVSVTTA---GLTVSLYALGV 67
++ NR + L +A+ A GIG + + +LP + DL S G+ ++LYAL
Sbjct: 1 MKPNRPLIVILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQ 56

Query: 68 AFGAPVLTSLTASMSRKTLLMWIMIVFIIGNGIAAVATSFTVLLIARVVSAFAHGVFMSI 127
APVL +L+ R+ +L+ + + I A A VL I R+V+
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA 116

Query: 128 GSTIAAALVPENKRASAIAFMFTGLTVATITGVPIGTFIGQQFGWRASFMVIVVIGIIAL 187
G+ I A + ++RA FM + G +G +G F A F + +
Sbjct: 117 GAYI-ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNF 174

Query: 188 VANSMLIPSNLKKGTRVSFRDQFKLITNGRLLLVFVITALGYGGTF-------VTFTYLS 240
+ L+P + K R R+ + + R + A F V
Sbjct: 175 LTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234

Query: 241 PLLQEVTGFKSSAVTIILLVYGIAIAIGN-MVGGKLSNH-NPIRALFYMFFIQAIVLFVL 298
++ + ++ + I L +GI ++ M+ G ++ RAL +L
Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294

Query: 299 TFTAPFQVAGLITIIFMGLFTFMNVPGLQVYVVILA 334
F +A I ++ M P LQ +
Sbjct: 295 AFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQV 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4896PF07299270.024 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 27.1 bits (60), Expect = 0.024
Identities = 15/82 (18%), Positives = 33/82 (40%), Gaps = 3/82 (3%)

Query: 53 VNILTKSYDFAQTVATDEVLKSDTVSAITELVEPVKDTVKSMAATAIEAKDRADESNEVI 112
IL + A + LKS + I + E + D K + T + ++R D + ++
Sbjct: 23 AYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELIDTVLTVQNREDAESFLL 82

Query: 113 GLFGLL---KLLKDPQAQKMFR 131
+ + + + +K+F
Sbjct: 83 KINPYVIPFQEVTAQTLKKLFP 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4898TACYTOLYSIN300.021 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 29.9 bits (67), Expect = 0.021
Identities = 23/92 (25%), Positives = 36/92 (39%), Gaps = 18/92 (19%)

Query: 333 DEIEQGFKEMPTFQSSKETKNIVEWLVDLGIEPSRRQAREDINNGAISMN---------- 382
D I+ KEMP + KE K + + S E+IN+ S+N
Sbjct: 77 DMIKLAPKEMPLESAEKEEKKSED------NKKSEEDHTEEINDKIYSLNYNELEVLAKN 130

Query: 383 GEKVTDVSRDVTVENSFDGRFIIIRKGKKNYS 414
GE + + +FI+I + KKN +
Sbjct: 131 GETIENFV--PKEGVKKADKFIVIERKKKNIN 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_4899CHANLCOLICIN290.040 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.5 bits (63), Expect = 0.040
Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 18 HVKQDEMLKNHTHIKVGGKADVFVSPTNYDEIQEVIKYANQYNIPVTFLGNGSNVIIKDG 77
VK D+ K+ K VS YD + +++K + + FL D
Sbjct: 418 SVKYDDWAKHLDQFAKYLKITGHVS-FGYDVVSDILKIKDTGDWKPLFLTLEKKAA--DA 474

Query: 78 GLRGITVSLIHITNVT---VTGTAIVAGCGAAIIDVSRI 113
G+ + L + T + G AIV G + ID +++
Sbjct: 475 GVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYIDKNKL 513


109BcerKBAB4_5050BcerKBAB4_5057N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_5050-2282.591273RND family efflux transporter MFP subunit
BcerKBAB4_5051-2272.964356phosphoglycerate transporter
BcerKBAB4_50521313.346503putative lipoprotein
BcerKBAB4_50530272.606103hypothetical protein
BcerKBAB4_5054-1241.188491histidine kinase
BcerKBAB4_5055119-1.083655two component transcriptional regulator
BcerKBAB4_5056116-3.817810UDP-glucose 4-epimerase
BcerKBAB4_5057417-5.557070EPSX protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5050RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 33/195 (16%), Positives = 68/195 (34%), Gaps = 20/195 (10%)

Query: 80 PTKGKVKDIAVKEGQEVEKGTKLFSYDNEEINLQMKQAE---LDQKMADMRYDQGKKKID 136
VK+I VKEG+ V KG L + + L ++ RY + I+
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 137 SLKKEIKKVKDSGAGKEVTDPMEEQVSEL----------EMAQKTTDLEKEKGKLQ--KE 184
K K+ D + V++ +++ L + QK +L+K++ +
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221

Query: 185 ELSKKQKELTIYSNFTGVVQKLDKDAAQSSSQALGGQGK-----AFLQVASKDPFQVQGT 239
+++ + + + L A + L + K L+V Q++
Sbjct: 222 RINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESE 281

Query: 240 LTELQKSQIQKDQTF 254
+ ++ Q F
Sbjct: 282 ILSAKEEYQLVTQLF 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5051TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 32/163 (19%), Positives = 66/163 (40%), Gaps = 11/163 (6%)

Query: 48 FNLSTTYLVDEYGFSTTQIGLLGAVMAVVYGLSKFFMGNLSDKAFAQRFIAVGLFLSGLV 107
N+S + +++ + + + + G LSD+ +R + G+ ++
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFG 92

Query: 108 NICFGFASSFGMILTLLVLNGIVQGMGAPP----CSIVMTKWFSKKERGTKTGLWNISHN 163
++ SF +LL++ +QG GA +V+ ++ K+ RG GL S
Sbjct: 93 SVIGFVGHSF---FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIG-SIV 148

Query: 164 VGGLLVPPFIGIGVGIFGESHWQGGVFIFPAIIVMVIAVLVWI 206
G V P IG G+ + + P I ++ + L+ +
Sbjct: 149 AMGEGVGPAIG---GMIAHYIHWSYLLLIPMITIITVPFLMKL 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5055HTHFIS1021e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (257), Expect = 1e-27
Identities = 36/136 (26%), Positives = 69/136 (50%), Gaps = 1/136 (0%)

Query: 2 KILVVDDESSIRNLIRMQLEMEGYEVLTAADGREALERW-NEQPDVLILDVMLPDTDGYE 60
ILV DD+++IR ++ L GY+V ++ D+++ DV++PD + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 61 LLRLFREKERDIPVLMLTAKSQMNDKLLGLQLGADDYVTKPFNYAELILRVKNMARRVKK 120
LL ++ D+PVL+++A++ + + GA DY+ KPF+ ELI + K+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 KEVPLNHEVIGAGNLL 136
+ L + L+
Sbjct: 125 RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5056NUCEPIMERASE1752e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 175 bits (445), Expect = 2e-54
Identities = 87/344 (25%), Positives = 152/344 (44%), Gaps = 44/344 (12%)

Query: 3 SILICGGAGYIGSHAVKKLVDEGLSVVVVDNLQTGHEDAI---------TEGAKFYNGDL 53
L+ G AG+IG H K+L++ G VV +DNL ++ ++ G +F+ DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 RDKEFLRDVFKQENIEAVMHFAADSLVGVSMEKPLQYYNNNVYGALCLLEVMDEFKVEKF 113
D+E + D+F + E V V S+E P Y ++N+ G L +LE K++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 114 IFSSTAATYGEVDVDLITEETKTN-PTNTYGETKLAIEKMLHWYSQASNLRYKIFRYFNV 172
+++S+++ YG + + + P + Y TK A E M H YS L R+F V
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 173 AGATPSGIIGEDHRPETHLIPLVLQVALGQREKIMMFGDDYNTPDGTCIRDYIHVEDLVA 232
G P G RP+ L + G+ + YN G RD+ +++D+
Sbjct: 182 YG--PWG------RPDMALFKFTKAMLEGKSIDV------YN--YGKMKRDFTYIDDIAE 225

Query: 233 AHFLGLKDLQNGGESDF----------------YNLGNGNGFSVKEIVDAVREVTKHEIP 276
A + L+D+ ++ + YN+GN + + + + A+ + E
Sbjct: 226 A-IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 277 AEMAPRRAGDPARLVASSQKAKEKLGWNPEYVNVKTIIEHAWDW 320
M P + GD A ++ E +G+ PE VK +++ +W
Sbjct: 285 KNMLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5057TYPE4SSCAGA290.040 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 28.5 bits (63), Expect = 0.040
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 17/108 (15%)

Query: 19 VVSGKLYWNKKVANA--TGQTSEVTKTKAEVKDSGAKKE--EKKEEKKQDAKSSFNEAYA 74
+V L +NK VA+A TG EV K + +++ S K+E EK+ EKK ++KS
Sbjct: 584 LVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSG-----N 638

Query: 75 KNLPDAVKEKLKKAAQDKKAVNLVIVGDEASSSEKDAWVAKFTANLEA 122
KN K + K A +K ++ EA+ +DA + NL+
Sbjct: 639 KN-----KMEAKAQANSQKDEIFALINKEAN---RDARAIAYAQNLKG 678


110BcerKBAB4_5205BcerKBAB4_5212N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_52050130.385084two component LuxR family transcriptional
BcerKBAB4_5206-112-0.160409GAF sensor signal transduction histidine kinase
BcerKBAB4_52072110.230295pyridoxal kinase
BcerKBAB4_5208312-0.094544diguanylate cyclase
BcerKBAB4_52092120.112955hypothetical protein
BcerKBAB4_5210112-0.587581carbon starvation protein CstA
BcerKBAB4_5211011-1.516255LytTR family two component transcriptional
BcerKBAB4_5212-111-1.646143major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5205HTHFIS933e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.0 bits (231), Expect = 3e-24
Identities = 39/219 (17%), Positives = 85/219 (38%), Gaps = 31/219 (14%)

Query: 2 KIKVLLVDDHTVVLKGLAFFLSTQEDFELVGEANNGKEALVKVGETSPDVVLMDLYMPEM 61
+L+ DD + L L ++ +++ +N + D+V+ D+ MP+
Sbjct: 3 GATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGIEATACIKKEYPNVKVIVLTSFSDQAHVLPALKAGASGYILKDIEPDQLVEAIRSAYK 121
+ + IKK P++ V+V+++ + + A + GA Y+ K + +L+ I
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG---- 116

Query: 122 GNIQLHPDIANALLSQTLPQEEKEEEPSVQVDVL--TARENEVLQLLAKGMSNKEIASVL 179
AL + E++ + ++ +A E+ ++LA+ M +++
Sbjct: 117 ----------RALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD--LTLM 164

Query: 180 VITE----KTVKAHMSSILSKLH-LSDRTQAALYAVKNG 213
+ E K + A LH R A+
Sbjct: 165 ITGESGTGKELVARA------LHDYGKRRNGPFVAINMA 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5206PF06580320.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.1 bits (73), Expect = 0.005
Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 447 NVSKHA---NVREATIYFKVTEKNVSLEIVDQGNGFVE-KDIKEKKSLGMTTMRERVELV 502
N KH + I K T+ N ++ + + G + K+ KE G+ +RER++++
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 503 GG---TIKIVSSKKRTSIKVNVP 522
G IK+ + + + V +P
Sbjct: 326 YGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5211HTHFIS473e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.1 bits (112), Expect = 3e-08
Identities = 20/134 (14%), Positives = 44/134 (32%), Gaps = 12/134 (8%)

Query: 2 KILLIMEETEERRKLVENFTENIRNVECFEAKTGTESLLIMKKHTPDFVFLNSQLMDGTG 61
IL+ ++ R L + + + + D V + + D
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 FEYSSLLREVNCYTKFIFIGE--DIEEAITAFRFQAFYYLLRPFREEDLQFILYKMGKEQ 119
F+ +++ + + AI A A+ YL +PF +L I+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII------- 115

Query: 120 GEKAKSHLRKLPIE 133
+A + ++ P +
Sbjct: 116 -GRALAEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5212TCRTETA561e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 55.6 bits (134), Expect = 1e-10
Identities = 64/369 (17%), Positives = 132/369 (35%), Gaps = 13/369 (3%)

Query: 7 ISKRKLLGIAGLGWLFDAMDVGMLSFVIVALQKDWGLSTQEMGWIG---SVNSIGMAVGA 63
+ + L + DA+ +G++ V+ L +D S G ++ ++ A
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 64 LLFGILSDKIGRKSVFIITLLLFSIGSGLTALTTTFAMFLVLRFLIGMGLGGELPVASTL 123
+ G LSD+ GR+ V +++L ++ + A + + R + G+ G VA
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAY 119

Query: 124 VSESVEAHERGKIVVLLESFWAGGWLIAALISYF---VIPKYGWEVAMVLSAVPALYALY 180
+++ + ER + + + + G + ++ P + A L+ + L +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 181 LRWNLPDS-PRFQKVAKRPSVIENIKSVWSVEYRKATIMLWILWFCVVFSYYGMFLWLPS 239
L LP+S ++ +R ++ W+ ++ + + + LW+
Sbjct: 180 L---LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 240 VMVLKGFSLIK-SFQYVLIMTLAQLPGYFTAAWFIERLGRKFVLVTYLIGTACSAYVFGI 298
+ L L RLG + L+ +I +
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 299 ADSLTALIVAGMLLSFFNLGAWGALYAYTPEQYPTTIRGTGAGMAAAFGRIGGILGPLLV 358
A +LL+ +G AL A Q +G G AA + I+GPLL
Sbjct: 297 ATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355

Query: 359 GYLVASQAS 367
+ A+ +
Sbjct: 356 TAIYAASIT 364



Score = 33.3 bits (76), Expect = 0.001
Identities = 29/125 (23%), Positives = 45/125 (36%), Gaps = 5/125 (4%)

Query: 274 ERLGRKFVLVTYLIGTACSAYVFGIADSLTALIVAGMLLSFFNLGAWGALYAYTPEQYPT 333
+R GR+ VL+ L G A + A L L + G +++ AY +
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI-GRIVAGITGATGAVAGAYIADITDG 126

Query: 334 TIRGTGAG-MAAAFGRIGGILGPLLVGYLVASQASLSLIFTIFCGSILIGALAVVILGQE 392
R G M+A FG G + GP+L G + + L L E
Sbjct: 127 DERARHFGFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALN--GLNFLTGCFLLPE 183

Query: 393 TKQRE 397
+ + E
Sbjct: 184 SHKGE 188


111BcerKBAB4_5228BcerKBAB4_5240N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BcerKBAB4_5228-215-0.967747TetR family transcriptional regulator
BcerKBAB4_5229-216-1.005077acriflavin resistance protein
BcerKBAB4_5230-114-1.361369methionine-R-sulfoxide reductase
BcerKBAB4_5231-114-1.139847hypothetical protein
BcerKBAB4_52320130.246945antiholin-like protein LrgB
BcerKBAB4_5233-1120.342167murein hydrolase regulator LrgA
BcerKBAB4_52340110.722261LytTR family two component transcriptional
BcerKBAB4_5235-1121.487729signal transduction histidine kinase LytS
BcerKBAB4_52361111.234918major facilitator transporter
BcerKBAB4_52372120.747073BCCT transporter
BcerKBAB4_5238210-1.105793nitric-oxide synthase
BcerKBAB4_523929-1.125717superoxide dismutase
BcerKBAB4_524029-1.530564NAD-dependent epimerase/dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5228HTHTETR621e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 1e-13
Identities = 21/62 (33%), Positives = 38/62 (61%)

Query: 2 KEKERLIIEMAMKLFATKGVNATSVQEIVTACGISKGAFYLYFKSKDELLLATLRYYYDK 61
+E + I+++A++LF+ +GV++TS+ EI A G+++GA Y +FK K +L
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 IQ 63
I
Sbjct: 70 IG 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5229ACRIFLAVINRP6660.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 666 bits (1719), Expect = 0.0
Identities = 234/1066 (21%), Positives = 458/1066 (42%), Gaps = 68/1066 (6%)

Query: 4 LINFSLKNKFAVWLLTIIVTIAGIYSGMNMKLETIPDITTPIVTVTTVYPGATPEEVADK 63
+ NF ++ W+L II+ +AG + + + + P I P V+V+ YPGA + V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 ISKPMEEQLQNLSGVNVVSSSSYQNASS-IQVEYDFDKNMEKAETEIKESLANVK--LPE 120
+++ +E+ + + + +SS+S S I + + + + A+ +++ L LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 GVKDPKVSRVNF--NAFPVISLSVASKNESLAALTENVEKNVVPGLKGLDGVASVQIAGQ 178
V+ +S + V + + +++ V NV L L+GV VQ+ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 179 QVDEVQLVFKKDKMKELGLSEDTVKNMIKGSDVSLPLGLYTFKDT------EKSVVVDGN 232
Q +++ D + + L+ V N +K + + G S++
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 ITTIKALKELKIPAIPSSPNGQGSQNAGAGTQAPQMNPAAMNGIPTVTLEEIADIKEVGK 292
+ ++ + + +G V L+++A ++ G+
Sbjct: 240 FKNPEEFGKVTLRV---NSDGS-----------------------VVRLKDVARVELGGE 273

Query: 293 A-ESISRTNGKEAIGIQIVKAADANTVDVVNAVKDKVKDLEKKY-KDLEIISTFDQGAPI 350
I+R NGK A G+ I A AN +D A+K K+ +L+ + + ++++ +D +
Sbjct: 274 NYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFV 333

Query: 351 EKSVETMLSKAIFGAIFAIVIIMLFLRNIRTTLISVVSIPLSLLIAVLVIKQMDITLNIM 410
+ S+ ++ + +++ LFL+N+R TLI +++P+ LL ++ ++N +
Sbjct: 334 QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTL 393

Query: 411 TLGAMTVAIGRVVDDSIVVIENIYRRMSLSEEKLRGKDLIREATKEMFIPIMSSTIVTIA 470
T+ M +AIG +VDD+IVV+EN+ R M E+KL K+ ++ ++ ++ +V A
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVM--MEDKLPPKEATEKSMSQIQGALVGIAMVLSA 451

Query: 471 VFLPLGLVKGMIGEMFLPFALTIVFALLASLLVAVTIVPMLAHSLFKKESMREKEVHH-- 528
VF+P+ G G ++ F++TIV A+ S+LVA+ + P L +L K S E
Sbjct: 452 VFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGF 511

Query: 529 ----EEKPSKLANGYKRILEWALNHKIITSSITVLLLVGSLALVPIIGVSFLPSEEEKMI 584
N Y + L I L++ G + L + SFLP E++ +
Sbjct: 512 FGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVF 571

Query: 585 IATYKPEPGQTLDDVEKIATKAEKHFQDKKDVKTIQ--FSLGGENPMSPGQTNQAMFFVQ 642
+ + G T + +K+ + ++ K + ++ F++ G + Q N M FV
Sbjct: 572 LTMIQLPAGATQERTQKVLDQVTDYYL-KNEKANVESVFTVNGFSFSGQAQ-NAGMAFVS 629

Query: 643 YD--NDTKNFEKEKEQVVKDLQKMSGKGEWKN---------QDFGAGGGSNEIKLYVYGD 691
+ E E V+ + GK + G G + + G
Sbjct: 630 LKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGL 689

Query: 692 SSEEIKPVVKDIQNIMKKN-KDLKDIDSSVAKTYAEYTLVADQEKLSKMGLTAAQIGMGL 750
+ + + + ++ L + + + A++ L DQEK +G++ + I +
Sbjct: 690 GHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTI 749

Query: 751 SNQHDRPVLTTIKKDDKDINVYVETEKQNYETIDDLTNRKITTPLGNEVAVKDVMTVKEG 810
S + + +YV+ + + +D+ + + G V T
Sbjct: 750 STALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWV 809

Query: 811 ETSNTVTHRDGRVYAEVSAKLTSDDVSK-ASAAVQKEVDKMDLPSGIDVSMGGVTQDIQE 869
S + +G E+ + S A A ++ K LP+GI G++ +
Sbjct: 810 YGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERL 867

Query: 870 SFKQLGLAMLAAIAIVYFVLVVTFGGALAPFAILFSLPFTIIGALVALLISGETLSVSAM 929
S Q + + +V+ L + P +++ +P I+G L+A + + V M
Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927

Query: 930 IGALMLIGIVVTNAIVLIDRVIH-KENDGLSTREALLEAGTTRLRPILMTAIATIGALIP 988
+G L IG+ NAI++++ E +G EA L A RLRPILMT++A I ++P
Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 989 LALGFEGSGLISKGLGVTVIGGLTSSTLLTLLIVPIVYEALSKFKK 1034
LA+ +G+ V+GG+ S+TLL + VP+ + + + K
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5234HTHFIS646e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 6e-14
Identities = 30/130 (23%), Positives = 59/130 (45%), Gaps = 6/130 (4%)

Query: 3 KVLVVDDEMLARDELKYLLEKTK-EVEIIGEADCVEDALEELMKNKPDIVFLDIQLSDDN 61
+LV DD+ R L L + +V I A + D+V D+ + D+N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNA---ATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GFEIANILKKMKNPPSIVFATAYDQY--ALQAFEVDALDYILKPFDEERIVQTLKKYKKQ 119
F++ +KK + ++ +A + + A++A E A DY+ KPFD ++ + + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 KQIKLETKQD 129
+ + +D
Sbjct: 122 PKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5235PF065802243e-70 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 224 bits (572), Expect = 3e-70
Identities = 66/216 (30%), Positives = 114/216 (52%), Gaps = 13/216 (6%)

Query: 359 QLELGEAELQSKLLQDAEIKALQAQINPHFLFNAINTVSALCRTDVEKARKLLLQLSVYF 418
Q E+ + ++ + Q+A++ AL+AQINPHF+FNA+N + AL D KAR++L LS
Sbjct: 146 QAEIDQWKMA-SMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELM 204

Query: 419 RCNLQGARQLLIPLEQELNHVHAYLSLEQARFPNKYEVNMYIEEALKVTLVPPFVLQLLV 478
R +L+ + + L EL V +YL L +F ++ + I A+ VPP ++Q LV
Sbjct: 205 RYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV 264

Query: 479 ENALRHAFPKKQPVCQVEVHVFEKEGMVHFEVKDNGQGIESERLEQLGKMVVPSKKGTGT 538
EN ++H + ++ + + G V EV++ G + +K+ TGT
Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL-----------ALKNTKESTGT 313

Query: 539 ALYNINERLIGLFGKETMLQIESELEQGTKILFVIP 574
L N+ ERL L+G E +++ + + ++ +IP
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV-LIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5236TCRTETB545e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.1 bits (130), Expect = 5e-10
Identities = 77/418 (18%), Positives = 145/418 (34%), Gaps = 42/418 (10%)

Query: 35 LDMLLLSFVLVYILKEFHLSPVEGGNLTLATTIGMLIGSYLFGFIADLFGRIRTMAFTIL 94
L+ ++L+ L I +F+ P + A + IG+ ++G ++D G R + F I+
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 95 LFSLATALIYFATDYWQLLIL-RFLVGMGVGGEFGIGMAIVTETWSKEMRAKATSVVALG 153
+ + + + ++ LLI+ RF+ G G + M +V KE R KA ++
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 154 WQFGVLVASLLPAFIVPHFGWRAVFLFGLIPALLAVYVRKSLSEPKIWEQKQRYKKELL- 212
G V + I + W + L +I + ++ K L + + K +L
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILM 207

Query: 213 -----------QKEAKGNLTTT-------EAEQLKQMKKFPLRKLFANKKVTITTIGLII 254
+ L + K F L N I ++
Sbjct: 208 SVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMI----GVL 263

Query: 255 MSFIQNFGYYGIFTWMPTILANKYNYTLAKA-SGWMFISTIGMLIGIATFGILADKIGRR 313
I G + +P ++ + + + A+ S +F T+ ++I GIL D+ G
Sbjct: 264 CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323

Query: 314 KTFTIYYVGGTIYCLIYFFL-FTDSTSLLWGSALLGFFANGMMGGFGAVLAENYPAEARS 372
+G T + + F T+ + + F + + +
Sbjct: 324 YVL---NIGVTFLSVSFLTASFLLETTSWF--MTIIIVFVLGGLSFTKTVISTIVSSSLK 378

Query: 373 TAENFIFGTGRGL--------AGFGPVIIGLLAAGGNLMGALSLIFIIYPIGLVTMLL 422
E G G L G G I+G L + L L + + L + LL
Sbjct: 379 QQEA---GAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLL 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BcerKBAB4_5240NUCEPIMERASE363e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.5 bits (82), Expect = 3e-04
Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 3 KVLVLGGTRFFGKHLVEVLLQAGHEV 28
K LV G F G H+ + LL+AGH+V
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV 27



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.