PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeZS7.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_011728 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BBUZS7_RS01690BBUZS7_RS01740Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS01690235-2.238111Cof-type HAD-IIB family hydrolase
BBUZS7_RS01695333-1.510909aminopeptidase
BBUZS7_RS01700233-2.752498divergent PAP2 family protein
BBUZS7_RS01705334-3.313303hypothetical protein
BBUZS7_RS01710433-3.419836membrane protein
BBUZS7_RS01715429-3.754118hypothetical protein
BBUZS7_RS01720328-3.263286peptide chain release factor 2
BBUZS7_RS01725231-5.163071hypothetical protein
BBUZS7_RS01730122-4.498810signal recognition particle-docking protein
BBUZS7_RS01735019-4.263750hypothetical protein
BBUZS7_RS01740-117-3.456769ABC transporter permease
2BBUZS7_RS02210BBUZS7_RS02270Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS022102192.011175DUF58 domain-containing protein
BBUZS7_RS022153102.454035MoxR family ATPase
BBUZS7_RS02220392.14626016S rRNA (guanine(527)-N(7))-methyltransferase
BBUZS7_RS022253122.576984tRNA uridine-5-carboxymethylaminomethyl(34)
BBUZS7_RS022304191.341819tRNA uridine-5-carboxymethylaminomethyl(34)
BBUZS7_RS022353261.110379FlbF protein
BBUZS7_RS022402221.423762flagellar hook-associated protein FlgK
BBUZS7_RS022450150.497742flagellar hook-associated protein 3
BBUZS7_RS02250010-1.149928flagellar assembly protein FliW
BBUZS7_RS02255214-1.331665carbon storage regulator CsrA
BBUZS7_RS022601170.201333tRNA
BBUZS7_RS02265211-1.069679tRNA
BBUZS7_RS02270215-2.868895hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02215HTHFIS391e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.4 bits (92), Expect = 1e-05
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHKLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02230TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02240FLGHOOKAP15190.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 519 bits (1338), Expect = 0.0
Identities = 137/633 (21%), Positives = 259/633 (40%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WTNN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02245FLAGELLIN578e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 56.6 bits (136), Expect = 8e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLRSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 30.8 bits (69), Expect = 0.012
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLRSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


3BBUZS7_RS02735BBUZS7_RS02930Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS027351173.126565flagellar biosynthesis protein FliZ
BBUZS7_RS027401184.130941flagellar motor switch protein FliN
BBUZS7_RS027451174.930465flagellar motor switch protein FliM
BBUZS7_RS027500194.530761flagellar basal body-associated protein FliL
BBUZS7_RS02755-2202.587759flagellar motor protein MotB
BBUZS7_RS02760-2201.994362motility protein A
BBUZS7_RS02765-2160.460954flagellar protein FlbD
BBUZS7_RS02770-2151.231390flagellar hook protein FlgE
BBUZS7_RS02775116-0.264266flagellar hook assembly protein FlgD
BBUZS7_RS027805160.740001flagellar hook-length control protein FliK
BBUZS7_RS027856173.013096flagellar protein
BBUZS7_RS027905183.269873flagellar protein FlbA
BBUZS7_RS027954194.263700flagellar protein export ATPase FliI
BBUZS7_RS028006193.992607flagellar assembly protein FliH
BBUZS7_RS028054193.599347flagellar motor switch protein FliG
BBUZS7_RS028102183.486817flagellar basal body M-ring protein FliF
BBUZS7_RS028151192.204966flagellar hook-basal body complex protein FliE
BBUZS7_RS028201181.529780flagellar basal body rod protein FlgC
BBUZS7_RS028251192.794980flagellar basal body rod protein FlgB
BBUZS7_RS028300183.837032HslU--HslV peptidase ATPase subunit
BBUZS7_RS02835-1193.278012ATP-dependent protease subunit HslV
BBUZS7_RS02840-2172.479007DNA-protecting protein DprA
BBUZS7_RS02845-1162.566078hypothetical protein
BBUZS7_RS02850-1152.485274cell division protein FtsZ
BBUZS7_RS028550130.737662cell division protein FtsA
BBUZS7_RS02860213-0.621829cell division protein FtsQ/DivIB
BBUZS7_RS02865314-0.788059putative lipid II flippase FtsW
BBUZS7_RS02870115-2.322892phospho-N-acetylmuramoyl-pentapeptide-
BBUZS7_RS02875017-3.111035UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-
BBUZS7_RS02880017-3.410824hypothetical protein
BBUZS7_RS02885016-2.78845616S rRNA (cytosine(1402)-N(4))-methyltransferase
BBUZS7_RS02890117-3.150989hypothetical protein
BBUZS7_RS02895-113-3.050559hypothetical protein
BBUZS7_RS02900-113-2.317948hypothetical protein
BBUZS7_RS02905-113-2.690593NAD(+)/NADH kinase
BBUZS7_RS02910-113-3.330341chemotaxis protein CheW
BBUZS7_RS02915010-3.368782RlmE family RNA methyltransferase
BBUZS7_RS02920-110-3.229630polyprenyl synthetase family protein
BBUZS7_RS02925-111-3.256448hypothetical protein
BBUZS7_RS02930115-3.313227ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02740FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02745FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02755OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02770FLGHOOKAP1514e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 51.5 bits (123), Expect = 4e-09
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRVNFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02780FLGHOOKFLIK392e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.4 bits (91), Expect = 2e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGDFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02785TYPE4SSCAGX290.019 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 28.6 bits (63), Expect = 0.019
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKVIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02800FLGFLIH473e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 46.7 bits (110), Expect = 3e-08
Identities = 43/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLDIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHAIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+HA + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q +RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02805FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02810FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02815FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02820FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02830HTHFIS310.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.010
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02845SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02855SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


4BBUZS7_RS03355BBUZS7_RS07015Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS03355211-0.163898dicarboxylate/amino acid:cation symporter
BBUZS7_RS033601120.958976proline--tRNA ligase
BBUZS7_RS033653122.023306DUF2259 domain-containing protein
BBUZS7_RS033704112.456262hypothetical protein
BBUZS7_RS033753112.395789DUF3996 domain-containing protein
BBUZS7_RS033804110.998496DUF3996 domain-containing protein
BBUZS7_RS03385390.513145mannose-6-phosphate isomerase, class I
BBUZS7_RS07015312-0.237826PTS transporter subunit EIIA
5BBUZS7_RS04535BBUZS7_RS04600Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS04535229-2.899131PTS transporter subunit EIIA
BBUZS7_RS04540229-3.6849621-phosphofructokinase
BBUZS7_RS04545226-3.735708hypothetical protein
BBUZS7_RS04555222-3.339057*exodeoxyribonuclease V subunit alpha
BBUZS7_RS04560216-2.537130exodeoxyribonuclease V subunit beta
BBUZS7_RS04565114-1.572590exodeoxyribonuclease V subunit gamma
BBUZS7_RS045701100.187128nicotinate phosphoribosyltransferase
BBUZS7_RS04575290.103030glucose-6-phosphate dehydrogenase
BBUZS7_RS045802130.643448Na+/H+ antiporter NhaC family protein
BBUZS7_RS04585317-0.142035Na+/H+ antiporter NhaC family protein
BBUZS7_RS04590322-0.373073ABC transporter substrate-binding protein
BBUZS7_RS045951141.658717ABC transporter permease
BBUZS7_RS046002132.123009ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS04555MYCMG045300.020 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 30.4 bits (68), Expect = 0.020
Identities = 30/113 (26%), Positives = 58/113 (51%), Gaps = 12/113 (10%)

Query: 76 LLAKDIQNTIIFTKDNLEKTNKSYNKLIKILKGLETFGNLETIKNIVLLLK--KNNILME 133
L+ +D+ + I +++ NL+K++ S +K+ + F +++IK I K KNN L+
Sbjct: 84 LIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF--IDSIKEISQQTKDSKNNELLH 141

Query: 134 FNKLKITTPLILENNIYIYTQKNYREEEE---LIKQIIKRLENHKSELNDNKI 183
+ P L+N +++Y + E E+ +IK + HK NDN++
Sbjct: 142 W-----AVPYFLQNLVFVYRGEKISELEQENVSWTDVIKAIVKHKDRFNDNRL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS04590MYCMG045346e-04 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 34.3 bits (78), Expect = 6e-04
Identities = 25/120 (20%), Positives = 59/120 (49%), Gaps = 4/120 (3%)

Query: 1 MKKIFILIVILTTFACTNKDTITLNVFNWAEYIDETLLDQFEKENNIKINYEIFHNNEEM 60
+K F + + + ++ + T + N+ YI LL++ ++++ + + + +NE++
Sbjct: 5 LKYCFFSLFVSLSSILSSCGSTTFVLANFESYISPLLLERVQEKH--PLTFLTYPSNEKL 62

Query: 61 MAKFNNTKNYYDIIVPSEYLIQELIDEGKIEKLDYSKLPNVTKNITQNLTNLEHDPGNLY 120
+ F N N Y + V S Y + ELI+ + +D+S+ + + + N D +L+
Sbjct: 63 INGFAN--NTYSVAVASTYAVSELIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF 120


6BBUZS7_RS04865BBUZS7_RS04955Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS048653202.728987signal recognition particle protein
BBUZS7_RS048701201.32093830S ribosomal protein S16
BBUZS7_RS048751210.566348KH domain-containing protein
BBUZS7_RS048802210.00958816S rRNA processing protein RimM
BBUZS7_RS048852220.300401tRNA (guanosine(37)-N1)-methyltransferase TrmD
BBUZS7_RS048900260.13479250S ribosomal protein L19
BBUZS7_RS04895-215-2.580238hypothetical protein
BBUZS7_RS04900-110-2.486331pantetheine-phosphate adenylyltransferase
BBUZS7_RS04905-110-3.01476350S ribosomal protein L32
BBUZS7_RS04910-19-2.886474acyl carrier protein
BBUZS7_RS04915-210-2.729892ribonuclease III
BBUZS7_RS04920010-1.787156CCA tRNA nucleotidyltransferase
BBUZS7_RS04925216-0.740782hypothetical protein
BBUZS7_RS04930314-0.621070hypothetical protein
BBUZS7_RS049452140.824439**endolytic transglycosylase MltG
BBUZS7_RS049503140.990876DNA primase
BBUZS7_RS049553140.946452RNA polymerase sigma factor RpoD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS04900LPSBIOSNTHSS1984e-68 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 198 bits (505), Expect = 4e-68
Identities = 57/157 (36%), Positives = 91/157 (57%), Gaps = 3/157 (1%)

Query: 4 AVFPGSFDPITWGHIDLIKRSLAIFDKVIVLVAKNKSKKYFLSDIERFSLTKDVISSLNF 63
A++PGSFDPIT+GH+D+I+R +FD+V V V +N +K+ S ER I+ L
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHL-- 60

Query: 64 SNVLVDRYSGFIVDYALINSIKFIVRGIRAFNDFDIEFERYLVNNKLNFEIDTIFLPSSA 123
N VD + G V+YA I+RG+R +DF++E + N L +++T+FL +S
Sbjct: 61 PNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 124 EHLYVRSDFVKELMLKKDVDLSNFVPELVFNRLKSKF 160
E+ ++ S VKE+ + ++ +FVP V L +F
Sbjct: 121 EYSFLSSSLVKEVA-RFGGNVEHFVPSHVAAALYDQF 156


7BBUZS7_RS05405BBUZS7_RS05500Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS054052174.080350transcription termination/antitermination
BBUZS7_RS054100152.869129translation initiation factor IF-2
BBUZS7_RS05415-1150.14037530S ribosome-binding factor RbfA
BBUZS7_RS05420-115-0.016237tRNA pseudouridine(55) synthase TruB
BBUZS7_RS054250150.51605030S ribosomal protein S15
BBUZS7_RS054300140.137054polyribonucleotide nucleotidyltransferase
BBUZS7_RS05435114-1.590877hypothetical protein
BBUZS7_RS05440211-1.185323YjgP/YjgQ family permease
BBUZS7_RS05445212-0.480500YjgP/YjgQ family permease
BBUZS7_RS05450312-0.867593tRNA guanosine(34) transglycosylase Tgt
BBUZS7_RS05455312-1.588602murein biosynthesis integral membrane protein
BBUZS7_RS05460210-1.372080HEAT repeat domain-containing protein
BBUZS7_RS05465311-1.987991bifunctional phosphopantothenoylcysteine
BBUZS7_RS07235115-1.828702DUF997 family protein
BBUZS7_RS05470014-1.563177sodium/pantothenate symporter
BBUZS7_RS05475014-1.069529RluA family pseudouridine synthase
BBUZS7_RS05480-113-1.038948hypothetical protein
BBUZS7_RS05485-113-1.176760UDP-N-acetylmuramate--L-alanine ligase
BBUZS7_RS05490014-0.363189YicC family protein
BBUZS7_RS05495-112-0.157914AAA family ATPase
BBUZS7_RS05500314-2.052474DNA-directed RNA polymerase subunit omega
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS05410TCRTETOQM764e-16 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 75.7 bits (186), Expect = 4e-16
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 22/144 (15%)

Query: 385 ITIMGHVDHGKTKLLSVL------------------QNIDINQTESGGITQHIGAYTIVY 426
I ++ HVD GKT L L + + GIT G + +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 427 NDREITFLDTPGHEAFTMMRSRGAQVTDIIVLVVSAIDGVMPQTIEAINHAKEANVPIIV 486
+ ++ +DTPGH F R V D +L++SA DGV QT + ++ +P I
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIF 125

Query: 487 AINKIDLPDSNPDK----IKHQLS 506
INKID + IK +LS
Sbjct: 126 FINKIDQNGIDLSTVYQDIKEKLS 149


8BBUZS7_RS02215BBUZS7_RS02245N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS022153102.454035MoxR family ATPase
BBUZS7_RS02220392.14626016S rRNA (guanine(527)-N(7))-methyltransferase
BBUZS7_RS022253122.576984tRNA uridine-5-carboxymethylaminomethyl(34)
BBUZS7_RS022304191.341819tRNA uridine-5-carboxymethylaminomethyl(34)
BBUZS7_RS022353261.110379FlbF protein
BBUZS7_RS022402221.423762flagellar hook-associated protein FlgK
BBUZS7_RS022450150.497742flagellar hook-associated protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02215HTHFIS391e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.4 bits (92), Expect = 1e-05
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHKLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02230TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02240FLGHOOKAP15190.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 519 bits (1338), Expect = 0.0
Identities = 137/633 (21%), Positives = 259/633 (40%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WTNN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02245FLAGELLIN578e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 56.6 bits (136), Expect = 8e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLRSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 30.8 bits (69), Expect = 0.012
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLRSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


9BBUZS7_RS02705BBUZS7_RS02855N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS02705-1181.326392flagellar biosynthesis protein FlhF
BBUZS7_RS02710-1170.987367flagellar biosynthesis protein FlhA
BBUZS7_RS02715019-0.127520flagellar biosynthesis protein FlhB
BBUZS7_RS02720-1170.580219flagellar biosynthetic protein FliR
BBUZS7_RS027250171.696137flagellar biosynthesis protein FliQ
BBUZS7_RS027300162.269566flagellar type III secretion system pore protein
BBUZS7_RS027351173.126565flagellar biosynthesis protein FliZ
BBUZS7_RS027401184.130941flagellar motor switch protein FliN
BBUZS7_RS027451174.930465flagellar motor switch protein FliM
BBUZS7_RS027500194.530761flagellar basal body-associated protein FliL
BBUZS7_RS02755-2202.587759flagellar motor protein MotB
BBUZS7_RS02760-2201.994362motility protein A
BBUZS7_RS02765-2160.460954flagellar protein FlbD
BBUZS7_RS02770-2151.231390flagellar hook protein FlgE
BBUZS7_RS02775116-0.264266flagellar hook assembly protein FlgD
BBUZS7_RS027805160.740001flagellar hook-length control protein FliK
BBUZS7_RS027856173.013096flagellar protein
BBUZS7_RS027905183.269873flagellar protein FlbA
BBUZS7_RS027954194.263700flagellar protein export ATPase FliI
BBUZS7_RS028006193.992607flagellar assembly protein FliH
BBUZS7_RS028054193.599347flagellar motor switch protein FliG
BBUZS7_RS028102183.486817flagellar basal body M-ring protein FliF
BBUZS7_RS028151192.204966flagellar hook-basal body complex protein FliE
BBUZS7_RS028201181.529780flagellar basal body rod protein FlgC
BBUZS7_RS028251192.794980flagellar basal body rod protein FlgB
BBUZS7_RS028300183.837032HslU--HslV peptidase ATPase subunit
BBUZS7_RS02835-1193.278012ATP-dependent protease subunit HslV
BBUZS7_RS02840-2172.479007DNA-protecting protein DprA
BBUZS7_RS02845-1162.566078hypothetical protein
BBUZS7_RS02850-1152.485274cell division protein FtsZ
BBUZS7_RS028550130.737662cell division protein FtsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02705PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 176 VFILVGPTGVGKTTTIAKLAAIY 198
+L G G+GK+T I L +
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02715TYPE3IMSPROT339e-117 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 339 bits (871), Expect = e-117
Identities = 101/345 (29%), Positives = 182/345 (52%), Gaps = 9/345 (2%)

Query: 25 RTELPTDQKKQKAREEGRVLKSTEINTAVSLLLLFALFFFMLSYFA---LDLIAVFKEQA 81
+TE PT +K + AR++G+V KS E+ + ++ L A+ + Y+ L+ + EQ+
Sbjct: 5 KTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQS 64

Query: 82 IKLPEVMRMSVYTMGFAYIRSIMGYVVLFFFASLAVNFFVNIIQVGFFITFKSLEPRWDK 141
LP +S + + +L A +A+ +++Q GF I+ ++++P K
Sbjct: 65 Y-LPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIA--SHVVQYGFLISGEAIKPDIKK 121

Query: 142 ISFNFSRWAKNSFFSAGAFFNLFKSLLKVVIICLIYYFIIENNIGKISKLSEYTLQSGIS 201
I N AK FS + KS+LKVV++ ++ + II+ N+ + +L ++
Sbjct: 122 I--NPIEGAKR-IFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITP 178

Query: 202 IVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEMEGDPLLRSRI 261
++ I ++ + ++ + DY F+ QYI+ LKM+K+E+K+E KEMEG P ++S+
Sbjct: 179 LLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKR 238

Query: 262 KERMRVILSTNLRVAIPQADVVITNPEHFAVAIKWDSETMLAPKVLAKGQDEIALTIKKI 321
++ + I S N+R + ++ VV+ NP H A+ I + P V K D T++KI
Sbjct: 239 RQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKI 298

Query: 322 ARENNVPLMENKLLARALYANVKVNEEIPREYWEIVSKILVRVYS 366
A E VP+++ LARALY + V+ IP E E +++L +
Sbjct: 299 AEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLER 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02720TYPE3IMRPROT1132e-32 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 113 bits (285), Expect = 2e-32
Identities = 46/242 (19%), Positives = 107/242 (44%), Gaps = 4/242 (1%)

Query: 16 VLVRIFMFLKFSPFFSTIKI-GYFNFFFSLILSVIVVEKIKIIYPLDNMLSFALILLGEA 74
L+R+ + +P S + +++++ + + + + +
Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQQI 78

Query: 75 ILGLIQAFFVNIIFNVFHLVGFFFSNQIGLAYANIFDVFSEEDSMIISQIFAYLFLLLFL 134
++G+ F + F G Q+GL++A D S + ++++I L LLLFL
Sbjct: 79 LIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLLFL 138

Query: 135 SSDFLLRFFVIGIHDSVLNIRVEHLVNMRNSGFVKLLLMSFGFLFEKALLISFPILSLLL 194
+ + L + + + D+ + + NS L + +F L+++ P+++LLL
Sbjct: 139 TFNGHL-WLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195

Query: 195 LFYLVLGILSKSSPQINLLIISFSTSLFLGLLILYIGFPSLAISSKRVIELSLDSLASFL 254
L LG+L++ +PQ+++ +I F +L +G+ ++ P +A + + + LA +
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADII 255

Query: 255 KL 256

Sbjct: 256 SE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02725TYPE3IMQPROT612e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 61.3 bits (149), Expect = 2e-16
Identities = 21/76 (27%), Positives = 43/76 (56%)

Query: 6 ILYLIRISIENIIILSAPMLIIALIVGLLISIFQAITSIQDQTLSFIPKIIVILLVIVIF 65
+++ ++ ++ILS I+A I+GLL+ +FQ +T +Q+QTL F K++ + L + +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 GPWILNKLMQFTYMIF 81
W L+ + +
Sbjct: 64 SGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02730FLGBIOSNFLIP2603e-90 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 260 bits (667), Expect = 3e-90
Identities = 97/213 (45%), Positives = 138/213 (64%), Gaps = 3/213 (1%)

Query: 41 GGSEIAFSLQLLILLTIITLSPAFLVLMTSFLRISIVLDFIRRALSLQQSPPTQIVMGLA 100
GG + +Q L+ +T +T PA L++MTSF RI IV +R AL +PP Q+++GLA
Sbjct: 34 GGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLA 93

Query: 101 LFLTIFTMWPTFNSIYEQAYLPLKESKINFNEFYNKGIAPLRIFMYKQMSDGRHEEIRLF 160
LFLT F M P + IY AY P E KI+ E KG PLR FM +Q R ++ LF
Sbjct: 94 LFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQT---READLGLF 150

Query: 161 MSMSNYDRPKNFSEVPTHVLIAAFILHELKVAFKMGILIFLPFIVLDIIVASVLMAMGMI 220
++N + VP +L+ A++ ELK AF++G IF+PF+++D+++ASVLMA+GM+
Sbjct: 151 ARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMM 210

Query: 221 MLPPVMISLPFKLILFVMVDGWTLITSGLIKSF 253
M+PP I+LPFKL+LFV+VDGW L+ L +SF
Sbjct: 211 MVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02740FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02745FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02755OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02770FLGHOOKAP1514e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 51.5 bits (123), Expect = 4e-09
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRVNFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02780FLGHOOKFLIK392e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.4 bits (91), Expect = 2e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGDFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02785TYPE4SSCAGX290.019 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 28.6 bits (63), Expect = 0.019
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKVIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02800FLGFLIH473e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 46.7 bits (110), Expect = 3e-08
Identities = 43/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLDIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHAIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+HA + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q +RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02805FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02810FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02815FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02820FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02830HTHFIS310.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.010
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02845SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS02855SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


10BBUZS7_RS03225BBUZS7_RS03280N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS03225-1110.914342S-ribosylhomocysteine lyase
BBUZS7_RS03230-1151.431533hypothetical protein
BBUZS7_RS032350181.815651HIT family protein
BBUZS7_RS032400172.282228magnesium transporter
BBUZS7_RS032450172.451014hypothetical protein
BBUZS7_RS03250-1183.723715BMP family protein
BBUZS7_RS032550204.098516BMP family protein
BBUZS7_RS032601204.458934BMP family protein
BBUZS7_RS032651195.024038BMP family protein
BBUZS7_RS032702194.95726230S ribosomal protein S7
BBUZS7_RS032751194.77942830S ribosomal protein S12
BBUZS7_RS032800194.756924DNA-directed RNA polymerase subunit beta'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03225LUXSPROTEIN1883e-64 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 188 bits (479), Expect = 3e-64
Identities = 52/162 (32%), Positives = 86/162 (53%), Gaps = 11/162 (6%)

Query: 4 ITSFTIDHTKLN-PGIYVSR-KDTFENVIFTTIDIRIKAPNIEPIIENAAIHTIEHIGAT 61
+ SFT+DHT++N P + V++ T + T D+R APN + I+ IHT+EH+ A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPN-KDILSEKGIHTLEHLYAG 61

Query: 62 LLRNN-EVWTEKIVYFGPMGCRTGFYLIIFGDYESKDLVDLVSWLFSE----IVNFSEPI 116
+RN+ + +I+ PMGCRTGFY+ + G + + D +W+ + V I
Sbjct: 62 FMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVAD--AWIAAMEDVLKVENQNKI 119

Query: 117 PGASDKECGNYKEHNLDMAKYESSKYLQI-LNNIKEENLKYP 157
P ++ +CG H+LD AK + L++ + K + L P
Sbjct: 120 PELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALP 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03250LIPPROTEIN48732e-16 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 72.7 bits (178), Expect = 2e-16
Identities = 72/286 (25%), Positives = 111/286 (38%), Gaps = 34/286 (11%)

Query: 13 TSCFSRNGIESSS-KKIKISMLVD-GVLDDKSFNASANEALLRLKKDFPENIEEVFSCAI 70
T+ + ++++ K+K ++ D G +DDKSFN SA EAL + K IE
Sbjct: 46 TNANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQ--TGIEINNVEPS 103

Query: 71 SGVYSSYVSDLDNLKRNGSDLIWLVGYMLTDASL--LVSSENPKISYGIIDPIYGDDVQI 128
S S+Y S L + IW++ S+ + + ++ I I G D I
Sbjct: 104 SNFESAYNSALSAGHK-----IWVLNGFKHQQSIKQYIDAHREELERNQIK-IIGIDFDI 157

Query: 129 PEN---LIAVVFRVEQGAFLAGYIAAKKSFSGK------IGFIGGMKGNIVDAFRYGYES 179
++ F +++ AF GY A S + + GG V F G+
Sbjct: 158 ETEYKWFYSLQFNIKESAFTTGY-AIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAK 216

Query: 180 GAKYANKDIEIISEYSNSFSDVDIGRT-----------IASKMYSKGIDVIHFAAGLAGI 228
G Y N+ + Y S +D G T + S + H +AG
Sbjct: 217 GILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGP 276

Query: 229 GVIEAAKNLGDGYYVIGADQDQSY-LAPKNFITSVIKNIGDALYLI 273
E + G YVIG D DQ +TSV+K+I A+Y
Sbjct: 277 ATFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYET 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03255LIPPROTEIN48703e-15 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 69.7 bits (170), Expect = 3e-15
Identities = 69/261 (26%), Positives = 105/261 (40%), Gaps = 29/261 (11%)

Query: 30 KVSLIID-GTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSDLEGLKDAGSDLI 88
K LI D G DDKSFN+SA +K + ++ IE+ E SSN + S AG +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVEPSSN-FESAYNSALSAGHKIW 121

Query: 89 WLIGYRFS------DMAKVAALQNPDMKYAIID-PIYSNDPIPANLVGMTFRAQEGAFLT 141
L G++ A L+ +K ID I + + F +E AF T
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKW---FYSLQFNIKESAFTT 178

Query: 142 GY-IAARLSKTGK----IGFLGGIEGEIVDAFRYGYEAGAKYANKD-----------IKI 185
GY IA+ LS+ + + GG V F G+ G Y N+ +K+
Sbjct: 179 GYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKL 238

Query: 186 STQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 245
+ + +V + +D H + G E + G Y+IGVD DQ
Sbjct: 239 DSGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQYVIGVDSDQ 298

Query: 246 AY-LAPDNVITSTTKDVGRAL 265
D ++TS K + +A+
Sbjct: 299 GMIQDKDRILTSVLKHIKQAV 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03260LIPPROTEIN48505e-09 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 50.4 bits (120), Expect = 5e-09
Identities = 59/296 (19%), Positives = 103/296 (34%), Gaps = 50/296 (16%)

Query: 18 FKSNKKSIKSDKV----VVGVLAHGSFYDKGYNQSVHDGVVKLRDNFGIKLITKSLRPYP 73
+ K+ +K+ ++ V + G DK +NQS + + + GI++
Sbjct: 47 NANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVE----- 101

Query: 74 IEGKRLLTVDEAMTEDAYEVQKNPLNLFW-LIGYRFSDLSVKL------SYERPDIYYGI 126
+ E AY + + W L G++ + ER I
Sbjct: 102 ---------PSSNFESAYNSALSAGHKIWVLNGFKHQQSIKQYIDAHREELERNQI---K 149

Query: 127 IDAFDYGDIQVPKNSLAIKFRNEEAAFLAGYIAA-----KMSRKEKIGFLTGPMSEHVKD 181
I D+ K +++F +E+AF GY A + K + G V
Sbjct: 150 IIGIDFDIETEYKWFYSLQFNIKESAFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTT 209

Query: 182 FKFGFKAGIFYAN---PKLRLVSK---KAPSLFDKEKGKAMAL-------FMYKEDKVGV 228
F GF GI Y N ++ K S F + + + V
Sbjct: 210 FNEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHV 269

Query: 229 IFPIAGITGLGVYDAAKELGPKYYVIGLNQDQSYI-APQNVITSIIKDIGKVIYSI 283
I +AG ++ + YVIG++ DQ I ++TS++K I + +Y
Sbjct: 270 ILSVAGPA---TFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYET 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03265LIPPROTEIN48603e-12 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 60.0 bits (145), Expect = 3e-12
Identities = 55/272 (20%), Positives = 97/272 (35%), Gaps = 31/272 (11%)

Query: 28 KTVSLIVDGAFDDKGFNESSSKAIRKLKADLNINIIEKASTGNSYLGDIANLEDGNSNLI 87
K V + +G DDK FN+S+ +A++ + I I +++ + +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIE-INNVEPSSNFESAYNSALSAGHKIW 121

Query: 88 WGIGFRLSDILFQ---RASENVSVNYAIIEGV-YDEIQIPKNLLNISFRSEEVAFLAGY- 142
GF+ + Q E + N I G+ +D K ++ F +E AF GY
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKESAFTTGYA 181

Query: 143 ----FASKASKTGKIGFVGGVRGKVLESFMYGYEAGAKYANSNIKVVSQYVGTFGDFGLG 198
+ + + GG + +F G+ G Y N K Y + G
Sbjct: 182 IASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKLDSG 241

Query: 199 ---------------RSTASNMYRDGVDIIFAAAGLSGIGVIEAAKELGPDHYIIGVDQD 243
ST +++ + I+ A + E + Y+IGVD D
Sbjct: 242 FTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATF----ETVRLANKGQYVIGVDSD 297

Query: 244 QSY-LAPNNVIVSAVKKVDSLMYS-LTKKYLE 273
Q + ++ S +K + +Y L LE
Sbjct: 298 QGMIQDKDRILTSVLKHIKQAVYETLLDLILE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS03280RTXTOXIND310.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 31.0 bits (70), Expect = 0.037
Identities = 7/17 (41%), Positives = 14/17 (82%)

Query: 1192 KHLLVRDGDVVKAGDML 1208
K ++V++G+ V+ GD+L
Sbjct: 108 KEIIVKEGESVRKGDVL 124


11BBUZS7_RS05255BBUZS7_RS05275N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBUZS7_RS052551220.612882muramidase
BBUZS7_RS052602171.254344flagellar P-ring protein
BBUZS7_RS052651132.469658hypothetical protein
BBUZS7_RS052700123.349793flagellar basal-body rod protein FlgG
BBUZS7_RS052753152.697866flagellar basal-body rod protein FlgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS05255FLGFLGJ486e-10 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 48.2 bits (114), Expect = 6e-10
Identities = 21/69 (30%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 37 DLRKASLEFEAMFIKQMLESMKKTLNKDQNLLNGGQVEEIFEDMLCEQRAKQMAQAQSFG 96
++R + + E MF++ ML+SM+ L KD L + ++ M +Q A+QM + G
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPKDG--LFSSEHTRLYTSMYDQQIAQQMTAGKGLG 89

Query: 97 LADLIYNQL 105
LA+++ Q+
Sbjct: 90 LAEMMVKQM 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS05260FLGPRINGFLGI2577e-86 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 257 bits (658), Expect = 7e-86
Identities = 83/352 (23%), Positives = 153/352 (43%), Gaps = 60/352 (17%)

Query: 35 SLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDL----IIKILEENNIINE 90
+ +++ ++K+IA + N L G G+V GL G GDS++ + +L+ I +
Sbjct: 24 AQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQ 83

Query: 91 IGSNNIESKNIALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEI 150
G +N +KNIA V V+ + GS+ V+S+ D+ L G L+ T+L +G+I
Sbjct: 84 GGQSN--AKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQI 141

Query: 151 IAIASGITQPNN-KLKGSGYTI-----------DSVIINEN--QNINHSYNIILKKGN-- 194
A+A G N +G T+ + II S N++L+ N
Sbjct: 142 YAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPD 201

Query: 195 YTLINRIHKILTS---KKINNKI---KSDSTIEIEAKNIS----LLEEIENIKIETN--P 242
++ R+ ++ + + + I + I ++ ++ L+ EIEN+ +ET+
Sbjct: 202 FSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261

Query: 243 KILIDKKNGIILASENAKI-------GTFTFSIEKDNQNI----FLSKNNKTTIQVNSMK 291
K++I+++ G I+ + +I GT T + + Q I F Q + M
Sbjct: 262 KVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMA 321

Query: 292 LNE----FILK-----------NSNNLSNKELIQIIQAAQKINKLNGELILE 328
+ E I++ NS L +I I+Q + L EL+L+
Sbjct: 322 MQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS05270FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 2e-07
Identities = 10/44 (22%), Positives = 23/44 (52%)

Query: 220 ILEMSNVSIAEEMVTMIVAQRAYEINSKAIQTSDNMLGIANNLK 263
+S V++ EE + Q+ Y N++ +QT++ + N++
Sbjct: 503 QQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 1e-05
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 14/79 (17%)

Query: 5 LWTAASGMTAQQYNVDTIANNLSNVNTTGFKKIRAEFEDLIYQTHNRAGTPATENTLRPL 64
+ A SG+ A Q ++T +NN+S+ N G+ + A N+
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--------------IMAQANSTLGA 49

Query: 65 GNQVGHGTKIAATQRIFEQ 83
G VG+G ++ QR ++
Sbjct: 50 GGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBUZS7_RS05275FLGHOOKAP1439e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.0 bits (101), Expect = 9e-07
Identities = 14/64 (21%), Positives = 28/64 (43%)

Query: 214 DTKTSGKAQEIDISLRPKIETETLEASNVNAVKEMVLMIEINRAYEANQKTIQTEDSLLG 273
T T + ++ ++ + S VN +E + + Y AN + +QT +++
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 274 KLIN 277
LIN
Sbjct: 541 ALIN 544



Score = 38.4 bits (89), Expect = 2e-05
Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 4 GIYTAASGMMAERRKLDTVSNNLANIDLIGYKKDLSIQK 42
I A SG+ A + L+T SNN+++ ++ GY + +I
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.