PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeB31.gbffThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_001318 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BB_RS00335BB_RS00385Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS00335234-2.216951Cof-type HAD-IIB family hydrolase
BB_RS00340333-1.490541aminopeptidase
BB_RS00345233-2.732785divergent PAP2 family protein
BB_RS00350333-3.294957hypothetical protein
BB_RS00355433-3.317092membrane protein
BB_RS00360427-3.652551hypothetical protein
BB_RS00365327-3.227380peptide chain release factor 2
BB_RS00370229-5.143499hypothetical protein
BB_RS00375020-4.447441signal recognition particle-docking protein
BB_RS00380017-4.207114hypothetical protein
BB_RS00385-115-3.485055ABC transporter ATP-binding protein
2BB_RS00855BB_RS00910Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS00855292.639210MoxR family ATPase
BB_RS00860292.23141116S rRNA (guanine(527)-N(7))-methyltransferase
BB_RS008652122.665107tRNA uridine-5-carboxymethylaminomethyl(34)
BB_RS008704191.422062tRNA uridine-5-carboxymethylaminomethyl(34)
BB_RS008753261.102717FlbF protein
BB_RS008802221.374999flagellar hook-associated protein FlgK
BB_RS008850140.335902flagellar hook-associated protein 3
BB_RS00890110-1.459835flagellar assembly protein FliW
BB_RS00895214-1.668894carbon storage regulator CsrA
BB_RS00900117-0.093332tRNA
BB_RS00905212-1.159342tRNA
BB_RS00910215-2.985017hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00855HTHFIS414e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 4e-06
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHRLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00870TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00880FLGHOOKAP15210.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 521 bits (1344), Expect = 0.0
Identities = 138/633 (21%), Positives = 260/633 (41%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WANN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + A N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00885FLAGELLIN584e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.7 bits (139), Expect = 4e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 31.2 bits (70), Expect = 0.008
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


3BB_RS01370BB_RS01565Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS013701173.167318flagella biosynthesis regulatory protein FliZ
BB_RS013751184.123279flagellar motor switch protein FliN
BB_RS013801174.901598flagellar motor switch protein FliM
BB_RS013850194.498714flagellar basal body-associated protein FliL
BB_RS01390-2192.559014flagellar motor protein MotB
BB_RS01395-2191.943013motility protein A
BB_RS01400-1150.405977flagellar protein FlbD
BB_RS01405-2151.186085flagellar hook protein FlgE
BB_RS01410116-0.292315flagellar hook assembly protein FlgD
BB_RS014155170.714144flagellar hook-length control protein FliK
BB_RS014206172.988842flagellar protein
BB_RS014255183.262211flagellar protein FlbA
BB_RS014305194.256038flagellar protein export ATPase FliI
BB_RS014356193.943844flagellar assembly protein FliH
BB_RS014404193.553892flagellar motor switch protein FliG
BB_RS014452193.437541flagellar basal body M-ring protein FliF
BB_RS014502192.147811flagellar hook-basal body complex protein FliE
BB_RS014551181.522118flagellar basal body rod protein FlgC
BB_RS014601192.806766flagellar basal body rod protein FlgB
BB_RS014650193.879571HslU--HslV peptidase ATPase subunit
BB_RS01470-1193.326185ATP-dependent protease subunit HslV
BB_RS01475-2172.505134DNA-protecting protein DprA
BB_RS01480-1162.608167hypothetical protein
BB_RS01485-1152.508221cell division protein FtsZ
BB_RS014900130.749144cell division protein FtsA
BB_RS01495113-0.626675cell division protein FtsQ/DivIB
BB_RS01500314-0.756865putative lipid II flippase FtsW
BB_RS01505115-2.272683phospho-N-acetylmuramoyl-pentapeptide-
BB_RS01510-116-3.095256UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-
BB_RS01515017-3.418486hypothetical protein
BB_RS01520016-2.79611816S rRNA (cytosine(1402)-N(4))-methyltransferase
BB_RS01525117-3.180282hypothetical protein
BB_RS01530-113-3.099629hypothetical protein
BB_RS01535012-2.370655hypothetical protein
BB_RS01540-113-2.698255NAD(+)/NADH kinase
BB_RS01545-113-3.250685chemotaxis protein CheW
BB_RS01550010-3.249426RlmE family RNA methyltransferase
BB_RS01555010-3.103625polyprenyl synthetase family protein
BB_RS01560010-3.127578hypothetical protein
BB_RS01565115-3.193593ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01375FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01380FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01390OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01405FLGHOOKAP1514e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 51.5 bits (123), Expect = 4e-09
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRVNFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01415FLGHOOKFLIK401e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.8 bits (92), Expect = 1e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGNFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01420TYPE4SSCAGX290.016 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.0 bits (64), Expect = 0.016
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01435FLGFLIH468e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 45.6 bits (107), Expect = 8e-08
Identities = 42/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLEIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHVIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+H + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q+ RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQL--RVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01440FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01445FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01450FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01455FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01465HTHFIS310.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.011
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01480SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01490SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


4BB_RS01715BB_RS01740Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS01715-110-3.036517pyruvate kinase
BB_RS01720010-5.159762AmmeMemoRadiSam system protein B
BB_RS01725013-4.65066750S ribosomal protein L28
BB_RS01730013-4.990113hypothetical protein
BB_RS01735013-5.150408hypothetical protein
BB_RS01740016-3.525977hypothetical protein
5BB_RS01985BB_RS02020Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS01985211-0.211648dicarboxylate/amino acid:cation symporter
BB_RS019901110.910539proline--tRNA ligase
BB_RS019952122.015644DUF2259 domain-containing protein
BB_RS020004112.448600hypothetical protein
BB_RS020053112.421926DUF3996 domain-containing protein
BB_RS020104101.006005DUF3996 domain-containing protein
BB_RS02015390.520384mannose-6-phosphate isomerase, class I
BB_RS02020212-0.212586PTS transporter subunit EIIA
6BB_RS03185BB_RS03250Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS03185224-2.917560PTS transporter subunit EIIA
BB_RS03190224-3.7214771-phosphofructokinase
BB_RS03195221-3.779450hypothetical protein
BB_RS03205218-3.403349*exodeoxyribonuclease V subunit alpha
BB_RS03210212-2.611606exodeoxyribonuclease V subunit beta
BB_RS03215110-1.643585nicotinate phosphoribosyltransferase
BB_RS03220190.139224glucose-6-phosphate dehydrogenase
BB_RS03225280.066012Na+/H+ antiporter NhaC family protein
BB_RS032303130.620210Na+/H+ antiporter NhaC family protein
BB_RS03235317-0.132793ABC transporter substrate-binding protein
BB_RS03240422-0.380735ABC transporter permease
BB_RS032452151.738407ABC transporter permease
BB_RS032502132.132251ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03205MYCMG045300.020 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 30.4 bits (68), Expect = 0.020
Identities = 30/113 (26%), Positives = 58/113 (51%), Gaps = 12/113 (10%)

Query: 76 LLAKDIQNTIIFTKDNLEKTNKSYNKLIKILKGLETFGNLETIKNIVLLLK--KNNILME 133
L+ +D+ + I +++ NL+K++ S +K+ + F +++IK I K KNN L+
Sbjct: 84 LIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF--IDSIKEISQQTKDSKNNELLH 141

Query: 134 FNKLKITTPLILENNIYIYTQKNYREEEE---LIKQIIKRLENHKSELNDNKI 183
+ P L+N +++Y + E E+ +IK + HK NDN++
Sbjct: 142 W-----AVPYFLQNLVFVYRGEKISELEQENVSWTDVIKAIVKHKDRFNDNRL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03240MYCMG045346e-04 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 34.3 bits (78), Expect = 6e-04
Identities = 25/120 (20%), Positives = 59/120 (49%), Gaps = 4/120 (3%)

Query: 1 MKKIFILIVILTTFACTNKDTITLNVFNWAEYIDKTLLDQFEKENNIKINYEIFHNNEEM 60
+K F + + + ++ + T + N+ YI LL++ ++++ + + + +NE++
Sbjct: 5 LKYCFFSLFVSLSSILSSCGSTTFVLANFESYISPLLLERVQEKH--PLTFLTYPSNEKL 62

Query: 61 MAKFNNTKNYYDIIVPSEYLIQELIDEGKIEKLDYSKLPNVTKNITQNLTNLEHDPGNLY 120
+ F N N Y + V S Y + ELI+ + +D+S+ + + + N D +L+
Sbjct: 63 INGFAN--NTYSVAVASTYAVSELIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03255PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 36 ITLLGPSGCGKTTLIKILGG 55
+ L G G GK+TLI L G
Sbjct: 599 VVLEGTGGIGKSTLINTLVG 618


7BB_RS03515BB_RS03605Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS035152202.837875signal recognition particle protein
BB_RS035201191.46496430S ribosomal protein S16
BB_RS035251200.698156KH domain-containing protein
BB_RS035301200.18037016S rRNA processing protein RimM
BB_RS035352220.528589tRNA (guanosine(37)-N1)-methyltransferase TrmD
BB_RS035400250.32229050S ribosomal protein L19
BB_RS03545-215-2.412922hypothetical protein
BB_RS03550-110-2.365293pantetheine-phosphate adenylyltransferase
BB_RS03555010-2.88891450S ribosomal protein L32
BB_RS03560-19-2.706975acyl carrier protein
BB_RS03565-210-2.559368ribonuclease III
BB_RS0357019-1.667061CCA tRNA nucleotidyltransferase
BB_RS03575215-0.638981hypothetical protein
BB_RS03580313-0.495616hypothetical protein
BB_RS035952140.910272**endolytic transglycosylase MltG
BB_RS036003141.024304RNA polymerase sigma factor RpoD
BB_RS036053130.950488hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03550LPSBIOSNTHSS1984e-68 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 198 bits (505), Expect = 4e-68
Identities = 57/157 (36%), Positives = 91/157 (57%), Gaps = 3/157 (1%)

Query: 4 AVFPGSFDPITWGHIDLIKRSLAIFDKVIVLVAKNKSKKYFLSDIERFSLTKDVISSLNF 63
A++PGSFDPIT+GH+D+I+R +FD+V V V +N +K+ S ER I+ L
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHL-- 60

Query: 64 SNVLVDRYSGFIVDYALINSIKFIVRGIRAFNDFDIEFERYLVNNKLNFEIDTIFLPSSA 123
N VD + G V+YA I+RG+R +DF++E + N L +++T+FL +S
Sbjct: 61 PNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 124 EHLYVRSDFVKELMLKKDVDLSNFVPELVFNRLKSKF 160
E+ ++ S VKE+ + ++ +FVP V L +F
Sbjct: 121 EYSFLSSSLVKEVA-RFGGNVEHFVPSHVAAALYDQF 156


8BB_RS04060BB_RS04160Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS040602174.072688transcription termination/antitermination
BB_RS040650152.887049translation initiation factor IF-2
BB_RS04070-1150.17916030S ribosome-binding factor RbfA
BB_RS04075-1150.017977tRNA pseudouridine(55) synthase TruB
BB_RS040800140.56213030S ribosomal protein S15
BB_RS04085-1140.152388polyribonucleotide nucleotidyltransferase
BB_RS04090114-1.548149lipoprotein
BB_RS04095211-1.179810YjgP/YjgQ family permease
BB_RS04100211-0.488162YjgP/YjgQ family permease
BB_RS04105412-0.919425tRNA guanosine(34) transglycosylase Tgt
BB_RS04110312-1.667776murein biosynthesis integral membrane protein
BB_RS04115210-1.424881HEAT repeat domain-containing protein
BB_RS04120412-2.024346bifunctional phosphopantothenoylcysteine
BB_RS04125215-1.918927DUF997 family protein
BB_RS04130015-1.648401sodium/pantothenate symporter
BB_RS04135014-1.112250hypothetical protein
BB_RS04140013-1.067072UDP-N-acetylmuramate--L-alanine ligase
BB_RS04145013-1.207618YicC family protein
BB_RS04150014-0.486793AAA family ATPase
BB_RS04155-113-0.135867DNA-directed RNA polymerase subunit omega
BB_RS04160314-2.090085tRNA (adenosine(37)-N6)-dimethylallyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS04065TCRTETOQM762e-16 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 76.4 bits (188), Expect = 2e-16
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 22/144 (15%)

Query: 385 ITIMGHVDHGKTKLLSVL------------------QNIDINQTESGGITQHIGAYTIVY 426
I ++ HVD GKT L L + + GIT G + +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 427 NDREITFLDTPGHEAFTMMRSRGAQVTDIVVLVVSAIDGVMPQTIEAINHAKEANVPIIV 486
+ ++ +DTPGH F R V D +L++SA DGV QT + ++ +P I
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIF 125

Query: 487 AINKIDLPDSNPDK----IKHQLS 506
INKID + IK +LS
Sbjct: 126 FINKIDQNGIDLSTVYQDIKEKLS 149


9BB_RS00855BB_RS00885N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS00855292.639210MoxR family ATPase
BB_RS00860292.23141116S rRNA (guanine(527)-N(7))-methyltransferase
BB_RS008652122.665107tRNA uridine-5-carboxymethylaminomethyl(34)
BB_RS008704191.422062tRNA uridine-5-carboxymethylaminomethyl(34)
BB_RS008753261.102717FlbF protein
BB_RS008802221.374999flagellar hook-associated protein FlgK
BB_RS008850140.335902flagellar hook-associated protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00855HTHFIS414e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 4e-06
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHRLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00870TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00880FLGHOOKAP15210.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 521 bits (1344), Expect = 0.0
Identities = 138/633 (21%), Positives = 260/633 (41%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WANN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + A N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS00885FLAGELLIN584e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.7 bits (139), Expect = 4e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 31.2 bits (70), Expect = 0.008
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


10BB_RS01340BB_RS01490N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS01340-1181.302588flagellar biosynthesis protein FlhF
BB_RS01345-1170.997388flagellar biosynthesis protein FlhA
BB_RS01350018-0.109581flagellar biosynthesis protein FlhB
BB_RS01355-1170.624559flagellar biosynthetic protein FliR
BB_RS013600171.744123flagellar biosynthesis protein FliQ
BB_RS013650162.310531flagellar type III secretion system pore protein
BB_RS013701173.167318flagella biosynthesis regulatory protein FliZ
BB_RS013751184.123279flagellar motor switch protein FliN
BB_RS013801174.901598flagellar motor switch protein FliM
BB_RS013850194.498714flagellar basal body-associated protein FliL
BB_RS01390-2192.559014flagellar motor protein MotB
BB_RS01395-2191.943013motility protein A
BB_RS01400-1150.405977flagellar protein FlbD
BB_RS01405-2151.186085flagellar hook protein FlgE
BB_RS01410116-0.292315flagellar hook assembly protein FlgD
BB_RS014155170.714144flagellar hook-length control protein FliK
BB_RS014206172.988842flagellar protein
BB_RS014255183.262211flagellar protein FlbA
BB_RS014305194.256038flagellar protein export ATPase FliI
BB_RS014356193.943844flagellar assembly protein FliH
BB_RS014404193.553892flagellar motor switch protein FliG
BB_RS014452193.437541flagellar basal body M-ring protein FliF
BB_RS014502192.147811flagellar hook-basal body complex protein FliE
BB_RS014551181.522118flagellar basal body rod protein FlgC
BB_RS014601192.806766flagellar basal body rod protein FlgB
BB_RS014650193.879571HslU--HslV peptidase ATPase subunit
BB_RS01470-1193.326185ATP-dependent protease subunit HslV
BB_RS01475-2172.505134DNA-protecting protein DprA
BB_RS01480-1162.608167hypothetical protein
BB_RS01485-1152.508221cell division protein FtsZ
BB_RS014900130.749144cell division protein FtsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01340PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 176 VFILVGPTGVGKTTTIAKLAAIY 198
+L G G+GK+T I L +
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01350TYPE3IMSPROT339e-117 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 339 bits (871), Expect = e-117
Identities = 101/345 (29%), Positives = 182/345 (52%), Gaps = 9/345 (2%)

Query: 25 RTELPTDQKKQKAREEGRVLKSTEINTAVSLLLLFALFFFMLSYFA---LDLIAVFKEQA 81
+TE PT +K + AR++G+V KS E+ + ++ L A+ + Y+ L+ + EQ+
Sbjct: 5 KTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQS 64

Query: 82 IKLPEVMRMSVYTMGFAYIRSIMGYVVLFFFASLAVNFFVNIIQVGFFITFKSLEPRWDK 141
LP +S + + +L A +A+ +++Q GF I+ ++++P K
Sbjct: 65 Y-LPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIA--SHVVQYGFLISGEAIKPDIKK 121

Query: 142 ISFNFSRWAKNSFFSAGAFFNLFKSLLKVVIICLIYYFIIENNIGKISKLSEYTLQSGIS 201
I N AK FS + KS+LKVV++ ++ + II+ N+ + +L ++
Sbjct: 122 I--NPIEGAKR-IFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITP 178

Query: 202 IVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEMEGDPLLRSRI 261
++ I ++ + ++ + DY F+ QYI+ LKM+K+E+K+E KEMEG P ++S+
Sbjct: 179 LLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKR 238

Query: 262 KERMRVILSTNLRVAIPQADVVITNPEHFAVAIKWDSETMLAPKVLAKGQDEIALTIKKI 321
++ + I S N+R + ++ VV+ NP H A+ I + P V K D T++KI
Sbjct: 239 RQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKI 298

Query: 322 ARENNVPLMENKLLARALYANVKVNEEIPREYWEIVSKILVRVYS 366
A E VP+++ LARALY + V+ IP E E +++L +
Sbjct: 299 AEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLER 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01355TYPE3IMRPROT1132e-32 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 113 bits (285), Expect = 2e-32
Identities = 46/242 (19%), Positives = 107/242 (44%), Gaps = 4/242 (1%)

Query: 16 VLVRIFMFLKFSPFFSTIKI-GYFNFFFSLILSVIVVEKIKIIYPLDNMLSFALILLGEA 74
L+R+ + +P S + +++++ + + + + +
Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQQI 78

Query: 75 ILGLIQAFFVNIIFNVFHLVGFFFSNQIGLAYANIFDVFSEEDSMIISQIFAYLFLLLFL 134
++G+ F + F G Q+GL++A D S + ++++I L LLLFL
Sbjct: 79 LIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLLFL 138

Query: 135 SSDFLLRFFVIGIHDSVLNIRVEHLVNMRNSGFVKLLLMSFGFLFEKALLISFPILSLLL 194
+ + L + + + D+ + + NS L + +F L+++ P+++LLL
Sbjct: 139 TFNGHL-WLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195

Query: 195 LFYLVLGILSKSSPQINLLIISFSTSLFLGLLILYIGFPSLAISSKRVIELSLDSLASFL 254
L LG+L++ +PQ+++ +I F +L +G+ ++ P +A + + + LA +
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADII 255

Query: 255 KL 256

Sbjct: 256 SE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01360TYPE3IMQPROT612e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 61.3 bits (149), Expect = 2e-16
Identities = 21/76 (27%), Positives = 43/76 (56%)

Query: 6 ILYLIRISIENIIILSAPMLIIALIVGLLISIFQAITSIQDQTLSFIPKIIVILLVIVIF 65
+++ ++ ++ILS I+A I+GLL+ +FQ +T +Q+QTL F K++ + L + +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 GPWILNKLMQFTYMIF 81
W L+ + +
Sbjct: 64 SGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01365FLGBIOSNFLIP2603e-90 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 260 bits (667), Expect = 3e-90
Identities = 97/213 (45%), Positives = 138/213 (64%), Gaps = 3/213 (1%)

Query: 41 GGSEIAFSLQLLILLTIITLSPAFLVLMTSFLRISIVLDFIRRALSLQQSPPTQIVMGLA 100
GG + +Q L+ +T +T PA L++MTSF RI IV +R AL +PP Q+++GLA
Sbjct: 34 GGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLA 93

Query: 101 LFLTIFTMWPTFNSIYEQAYLPLKESKINFNEFYNKGIAPLRIFMYKQMSDGRHEEIRLF 160
LFLT F M P + IY AY P E KI+ E KG PLR FM +Q R ++ LF
Sbjct: 94 LFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQT---READLGLF 150

Query: 161 MSMSNYDRPKNFSEVPTHVLIAAFILHELKVAFKMGILIFLPFIVLDIIVASVLMAMGMI 220
++N + VP +L+ A++ ELK AF++G IF+PF+++D+++ASVLMA+GM+
Sbjct: 151 ARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMM 210

Query: 221 MLPPVMISLPFKLILFVMVDGWTLITSGLIKSF 253
M+PP I+LPFKL+LFV+VDGW L+ L +SF
Sbjct: 211 MVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01375FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01380FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01390OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01405FLGHOOKAP1514e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 51.5 bits (123), Expect = 4e-09
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRVNFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01415FLGHOOKFLIK401e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.8 bits (92), Expect = 1e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGNFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01420TYPE4SSCAGX290.016 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.0 bits (64), Expect = 0.016
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01435FLGFLIH468e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 45.6 bits (107), Expect = 8e-08
Identities = 42/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLEIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHVIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+H + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q+ RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQL--RVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01440FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01445FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01450FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01455FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01465HTHFIS310.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.011
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01480SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01490SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


11BB_RS01855BB_RS01910N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS01855-1110.815679S-ribosylhomocysteine lyase
BB_RS01860-1151.340997hypothetical protein
BB_RS018650181.730165HIT family protein
BB_RS018700172.203348magnesium transporter
BB_RS018750172.411239hypothetical protein
BB_RS01880-1183.716053BMP family protein
BB_RS01885-1204.165010BMP family protein
BB_RS018901204.489226BMP family protein
BB_RS018951195.036676BMP family protein
BB_RS019002204.96033530S ribosomal protein S7
BB_RS019051194.77176630S ribosomal protein S12
BB_RS019101194.738823DNA-directed RNA polymerase subunit beta'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01855LUXSPROTEIN1883e-64 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 188 bits (479), Expect = 3e-64
Identities = 52/162 (32%), Positives = 86/162 (53%), Gaps = 11/162 (6%)

Query: 4 ITSFTIDHTKLN-PGIYVSR-KDTFENVIFTTIDIRIKAPNIEPIIENAAIHTIEHIGAT 61
+ SFT+DHT++N P + V++ T + T D+R APN + I+ IHT+EH+ A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPN-KDILSEKGIHTLEHLYAG 61

Query: 62 LLRNN-EVWTEKIVYFGPMGCRTGFYLIIFGDYESKDLVDLVSWLFSE----IVNFSEPI 116
+RN+ + +I+ PMGCRTGFY+ + G + + D +W+ + V I
Sbjct: 62 FMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVAD--AWIAAMEDVLKVENQNKI 119

Query: 117 PGASDKECGNYKEHNLDMAKYESSKYLQI-LNNIKEENLKYP 157
P ++ +CG H+LD AK + L++ + K + L P
Sbjct: 120 PELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALP 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01880LIPPROTEIN48748e-17 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 74.3 bits (182), Expect = 8e-17
Identities = 83/340 (24%), Positives = 127/340 (37%), Gaps = 34/340 (10%)

Query: 7 IFGILLTSCFSRNGIESSS-KKIKISMLVD-GVLDDKSFNSSANEALLRLKKDFPENIEE 64
I T+ + ++++ K+K ++ D G +DDKSFN SA EAL + K IE
Sbjct: 40 ISKYTTTNANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQ--TGIEI 97

Query: 65 VFSCAISGVYSSYVSDLDNLKRNGSDLIWLVGYMLTDASL--LVSSENPKISYGIIDPIY 122
S S+Y S L + IW++ S+ + + ++ I I
Sbjct: 98 NNVEPSSNFESAYNSALSAGHK-----IWVLNGFKHQQSIKQYIDAHREELERNQIK-II 151

Query: 123 GDDVQIPEN---LIAVVFRVEQGAFLAGYIAAKKSFSGK------IGFIGGMKGNIVDAF 173
G D I ++ F +++ AF GY A S + + GG V F
Sbjct: 152 GIDFDIETEYKWFYSLQFNIKESAFTTGY-AIASWLSEQDESKRVVASFGGGAFPGVTTF 210

Query: 174 RYGYESGAKYANKDIEIISEYSNSFSDVDIGRT-----------IASKMYSKGIDVIHFA 222
G+ G Y N+ + Y S +D G T + S + H
Sbjct: 211 NEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHVI 270

Query: 223 AGLAGIGVIETAKNLGDGYYVIGADQDQSY-LAPKNFITSVIKNIGDALYLITGEYIKNN 281
+AG ET + G YVIG D DQ +TSV+K+I A+Y + I
Sbjct: 271 LSVAGPATFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYETLLDLILEK 330

Query: 282 NVWEGGKVVQMGLRDGVIGLPNANEFEYIKVLERKIINKE 321
VV+ D + ++I V E N E
Sbjct: 331 EEGYKPYVVKDKKADKKWSHFGTQKEKWIGVAENHFSNTE 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01885LIPPROTEIN48702e-15 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 69.7 bits (170), Expect = 2e-15
Identities = 66/261 (25%), Positives = 99/261 (37%), Gaps = 29/261 (11%)

Query: 30 KVSLIID-GTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSDLEGLKDAGSDLI 88
K LI D G DDKSFN+SA +K + ++ IE+ E SSN + S AG +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVEPSSN-FESAYNSALSAGHKIW 121

Query: 89 WLIGYRFS------DVAKVAALQNPDMKYAIID-PIYSNDPIPANLVGMTFRAQEGAFLT 141
L G++ A L+ +K ID I + + F +E AF T
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKW---FYSLQFNIKESAFTT 178

Query: 142 GYIAAKLSKTGK-----IGFLGGIEGEIVDAFRYGYEAGAKYANKD-----------IKI 185
GY A + GG V F G+ G Y N+ +K+
Sbjct: 179 GYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKL 238

Query: 186 STQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 245
+ + +V + +D H + G E + G Y+IGVD DQ
Sbjct: 239 DSGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQYVIGVDSDQ 298

Query: 246 AY-LAPDNVITSTTKDVGRAL 265
D ++TS K + +A+
Sbjct: 299 GMIQDKDRILTSVLKHIKQAV 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01890LIPPROTEIN48505e-09 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 50.4 bits (120), Expect = 5e-09
Identities = 59/296 (19%), Positives = 103/296 (34%), Gaps = 50/296 (16%)

Query: 18 FKSNKKSIKSDKV----VVGVLAHGSFYDKGYNQSVHDGVVKLRDNFGIKLITKSLRPYP 73
+ K+ +K+ ++ V + G DK +NQS + + + GI++
Sbjct: 47 NANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVE----- 101

Query: 74 IEGKRLLTVDEAMTEDAYEVQKNPLNLFW-LIGYRFSDLSVKL------SYERPDIYYGI 126
+ E AY + + W L G++ + ER I
Sbjct: 102 ---------PSSNFESAYNSALSAGHKIWVLNGFKHQQSIKQYIDAHREELERNQI---K 149

Query: 127 IDAFDYGDIQVPKNSLAIKFRNEEAAFLAGYIAA-----KMSRKEKIGFLTGPMSEHVKD 181
I D+ K +++F +E+AF GY A + K + G V
Sbjct: 150 IIGIDFDIETEYKWFYSLQFNIKESAFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTT 209

Query: 182 FKFGFKAGIFYAN---PKLRLVSK---KAPSLFDKEKGKAMAL-------FMYKEDKVGV 228
F GF GI Y N ++ K S F + + + V
Sbjct: 210 FNEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHV 269

Query: 229 IFPIAGITGLGVYDAAKELGPKYYVIGLNQDQSYI-APQNVITSIIKDIGKVIYSI 283
I +AG ++ + YVIG++ DQ I ++TS++K I + +Y
Sbjct: 270 ILSVAGPA---TFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYET 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01895LIPPROTEIN48603e-12 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 60.0 bits (145), Expect = 3e-12
Identities = 55/272 (20%), Positives = 97/272 (35%), Gaps = 31/272 (11%)

Query: 28 KTVSLIVDGAFDDKGFNESSSKAIRKLKADLNINIIEKASTGNSYLGDIANLEDGNSNLI 87
K V + +G DDK FN+S+ +A++ + I I +++ + +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIE-INNVEPSSNFESAYNSALSAGHKIW 121

Query: 88 WGIGFRLSDILFQ---RASENVSVNYAIIEGV-YDEIQIPKNLLNISFRSEEVAFLAGY- 142
GF+ + Q E + N I G+ +D K ++ F +E AF GY
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKESAFTTGYA 181

Query: 143 ----FASKASKTGKIGFVGGVRGKVLESFMYGYEAGAKYANSNIKVVSQYVGTFGDFGLG 198
+ + + GG + +F G+ G Y N K Y + G
Sbjct: 182 IASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKLDSG 241

Query: 199 ---------------RSTASNMYRDGVDIIFAAAGLSGIGVIEAAKELGPDHYIIGVDQD 243
ST +++ + I+ A + E + Y+IGVD D
Sbjct: 242 FTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATF----ETVRLANKGQYVIGVDSD 297

Query: 244 QSY-LAPNNVIVSAVKKVDSLMYS-LTKKYLE 273
Q + ++ S +K + +Y L LE
Sbjct: 298 QGMIQDKDRILTSVLKHIKQAVYETLLDLILE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS01910RTXTOXIND310.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 31.0 bits (70), Expect = 0.037
Identities = 7/17 (41%), Positives = 14/17 (82%)

Query: 1192 KHLLVRDGDVVKAGDML 1208
K ++V++G+ V+ GD+L
Sbjct: 108 KEIIVKEGESVRKGDVL 124


12BB_RS03910BB_RS03930N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BB_RS039101220.747456muramidase
BB_RS039151171.366242flagellar P-ring protein
BB_RS039200132.605427hypothetical protein
BB_RS039250113.460755flagellar basal-body rod protein FlgG
BB_RS039303162.796587flagellar basal-body rod protein FlgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03910FLGFLGJ487e-10 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 47.8 bits (113), Expect = 7e-10
Identities = 21/69 (30%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 34 DLRKASLEFEAMFIKQMLESMKKTLNKDQNLLNGGQVEEIFEDMLCEQRAKQMAQAQSFG 93
++R + + E MF++ ML+SM+ L KD L + ++ M +Q A+QM + G
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPKDG--LFSSEHTRLYTSMYDQQIAQQMTAGKGLG 89

Query: 94 LADLIYNQL 102
LA+++ Q+
Sbjct: 90 LAEMMVKQM 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03915FLGPRINGFLGI2579e-86 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 257 bits (658), Expect = 9e-86
Identities = 83/352 (23%), Positives = 153/352 (43%), Gaps = 60/352 (17%)

Query: 35 SLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDL----IIKILEENNIINE 90
+ +++ ++K+IA + N L G G+V GL G GDS++ + +L+ I +
Sbjct: 24 AQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQ 83

Query: 91 IGSNNIESKNIALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEI 150
G +N +KNIA V V+ + GS+ V+S+ D+ L G L+ T+L +G+I
Sbjct: 84 GGQSN--AKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQI 141

Query: 151 IAIASGITQPNN-KLKGSGYTI-----------DSVIINEN--QNINHSYNIILKKGN-- 194
A+A G N +G T+ + II S N++L+ N
Sbjct: 142 YAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPD 201

Query: 195 YTLINRIHKILTS---KKINNKI---KSDSTIEIEAKNIS----LLEEIENIKIETN--P 242
++ R+ ++ + + + I + I ++ ++ L+ EIEN+ +ET+
Sbjct: 202 FSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261

Query: 243 KILIDKKNGIILASENAKI-------GTFTFSIEKDNQNI----FLSKNNKTTIQVNSMK 291
K++I+++ G I+ + +I GT T + + Q I F Q + M
Sbjct: 262 KVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMA 321

Query: 292 LNE----FILK-----------NSNNLSNKELIQIIQAAQKINKLNGELILE 328
+ E I++ NS L +I I+Q + L EL+L+
Sbjct: 322 MQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03925FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 2e-07
Identities = 10/44 (22%), Positives = 23/44 (52%)

Query: 220 ILEMSNVSIAEEMVTMIVAQRAYEINSKAIQTSDNMLGIANNLK 263
+S V++ EE + Q+ Y N++ +QT++ + N++
Sbjct: 503 QQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 1e-05
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 14/79 (17%)

Query: 5 LWTAASGMTAQQYNVDTIANNLSNVNTTGFKKIRAEFEDLIYQTHNRAGTPATENTLRPL 64
+ A SG+ A Q ++T +NN+S+ N G+ + A N+
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--------------IMAQANSTLGA 49

Query: 65 GNQVGHGTKIAATQRIFEQ 83
G VG+G ++ QR ++
Sbjct: 50 GGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BB_RS03930FLGHOOKAP1438e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.0 bits (101), Expect = 8e-07
Identities = 14/64 (21%), Positives = 28/64 (43%)

Query: 214 DTKTSGKAQEIDISLRPKIETETLEASNVNAVKEMVLMIEINRAYEANQKTIQTEDSLLG 273
T T + ++ ++ + S VN +E + + Y AN + +QT +++
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 274 KLIN 277
LIN
Sbjct: 541 ALIN 544



Score = 38.4 bits (89), Expect = 2e-05
Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 4 GIYTAASGMMAERRKLDTVSNNLANIDLIGYKKDLSIQK 42
I A SG+ A + L+T SNN+++ ++ GY + +I
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.