PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome358.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_009485 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BBta_0082BBta_0103Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_00820153.2783292-octaprenylphenol hydroxylase
BBta_00830143.192181bifunctional
BBta_00841143.562818deoxyuridine 5'-triphosphate
BBta_00850123.449551sensor histidine kinase
BBta_00861133.459791hypothetical protein
BBta_00870123.224246hypothetical protein
BBta_00880113.203437nucleotidyl transferase family protein
BBta_00891103.027492DNA helicase/exodeoxyribonuclease V subunit B
BBta_0090-191.905311DNA helicase/exodeoxyribonuclease V subunit A
BBta_00910111.145325thioredoxin 1, redox factor
BBta_0092-1111.033402ATP-dependent DNA ligase
BBta_00931111.042726hypothetical protein
BBta_0094-1120.960754bifunctional folylpolyglutamate
BBta_0095-1130.683471acetyl-CoA carboxylase subunit beta
BBta_00960110.665365tryptophan synthase subunit alpha
BBta_0097214-0.843005tryptophan synthase subunit beta
BBta_0098413-0.390275N-(5'-phosphoribosyl)anthranilate isomerase
BBta_00992140.569000hypothetical protein
BBta_01013140.440337integration host factor subunit beta
BBta_01023150.188045S49 family peptidase
BBta_01032140.14437530S ribosomal protein S1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0082YERSSTKINASE320.005 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.4 bits (73), Expect = 0.005
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 278 HALRDGFFHADMHPGNLFLDK-EGRLVAVDFGIMGRLGMKERRF 320
H + G H D+ PGN+ D+ G V +D G+ R G + + F
Sbjct: 260 HLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0083TONBPROTEIN300.016 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.0 bits (67), Expect = 0.016
Identities = 8/29 (27%), Positives = 10/29 (34%)

Query: 41 EPADPSEPPSAPPTPAPPPEPMVPQPGVA 69
+ P P P P P P P P+
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPV 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0085DNABINDINGHU280.035 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 28.1 bits (63), Expect = 0.035
Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 17/85 (20%)

Query: 668 IDAGAMKLELGPVDAAKAIEAAAEGVQDRLAT-DRIRLKVQVDPNVGTFVGDERRVVQVL 726
I A EL D+A A++A V LA ++++L G F ER
Sbjct: 8 IAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQL-----IGFGNFEVRER------ 56

Query: 727 YNLLANAVGFSPQ-DSTVLVSARRT 750
A G +PQ + + A +
Sbjct: 57 ----AARKGRNPQTGEEIKIKASKV 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0087TACYTOLYSIN260.019 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 26.5 bits (58), Expect = 0.019
Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 65 GLAYRRCELAWVNGDQIGVTFLKQGKKKANK 95
L Y E+ NG+ I K+G KKA+K
Sbjct: 118 SLNYNELEVLAKNGETIENFVPKEGVKKADK 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0101DNABINDINGHU1102e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 110 bits (277), Expect = 2e-35
Identities = 33/88 (37%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 3 KSELVQRIAEHNPHLYQRDVENIVNAILDEIVAALARGDRVELRGFGAFSVKHRPARAGR 62
K +L+ ++AE L ++D V+A+ + + LA+G++V+L GFG F V+ R AR GR
Sbjct: 4 KQDLIAKVAEAT-ELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 63 NPRTGAHVPVDQKSVPFFKTGKEMRERL 90
NP+TG + + VP FK GK +++ +
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0103SHIGARICIN310.015 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 30.6 bits (69), Expect = 0.015
Identities = 9/26 (34%), Positives = 9/26 (34%)

Query: 104 EESWGKLEKAFQNNEKVNGVIFNQVK 129
E SW L K Q NG V
Sbjct: 212 ENSWSALSKQIQIASTNNGQFETPVV 237


2BBta_0141BBta_0147Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0141212-1.320102phosphoribosyl-ATP pyrophosphatase
BBta_0142214-0.958572imidazole glycerol phosphate synthase subunit
BBta_0143214-1.789829hypothetical protein
BBta_0144213-0.311662hypothetical protein
BBta_01452120.1841791-(5-phosphoribosyl)-5-[(5-
BBta_0146213-0.500660imidazole glycerol phosphate synthase subunit
BBta_0147210-0.479132hypothetical protein
3BBta_0275BBta_0291Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_02752122.704174hypothetical protein
BBta_02760132.519992cytochrome P450 hydroxylase
BBta_02771112.598262hypothetical protein
BBta_02781142.179679major facilitator superfamily permease
BBta_0279112-0.680261hypothetical protein
BBta_0280-2131.710441hypothetical protein
BBta_0282-2123.302089polysaccharide deacetylase domain-containing
BBta_0283-1103.862610hypothetical protein
BBta_02840103.673838hypothetical protein
BBta_02851104.454754hypothetical protein
BBta_02861125.661925*HemY domain-containing protein
BBta_02870104.329655hypothetical protein
BBta_02880122.698065uroporphyrinogen-III synthase
BBta_0289-1142.448709DNA-binding/iron metalloprotein/AP endonuclease
BBta_0290-1172.018737NAD(P)H-dependent glycerol-3-phosphate
BBta_02912180.397307hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0278TCRTETA585e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.5 bits (139), Expect = 5e-11
Identities = 66/307 (21%), Positives = 110/307 (35%), Gaps = 29/307 (9%)

Query: 65 ALMLPVMFISMP-AGAIADMYDRRIVALVSLLVALGGAVTLTVLAWLGLVTPERLLALCF 123
AL + F P GA++D + RR V LVSL G AV ++A + +L +
Sbjct: 50 ALYALMQFACAPVLGALSDRFGRRPVLLVSLA---GAAVDYAIMATAPFLW---VLYIGR 103

Query: 124 VIGSGMALMGPAWQSSVSEQVPPETLPAAVALNGISYNIARSFGPAIGGVVVAAAGAVAA 183
++ G + +++ + + GP +GG++ +
Sbjct: 104 IVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPF 163

Query: 184 FAANALLYLPLLAALFLWRRVVEPSRLPREPLNRAIVSGVRYIIHSPSIRIVLIRTLVTG 243
FAA AL L L FL + R P ++ R+ + ++ +
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223

Query: 244 VIGGSISALMPLVARDLLHGGAQTYGIMLGAFG-MGAVVGALNIGEIRKRLSGEAAIRSC 302
++G +AL + D H A T GI L AFG + ++ A+ G + RL A+
Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-- 281

Query: 303 ALTLGAATAAVGLSGEPVLTAAALVVAGAVWMLAVALFNIGVQLSAPRWVAGRALAAFQA 362
LG G L A WM + + G + A QA
Sbjct: 282 ---LGMIADGTGY--------ILLAFATRGWMAFPIMVLLA--------SGGIGMPALQA 322

Query: 363 AISGGIA 369
+S +
Sbjct: 323 MLSRQVD 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0283PF05616333e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.8 bits (74), Expect = 3e-04
Identities = 28/126 (22%), Positives = 42/126 (33%), Gaps = 5/126 (3%)

Query: 10 IALASATTAQAGGTRSLSLAPNDTAAPAPRPAYVQQAGEVTITPAP-VPAGAAGQPATQP 68
+ +A T G + P A R + +V + P P + G+A P QP
Sbjct: 268 VEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQP 327

Query: 69 VQPAPAAAAPVANSTPATQPAAAAAPTPD----PTGSVQKSSRRSTSAAKSARAGKPRGK 124
+ A P N P P P PD P + + T A +P G+
Sbjct: 328 LPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGR 387

Query: 125 SWTEAR 130
E +
Sbjct: 388 HRKERK 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0286IGASERPTASE434e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.1 bits (101), Expect = 4e-06
Identities = 48/243 (19%), Positives = 73/243 (30%), Gaps = 47/243 (19%)

Query: 421 QTPVASLPSERNSVIESSEFADAVLAPPTRATIPEGLAEPP------------------- 461
Q V S+PS + E AP T + E +AE
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTA 1063

Query: 462 --REVARQSQLDVIAPTPQDNVPPRPAETAVSDPAV--DTPVQEAVDKPVDRPVDNPVEK 517
REVA++++ +V A T + V +ET + +T E +K K
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK----------AK 1113

Query: 518 TEDNRLIEVSSATPAEPAPPLQASGPTDADSETPEAERDEA------------ESEAEAP 565
E + EV T P Q T P E D ++ E P
Sbjct: 1114 VETEKTQEVPKVT--SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 566 AKPADPVATAPTPLFRTRSDLGKPPETPLQTIVPIMRPPDDPGVDDEDQTRDEFAEQIAP 625
AK P T + E P T +P + ++ + R + + P
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 626 KAQ 628

Sbjct: 1232 HNV 1234


4BBta_0425BBta_0477Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0425216-0.121603hypothetical protein
BBta_0426314-0.76549350S ribosomal protein L27
BBta_0427214-1.03297750S ribosomal protein L21
BBta_0428214-1.392040hypothetical protein
BBta_0429317-1.37780231 kDa outer-membrane immunogenic protein
BBta_04301150.262675*resolvase
BBta_04312141.140399hypothetical protein
BBta_04321151.189391hypothetical protein
BBta_04331151.720529hypothetical protein
BBta_04342151.702995prophage CP4-57 regulatory protein
BBta_04351130.947808conjugal transfer relaxase TraA
BBta_0436221-3.129215conjugual transfert protein, traC
BBta_0437321-2.470387conjugal transfer protein traD
BBta_0438421-2.665539transposase
BBta_0439430-5.154687transposase
BBta_0440329-4.968951transposase
BBta_0441229-5.361658transposase
BBta_0442127-3.809375transposase
BBta_0443128-4.498546transposase
BBta_0444027-3.995996HTH-type transcriptional regulator cbbR
BBta_0445024-3.605234D-fructose 1,6-bisphosphatase
BBta_0446021-3.521099phosphoribulokinase
BBta_0447019-2.965713transketolase
BBta_0448119-4.009511glyceraldehyde-3-phosphate dehydrogenase
BBta_0449121-3.913027phosphoglycerate kinase
BBta_0450219-3.714705fructose-1,6-bisphosphate aldolase
BBta_0451320-2.760281ribulose bisophosphate carboxylase
BBta_0452321-2.493438ribulose 1,5-bisphosphate carboxylase small
BBta_0453323-3.311721CbbX-like protein
BBta_0454425-3.492028ribulose-5-phosphate 3-epimerase
BBta_0455424-3.921757fructose-bisphosphate aldolase
BBta_0456426-4.884476phosphoglycolate phosphatase
BBta_0457427-5.280375haloacid dehalogenase
BBta_0458428-5.481646two-component response regulator protein
BBta_0460527-5.009294uptake hydrogenase accessory protein hupU
BBta_0461129-4.932663uptake hydrogenase large subunit
BBta_0462332-4.390985hypothetical protein
BBta_0463533-4.300651hypothetical protein
BBta_0464329-4.100498hydrogenase maturation protease HyaD
BBta_0465025-2.984691hypothetical protein
BBta_0466024-2.433528hypothetical protein
BBta_0467023-1.961674hypothetical protein
BBta_0468-122-2.216539hypothetical protein
BBta_0469-121-2.715417nitrogen-fixing protein NifU
BBta_0470-120-2.292107hypothetical protein
BBta_0471-121-2.738711carbamoyl phosphate phosphatase, (NiFe)
BBta_0472-123-3.867897hydrogenase expression/formation protein HypC
BBta_0473-120-3.583901phosphoheptose isomerase
BBta_0474-119-3.552638hydrogenase expression/formation
BBta_0475021-1.814153hydrogenase maturation
BBta_0476123-2.591812hydrogenase formation/expression
BBta_0477323-2.516902hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0429OMPADOMAIN481e-08 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 47.6 bits (113), Expect = 1e-08
Identities = 48/211 (22%), Positives = 72/211 (34%), Gaps = 34/211 (16%)

Query: 15 ATALAMTGARAADMPATTYKAPAYASAPAPFSWTGFYAGANVGGAFTGNDSVTNLAGGNG 74
A A+A+ A A A A +Y GA +G + + T NG
Sbjct: 5 AIAIAVALAGFA------------TVAQAAPKDNTWYTGAKLGWSQYHD---TGFINNNG 49

Query: 75 G-----KLSGVIGGMQAGYNYQVSPLFVVGIENDLDFTGLSRQGDLVNPAVSVPWLTTGR 129
+G GG YQV+P G E D+ G V
Sbjct: 50 PTHENQLGAGAFGG------YQVNPYV--GFEMGYDWLGRMPYKGSVENGAYKAQGVQLT 101

Query: 130 ARAGFTMLDQRLFLYGTAGLA-----AGELKDGPVHKVKMGWTAGGGAEWAFLPKWSAKL 184
A+ G+ + D L +Y G G H + GG E+A P+ + +L
Sbjct: 102 AKLGYPITDD-LDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRL 160

Query: 185 EYLYTDLKHDALPDWRAAKIHSVRVGLNYHF 215
EY +T+ DA + +G++Y F
Sbjct: 161 EYQWTNNIGDAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0438DPTHRIATOXIN270.015 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 26.6 bits (58), Expect = 0.015
Identities = 20/59 (33%), Positives = 28/59 (47%), Gaps = 10/59 (16%)

Query: 33 HQRVVRHGHLPEREVMTGIGPV---------AVRQPRVRDREAAATDPDRTRFSPSILP 82
HQ + H L E + +TG PV AV +V D E A + ++T + SILP
Sbjct: 283 HQTALEHPELSELKTVTGTNPVFAGANYAAWAVNVAQVIDSE-TADNLEKTTAALSILP 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0447TCRTETB310.022 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.022
Identities = 19/66 (28%), Positives = 27/66 (40%), Gaps = 1/66 (1%)

Query: 105 SRTAGHPEFGHAAGIETTTGPLGQGIATAV-GMALSERMLNARFGDDLVDHHTYVIAGDG 163
S + E G + T L +G A+ G LS +L+ R VD TY+ +
Sbjct: 374 SSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLL 433

Query: 164 CLMEGI 169
L GI
Sbjct: 434 LLFSGI 439


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0458HTHFIS379e-130 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 379 bits (975), Expect = e-130
Identities = 144/501 (28%), Positives = 225/501 (44%), Gaps = 53/501 (10%)

Query: 5 GTVLIVARRGGDWTDCLDVLS---ERYAYRVVLVGSVAEALTTMSGIHVDLAVAEDRKDE 61
T+L+ D VL+ R Y V + + A ++ DL V + +
Sbjct: 4 ATILVA----DDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 62 SIGLDFLTRLRVSHPEIMRVYVATGGSPLSYGTLTKAAIYQFLLTPLDATQLGLVVERAL 121
D L R++ + P++ + ++ + ++ ++ Y +L P D T+L ++ RAL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 122 EARELARRHRILSREFKISGDVLRLGERRDLLFRPESHQFEKLVYASEKMAELCDLAKQA 181
+ +S LV S M E+ + +
Sbjct: 120 AEPKRRPSKL-----------------------EDDSQDGMPLVGRSAAMQEIYRVLARL 156

Query: 182 ATTELPILIQGETGTGKELLARAIHYNSPRRTSPLLIQNCGGMPDDLLQSELFGHKRGAF 241
T+L ++I GE+GTGKEL+ARA+H RR P + N +P DL++SELFGH++GAF
Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 242 TGAISDRLGLFRAADGGTVFLDEISEVSPSFQVSLLRFLQEGEVKPLGSDKVEHCSVRII 301
TGA + G F A+GGT+FLDEI ++ Q LLR LQ+GE +G VRI+
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276

Query: 302 AASNRSLKDMVARREFRQDLYFRLKGFELEVPPLRERRDDIAPLSEFFAAKHADGMGRKI 361
AA+N+ LK + + FR+DLY+RL L +PPLR+R +DI L F + A+ G +
Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDV 335

Query: 362 LGITASTIEKLSACDFPGNVRELENEIRRMVALAKDGEY--------------------- 400
+E + A +PGNVRELEN +RR+ AL
Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395

Query: 401 -ITTRNMSASLLASAAARSRSDGPRGFVPEGATLKDRVESLEKQIVGSALLRNHWNQSRT 459
+ ++S S R +P + +E ++ +AL NQ +
Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455

Query: 460 AAELGLSRVGLSNKIKRYNLE 480
A LGL+R L KI+ +
Sbjct: 456 ADLLGLNRNTLRKKIRELGVS 476


5BBta_0516BBta_0525Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0516213-2.481752multi-sensor hybrid histidine kinase
BBta_0517416-2.615255response regulator receiver
BBta_0518212-0.821180response regulator receiver
BBta_0519312-0.287308CRP/FNR family transcriptional regulator
BBta_0521313-0.267598hypothetical protein
BBta_0522315-0.794399hypothetical protein
BBta_0523111-1.088198flagellar biosynthesis repressor FlbT
BBta_0524110-1.021528chemotaxis protein CheA
BBta_0525116-3.485866CheW protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0516HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 2e-17
Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 3/138 (2%)

Query: 586 RVLVVDDNPTNRLVATKMLKDFDIQTDTACDGAEAVTAASRFNYDLILMDVRMPEMDGFQ 645
+LV DD+ R V + L + A + + DL++ DV MP+ + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 646 ATRTIRARGERRSNVPIIAFTANAFMEDIRACREAGMNDFVVKPARKKALVEAILRVLPA 705
I+ + R ++P++ +A E G D++ KP L+ I R L
Sbjct: 65 LLPRIK---KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 706 RTLAIETIASDAPPLAPV 723
+ D+ P+
Sbjct: 122 PKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0517HTHFIS638e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 8e-15
Identities = 24/138 (17%), Positives = 55/138 (39%), Gaps = 5/138 (3%)

Query: 2 RNELLVIEDADVHLSILRKIATQAGFNTTGVSSVDAASTILRTRHFDCVTLDLSLGERSG 61
+LV +D ++L + ++AG++ S+ + D V D+ + + +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 TEVLQRLAELKYRGPVLIISASENDRLDASVRIGNFLELNVCPPFSKPINLPLLRQTLKQ 121
++L R+ + + PVL++SA + ++ E KP +L L + +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSA--QNTFMTAI---KASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 122 IASETDRQKLVRRQAGRG 139
+E R+ +
Sbjct: 118 ALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0518HTHFIS831e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 1e-21
Identities = 26/105 (24%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 9 ILVVDDYATMIRIIRNLLKQLGFENVDDATDGSAALAKMQAKKYGLVISDWNMEPMTGYD 68
ILV DD A + ++ L + G++ V ++ + + A LV++D M +D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 69 LLREVRASPELSKTPFIMITAESKTENVIAAKKAGVSNYIVKPFN 113
LL ++ P ++++A++ I A + G +Y+ KPF+
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0522FLAGELLIN577e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.4 bits (138), Expect = 7e-11
Identities = 80/498 (16%), Positives = 155/498 (31%), Gaps = 7/498 (1%)

Query: 25 LQSTAQLLATTQNNLSTGKKVNSALDNPTNFFTAQGLDNRASDISNLLDGIGNGVQVLQA 84
L + L++ LS+G ++NSA D+ A + ++ +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 85 ANTGITSLQKLVDSAKSIANQVLQSAVGYSTKSTVTSAALTGATATSLIGASTTAVTGSA 144
+ + + + ++ +Q+ G ++ S + S I + +
Sbjct: 77 TEGALNEINNNLQRVRELS---VQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNG 133

Query: 145 VLNDNTSTAVAITGSTKLSGTPGTSSNDLASSITTGDTLVVNGTTFTFIAGTSSSGTNIG 204
V + + I T + D VNG + SS N+
Sbjct: 134 VKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 205 VGDTVTNLLSTIQSATGVTSSITAGAITLTPPAAGLTLSGTSLAKLGLSAVGNSLSGQTL 264
DT + + + +T P + + L
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAV-DLFKT 252

Query: 265 TIAATGGGTATSVTFGLGTGQVNSLNDLNAKLAANNLQATVASATGKISITTTNDAASST 324
T + G A ++ + G+ D + + ++TT + T
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK---VSTTINGEKVT 309

Query: 325 IGAIGGTAAASSQSFNGLTAAAPVADATAQSQRASLVAQYNNVLAQINTTAADASFNGIN 384
+ TA A++ L ++ V + Q N + A +A
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 385 LLNGDTLKLTFNETGKSTLSITGVTFNTGGLGLSTLTSGTDFLDNNSANKVIKVLNTASS 444
+ K TL+ + + G+STL + S + +++A S
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 445 TLRDEASTLGSNLSVVQVRQDFNKNLINVLQTGSSNLTLADTNEEAANSQALSTRQSIAV 504
+ S+LG+ + N + L + S + AD E +N Q
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 505 SALSLANQSQASVLQLLR 522
S L+ ANQ +VL LLR
Sbjct: 490 SVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0524HTHFIS884e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 4e-20
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 4/134 (2%)

Query: 795 QTQSVLLVDDSPFFRNMLAPVLKSAGYKVRTAASAIEGLATLRSGHTFDIVVTDIEMPEM 854
++L+ DD R +L L AGY VR ++A + +G D+VVTD+ MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE 60

Query: 855 NGFEFAEAIRSDQNLNQLPVIAVSSLVSPAAIERGRQAGLYDYIAK-FDRPGLIAALKEQ 913
N F+ I+ + LPV+ +S+ + + + G YDY+ K FD LI +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 914 IEERARAEANRRAA 927
+ E R +
Sbjct: 119 LAEPKRRPSKLEDD 132


6BBta_0579BBta_0612Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0579630-3.370577hypothetical protein
BBta_0580735-4.543490*phage integrase
BBta_0581841-5.456822hypothetical protein
BBta_0583951-7.329166hypothetical protein
BBta_05841057-8.272735hypothetical protein
BBta_05861058-8.552638hypothetical protein
BBta_05871058-8.417076hypothetical protein
BBta_0588958-8.118130hypothetical protein
BBta_0589956-7.639863hypothetical protein
BBta_0592645-6.742879hypothetical protein
BBta_0593630-6.298116hypothetical protein
BBta_0595628-5.830248transposase
BBta_0596630-6.242837hypothetical protein
BBta_0597431-5.555632helicase
BBta_0598534-5.993236hypothetical protein
BBta_0599533-6.235376arylsulfatase
BBta_0600437-5.782118hypothetical protein
BBta_0601437-5.731354hypothetical protein
BBta_0602335-5.371764hypothetical protein
BBta_0603533-5.273198hypothetical protein
BBta_0604531-4.920672hypothetical protein
BBta_0606231-2.722699hypothetical protein
BBta_0607231-3.780837hypothetical protein
BBta_0608229-3.616520hypothetical protein
BBta_0609227-3.364283transposase
BBta_0610-129-3.787241hypothetical protein
BBta_0611-125-3.788088transposase
BBta_0612026-3.484929acyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0589VACCYTOTOXIN422e-05 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 41.9 bits (98), Expect = 2e-05
Identities = 38/157 (24%), Positives = 63/157 (40%), Gaps = 25/157 (15%)

Query: 717 EFGSVEDLRQLGDILSDLRQVKGVQSAQMLMLEAIVLRERADRGNLPV----SETEEILD 772
+FG++E + +L + +D+ + AQ L +L + D G + EI
Sbjct: 880 DFGTIESVFELANRSNDIDTLYANSGAQGRDLLQTLLIDSHDAGYARTMIDATSANEITK 939

Query: 773 RSRLLLTNARDQIAKKPRNPSRDQLL----ASILTSYATTLRRQMQVRIEQGNLLAAQTI 828
+ T + IA S Q L A IL S L R+ I+
Sbjct: 940 QLN-TATTTLNNIASLEHKTSGLQTLSLSNAMILNSRLVNLSRRHTNHID---------- 988

Query: 829 ATPALAAAQRAQALQDQ-WHPFDAAALVYYRLAQAWQ 864
+ A+R QAL+DQ + ++AA V Y+ A ++
Sbjct: 989 -----SFAKRLQALKDQRFASLESAAEVLYQFAPKYE 1020


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0602SYCDCHAPRONE385e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.0 bits (88), Expect = 5e-05
Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 10/127 (7%)

Query: 668 NLADLYRQRGQDGEGEAVLRTALVASPRDAALFYSLGLTLTRLRRPDDALNALARAAELE 727
+LA Q G+ + V + V D+ F LG + + D A+++ + A ++
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 728 PERQRYVYVYGIALHSSGRREAAVSVL---KDALRAHPNDSQI-------LQALISFSRM 777
+ R+ + L G A S L ++ + ++ L+A+ M
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEM 160

Query: 778 SGDATSA 784
+
Sbjct: 161 EHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0612SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 13/60 (21%), Positives = 27/60 (45%)

Query: 66 DDHLMMRNLAVLPAFQGRRLGHALIDFAEAEARRRNLPELRLYTNEKFPETMPFYKSRGF 125
+ + ++ ++AV ++ + +G AL+ A A+ + L L T + FY F
Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


7BBta_0670BBta_0685Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_06702152.174672glutathione S-transferase
BBta_06712132.247718AraC family transcriptional regulator
BBta_06722122.283139cation-transporting ATPase
BBta_06732110.919093hypothetical protein
BBta_06741110.913944fatty-acid--CoA ligase
BBta_06753101.622707carboxylase
BBta_06762101.100836biotin carboxylase
BBta_0677-1100.238645two component-response regulator receiver
BBta_0678-1110.341083two-component sensor histidine kinase
BBta_06790110.805131cytochrome C
BBta_06800111.917259hypothetical protein
BBta_06811111.855628acyl-CoA synthetase
BBta_06821101.978263branched chain amino acid binding protein
BBta_06831122.496337MAPEG family protein
BBta_06842112.483770hypothetical protein
BBta_06852102.265814hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0674OMPADOMAIN300.028 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 29.9 bits (67), Expect = 0.028
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 40 TD--GSRQVTFTELERDANRFANYLVAKGLKPGEKIATV 76
TD GS ER A +YL++KG+ P +KI+
Sbjct: 261 TDRIGSDAYNQGLSERRAQSVVDYLISKGI-PADKISAR 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0676RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.4 bits (79), Expect = 0.001
Identities = 11/49 (22%), Positives = 22/49 (44%)

Query: 566 DLTLAAPQSSAAAGGDGKVRAALNGRVVAVLVKLGDRVAVGQPVITLEA 614
++ A +G +++ N V ++VK G+ V G ++ L A
Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0678PF06580310.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.012
Identities = 36/170 (21%), Positives = 59/170 (34%), Gaps = 22/170 (12%)

Query: 255 KRQTREQLRTLLAEVN-HRSKNLLSLVQAIARQMTRRGRPL--DLERFLQRLQAIASNQD 311
QL L A++N H N L+ ++A+ + + R + L + R SN
Sbjct: 156 SMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELM-RYSLRYSNAR 214

Query: 312 LLIQNDWRFIPLG---GLVRSQLGAFSDLVGNRIAIDGPEIELTPEAAQA--IGMAVHEL 366
+ L +V S L S +R+ E ++ P M V L
Sbjct: 215 Q--------VSLADELTVVDSYLQLASIQFEDRLQF---ENQINPAIMDVQVPPMLVQTL 263

Query: 367 ATNAAKYG-ALSNDDGHVTLRWVRDGDDLEMIWRESGGPAVSPPTHSGFG 415
N K+G A G + L+ +D + + E+ G T G
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTV-TLEVENTGSLALKNTKESTG 312


8BBta_0972BBta_0996Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_09720123.424458acyl-CoA synthetase
BBta_0973-1124.667755hypothetical protein
BBta_0974-1133.038734hypothetical protein
BBta_09750133.063561hypothetical protein
BBta_0976-1142.994955hypothetical protein
BBta_0977-2142.710526hypothetical protein
BBta_09780111.913429hypothetical protein
BBta_09791131.143931NAD synthetase
BBta_09803140.640152extracellular metal-binding protein
BBta_0981218-0.392752hypothetical protein
BBta_0982219-0.297418hypothetical protein
BBta_0983118-1.293640major facilitator superfamily permease
BBta_0984122-1.120139hypothetical protein
BBta_0985023-0.700231hypothetical protein
BBta_0986-119-0.455634hypothetical protein
BBta_0987-119-0.309170hypothetical protein
BBta_0988-115-0.397929hypothetical protein
BBta_0989116-2.290990hypothetical protein
BBta_0990117-3.401639hypothetical protein
BBta_0991122-4.736674hypothetical protein
BBta_0992327-5.658603hypothetical protein
BBta_0993226-5.809425hypothetical protein
BBta_0994227-6.184902hypothetical protein
BBta_0995025-4.917419hypothetical protein
BBta_0996021-4.051403hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0973IGASERPTASE391e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 1e-05
Identities = 21/145 (14%), Positives = 48/145 (33%), Gaps = 4/145 (2%)

Query: 40 ESTRLERAAPAAPSPSPTATSYLAATQAAATAPIKVTPAQAEPVPEPSAAPASAPAVTAQ 99
E + + +P + + I A+ + P P APA+ T
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI----ARVDEAPVPPPAPATPSETTET 1039

Query: 100 DAATENAPAQAQEKTAEKAPDPDLKAREAERKRAAEQRRAERHRQVAERRRQRREQDLRA 159
A ++ EK + A + + RE ++ + + + +VA+ + +E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 160 VEQAVRENSMPRAYAAQPVDMDAPR 184
++ +A + P+
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0983TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.7 bits (124), Expect = 2e-09
Identities = 69/328 (21%), Positives = 110/328 (33%), Gaps = 16/328 (4%)

Query: 65 GTLCDRVGPRPVVLIGSILLAASLALASLASSLLMFQLLFGLLVGGATAAIFAPVMATVT 124
G L DR G RPV+L+ A A+ + A L + L G +V G T A A A +
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWV--LYIGRIVAGITGATGAVAGAYIA 121

Query: 125 GWFDTH-RSLAVSLVSAGMGVAPMTMSPLAAWLVSHWDWRTSMQCIAAVVAAIMIPVSLL 183
D R+ +SA G M P+ L+ + AA+ + L
Sbjct: 122 DITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 184 VRRPPEVERDTAAAPQHGPG----LPEMTRPTALRSPQFTILLATNFFCCATHSGPIIHT 239
+ + ER P A F I+ A
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240

Query: 240 VSYAVSCGIPLMAAVTIYSVEGLAGLFGRIAFGLLGDRFGAKRILVLGLLAQAFGALAYV 299
+ +++ + L L + G + R G +R L+LG++A G +
Sbjct: 241 FHWD-----ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295

Query: 300 FAQDLAAFYAVAAVFGFIYAGTMPLYAAIARENFPQAMMGTVMGGIAMAGSLGMATGPLA 359
FA + + + G MP A+ + G + G +A SL GPL
Sbjct: 296 FATRGWMAFPIMVLLASGGIG-MPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLL 354

Query: 360 GGLIYDSFASYTWLYVGSWLMGLGAVLI 387
IY + + + W+ G L+
Sbjct: 355 FTAIYAASITTWNGWA--WIAGAALYLL 380


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0989PRTACTNFAMLY280.039 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 27.7 bits (61), Expect = 0.039
Identities = 34/149 (22%), Positives = 50/149 (33%), Gaps = 15/149 (10%)

Query: 23 MAQTTGATKPAPAAAATEAGPDQPPPGGCMPIGITAAGDVVFPFACKDLIDLHRGKGQQS 82
A PA A A P PGG P G D + +D+ + +
Sbjct: 255 RATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYG------VDVSGSSVELA 308

Query: 83 SATPEA-GQGAPAHDA----ATAAAGDAVKTTGSVAVQPQSR---PSAEGDGIKAAEGAA 134
+ EA GA T + G G+V +R P A I GA
Sbjct: 309 QSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAH 368

Query: 135 APAQPAVREKDAPEANRAEPSGSRASRGD 163
A + A+ + PE + +G ++GD
Sbjct: 369 AQGK-ALLYRVLPEPVKLTLTGGADAQGD 396


9BBta_1010BBta_1026Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1010013-3.068444WecB/TagA/CpsF glycosyl transferase
BBta_1011013-3.466235hypothetical protein
BBta_1012014-4.194608hypothetical protein
BBta_1013014-4.528029hypothetical protein
BBta_1014115-4.581863hypothetical protein
BBta_1015215-3.771150hypothetical protein
BBta_1016214-3.039389hypothetical protein
BBta_1017013-2.320169hypothetical protein
BBta_1018115-1.814539hypothetical protein
BBta_1019116-2.493310hypothetical protein
BBta_1020016-2.239611hypothetical protein
BBta_1021017-2.235055hypothetical protein
BBta_1022117-1.897587hypothetical protein
BBta_1023116-1.890236hypothetical protein
BBta_1025215-2.318846hypothetical protein
BBta_1026214-1.481263hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1014PF06057280.041 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.041
Identities = 6/16 (37%), Positives = 10/16 (62%)

Query: 98 FDGDYPTLFKRLQEFL 113
FD DY + K ++ +L
Sbjct: 225 FDDDYDKVVKLIKGWL 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1015CHANLCOLICIN330.002 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.1 bits (75), Expect = 0.002
Identities = 33/147 (22%), Positives = 54/147 (36%), Gaps = 22/147 (14%)

Query: 180 ILNEDIRDRTSRASDTTKFLGREVERLQAESAALESKIAQAKETQGTPSAG-AIDPLAQM 238
I+NE +R SR T+ +QAE L A+ K + +A A Q
Sbjct: 97 IVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQR 156

Query: 239 RAEYAQKSAIYSDKHPMMKALK----------RQIEAAEKTLAPSNSNGVNLDA------ 282
R E ++ A + + +A + + +E A+K L+ + S V +D
Sbjct: 157 RKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLN 216

Query: 283 -----LQSQREAIQKNLENASAKYAAA 304
R+A K L + A A
Sbjct: 217 SRLSSSIHARDAEMKTLAGKRNELAQA 243



Score = 29.3 bits (65), Expect = 0.031
Identities = 33/176 (18%), Positives = 65/176 (36%), Gaps = 36/176 (20%)

Query: 181 LNEDIRDRTSRASDTTKFLGREVERLQAESAALESKI---------AQAKETQGTPSAGA 231
L E R + S+ + VE Q + +A +S++ ++ + + A
Sbjct: 173 LAEAEEKRLAALSE----EAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDA 228

Query: 232 I-DPLAQMRAEYAQKSAIYSD-----------------KHPMMKALKRQIEAAEKTLAPS 273
LA R E AQ SA Y + P +A +R++ A +
Sbjct: 229 EMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQ 288

Query: 274 N---SNGVNLDALQSQREAIQKNLENASAKYAA--AQLGEALEKNQQSEKFEVLEQ 324
++ ++ + + IQK + S A A++ EA E ++++ + Q
Sbjct: 289 KQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQ 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1017V8PROTEASE300.008 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 29.6 bits (66), Expect = 0.008
Identities = 10/44 (22%), Positives = 18/44 (40%), Gaps = 3/44 (6%)

Query: 129 GNSGSGIFDADAQCLLGIVSRKISQSFQPPAGSGQQPVIFDLAK 172
GNSGS +F+ + ++GI + F + + K
Sbjct: 235 GNSGSPVFNEKNE-VIGIHWGGVPNEF--NGAVFINENVRNFLK 275


10BBta_1036BBta_1068Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1036017-3.217778serine/threonine protein phosphatase
BBta_1037018-3.142062glycosyltransferase group 1 family protein
BBta_1038120-3.422022hypothetical protein
BBta_1039018-2.862460hypothetical protein
BBta_1040016-2.338206hypothetical protein
BBta_1041-115-2.567733hypothetical protein
BBta_1042-114-2.373441group 1 glycosyl transferase
BBta_1043-214-2.455120hypothetical protein
BBta_1044-312-2.708000hypothetical protein
BBta_1045-114-3.513792glutamate-1-semialdehyde 2,1-aminomutase
BBta_1046-214-4.105457sugar nucleotide epimerase/dehydratase
BBta_1048-115-4.230035SAM-dependent methyltransferase
BBta_1049-218-4.034791dTDP-4-dehydrorhamnose 3,5-epimerase
BBta_1050-218-4.326635hypothetical protein
BBta_1051-217-4.027042acyltransferase membrane protein
BBta_1052-217-3.941883glucose-1-phosphate cytidylyltransferase
BBta_1054-219-3.520456hypothetical protein
BBta_1055-120-3.245737hypothetical protein
BBta_1056-219-2.414965hypothetical protein
BBta_1057-219-1.799320hypothetical protein
BBta_1059022-2.005084hypothetical protein
BBta_1060022-2.202964group 1 glycosyl transferase
BBta_1061222-2.417519hypothetical protein
BBta_1062523-2.459483hypothetical protein
BBta_1063225-5.022840hypothetical protein
BBta_1064227-5.197036hypothetical protein
BBta_1065126-5.015774hypothetical protein
BBta_1066024-4.730222hypothetical protein
BBta_1067022-3.511771hypothetical protein
BBta_1068019-3.083217glycosyl transferase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1046NUCEPIMERASE857e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 85.2 bits (211), Expect = 7e-21
Identities = 68/308 (22%), Positives = 110/308 (35%), Gaps = 63/308 (20%)

Query: 1 MKILITGNLGYVGPAVIKYLRQQRPDATLHGFD--NAYFAHCLTGAPRLPERDLDEQFYG 58
MK L+TG G++G V K L + + G D N Y+ L + L+
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLE-AGHQVV-GIDNLNDYYD------VSLKQARLELLAQP 52

Query: 59 DVRNVGLDLRSYDAVVQLAAISNDPMGNQFEAVT------------------FDINQGST 100
+ +DL + + L A FE V D N
Sbjct: 53 GFQFHKIDLADREGMTDLFAS------GHFERVFISPHRLAVRYSLENPHAYADSNLTGF 106

Query: 101 VSIAKAAAAAGVKNFVFASSCSVYGVAEGAPRKESDPLN-PITAYAKSKIGAER------ 153
++I + +++ ++ASS SVYG+ P D ++ P++ YA +K E
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS 166

Query: 154 ELASIDTNMVVTSLRFATACGMSDRLRLDLVLNDFVACALSQGKISVLSDGTPWRPLIDV 213
L + T LRF T G R D+ L F L I V + G R +
Sbjct: 167 HLYGLPA----TGLRFFTVYG--PWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 220

Query: 214 ADMARAI------------DWAIDRGAQTGGR--YLAVNVGSDDRNYQVKELAHAVAQSV 259
D+A AI W ++ G Y N+G+ ++ + A+ ++
Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSP-VELMDYIQALEDAL 279

Query: 260 PGTDVSIN 267
G + N
Sbjct: 280 -GIEAKKN 286


11BBta_1148BBta_1167Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_11481163.690655hypothetical protein
BBta_11490164.839664hypothetical protein
BBta_11501174.482723hypothetical protein
BBta_1151-1143.120298N-acetyl-gamma-glutamyl-phosphate reductase
BBta_1152224-3.169851short-chain dehydrogenase
BBta_1153538-7.315764NADPH quinone oxidoreductase
BBta_1154645-9.431541LysR family transcriptional regulator
BBta_1155533-6.861869diaminopimelate decarboxylase
BBta_1156531-6.359355T/G mismatch-specific endonuclease
BBta_1157422-4.864252hypothetical protein
BBta_1158214-2.852424hypothetical protein
BBta_1159-180.7606835-methylcytosine methyltransferase
BBta_1161-2142.718634hypothetical protein
BBta_1162-2131.590393RpiR family transcriptional regulator
BBta_1163-2131.317476non-hemolytic phospholipase C
BBta_11643121.804278phosphodiesterase (yfcE)
BBta_11650121.124434transcription elongation factor regulatory
BBta_11661121.765346IclR family transcriptional regulator
BBta_11672130.982222dihydrodipicolinate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1152DHBDHDRGNASE775e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.4 bits (190), Expect = 5e-19
Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 4/141 (2%)

Query: 27 GAHPDLLALALDVTRESEAVAAAQAAIDR-FGRIDVLLNNAGFGLMGAVEETSQEEIEAV 85
H + A DV R+S A+ A I+R G ID+L+N AG G + S EE EA
Sbjct: 56 ARHAE--AFPADV-RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT 112

Query: 86 FRTNVFGLLAVTRAILPHMRKARSGRILNISSIGGYRGSAGFGVYGATKFAVEALSEAMR 145
F N G+ +R++ +M RSG I+ + S Y ++K A ++ +
Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 146 AELEPLGIHVTAIEPGYFRTD 166
EL I + PG TD
Sbjct: 173 LELAEYNIRCNIVSPGSTETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1161PF07675350.002 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 35.1 bits (80), Expect = 0.002
Identities = 48/196 (24%), Positives = 74/196 (37%), Gaps = 32/196 (16%)

Query: 112 GSGIAHRPATALSDTLSSRDPIAQALWQA---QRERTLASVKRIRAGTPRPRLSLHDPWA 168
G ++P + L+ T + + W A ++ VKRI G +++
Sbjct: 328 GEPSPYQPVSNLTATAQGQKVTLK--WDAPSAKKAEGSREVKRIGDGL---FVTIEPAND 382

Query: 169 VRALVIVLLVATYFAAGDERRIRIASAFDWN--GVMTPVTIRVDAWIKPPLYTAKPPIIL 226
VRA +++A GD + D N G + P T PL+T L
Sbjct: 383 VRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPAT--------GPLFTGTASSNL 434

Query: 227 SAANKEGAVPAN-GPIVVPAGSTLIIRSSGGNLDVVTSGGV-------AEAAPGEIA-AG 277
+AN E PAN P+V + G +VV GGV E A G++ AG
Sbjct: 435 YSANFEYLTPANADPVVTTQNIIVT-----GQGEVVIPGGVYDYCITNPEPASGKMWIAG 489

Query: 278 EAAPKGATERHFTIKA 293
+ + A F +A
Sbjct: 490 DGGNQPARYDDFAFEA 505


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1166PF05616290.031 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.031
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 189 RQVKAAGYALSNQ--ENAPGLCVLAAPVIDRDGVPLAAIS 226
+ +KA GY ++ E APG V PV DR+G P+ ++
Sbjct: 254 KYIKATGYPGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVA 293


12BBta_1210BBta_1244Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1210322-4.4809873-oxoacyl-ACP reductase
BBta_1211427-5.873270hypothetical protein
BBta_1212430-6.8432643-oxoacyl-ACP reductase
BBta_1213534-7.675208alcohol dehydrogenase
BBta_1214642-9.528903NAD-dependent aldehyde dehydrogenase
BBta_1215855-11.907675*hypothetical protein
BBta_1217739-9.099173dienelactone hydrolase domain-containing
BBta_1218335-5.983960hypothetical protein
BBta_1219228-3.990069HTH-type transcriptional regulator
BBta_1220226-2.901199hypothetical protein
BBta_1221426-2.487115hypothetical protein
BBta_1222431-3.923925hypothetical protein
BBta_1223634-4.197636IS66 family transposase
BBta_1224639-5.630765transposase
BBta_1225744-7.390002IS3/IS911 family transposase
BBta_1226848-9.238262hypothetical protein
BBta_1227752-9.495495hypothetical protein
BBta_1228957-10.363207hypothetical protein
BBta_1229958-11.019842hypothetical protein
BBta_1230954-10.803046hypothetical protein
BBta_1231748-9.440051hypothetical protein
BBta_1232648-9.274454hypothetical protein
BBta_1233636-8.442100hypothetical protein
BBta_1234533-8.309879hypothetical protein
BBta_1235635-8.750443transposase
BBta_1236637-8.028651insertion element protein
BBta_1237638-8.091067hypothetical protein
BBta_1238738-8.103783hypothetical protein
BBta_1239946-8.813354hypothetical protein
BBta_1240945-8.510775hypothetical protein
BBta_1241846-8.291933hypothetical protein
BBta_1242948-9.029335hypothetical protein
BBta_1243740-7.565963hypothetical protein
BBta_1244531-5.504774phage / plasmid primase P4
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1210DHBDHDRGNASE792e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 78.9 bits (194), Expect = 2e-19
Identities = 68/261 (26%), Positives = 111/261 (42%), Gaps = 11/261 (4%)

Query: 4 GIKGRRAIVCASSKGLGRACAIALANEGVHVTITARGAEALAK--TAADIRAANPDITVT 61
GI+G+ A + +++G+G A A LA++G H+ E L K ++ A + +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 62 EVAGDITTPEGRAAVLKACPDPDILINNAGGPPPGDFRNWTRDDWIKALDANMLTPIELI 121
+V E A + + DIL+N AG PG + + ++W N
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 122 KATVDGMMARKFGRIVNITSAAVKAPIDILGLSNGARAGLTGFVAGLSRKTVINNVTINA 181
++ MM R+ G IV + S P + ++A F L + N+ N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 182 LLPGPFETDRLIGTAKIESERRGIPPEQIL---AERAKTN-PAGRFGDPEEFGLACAFLC 237
+ PG ETD + ++ G EQ++ E KT P + P + A FL
Sbjct: 185 VSPGSTETDM---QWSLWADENG--AEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 238 GAKAGYITGQNILLDGGAFPG 258
+AG+IT N+ +DGGA G
Sbjct: 240 SGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1212DHBDHDRGNASE1118e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (279), Expect = 8e-32
Identities = 80/252 (31%), Positives = 131/252 (51%), Gaps = 10/252 (3%)

Query: 10 EVILITGASQGLGRQFARVLSAHGAAVALAARQTGKLKALEDEIKAKGGRAVAVEMDVTS 69
++ ITGA+QG+G AR L++ GA +A KL+ + +KA+ A A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 70 AASIAKGLDAAESALGPITVLINNAGIAVEKLAVEQSEADWDAVIGANLKGAYFAATEVA 129
+A+I + E +GPI +L+N AG+ L S+ +W+A N G + A+ V+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 130 RRMIARKGGGNIINIASVLGFSVVKFIAPYAISKAGVVQATKALALELAASDIRVNALAP 189
+ M+ R+ G+I+ + S +A YA SKA V TK L LELA +IR N ++P
Sbjct: 129 KYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 190 GYIDTDINHDVWETPAGEKLIKR---------IPQRRVGRESDLDGAIMLLASPASRYMT 240
G +TD+ +W G + + + IP +++ + SD+ A++ L S + ++T
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 241 GSVVTVDGGFLL 252
+ VDGG L
Sbjct: 248 MHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1220PF07675270.021 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 26.6 bits (58), Expect = 0.021
Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 77 YFDANFEFGSPDPADPTVTQRAITL 101
+ ANFE+ +P ADP VT + I +
Sbjct: 434 LYSANFEYLTPANADPVVTTQNIIV 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1238HTHFIS320.012 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.012
Identities = 21/72 (29%), Positives = 31/72 (43%)

Query: 331 QHLANYQRKERKPLDTVELNEQIVDRLFQIEAIKRVAERFEGKHRKALVVQATGTGKTRV 390
+ LA +R+ K D + +V R ++ I RV R ++ +GTGK V
Sbjct: 117 RALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176

Query: 391 AIALTDLLIRAG 402
A AL D R
Sbjct: 177 ARALHDYGKRRN 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1240IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 0.002
Identities = 31/188 (16%), Positives = 62/188 (32%), Gaps = 20/188 (10%)

Query: 2 AAVEKARHAVDQLVAAKLREGRQAIAAEEAIKAKAAA-------ADDLQLKQKEVS---- 50
+ + A A E K ++ A + + +EV+
Sbjct: 1015 EEIARVDEAPVPPPAPAT-PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAK 1073

Query: 51 -----DLQVL-LAQRDEKLKEAQKLQAEVMRKQREIDEAKRELDLTIEKRVDASIAEIRQ 104
+ Q +AQ + KE Q + + + ++AK E + T E S +Q
Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQ 1133

Query: 105 KAKAEAEEGERLKVAEKDNLISSLQRQIEDLKRKA-EQGSQQLQGEVLELELESTLRASF 163
++E + + E D ++ + Q + EQ +++ V + EST +
Sbjct: 1134 -EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192

Query: 164 PHDTIEPV 171
P
Sbjct: 1193 NSVVENPE 1200


13BBta_1389BBta_1419Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1389-118-3.005162inositol monophosphatase family protein
BBta_1390-123-3.231602N-formylglutamate amidohydrolase
BBta_1391130-4.133443response regulator receiver
BBta_1392135-4.333355*phage related integrase
BBta_1394234-2.339217recombinase
BBta_1395333-2.528192hypothetical protein
BBta_1396433-2.994973hypothetical protein
BBta_1397431-3.597334hypothetical protein
BBta_1398430-3.917462hypothetical protein
BBta_1399428-5.796380hypothetical protein
BBta_1400229-6.550388hypothetical protein
BBta_1401429-6.765194hypothetical protein
BBta_1402326-6.181287transposase
BBta_1403431-7.101619hypothetical protein
BBta_1404429-6.610580hypothetical protein
BBta_1405324-4.140812thioredoxin-like protein
BBta_1406422-3.911306hypothetical protein
BBta_1408223-3.126104hypothetical protein
BBta_1409124-2.854953hypothetical protein
BBta_1411021-2.281602acetoacetate decarboxylase
BBta_1412021-2.049586patatin-like phospholipase
BBta_1413229-2.855786hypothetical protein
BBta_1414231-3.098995hypothetical protein
BBta_1415632-4.787810hypothetical protein
BBta_1416326-3.728396hypothetical protein
BBta_1417324-2.398841hypothetical protein
BBta_1419225-1.932470hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1391HTHFIS791e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 1e-20
Identities = 30/117 (25%), Positives = 57/117 (48%)

Query: 3 KILLAEDDNDMRRFLVKALENAGFQVSPHDNGMSAYQRLREEPFEMLLTDIVMPEMDGIE 62
IL+A+DD +R L +AL AG+ V N + ++ + ++++TD+VMP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LARRASELDPDIKIMFITGFAAVALNSDSEAPKNAKVLSKPVHLRELVSEVNKMLAA 119
L R + PD+ ++ ++ + L KP L EL+ + + LA
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


14BBta_1430BBta_1466Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1430222-2.380884hypothetical protein
BBta_1432221-2.095067hypothetical protein
BBta_1433019-1.644257membrane-bound ATP synthase, F1 sector subunit
BBta_1434025-1.957204ATP synthase subunit b
BBta_1435027-3.336646F0F1 ATP synthase subunit C
BBta_1436027-3.340599ATP synthase A chain (ATPase protein 6)
BBta_1438025-3.275743H(+)-transporting ATP synthase, gene 1
BBta_1439124-3.243681F0F1 ATP synthase subunit epsilon
BBta_1440122-2.894758F0F1 ATP synthase subunit beta
BBta_1442117-3.576004DNA polymerase IV
BBta_1443218-3.042134hypothetical protein
BBta_1444321-4.714497hypothetical protein
BBta_1446324-5.120953hypothetical protein
BBta_1447224-5.478786hypothetical protein
BBta_1448426-5.960003membrane transport protein
BBta_1449424-5.656482hypothetical protein
BBta_1451323-5.720228P-type ATPase, Mg2+ ATPase transport protein
BBta_1452221-4.709413hypothetical protein
BBta_1453218-4.408964NreA protein
BBta_1454218-4.323333hypothetical protein
BBta_1455014-3.318188ABC transporter permease
BBta_1456016-2.467908ABC transporter ATP-binding protein
BBta_1457015-1.836921hypothetical protein
BBta_1458114-0.509706transposase
BBta_1459-115-1.024972ABC transporter ATP-binding protein
BBta_1460019-2.060777hypothetical protein
BBta_1461122-2.760722hypothetical protein
BBta_1462223-3.485803insertion sequence ATP-binding protein y4pL
BBta_1463224-3.944346transposase
BBta_1464128-4.705544hypothetical protein
BBta_1466126-3.430615recombinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1434RTXTOXIND280.038 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.038
Identities = 23/142 (16%), Positives = 43/142 (30%), Gaps = 10/142 (7%)

Query: 37 ARKAEVRRQFDTVRDFEAKANAELA---AVEAERAGIAAEREAALKAAAAQ-AQEMAEAR 92
R + R + + E K E E E + + + Q Q+
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 93 RAQAERD-AQALMDSTRKTLASERESALDEARRLALDLGADLAQRLLAEVPMQYRAEAWI 151
+ +AER A ++ E+ LD+ L A +L + A
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSR-LDDFSSLL-HKQAIAKHAVLEQENKYVEAVN-- 266

Query: 152 ERIEQHLKAMPQAERDALVRQL 173
+ + + Q E + L +
Sbjct: 267 -ELRVYKSQLEQIESEILSAKE 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1451ACRIFLAVINRP330.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 32.5 bits (74), Expect = 0.009
Identities = 13/73 (17%), Positives = 27/73 (36%), Gaps = 7/73 (9%)

Query: 69 RLKKLGPNL-------VARERKPTIPEEIWNRARNPLNALLLTLAVVSFLLGDVRAAVVI 121
+L +L P + P + I + A++L V+ L ++RA ++
Sbjct: 309 KLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIP 368

Query: 122 AAMVVLAITTAFI 134
V + + F
Sbjct: 369 TIAVPVVLLGTFA 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1452HTHFIS240.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 24.0 bits (52), Expect = 0.043
Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 32 EKFAGSGLTRWHEGAFLLALGGTLLI 57
EK A +G G F A GGTL +
Sbjct: 212 EKGAFTGAQTRSTGRFEQAEGGTLFL 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1462PF05272280.031 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.031
Identities = 38/191 (19%), Positives = 66/191 (34%), Gaps = 14/191 (7%)

Query: 42 LLLEREAS----LRHDKRLATRLRYAKLRQQA-CVEDIDYRTPRGLDRPLFAKLVEGRWI 96
LL R A+ LR LA + + +LR+Q V +R G + ++
Sbjct: 447 LLKPRRAALIEALRSAPALAGCVAFDELREQPVAVRAFPWRKAPGPLEDADVLRL-ADYV 505

Query: 97 DDHVNLLICGPAGVGKSWLASALGHK--ACRDN-RSVLYQRVPRL---FDDLALARGDGR 150
+ ++ +A ++ RD ++ + VPRL + D
Sbjct: 506 ETTYGTGEASAQTTEQAINVAADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDY 565

Query: 151 HPRLLRGLGRVDLLILDDWGLEPLDAGARHDLLEILEDRYGHRSTIVTSQLPVDQWHL-- 208
PR LR L V IL ++ G + D +LE G + + + L +
Sbjct: 566 KPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDT 625

Query: 209 LIGDPTYADAV 219
T D+
Sbjct: 626 HFDIGTGKDSY 636


15BBta_1477BBta_1587Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1477-128-3.380884insertion sequence ATP-binding protein y4pL
BBta_1480-129-4.538152hypothetical protein
BBta_1481230-5.318188hypothetical protein
BBta_1482331-4.197025hypothetical protein
BBta_1483537-4.574944hypothetical protein
BBta_1484434-4.427679hypothetical protein
BBta_1485335-4.338361XRE family transcriptional regulator
BBta_1486438-3.792182hypothetical protein
BBta_1487235-1.804390hypothetical protein
BBta_1488233-3.097225hypothetical protein
BBta_1489132-3.119496hypothetical protein
BBta_1490132-2.958963hypothetical protein
BBta_1491235-3.450591lytic transglycosylase
BBta_1492335-3.923149hypothetical protein
BBta_1493337-5.731263conjugal transfer coupling protein TraG
BBta_1494343-6.169955CopG family protein
BBta_1495342-5.496972hypothetical protein
BBta_1496242-5.682683hypothetical protein
BBta_1497240-5.292672hypothetical protein
BBta_1498233-4.938430hypothetical protein
BBta_1499031-4.806958hypothetical protein
BBta_1500129-4.189575conjugal transfer protein TrbB
BBta_1501228-4.279049conjugal transfer protein TrbC
BBta_1502327-3.696870conjugal transfer protein TrbD
BBta_1503330-4.091308conjugal transfer ATPase TrbE
BBta_1504530-3.883506conjugal transfer protein TrbJ
BBta_1505435-4.554769conjugal transfer protein TrbL
BBta_1506534-4.974086conjugal transfer protein TrbF
BBta_1507534-5.126051conjugal transfer protein TrbG
BBta_1508339-5.605528conjugal transfer protein TrbI
BBta_1509336-5.754550hypothetical protein
BBta_1510233-5.796159LysR family transcriptional regulator
BBta_1511330-5.089948hypothetical protein
BBta_1512132-5.148826metal dependent phosphohydrolase
BBta_1513235-5.423562hypothetical protein
BBta_1514234-5.470500response regulator receiver
BBta_1515234-5.757105secretory protein kinase, cpaF
BBta_1516434-5.254016Type II secretion system protein
BBta_1517435-5.466611hypothetical protein
BBta_1519636-5.811602pilus assembly protein CpaC
BBta_1520830-5.523498hypothetical protein
BBta_1521728-5.927628hypothetical protein
BBta_1522629-5.480573hypothetical protein
BBta_1523730-5.909273hypothetical protein
BBta_1525530-5.983268hypothetical protein
BBta_1526528-6.561218hypothetical protein
BBta_1527236-7.277695Flp/Fap pilin component
BBta_1528438-8.623165hypothetical protein
BBta_1529338-9.196364hypothetical protein
BBta_1530332-7.918143hypothetical protein
BBta_1531332-7.752627hypothetical protein
BBta_1532334-7.906226hypothetical protein
BBta_1533333-7.835915chemotaxis protein methyltransferase cheR
BBta_1534223-5.749383chemotaxis protein CheY
BBta_1536425-5.799150methyl-accepting chemotaxis protein
BBta_1537734-4.782871hypothetical protein
BBta_1538631-5.262445transposase
BBta_1539727-4.000768transposase
BBta_1540626-4.149155transposase
BBta_1541625-4.885298hypothetical protein
BBta_1542525-4.184418hypothetical protein
BBta_1543626-4.572819outer-membrane protein
BBta_1544525-4.845928outer membrane lipoprotein (NodT-like), RND
BBta_1545427-5.876569HlyD family heavy metal efflux pump
BBta_1546529-6.013759cation efflux system protein
BBta_1547728-5.755845hypothetical protein
BBta_1548521-5.027725hypothetical protein
BBta_1549519-4.265511LuxR transcriptional regulator
BBta_1550519-3.519579ATP-binding region, ATPase-like protein
BBta_1551314-2.101370hypothetical protein
BBta_1552315-2.089436OmpA-like transmembrane domain-containing
BBta_1553118-1.626127manganese transport protein
BBta_1554430-3.220727hypothetical protein
BBta_1555429-3.992485manganese transport regulator MntR
BBta_1556331-4.627306manganese/divalent cation transport protein
BBta_1557740-5.244415MgtC-magnesium transport family protein
BBta_1558741-5.502096hypothetical protein
BBta_1559640-5.509311hypothetical protein
BBta_1560539-5.338246transposase
BBta_1561026-2.819650hypothetical protein
BBta_1562-121-1.202293hypothetical protein
BBta_1563-115-1.029471hypothetical protein
BBta_1564-113-1.476648hypothetical protein
BBta_1565013-1.176253hypothetical protein
BBta_1566-113-1.027158hypothetical protein
BBta_1567015-2.600720hypothetical protein
BBta_1568017-3.542808cation transport ATPase
BBta_1569227-5.367201hypothetical protein
BBta_1570335-5.776861hypothetical protein
BBta_1572338-6.291570hypothetical protein
BBta_1573337-5.884668transposase
BBta_1574438-7.139794hypothetical protein
BBta_1575541-6.747076hypothetical protein
BBta_1576341-6.504715hypothetical protein
BBta_1577337-5.525709hypothetical protein
BBta_1578129-3.963634hypothetical protein
BBta_1580126-3.4303182-amino-3-ketobutyrate coenzyme A ligase
BBta_1581121-1.539721hypothetical protein
BBta_1582121-1.696428hypothetical protein
BBta_1583219-1.570472hypothetical protein
BBta_1584118-1.556518hypothetical protein
BBta_1585223-2.708919transposase
BBta_1586025-3.031234phosphoglycerate mutase
BBta_1587220-0.793763hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1477PF05272280.031 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.031
Identities = 38/191 (19%), Positives = 66/191 (34%), Gaps = 14/191 (7%)

Query: 42 LLLEREAS----LRHDKRLATRLRYAKLRQQA-CVEDIDYRTPRGLDRPLFAKLVEGRWI 96
LL R A+ LR LA + + +LR+Q V +R G + ++
Sbjct: 447 LLKPRRAALIEALRSAPALAGCVAFDELREQPVAVRAFPWRKAPGPLEDADVLRL-ADYV 505

Query: 97 DDHVNLLICGPAGVGKSWLASALGHK--ACRDN-RSVLYQRVPRL---FDDLALARGDGR 150
+ ++ +A ++ RD ++ + VPRL + D
Sbjct: 506 ETTYGTGEASAQTTEQAINVAADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDY 565

Query: 151 HPRLLRGLGRVDLLILDDWGLEPLDAGARHDLLEILEDRYGHRSTIVTSQLPVDQWHL-- 208
PR LR L V IL ++ G + D +LE G + + + L +
Sbjct: 566 KPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDT 625

Query: 209 LIGDPTYADAV 219
T D+
Sbjct: 626 HFDIGTGKDSY 636


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1496TCRTETB553e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 55.3 bits (133), Expect = 3e-10
Identities = 37/176 (21%), Positives = 66/176 (37%), Gaps = 19/176 (10%)

Query: 216 LFRTINAAI----SPALASEFGLDAAETGLLASVYFLVFGVAQIPVGIFLDRFGPRRVQG 271
F +N + P +A++F A T + + + L F + G D+ G +R
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR--- 80

Query: 272 VLLVIAAGGATLFG--------NASSLPELLFARGMIGLGVAGSLMAGLKSIVTWFPRER 323
LL+ G + S L+ AR + G G A + + + P+E
Sbjct: 81 -LLLF---GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 324 VALANGWMIMLGSLGAVTATAPTDWLLNYVGWRSLFEILTIGTFAVSGLIYMVVPE 379
A G + + ++G A + +Y+ W L I I V L+ ++ E
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1506PF04335513e-10 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 51.4 bits (123), Expect = 3e-10
Identities = 42/218 (19%), Positives = 79/218 (36%), Gaps = 12/218 (5%)

Query: 19 YQKAAQVWD-ERIGSARVQARNWRLMAFGCLLLSGGLAAALVWESTQGTITPWVVQVD-H 76
Y + A W+ +++ +A + ++A L+ A+ + T+ P+V+ VD +
Sbjct: 13 YFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRN 72

Query: 77 LGQAQAVAPASSSYQ-PTDPQIAFH-LARFIEDVRGLPTDAIVLRQDWLRAYDFTTDRGA 134
G+A A D + + LA ++ G A + + +
Sbjct: 73 TGEASIAAKLHGDATITYDEAVRKYFLATYVRYREGW--IAAAREEYFDAVMVMSARPEQ 130

Query: 135 AALNDYARTN---DPFVKLG-NTQVAVEISSVIRASPQSFRVAWTERRYDSGQLAATERW 190
+ + +T+ P L T V VEI V +V +T+ +
Sbjct: 131 DRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSN-STKTDA 189

Query: 191 TAILSVIIETPRDTDRLR-KNPLGVYVNSINWSKELGQ 227
A + ++ + R KNPLG V S E+ Q
Sbjct: 190 VATIKYKVDGTPSKEVDRFKNPLGYQVESYRADVEVPQ 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1515PF07675300.015 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.015
Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 3 ASTRPTVSEAKDFIRDQIFLRIEPLVAVRISQQDLMVSVNKLVAEIATGRKILLNQD 59
++ + S I D +F+ IEP VR ++ ++++ + V TG + LL+ D
Sbjct: 356 SAKKAEGSREVKRIGDGLFVTIEPANDVRANEAKVVLAADN-VWGDNTGYQFLLDAD 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1516BCTERIALGSPF347e-04 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 7e-04
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 7/120 (5%)

Query: 164 ARMLRAGLPITVAMRTVAVDGSPP-VSRVFGLIADELRIGVPLEEALDTNSREIGLPDFR 222
A ++ A +P+ A+ VA P +S++ + ++ G L +A+ F
Sbjct: 78 ATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFP-----GSFE 132

Query: 223 FFAVAMTLQFATGGNLTATLESLSDIIRKRRAARLKA-KAATGEIRLTAYTLGAIPILTT 281
AM T G+L A L L+D +R+ R + +A LT + + IL +
Sbjct: 133 RLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLS 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1519BCTERIALGSPD1503e-41 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 150 bits (380), Expect = 3e-41
Identities = 72/277 (25%), Positives = 118/277 (42%), Gaps = 9/277 (3%)

Query: 172 VINAMRVAASQQVMLRVRFIEVSRQAEREIGVNWFGANASGTRGINTGTGAISQAGPTAT 231
VI + + QV++ EV +G+ W NA T+ N+G +
Sbjct: 336 VIAQLDIRR-PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAI----- 389

Query: 232 SAGVPVFNTIGTFAGSTLSAPFGVGLFNLANKGGSVDVLITALEKKGLARRLAEPDLVAL 291
AG +N GT + S SA G+ +L+TAL LA P +V L
Sbjct: 390 -AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTL 448

Query: 292 SGDTASFLAGGEYPVPS-VQSSSGTTPVITVLYKPFGVQLTFVPTVLASGIINLRLTPSV 350
A+F G E PV + Q++SG TV K G++L P + + L + V
Sbjct: 449 DNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEV 508

Query: 351 SELDYTNAVAISGTLVPALSKREARTAIELRDGQSFAIAGLLQSDNLRDVGQLPWLGSVP 410
S + A + S L + R A+ + G++ + GLL ++P LG +P
Sbjct: 509 SSVAD-AASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIP 567

Query: 411 VLGTLFRSTSYQQKETDLVVIVTPHLVAPAAPGQALA 447
V+G LFRSTS + + +L++ + P ++ + +
Sbjct: 568 VIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQAS 604


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1521SYCDCHAPRONE458e-09 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 44.5 bits (105), Expect = 8e-09
Identities = 21/93 (22%), Positives = 37/93 (39%)

Query: 5 YFNRGIYGTAEKYFQSAVEKAPKDVSAWIGLAASYDRLGRFDLADHAYGQAIKLGGETTQ 64
+ G Y A K FQ+ D ++GL A +G++DLA H+Y + + +
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPR 105

Query: 65 ILNNLGYSYMLRGKLTAARTKFMEAYRREPDNP 97
+ + +G+L A + A D
Sbjct: 106 FPFHAAECLLQKGELAEAESGLFLAQELIADKT 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1528PREPILNPTASE280.025 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 27.8 bits (62), Expect = 0.025
Identities = 18/80 (22%), Positives = 35/80 (43%), Gaps = 7/80 (8%)

Query: 18 VLGISAAIDLKDRVIPNELVVAIAVIGFAQGLALRPGLVWFSLLVAIIVFFGLWIVAHVK 77
VL IDL ++P++L + + G L + +++ A+ + LW +
Sbjct: 143 VLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAF 202

Query: 78 I-------IGGGDLKLISAV 90
+G GD KL++A+
Sbjct: 203 KLLTGKEGMGYGDFKLLAAL 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1534HTHFIS578e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.8 bits (137), Expect = 8e-13
Identities = 18/109 (16%), Positives = 42/109 (38%), Gaps = 2/109 (1%)

Query: 2 VVDDSSVGRKIARRILEELEFRIIEAKDGEKAVEARKSDLPDVVLLDWDTQVTDGCELFG 61
V DD + R + + L + + + + D+V+ D + +L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 62 NLLRLSGGDQPIVVICTTESNIDHISDALHAGSDEYVVKPFNRDAVVAR 110
+ + D P V++ + ++ A G+ +Y+ KPF+ ++
Sbjct: 68 RIKKA-RPDLP-VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1536PF07201330.003 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 32.9 bits (75), Expect = 0.003
Identities = 33/152 (21%), Positives = 65/152 (42%), Gaps = 9/152 (5%)

Query: 453 IIETVSSASNDLESSASTLSSTATRSQQLATAVAAASEEASTNVQSVASAAEELT---SS 509
+ +S + L + ++S+ +Q L + + S +QS+A AEE+T S
Sbjct: 3 TLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVFSE 62

Query: 510 VHEIS---RQVQESARIASGAVEQARTTNERVNELSKAAARIGDVVELINTIAGQTNLLA 566
E+S R++ +S S EQ +V EL + + +++ L++ + +L
Sbjct: 63 RKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQ-KQNVSELLSLLS-NSPNISLSQ 120

Query: 567 LNATIEAARAGEAGRGFAVVASEVKALAEQTA 598
L A +E + E F ++ AL +
Sbjct: 121 LKAYLEGK-SEEPSEQFKMLCGLRDALKGRPE 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1538BCTERIALGSPF250.041 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 24.8 bits (54), Expect = 0.041
Identities = 13/37 (35%), Positives = 14/37 (37%)

Query: 15 KYHVVFIPKYRRKVLYGELRRHLGDVFRRLALQKECR 51
+ F Y V GE HL V RLA E R
Sbjct: 126 CFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQR 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1545RTXTOXIND479e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.7 bits (111), Expect = 9e-08
Identities = 33/242 (13%), Positives = 80/242 (33%), Gaps = 45/242 (18%)

Query: 140 DLLSAASSLIATSGVLQLTTRALNRLKTLYESRAVAQ---KDVEQAISDQQTAEGAHKAA 196
+ L+ + + + ++ L+ +L +A+A+ + E + +K+
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 197 RDAVR------------IFGKTDAEIDAIIKERRADST---------------LVVKSPI 229
+ + + EI +++ + V+++P+
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 230 NGRITARNA-APGLFVQPGSAPAPFSVADTSTMWMIANVAESDVSAIHVGQHVKVSVMSY 288
+ ++ G V V + T+ + A V D+ I+VGQ+ + V ++
Sbjct: 335 SVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 289 PGKIF---EGHISTISSN--VDPNTH------RMLVRSEIEDPDH--ELRSGMFARFSIV 335
P + G + I+ + D + + + + L SGM I
Sbjct: 394 PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453

Query: 336 IG 337
G
Sbjct: 454 TG 455



Score = 34.0 bits (78), Expect = 0.001
Identities = 23/124 (18%), Positives = 43/124 (34%), Gaps = 21/124 (16%)

Query: 114 GRILETFAKVGDEVKKGQVLFTIDSPDLLSAASSLIATSGVLQLTTRALNRLKTLYES-- 171
+ E K G+ V+KG VL + L + A +L S +LQ R + L S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLT--ALGAEADTLKTQSSLLQARLEQT-RYQILSRSIE 161

Query: 172 ---------------RAVAQKDVEQAISDQQTAEGAHKAARDAVRI-FGKTDAEIDAIIK 215
+ V++++V + S + + + + K AE ++
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221

Query: 216 ERRA 219

Sbjct: 222 RINR 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1546ACRIFLAVINRP5620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 562 bits (1449), Expect = 0.0
Identities = 218/1082 (20%), Positives = 412/1082 (38%), Gaps = 87/1082 (8%)

Query: 8 FGLTRRAIILLGVLVFICGGLIAFRNLNIEAYPNPAPVILEITAQAPGLSAEEMERYYTI 67
F + R + ++ + G +A L + YP AP + ++A PG A+ ++ T
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 68 PMEVGLAVTPGVDVIRSTSF-YGLSFVRVTFKYGVEFYFAYTQAALSLQQ-RVNLPNNTQ 125
+E + + + STS G + +TF+ G + A Q LQ LP Q
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 126 PNIQQSSQAGEILRYQLA---GPPHFGLTNLRTVQDWIVQRRLLTVPGVVQVNSWGGTTK 182
++ P ++ V+ L + GV V +G
Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 183 QYDVEVDLHKLDAYNLTLQQVTTALSNSNINVGGREI-----AVGQQ-SVNIRGVGLFDS 236
+ +D L+ Y LT V L N + ++ GQQ + +I F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 237 GGEKDLTQGYKVSDIENVVL-TQMNGVPVQVKDVAKVSVGFVPRLGIAGRDSNDDVVMAI 295
+ V L +G V++KDVA+V +G IA + + I
Sbjct: 243 -----------PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291

Query: 296 VVMGRTYHTNEVLPRVEAEIAKMNSDGTLPPGVKLVPFYDRGTLISVTTRTVLHNLIFGC 355
+ + + ++A++A++ P G+K++ YD + ++ V+ L
Sbjct: 292 KLATGA-NALDTAKAIKAKLAELQP--FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 356 ALVFLIQWLFLGDLRSAIIVGVNIPFALFFSVIVLVLLGQDANLLSVG--AVDFGIIVDS 413
LVFL+ +LFL ++R+ +I + +P L + +L G N L++ + G++VD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 414 AVILVENIFRNFQASDKHKQETLGYLSEGQWGPDPTRPKPGSPNLWTDRLRLILASALQV 473
A+++VEN+ R + E + P S Q+
Sbjct: 409 AIVVVENVER--------------VMMEDKLPP----------------KEATEKSMSQI 438

Query: 474 DKAVFFSAAITVAAFVPLFTMQGVEGQIFNPMARTYGYALVGALISTFTISPVLGSFLL- 532
A+ A + A F+P+ G G I+ + T A+ +++ ++P L + LL
Sbjct: 439 QGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498

Query: 533 ---PEHVTETETIVVRALRAV------YAPALRWALGHRKLVASLGLAMLGVTGLLMMRL 583
EH Y ++ LG + ++ +L +RL
Sbjct: 499 PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558

Query: 584 GSEFLPHLEEGNLWIRATMPPTIGLMSGEPVAHKAREILLRH--PEITTVVTQHGRPDDG 641
S FLP ++G +P + V + + L++ + +V T +G G
Sbjct: 559 PSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSG 618

Query: 642 SDAAGFNNLELFAPLKPFDQWP-AGLTKDKLTKQLQQEFADEAPGVVFNFSQYIQDNFEE 700
N F LKP+++ + + + + + E G V F+
Sbjct: 619 Q---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGT 675

Query: 701 QLSGVKGANSAKIVGPDLVTLEELARQVRHEMAQVRGIEDLDVF--WVRGQPNLNIKVDR 758
+ G L + Q+ AQ + V + ++VD+
Sbjct: 676 --ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPA-SLVSVRPNGLEDTAQFKLEVDQ 732

Query: 759 ERAARYGLNTGDVNTLVQAALGGAHATALLEADRQFNVVVRLPAEYRESLEAVRNIKVGI 818
E+A G++ D+N + ALGG + ++ R + V+ A++R E V + V
Sbjct: 733 EKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYV-- 790

Query: 819 NTPAAANAYIPLSELADITLDTGSSYIYHESRERYIPVKFSVRDRDLGGAVAEAQERIAE 878
+A +P S GS + + + ++ G E +A
Sbjct: 791 --RSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS 848

Query: 879 NVKLPPGYRVLWAGEFESLQLAKKRLEVIVPISLAMILVLLYGLFNSLRDSLLALAGIPF 938
KLP G W G +L+ + +V IS ++ + L L+ S + + +P
Sbjct: 849 --KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPL 906

Query: 939 AVAGGIIALYVTGLDFSISAAIGFVSLFGVSVMDGILMITYYNQARQGVADSV-EAMFQA 997
+ G ++A + + +G ++ G+S + IL++ + + V EA A
Sbjct: 907 GIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMA 966

Query: 998 STQRMRPMLMTAMSACIGLFPAALSEGIGAQVQRPLATVVVGGMLIGPIMLLVVVPALRV 1057
R+RP+LMT+++ +G+ P A+S G G+ Q + V+GGM+ ++ + VP V
Sbjct: 967 VRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFV 1026

Query: 1058 ML 1059
++
Sbjct: 1027 VI 1028



Score = 71.8 bits (176), Expect = 1e-14
Identities = 65/350 (18%), Positives = 137/350 (39%), Gaps = 21/350 (6%)

Query: 724 LARQVRHEMAQVRGIEDLDVFWVRGQPNLNIKVDRERAARYGLNTGDVNTLVQAAL---- 779
+A V+ ++++ G+ D+ +F Q + I +D + +Y L DV ++
Sbjct: 158 VASNVKDTLSRLNGVGDVQLF--GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 780 GGAHATALLEADRQFNVVVRLPAEYRESLEAVRNIKVGINTPAAANAYIPLSELADITLD 839
G +Q N + ++ + E + + +N+ + + L ++A + L
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFK-NPEEFGKVTLRVNSDGSV---VRLKDVARVELG 271

Query: 840 TGSSYIYHESRERYIPVKFSVRDRDLGGAVAEAQ---ERIAE-NVKLPPGYRVLWAGEFE 895
G +Y ++ A+ A+ ++AE P G +VL+ ++
Sbjct: 272 -GENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP--YD 328

Query: 896 SLQLAKKRLE-VIVPISLAMILVLL--YGLFNSLRDSLLALAGIPFAVAGGIIALYVTGL 952
+ + + V+ + A++LV L Y ++R +L+ +P + G L G
Sbjct: 329 TTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY 388

Query: 953 DFSISAAIGFVSLFGVSVMDGILMI-TYYNQARQGVADSVEAMFQASTQRMRPMLMTAMS 1011
+ G V G+ V D I+++ + EA ++ +Q ++ AM
Sbjct: 389 SINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV 448

Query: 1012 ACIGLFPAALSEGIGAQVQRPLATVVVGGMLIGPIMLLVVVPALRVMLLG 1061
P A G + R + +V M + ++ L++ PAL LL
Sbjct: 449 LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1549HTHFIS753e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 3e-18
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 3/123 (2%)

Query: 7 KILCIEDDRETAALIVEELTERGFDVTLAYDGGEGFAAIFRTMPDLVLCDINMRVMSGFE 66
IL +DD ++ + L+ G+DV + + + I DLV+ D+ M + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 VLEHLTKIAPRFNNMPFIFLTALTDRRNELKGRQLGADDYVTKPIDFDILVSIINARLAH 126
+L + K P +P + ++A +K + GA DY+ KP D L+ II LA
Sbjct: 65 LLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 VAR 129
R
Sbjct: 122 PKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1552OMPADOMAIN310.009 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.4 bits (71), Expect = 0.009
Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 12/96 (12%)

Query: 80 IYATAGLAYSQGRLTEDPFDPATQKKIGFRAGWVAGAGVEAPVDGNWTARIEYLY-SNFG 138
IY G + D K V GVE + R+EY + +N G
Sbjct: 114 IYTRLGGMVWRA----DTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIG 169

Query: 139 SLNVMLPSGTSYGSTFDLHTVRLGLNRKLGGSAKEP 174
+ + G+ D + LG++ + G P
Sbjct: 170 DAH-------TIGTRPDNGMLSLGVSYRFGQGEAAP 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1584GPOSANCHOR330.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.002
Identities = 28/88 (31%), Positives = 38/88 (43%), Gaps = 3/88 (3%)

Query: 19 LQAEREARLRAEAVAASARAELSDNEVLIAHLELRI---EKLKRELHGQRSERTARLIEQ 75
L A REA+ + E A ++L+ E L LE EK K EL + L E+
Sbjct: 388 LDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEK 447

Query: 76 LELELEELATTASEDELAAQVAAAKTQN 103
L + EELA + +Q AK N
Sbjct: 448 LAKQAEELAKLRAGKASDSQTPDAKPGN 475


16BBta_1873BBta_1899Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1873314-0.038437flagellar motor switch protein
BBta_18742130.637561flagellar assembly protein H
BBta_18751120.492940flagellar motor switch protein G
BBta_18761111.341277flagellar MS-ring protein
BBta_18781101.573318flagellar basal-body rod modification protein
BBta_18800131.476514flagellar hook length determination protein
BBta_1881-116-1.632815tRNA-specific 2-thiouridylase MnmA
BBta_1882222-3.899087phosphatidyl-N-methylethanolamine
BBta_1883430-5.387592hypothetical protein
BBta_1884433-7.829627*hypothetical protein
BBta_1886231-5.206187hypothetical protein
BBta_1888227-4.368733hypothetical protein
BBta_1889226-3.581514non-heme chloroperoxidase
BBta_1890226-3.256161hypothetical protein
BBta_1891225-2.602199hypothetical protein
BBta_1892224-2.084849transcriptional regulator
BBta_1894321-2.520521twin-arginine translocation pathway signal
BBta_1895427-3.036193cytochrome C
BBta_1896213-3.653291hypothetical protein
BBta_1897-115-3.790831hypothetical protein
BBta_1899-114-3.531412hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1873FLGMOTORFLIN922e-27 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 92.3 bits (229), Expect = 2e-27
Identities = 35/76 (46%), Positives = 55/76 (72%)

Query: 36 ADLEAVFDVPVQVSAVLGRSKMDVGELLKLGPGTVLELDRKVGEAIDIYVNNRLVARGEV 95
D++ + D+PV+++ LGR++M + ELL+L G+V+ LD GE +DI +N L+A+GEV
Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111

Query: 96 VLVEDKLGVTMTEIIK 111
V+V DK GV +T+II
Sbjct: 112 VVVADKYGVRITDIIT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1875FLGMOTORFLIG302e-103 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 302 bits (776), Expect = e-103
Identities = 111/340 (32%), Positives = 196/340 (57%), Gaps = 2/340 (0%)

Query: 24 RQAQREKTEPLSGPRRAAVMMLALGEQYGGKIWQQLDDDEVRELSLAMSTLGTVEADVVE 83
++ + L+G ++AA++++++G + K+++ L +E+ L+ ++ L T+ +++ +
Sbjct: 5 KEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKD 64

Query: 84 DLMLEFVSRMSASGALM-GNFDATERLLQQYLPPERVNGIMDEIRGPAGRNMWEKLSNVQ 142
+++LEF M A + G D LL++ L ++ I++ + +E +
Sbjct: 65 NVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRAD 124

Query: 143 EEVLANYLKNEYPQTIAVVLSKLKPEHAARVLAILPEDMALDVIGRMLRMEAVQKEVIER 202
+ N+++ E+PQTIA++LS L P+ A+ +L+ LP ++ +V R+ M+ EV+
Sbjct: 125 PANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVRE 184

Query: 203 VEQTLRVEFMSNLSQTRR-RDAHEVMAEIFNNFDRQTETRFITSLEEENRESAERIKALM 261
VE+ L + S S+ + + EI N DR+TE I SLEEE+ E AE IK M
Sbjct: 185 VERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKM 244

Query: 262 FTFDDLIKLDSASAQTLLRNVDKDKLGVALKSANEEVRNFFFGNMSSRAAKMLQDDMAAM 321
F F+D++ LD S Q +LR +D +L ALKS + V+ F NMS RAA ML++DM +
Sbjct: 245 FVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFL 304

Query: 322 GPVRLRDVDEAQALLVNLAKDLAAKGEIMLSKNRADDELV 361
GP R +DV+E+Q +V+L + L +GEI++S+ +D LV
Sbjct: 305 GPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1876FLGMRINGFLIF340e-113 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 340 bits (873), Expect = e-113
Identities = 169/557 (30%), Positives = 262/557 (47%), Gaps = 47/557 (8%)

Query: 5 LDFLKGLGAARLTAMIAVTAALIGFFGFVIMRVTAPQMTTLFTDLGMDDSSSIIKDLERQ 64
L++L L A +I +A + +++ P TLF++L D +I+ L +
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 65 GIPFEIRNEGSVILVPKDKVTRLRMKLAEGGLPKGGGVGYEIFDKSDALGTTSFVQNINH 124
IP+ N I VP DKV LR++LA+ GLPKGG VG+E+ D+ G + F + +N+
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQE-KFGISQFSEQVNY 131

Query: 125 LRALEGELARTIRAIDRIQAARVHLVLPERPLFSRETPEPSASIVVRVRG--ALDAAQIR 182
RALEGELARTI + +++ARVHL +P+ LF RE PSAS+ V + ALD QI
Sbjct: 132 QRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQIS 191

Query: 183 AIRHLVASAVNGLKPQRVSIVDEAGQLLA---DGTQTDIDQQVGDERRNTFEKRMRKQVE 239
A+ HLV+SAV GL P V++VD++G LL + D Q+ E R+++++E
Sbjct: 192 AVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFAND--VESRIQRRIE 249

Query: 240 DIVSSVVGAGRARVQLSADFDFNRITQTSDRYDPEGRVLRSSQTREEQSASSE------- 292
I+S +VG G Q++A DF QT + Y P G +++ + + S +
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 293 ------SNGQVTVNNE----LPGNQQNQQQ---------QQPPRDQSKKTEETNNYEISR 333
SN N P NQQN Q +S + ET+NYE+ R
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 334 TTKTEVTEAGRVNRISVAVLVDGAYSKNEKGELVYKERSKEELDRIAALVRSAIGFDQKR 393
T + G + R+SVAV+V+ + K + +++ +I L R A+GF KR
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNYKTLADGKPL----PLTADQMKQIEDLTREAMGFSDKR 425

Query: 394 GDQVEVVNLKFAD-APTVPQINEPSGFLGMLQFTKDDVMYVIELAVMMLLGIVVVFMVVR 452
GD + VVN F+ T ++ F F + L V+++ I+ VR
Sbjct: 426 GDTLNVVNSPFSAVDNTGGELP----FWQQQSFIDQLLAAGRWLLVLVVAWILWRK-AVR 480

Query: 453 PLVKKIIASDEVAAALKSAVPALTDETAQAQAHAQQTATLIDVAQVQGQVHAQSVHRVGE 512
P + + E A A + + + + L Q R+ E
Sbjct: 481 PQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIRE 537

Query: 513 LADRNPGEAAAIIRQWL 529
++D +P A +IRQW+
Sbjct: 538 MSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1880FLGHOOKFLIK320.006 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.1 bits (72), Expect = 0.006
Identities = 44/180 (24%), Positives = 71/180 (39%), Gaps = 5/180 (2%)

Query: 353 AQAANQANAPASDPTAQLATALQPQLTTPTSQSGQITSANLTATAATATAVPLHGLAVEI 412
A Q+ A + + A P +T +Q +A + +A + L+ I
Sbjct: 190 LVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVL-SAPLGSHEWQQSLSQHI 248

Query: 413 AASALNGKSRFEIRLDPAELGRIDVRIDVDRNGQVTSHLRVEKPETLAMLQQTAPQLQQA 472
+ G+ E+RL P +LG + + + VD N Q + A L+ P L+
Sbjct: 249 SLFTRQGQQSAELRLHPQDLGEVQISLKVDDN-QAQIQMVSPHQHVRAALEAALPVLRTQ 307

Query: 473 LQDAGL---KSNNSGLQFSLRDQNSSGQNGGDNQQNGNAQRLIVTEDETVPAQLAGRSYG 529
L ++G+ +SN SG FS + Q +S Q N + VP L GR G
Sbjct: 308 LAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTG 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1897SALSPVBPROT307e-04 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein

signature.
Length = 591

Score = 29.7 bits (66), Expect = 7e-04
Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 7/55 (12%)

Query: 3 PSGLASIPQPVP---GRGEV----LVKVAASGVNPLGIKIRAGVAAHARHPFHAV 50
P GLASI P+P RG L + G P G+ + AR H V
Sbjct: 32 PDGLASITLPLPISAERGFAPALALHYSSGGGNGPFGVGWSCATMSIARSTSHGV 86


17BBta_1954BBta_1963Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1954214-1.716819branched chain amino acid ABC transporter
BBta_1955112-1.374129branched-chain amino acid ABC transporter
BBta_1956013-0.588617branched chain amino acid ABC transporter
BBta_19571120.026086branched-chain amino acid ABC transporter
BBta_19581120.449344acylamide amidohydrolase
BBta_19592122.121976urease accessory protein UreE
BBta_19602102.588575urease accessory protein UreF
BBta_19613102.277108bifunctional urease subunit gamma/beta
BBta_1962292.525694urease subunit alpha
BBta_1963283.575068urease accessory protein UreG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1957PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.020
Identities = 17/54 (31%), Positives = 22/54 (40%), Gaps = 13/54 (24%)

Query: 34 GRNGTGKTTLLKTLMGLTDRMDGQIRLGDQ-------------EIGREPTFRRA 74
G G GK+TL+ TL+GL D +G E+ FRRA
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRA 656


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1962UREASE8720.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 872 bits (2255), Expect = 0.0
Identities = 315/570 (55%), Positives = 397/570 (69%), Gaps = 8/570 (1%)

Query: 3 TIDRRDYAALYGPTIGDAVRLGDTSLFAVVEKDHAVYGDECLHGGGKTLRDGIGMAGLTS 62
+ R YA ++GPT+GD VRL DT LF VEKD +G+E GGGK +RDG+G + +T
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 63 AEGALDFLLCNVTVIDPVIGIVKGDLGIRNGRIVGIGKAGNPAIMDGVDPRLIVSTGTTV 122
GA+D ++ N ++D GIVK D+G+++GRI IGKAGNP + GV +IV GT V
Sbjct: 64 EGGAVDTVITNALILDHW-GIVKADIGLKDGRIAAIGKAGNPDMQPGVT--IIVGPGTEV 120

Query: 123 RDCEGLIATPGAIDVHVHFDSAGLVEHALASGITTMIGGSLGPIT---VGIDSGGPFNTG 179
EG I T G +D H+HF +E AL SG+T M+GG GP + GP++
Sbjct: 121 IAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIA 180

Query: 180 KMLQAAEAWPMNFGFLGRGNSHREAPLIEQLQTGVMGLKLHEDWGTMPAAIDTCLRVADA 239
+M++AA+A+PMN F G+GN+ L+E + G LKLHEDWGT PAAID CL VAD
Sbjct: 181 RMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADE 240

Query: 240 QDFQVQIHTDTLNESGFVENTLEAIGGRTIHMYHTEGAGGGHAPDIIRVAGEMHCLPSST 299
D QV IHTDTLNESGFVE+T+ AI GRTIH YHTEGAGGGHAPDIIR+ G+ + +PSST
Sbjct: 241 YDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSST 300

Query: 300 NPTNPYTVNTFDEHLDMTMVCHHLNPAIPEDVAFAESRIRAQTIAAEDVLHDIGAISMLG 359
NPT PYTVNT EHLDM MVCHHL+P IPED+AFAESRIR +TIAAED+LHDIGA S++
Sbjct: 301 NPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIIS 360

Query: 360 SDSQGMGRIHEVICRTWQLASKMKDQRGALPEDRPGFGDNARIRRYIAKYTINAAKTFGI 419
SDSQ MGR+ EV RTWQ A KMK QRG L E+ G DN R++RYIAKYTIN A G+
Sbjct: 361 SDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEE-TGDNDNFRVKRYIAKYTINPAIAHGL 419

Query: 420 ADHIGSLEDGKIADIVVWRPAFFGIKPELVIKSGFIAWGAMGDSAASLMTCEPMLLRPQW 479
+ IGSLE GK AD+V+W PAFFG+KP++V+ G IA MGD AS+ T +P+ RP +
Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479

Query: 480 GAFGRAPAALSACFVHPLAIARGLAAELGLSKQLLPVKGTR-GLTKRDMVWNDTCPTIRV 538
GA+GR+ S FV ++ GLA LG++K+L+ V+ TR G+ K M+ N P I V
Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539

Query: 539 DPETFDVFVNGELATCEPARELPLARRYML 568
DPET++V +GEL TCEPA LP+A+RY L
Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569


18BBta_1998BBta_2008Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1998214-0.107694Ni/Fe-hydrogenase, 1 b-type cytochrome subunit
BBta_19994151.233441hydrogenase maturation
BBta_20002181.222999hydrogenase expression/formation
BBta_20011181.077991hydrogenase expression/formation
BBta_2002316-0.391654hydrogenase expression/formation
BBta_20035160.941481rubredoxin
BBta_20045160.904444hydrogenase expression/formation
BBta_20055140.565170hydrogenase expression/formation
BBta_20066120.524840hydrogenase expression/formation
BBta_20074100.301159hydrogenase nickel incorporation
BBta_2008290.695842carbamoyl phosphate phosphatase, (NiFe)
19BBta_2100BBta_2105Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_21002131.846389hypothetical protein
BBta_21012121.451017Atrazine chlorohydrolase
BBta_21022131.181526uracil phosphoribosyltransferase
BBta_21032121.045841hypothetical protein
BBta_21042130.902831hypothetical protein
BBta_21052121.364571N-isopropylammelide isopropylaminohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2105UREASE441e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 43.6 bits (103), Expect = 1e-06
Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 12/72 (16%)

Query: 1 MDLIIRNAVLAQQGELRCADIGISKGRIAAIA----PSLQ--------ADGEARDAENCL 48
+D +I NA++ + ADIG+ GRIAAI P +Q E E +
Sbjct: 68 VDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKI 127

Query: 49 VVPGLIETHIHL 60
V G +++HIH
Sbjct: 128 VTAGGMDSHIHF 139


20BBta_2628BBta_2648Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2628-122-3.309316hypothetical protein
BBta_2629024-3.435947AraC family transcriptional regulator
BBta_2630024-3.116517enoyl-CoA hydratase
BBta_2631125-3.090448formyl-coenzyme A transferase L-carnitine
BBta_2632128-3.458417fumarylacetoacetate (FAA) hydrolase
BBta_2633226-4.012420carbon-monoxide dehydrogenase small subunit
BBta_2634223-3.471653carbon-monoxide dehydrogenase large subunit
BBta_2635-116-3.320339carbon monoxide dehydrogenase medium subunit
BBta_2637-114-3.267459enoyl-CoA hydratase
BBta_2638-112-1.128550hypothetical protein
BBta_2639-1110.841628medium-chain-fatty-acid--CoA ligase
BBta_26401122.571606RuBisCO operon transcriptional regulator
BBta_26412112.858529ribulose bisophosphate carboxylase
BBta_26425124.675840ribulose 1,5-bisphosphate carboxylase small
BBta_26436135.217328carboxysome structural peptide CsoS2
BBta_26447174.372946carboxysome structural peptide
BBta_26453144.317156carboxysome peptide A
BBta_26463133.424458carboxysome peptide B
BBta_26472123.235379major carboxysome shell protein 1C
BBta_26481113.157156major carboxysome shell protein 1A
21BBta_2658BBta_2667Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2658229-3.092366hypothetical protein
BBta_2659128-3.269086LysR family transcriptional regulator
BBta_2660128-3.148098transposase
BBta_2661-125-2.941357transposase
BBta_2662027-2.912352LysR family transcriptional regulator
BBta_2663121-3.023552NADH-azoreductase, FMN-dependent
BBta_2664123-3.291853pirin-like protein
BBta_2665220-2.908831hypothetical protein
BBta_2666218-2.691067iron-regulated membrane protein
BBta_2667414-2.539133hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2665NUCEPIMERASE344e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.4 bits (79), Expect = 4e-04
Identities = 25/125 (20%), Positives = 40/125 (32%), Gaps = 28/125 (22%)

Query: 2 YAVTGATGHLGRKVVARLREFVPSSEIVAV----------VRDARKA--ADLGVGIRTAD 49
Y VTGA G +G V RL E ++V + ++ AR A G D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLE--AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 50 YADPRALE--------------GALAGIERLLLISSSSFGTRQAEHANVIAAAKTAGVGH 95
AD + + L + + N++ + + H
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 96 LVYTS 100
L+Y S
Sbjct: 121 LLYAS 125


22BBta_3043BBta_3049Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_30432101.035871sufE-like protein
BBta_30441121.164571hypothetical protein
BBta_30452111.100917multi-sensor signal transduction histidine
BBta_3046290.916823hypothetical protein
BBta_30473100.058168hypothetical protein
BBta_30483100.026686hypothetical protein
BBta_30492110.067277hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3045PF06580448e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.5 bits (105), Expect = 8e-07
Identities = 22/105 (20%), Positives = 38/105 (36%), Gaps = 26/105 (24%)

Query: 456 LVSNAIKF----TERGGRVMVSAAIEGPQLLLRITDTGVGIAADDLKRIGDPFFQAGKTY 511
LV N IK +GG++++ + + L + +TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL------------------ 304

Query: 512 QRRHEGTGLGLSIVKS-LVNLHGGE--MSIESKLDEGTTVSIALP 553
+ E TG GL V+ L L+G E + + K + +P
Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLIP 348


23BBta_3119BBta_3185Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3119323-0.849724glutathione S-transferase
BBta_3123118-0.500431hypothetical protein
BBta_31240122.083319integrase/recombinase
BBta_31252141.801919hypothetical protein
BBta_31261151.616242hypothetical protein
BBta_3127-1120.514617hypothetical protein
BBta_3128-1101.509458sulfopyruvate decarboxylase subunit alpha
BBta_3129-191.883256D-3-phosphoglycerate dehydrogenase (PGDH)
BBta_3130-291.744912hypothetical protein
BBta_3131-282.585698phosphinothricin N-acetyltransferase
BBta_3132-182.560887ornithine decarboxylase
BBta_31334113.539421nicotinate-nucleotide--dimethylbenzimidazole
BBta_31342112.847886alpha-ribazole phosphatase
BBta_31353122.510617cobalamin-5'-phosphate synthase
BBta_31362142.456855aminotransferase
BBta_31372131.701483adenosylcobinamide-phosphate synthase
BBta_31381111.296846cobyric acid synthase
BBta_31391122.368274cob(I)yrinic acid a,c-diamide
BBta_31402142.689900adenosylcobinamide kinase
BBta_31412142.586830hypothetical protein
BBta_31422152.291064hypothetical protein
BBta_31432152.507023cobalamin synthesis protein cobW
BBta_31443152.933439cobaltochelatase subunit CobN
BBta_31453163.002130precorrin-3B synthase (cobG)
BBta_31463162.870607precorrin-8X methylmutase
BBta_31472162.866336cobalt-factor II
BBta_31483133.845430precorrin-3 methyltransferase
BBta_31493133.945300precorrin-6A reductase
BBta_31501143.517086precorrin-6Y C5,15-methyltransferase
BBta_31511122.858135CobE protein
BBta_31520132.156334precorrin-4 C(11)-methyltransferase
BBta_31530121.928707cobalt-precorrin-6A synthase
BBta_31540130.860075cobyrinic acid a,c-diamide synthase
BBta_3155311-1.217580cob(II)yrinic acid a,c-diamide reductase
BBta_3156310-1.137889hypothetical protein
BBta_3157412-1.202854hypothetical protein
BBta_3158712-2.131894RNA polymerase sigma factor
BBta_3159610-2.817558flagellin
BBta_3160511-2.797981flagellar hook-associated proteinFliD
BBta_3161217-2.081597flagellar protein fliS
BBta_3162412-2.935022hypothetical protein
BBta_3164411-3.122493basal-body rod modification protein flgD
BBta_3165114-1.645136flagellar hook protein FlgE
BBta_3166014-2.123218hypothetical protein
BBta_3167114-2.134540hypothetical protein
BBta_3168113-1.962151flagellar hook-associated protein FlgK
BBta_3169313-1.110688flagellar hook-associated protein FlgL
BBta_317038-0.454134short-chain dehydrogenase
BBta_317137-0.604715cytochrome o ubiquinol oxidase subunit I
BBta_31724121.579327hypothetical protein
BBta_3174220-3.109817hypothetical protein
BBta_3175218-3.437539hypothetical protein
BBta_3176320-3.601998O-acetylhomoserine
BBta_3177422-4.278721NADH dehydrogenase/NAD(P)H nitroreductase
BBta_3178219-3.731369ABC transporter substrate-binding protein
BBta_3179120-3.811073hypothetical protein
BBta_31810111.226682hypothetical protein
BBta_31820121.662976TetR family transcriptional regulator
BBta_31832132.036981O-methyltransferase
BBta_31842142.599215polysaccharide deacetylase
BBta_31853133.009103ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3125RTXTOXINA290.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.004
Identities = 19/57 (33%), Positives = 24/57 (42%), Gaps = 7/57 (12%)

Query: 8 SLVAASLLASGAAYAQSTTVE-------GAANGAAAGGAVAGPVGAMVGGTVGAAVG 57
SL+AA +GA A TT+ + AA V PV A+VG G G
Sbjct: 352 SLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISG 408



Score = 28.8 bits (64), Expect = 0.006
Identities = 6/27 (22%), Positives = 11/27 (40%)

Query: 30 AANGAAAGGAVAGPVGAMVGGTVGAAV 56
A AA G + + ++ V A+
Sbjct: 292 IAQRAAQGLSTSAAAAGLIASAVTLAI 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3131SACTRNSFRASE327e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.2 bits (73), Expect = 7e-04
Identities = 10/64 (15%), Positives = 27/64 (42%)

Query: 93 RFAVKHSIYVHHEHLGRGVGRLLMQALIDASAAAGYRQMIGYIDADNTASLGIHERFGFV 152
+A+ I V ++ +GVG L+ I+ + + ++ N ++ + + F+
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 153 RVGL 156
+
Sbjct: 148 IGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3138BACINVASINB300.028 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 29.7 bits (66), Expect = 0.028
Identities = 29/133 (21%), Positives = 51/133 (38%), Gaps = 4/133 (3%)

Query: 84 SEIGSQVVVQGRVIGNAKASAY--QAMKPQLMKAVLDSFHHLIADTDIALVEGAGSASEI 141
+ +G V+V ++ A ++ QA+ P +M+ VL LI +EG G +
Sbjct: 344 AAVGLAVMVADEIVKAATGVSFIQQALNP-IMEHVLKPLMELIGKAITKALEGLGVDKKT 402

Query: 142 NLRAGDIAN-MGFAQATQIPVVLIGDIDRGGVIASLVGTQAVLAPDDAALIAGFLVNKFR 200
AG I + A A +V++ + +G ++ L+ L +
Sbjct: 403 AEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQ 462

Query: 201 GDPALFASGMSEI 213
LF GM I
Sbjct: 463 NGSKLFTQGMQRI 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3159FLAGELLIN893e-22 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 88.9 bits (220), Expect = 3e-22
Identities = 74/270 (27%), Positives = 127/270 (47%), Gaps = 12/270 (4%)

Query: 4 ISTNIAANSAVRYLNINSNQETSSLSKLSSGSRITSASDDAAGLAISTRISSDITTLQQA 63
I+TN + LN + + +S++ +LSSG RI SA DDAAG AI+ R +S+I L QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 ATNAAQATSILQTADGGASNISDILARMKSLASESASGTTVGTSRTYIQSEFSQLISEIS 123
+ NA SI QT +G + I++ L R++ L+ ++ +GT + IQ E Q + EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 SIATGTRYSGQSLLDGTSTFSAGVSVLVGSSATDTIDIKLSDLTASTLGVSSLDVSTQSG 183
++ T+++G +L + + + VG++ +TI I L + +LG+ +V+
Sbjct: 124 RVSNQTQFNGVKVLSQDNQ----MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 184 AT--------AALTTLDTAIDTVSKARADIGAQESRFNFSADSISTQTQNLQSANSAIKD 235
AT +T DT +K R D+ + + +A ++ + + D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 236 VDIASEQAKLSSAQVKTQAAVSAEAAANQI 265
+ L T A+A A I
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 57.4 bits (138), Expect = 1e-11
Identities = 45/215 (20%), Positives = 81/215 (37%), Gaps = 2/215 (0%)

Query: 61 QQAATNAAQATSILQTADGGASNISDILARMKSLASESASGTTVGTSRTYIQSEFSQLIS 120
T + ++I+ A + + +S+ +
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 121 EISSIATGTRYSGQSLLDGTSTFSAGVSVLVGSSATDTIDIKLSDLTASTLGV--SSLDV 178
+ + T + + G T D TAS + +
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 179 STQSGATAALTTLDTAIDTVSKARADIGAQESRFNFSADSISTQTQNLQSANSAIKDVDI 238
+ + L ++D+A+ V R+ +GA ++RF+ + ++ NL SA S I+D D
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 239 ASEQAKLSSAQVKTQAAVSAEAAANQIPQYLLKLL 273
A+E + +S AQ+ QA S A ANQ+PQ +L LL
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3165FLGHOOKAP1386e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 38.0 bits (88), Expect = 6e-05
Identities = 16/65 (24%), Positives = 29/65 (44%)

Query: 345 TATSGNATLQASGENGAGTIYGSELESSTTDTTGQFSNMISAQQAYSAASQVISAVNKMY 404
T+ T A+ N + + S + ++ N+ QQ Y A +QV+ N ++
Sbjct: 480 NKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539

Query: 405 DTLIS 409
D LI+
Sbjct: 540 DALIN 544



Score = 30.7 bits (69), Expect = 0.010
Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 9/66 (13%)

Query: 4 SGALSSAISALNAQSSSLAMISDNISNADTTGYKTTSGLFEQLVTASSNSKAYSSG---- 59
S +++A+S LNA ++L S+NIS+ + GY + + A +NS + G
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT-----TIMAQANSTLGAGGWVGN 55

Query: 60 GVSVSS 65
GV VS
Sbjct: 56 GVYVSG 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3168FLGHOOKAP11131e-28 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 113 bits (283), Expect = 1e-28
Identities = 123/593 (20%), Positives = 235/593 (39%), Gaps = 77/593 (12%)

Query: 9 VAFSGISATELQISVASSNISNADTKGYTEKS---ANQVATVTGG--VGTGVSITGISSN 63
A SG++A + ++ AS+NIS+ + GYT ++ A +T+ G VG GV ++G+
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQRE 65

Query: 64 VDKLLLKSLIGANSELGAADTTNSYLEQLQQLFGSASTSGTSTTGTSLANALASLESALS 123
D + L A ++ + ++ + ++++S LA + ++L
Sbjct: 66 YDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSS--------LATQMQDFFTSLQ 117

Query: 124 SLASSPSSVSLQSAAVSALDDFASQLRSTSSGVQSLRANSDKDIASSVKSVNDDLQQIAD 183
+L S+ + + A + + +Q ++T ++ + I +SV +N+ +QIA
Sbjct: 118 TLVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIAS 177

Query: 184 LNVQIRKMAA--AGQSTADLEDQRNSALQDLSSYMNVSYYTASNGDLQVYTSSGRALVD- 240
LN QI ++ AG S +L DQR+ + +L+ + V G + ++G +LV
Sbjct: 178 LNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQG 237

Query: 241 SSAHTLSYTASANVSASS--SYSSGGFSGIMVDGVDVTSQITSGKIGALITLRDQTLPAT 298
S+A L+ S+ + + +Y G I + + +G +G ++T R Q L T
Sbjct: 238 STARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKL----LNTGSLGGILTFRSQDLDQT 293

Query: 299 QTQLDQLATELKSALNAITNGASAVPPPTSLTGSASVSSTTALAASGTVRIAVADQSGNL 358
+ L QLA A N A+G
Sbjct: 294 RNTLGQLALAFAEAFNTQ--------------------HKAGFDANGDAGEDFFAIGKPA 333

Query: 359 VSYK---DLDLSSYATVGDLVTALNGISGVSASLD------SSGYLSISATSSSNGIAIN 409
V D++ ATV D L +S + + + + T +NG
Sbjct: 334 VLQNTKNKGDVAIGATVTDASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAF 393

Query: 410 DMT--SSVGGGGFSEYFGMNDLVTGTGAANFAV-DSSILSGAAGLPTGTLDNSATLTTGS 466
D + G ++ F + + + + D + ++ A+ G DN
Sbjct: 394 DGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR------- 446

Query: 467 QVLTSGSATIINQLYDKLTGSTSFSSAGGLSATTGSFADYAAAIVANVASKATQASSNYT 526
+G A + Q K G SF D A++V+++ +K ++
Sbjct: 447 ----NGQALLDLQSNSKTVGGAK------------SFNDAYASLVSDIGNKTATLKTSSA 490

Query: 527 AKSTAQASYASSLSSQSGVNIDEETARVSSLQNKYAAASQLISVVNSMFSSLL 579
+ ++ S SGVN+DEE + Q Y A +Q++ N++F +L+
Sbjct: 491 TQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3170DHBDHDRGNASE741e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.5 bits (180), Expect = 1e-16
Identities = 51/190 (26%), Positives = 78/190 (41%), Gaps = 1/190 (0%)

Query: 211 RDLRGTRVVVTGASSGIGRATALALAREGASVVLAARRENVLKDVALECETLGGRAIAVA 270
+ + G +TGA+ GIG A A LA +GA + L+ V + A A
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 271 TDVTDADAVKRLAEQAVRTFGGVDVWINNAGTGVFGPYQDADMALHRKTVEVNLLGTMNG 330
DV D+ A+ + + R G +D+ +N AG G T VN G N
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 331 AYAVLPIFLRQRRGTLINNISLGGWAPTPFAAAYTASKFGLRGFSASLRQELTAHKDVHV 390
+ +V + +R G+++ S P AAY +SK F+ L EL A ++
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL-AEYNIRC 182

Query: 391 CSVFPAMVDT 400
V P +T
Sbjct: 183 NIVSPGSTET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3182HTHTETR631e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.5 bits (154), Expect = 1e-14
Identities = 29/183 (15%), Positives = 71/183 (38%), Gaps = 10/183 (5%)

Query: 14 QPQQARSTDLVAAILEAAVQVLTTEGAQRFTTARVAEKAGVSVGSLYQYFPNKAAILFRL 73
+ + + + IL+ A+++ + +G + +A+ AGV+ G++Y +F +K+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWQQTTRLLHGILEDTSRPPLVRLRALVHAFLQ---SECDEAAIRGALDDAAPLYRD 130
L PL LR ++ L+ +E + + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 131 APEARQATASGRRALLAFMREVLPDAGEA-------DRARAADLIKATLSQVGKHFSETP 183
+QA + + + L EA RAA +++ +S + +++ P
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 184 RNR 186
++
Sbjct: 183 QSF 185


24BBta_3214BBta_3235Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_32143143.451473hypothetical protein
BBta_32164133.705867hypothetical protein
BBta_32174123.804262hypothetical protein
BBta_32183124.066992hypothetical protein
BBta_3219293.278702two component transcriptional regulator
BBta_3220-172.021102two-component sensor histidine kinase
BBta_3221-180.920594hypothetical protein
BBta_3222-28-0.259529hypothetical protein
BBta_3223-110-0.253869carbohydrate kinase
BBta_3224-111-0.862665transcriptional regulator
BBta_3225111-0.059454sugar ABC transporter periplasmic substrate
BBta_32263120.999033sugar ABC transporter permease
BBta_32273121.226471sugar ABC transporter permease
BBta_32283121.797215sugar ABC transporter ATP-binding protein
BBta_32291121.480663sugar ABC transporter ATP-binding protein
BBta_3230011-0.198244carbohydrate kinase
BBta_3231-111-1.578981D-3-phosphoglycerate dehydrogenase
BBta_3232-112-1.935505L-fuculose phosphate aldolase
BBta_3233-113-1.759139glycerol-3-phosphate dehydrogenase
BBta_3234-114-2.805832vanillate O-demethylase oxidoreductase
BBta_3235-112-3.476531toluate 1,2-dioxygenase subunit alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3218PERTACTIN290.005 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.9 bits (64), Expect = 0.005
Identities = 21/73 (28%), Positives = 29/73 (39%), Gaps = 2/73 (2%)

Query: 15 ADPERDQLLPAGPVPPPCQAQVPPEQLPEPPKLVVPRRPVTAPLRVQALASLAVAAAAAA 74
A P GP P P Q P P+PP+ P + Q A ++AAA A
Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQP--PQPPQPPQPPQRQPEAPAPQPPAGRELSAAANA 624

Query: 75 TNNKVAISLFTSL 87
N + L ++L
Sbjct: 625 AVNTGGVGLASTL 637


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3219HTHFIS891e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 1e-22
Identities = 39/163 (23%), Positives = 79/163 (48%), Gaps = 6/163 (3%)

Query: 6 ATLLMVEDDPEISRLVRDFMRREGFEIEVAENAAAMDAVLRRLRPDLIILDLMLPGEDGL 65
AT+L+ +DD I ++ + R G+++ + NAA + + DL++ D+++P E+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 SICRRLRQSD-DIPILMLSAKSDEIDRVVGLELGADDYMVKPFGPRELLARVRALLRRAQ 124
+ R++++ D+P+L++SA++ + + E GA DY+ KPF EL+ + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 125 ASPRREASRRFAFDRFVLDVDA-----RSIETVSGGDAPIQLT 162
P + V A R + + D + +T
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3234OMPADOMAIN290.028 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 28.7 bits (64), Expect = 0.028
Identities = 16/64 (25%), Positives = 26/64 (40%), Gaps = 18/64 (28%)

Query: 198 EAVKAAALTKGVPPDRIHFELF----------------RAETPS--SPDRPFEVELRSTG 239
++V ++KG+P D+I RA +PDR E+E++
Sbjct: 279 QSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIK 338

Query: 240 QVVT 243
VVT
Sbjct: 339 DVVT 342


25BBta_3306BBta_3323Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3306216-1.916762alkanesulfonate ABC transporter ATP binding
BBta_3307519-2.588873acyl-CoA dehydrogenase
BBta_3308725-4.119257sulfonate monooxygenase
BBta_3309932-6.391561alkanesulfonate ABC transporter substrate
BBta_3310937-7.490324hypothetical protein
BBta_3311937-7.535332XRE family transcriptional regulator
BBta_3312838-8.741938hypothetical protein
BBta_3313837-8.505579hypothetical protein
BBta_3314830-6.894501hypothetical protein
BBta_3315523-3.297372hypothetical protein
BBta_3316620-1.224021hypothetical protein
BBta_33185200.203032hypothetical protein
BBta_3319617-1.239998hypothetical protein
BBta_3320521-2.902627hypothetical protein
BBta_3321524-3.095457hypothetical protein
BBta_3322523-3.229856hypothetical protein
BBta_3323122-3.156850hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3306PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.020
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 4/36 (11%)

Query: 39 VSLVVEPG---EFVALL-GPSGCGKSTLLRLIAGLD 70
V+ V+EPG ++ +L G G GKSTL+ + GLD
Sbjct: 585 VARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3322IGASERPTASE320.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.003
Identities = 23/130 (17%), Positives = 40/130 (30%), Gaps = 15/130 (11%)

Query: 42 AATETEDPDVIA--ATMAKPG-VVLKRPAGSTGRFAEQSEL-----PDLDDEESIPKKKA 93
+ + P V + +A+ + PA +T +E +E ++ A
Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATP--SETTETVAENSKQESKTVEKNEQDA 1058

Query: 94 GRPEAKRR-----AAPTVSVEDSRKAAAQFEREQRRRDAQRRKEEAIREKERARREKLIA 148
A+ R A V AQ E + KE A EKE + +
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 149 KAQAALEAAQ 158
+ +Q
Sbjct: 1119 TQEVPKVTSQ 1128



Score = 28.5 bits (63), Expect = 0.024
Identities = 25/168 (14%), Positives = 50/168 (29%), Gaps = 19/168 (11%)

Query: 45 ETEDPDVIAATMAKPGVVLKRPAGSTGRFAEQSELPDLDDEESIPKKKAGR-PEAKRRAA 103
+T D I V P+ + A E P + P + E ++ +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEI-ARVDEAPVPPPAPATPSETTETVAENSKQES 1048

Query: 104 PTVSVEDSRKAAAQFEREQRRRDAQRRKEEAIR---------EKERARREKLIAKAQAAL 154
TV + + +EA E ++ E +
Sbjct: 1049 KTVEKNEQDATETT-------AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 155 EAAQQEHNERLGNLEAERRDLEKRIEAEDRRWEAEREKLRDALRRVRE 202
E A E E+ +E E+ ++ ++ + + E ++ RE
Sbjct: 1102 ETATVEKEEK-AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148


26BBta_3427BBta_3447Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_34272151.217439cation efflux system protein cusB
BBta_3428220-0.180721hypothetical protein
BBta_3429-122-1.792678hypothetical protein
BBta_3430124-2.473828hypothetical protein
BBta_3431124-3.571732hypothetical protein
BBta_3432026-4.416587hypothetical protein
BBta_3433030-4.528029hypothetical protein
BBta_3434131-4.608712hypothetical protein
BBta_3435132-4.866898UspA stress protein
BBta_3436134-4.144739ABC transporter ATP-binding protein
BBta_3437235-4.191423hypothetical protein
BBta_3438342-5.267332secretion protein HlyD
BBta_3439443-5.890103hypothetical protein
BBta_3441342-5.424841hypothetical protein
BBta_3442240-5.030330cation-transporting ATPase
BBta_3443443-7.432142hypothetical protein
BBta_3444128-5.976425hypothetical protein
BBta_3445022-4.294938hypothetical protein
BBta_3446-122-4.250693hypothetical protein
BBta_3447-117-3.624701hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3427RTXTOXIND518e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 8e-09
Identities = 31/123 (25%), Positives = 45/123 (36%), Gaps = 15/123 (12%)

Query: 267 APRDGIVLERNA-VEGMRANPGDVLFRIA-DISLVWALVDVAERDLGSIAVGQPVTIRAR 324
AP V + EG + L I + + V +D+G I VGQ I+
Sbjct: 332 APVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE 391

Query: 325 SFPGRTF---TGSIAVIYPQVNKDTRTA---RVRIEL-------ANSDLALLPDMYVDAE 371
+FP + G + I +D R V I + N ++ L M V AE
Sbjct: 392 AFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451

Query: 372 IDT 374
I T
Sbjct: 452 IKT 454



Score = 30.6 bits (69), Expect = 0.014
Identities = 10/51 (19%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 155 TLHVAVKAPGTIQLDERRVSVIAMRAESFVQKVADVTTGTRVKAGQPLMEI 205
+ + A G + R + + S V+++ V G V+ G L+++
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPI-ENSIVKEII-VKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3436PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.002
Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 48 VVLLGPSGSGKSTLLNILGGLD 69
VVL G G GKSTL+N L GLD
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3438RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 3e-07
Identities = 44/253 (17%), Positives = 79/253 (31%), Gaps = 31/253 (12%)

Query: 99 RTRREVEEMLGTGEANLERAKAVVERARALSDQANTDLARTRTLAQQGAATAQAFERAEL 158
+ + E L A A + R LS + L +L + A A E
Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 159 AARLAERDLRAAEFQDHAAEHEISQLRALLARYSND------------------------ 194
A +LR + Q E EI + +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELA 319

Query: 195 -GGGQPEAWNVASPVAGVVLKVAQESE-TIVQPGTPLLDI-GDASDIEVVVDVLSTNAVE 251
+ +A + +PV+ V ++ +E +V L+ I + +EV V + +
Sbjct: 320 KNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGF 379

Query: 252 IRAGADVTID----NWGGEGKLKGRVRRVEPAAFTKISTLGVEEQRVNVLVDILSPSEQW 307
I G + I + G L G+V+ + A V +++ + LS +
Sbjct: 380 INVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKN 439

Query: 308 ARLGDAYQVDVQI 320
L V +I
Sbjct: 440 IPLSSGMAVTAEI 452


27BBta_3611BBta_3619Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_36112150.982719NADP-dependent oxidoreductase
BBta_3612315-0.733683hypothetical protein
BBta_3613318-1.905605SUN-family protein, RNA methyltransferase
BBta_3614425-4.160507GMP synthase
BBta_3615441-6.085429hypothetical protein
BBta_3618548-5.272963hypothetical protein
BBta_3619341-4.450087hypothetical protein
28BBta_3729BBta_3747Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_37292151.263532hypothetical protein
BBta_37302161.179089carbon monoxide dehydrogenase medium subunit,
BBta_37312151.002401carbon-monoxide dehydrogenase small subunit,
BBta_37322161.164571xanthine dehydrogenase, molybdenum binding
BBta_37332151.761140branched-chain amino acid ABC transporter
BBta_37343162.308985branched-chain amino acid ABC transporter
BBta_37353132.209377branched-chain amino acid ABC transporter
BBta_37363122.916498branched-chain amino acid ABC transporter
BBta_37370122.576439branched-chain amino acid ABC transporter
BBta_37381111.993243ethanolamine ammonia-lyase light chain
BBta_37390102.087974ethanolamine ammonia-lyase heavy chain
BBta_37400112.316609hypothetical protein
BBta_3741-1102.837164hypothetical protein
BBta_3742-1102.837811ribonuclease Z
BBta_37430113.102095glycerophosphoryl diester phosphodiesterase
BBta_37440123.488809protein serine-threonine phosphatase
BBta_3745-1142.741547hypothetical protein
BBta_3746-1183.004536hypothetical protein
BBta_37472162.234907**hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3729TYPE3IMRPROT240.049 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 23.6 bits (51), Expect = 0.049
Identities = 6/26 (23%), Positives = 9/26 (34%)

Query: 5 SLGWLALFRWLPARLAPCFFALAVFG 30
L WL L+ W R+ +
Sbjct: 9 WLSWLNLYFWPLLRVLALISTAPILS 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3736PF05272280.037 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.037
Identities = 9/17 (52%), Positives = 9/17 (52%)

Query: 38 GPNGAGKSTFFNILSGT 54
G G GKST N L G
Sbjct: 603 GTGGIGKSTLINTLVGL 619


29BBta_3890BBta_3910Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3890211-1.148933hypothetical protein
BBta_3891213-1.242975GNAT family acetyltransferase
BBta_3892011-1.166432hypothetical protein
BBta_3893112-1.311759hypothetical protein
BBta_3894115-2.126635sensor histidine kinase
BBta_3895120-2.355638hypothetical protein
BBta_3896320-2.983577hypothetical protein
BBta_3897220-3.024723oxidoreductase NAD(P)-binding subunit
BBta_3899219-3.436124hypothetical protein
BBta_3900119-3.016220hypothetical protein
BBta_3901219-2.626043hypothetical protein
BBta_3902321-3.272385hypothetical protein
BBta_3904226-4.641881acyltransferase
BBta_3905233-6.963438hypothetical protein
BBta_3906330-6.034853hypothetical protein
BBta_3907433-6.713798hypothetical protein
BBta_3908433-6.721493hypothetical protein
BBta_3909331-6.572619acyltransferase
BBta_3910290.117780hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3897DHBDHDRGNASE851e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 1e-21
Identities = 67/253 (26%), Positives = 110/253 (43%), Gaps = 16/253 (6%)

Query: 54 LAGRKALITGGDSGMGRAAAIAYAREGADV-AINYYPTEEADAQEVIGLIT----ALPGD 108
+ G+ A ITG G+G A A A +GA + A++Y P + + A P D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 109 LRDERFCQQLVQQAVEALGGLDIIVSNAARQQARQSILDVSSEDFDATMKTNIYAPFWII 168
+RD ++ + +G +DI+V N A I +S E+++AT N F
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 169 KAALPHLKP--GSAIIGTSSEQAYDPSPDLYDYAQTKAATMNYVKSLAKQLGPKGIRVNA 226
++ ++ +I+ S A P + YA +KAA + + K L +L IR N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 227 VAPGPIWTPLQVS------GGATMEK--LEKFGGHTPLGRPGQPAELASIYVQLAAADAS 278
V+PG T +Q S G + K LE F PL + +P+++A + L + A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 279 YANGQVYGASGGS 291
+ GG+
Sbjct: 245 HITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3899TCRTETOQM240.019 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 24.0 bits (52), Expect = 0.019
Identities = 8/32 (25%), Positives = 16/32 (50%)

Query: 3 SADVVSRQLIAVYPELRSQNYARLVTNEEGDE 34
SA++V +Q + +YP + N+ + E
Sbjct: 149 SAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180


30BBta_4174BBta_4181Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4174624-4.402369acyltransferase
BBta_41761038-7.678082*hypothetical protein
BBta_41771039-6.693477hypothetical protein
BBta_4178941-6.274938hypothetical protein
BBta_4179840-5.335807hypothetical protein
BBta_4180117-1.827184hypothetical protein
BBta_4181214-2.345158hypothetical protein
31BBta_4311BBta_4316Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4311214-1.334161L-glutamine synthetase
BBta_4312213-0.965393glutamine synthetase
BBta_4313213-1.163691protein tyrosine/serine phosphatase
BBta_4314314-1.276507hypothetical protein
BBta_4315313-1.529286hypothetical protein
BBta_4316313-1.531049sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4316HTHFIS764e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 4e-16
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 3/115 (2%)

Query: 1576 TVLLVDDDARNIFALSSVLERRGMKVLTATTGAEAIDLVQSTPAISIVLMDIMMPQMDGY 1635
T+L+ DDDA L+ L R G V + A + + +V+ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63

Query: 1636 QTIGVIRQNPAFARLPIIALTAKAMKGDREKCLEAGASDYLAKPVNTEQLLLAIR 1690
+ I++ A LP++ ++A+ K E GA DYL KP + +L+ I
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116



Score = 66.0 bits (161), Expect = 7e-13
Identities = 24/130 (18%), Positives = 50/130 (38%), Gaps = 7/130 (5%)

Query: 1307 TTLLIVEDDPHYARVLIDLARDKGFKILVATRGAEALDLAKQYQPAAISLDVFLPDMLGW 1366
T+L+ +DD VL G+ + + + A + DV +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 1367 TVLSQLK-HNPLTRHIPVQIITLD---EDRQHALARGAFSFVNKPTTTEGVSAALSQIKE 1422
+L ++K P +PV +++ A +GA+ ++ KP + + +
Sbjct: 64 DLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1423 YAQPRRKRLL 1432
+ R +L
Sbjct: 121 EPKRRPSKLE 130



Score = 65.2 bits (159), Expect = 1e-12
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 1428 RKRLLIVEDNAAEQLSIRQLLDHDDIEILAADTGAGALEALRGTPCDCVVLDLRLPDMSG 1487
+L+ +D+AA + + Q L ++ A + D VV D+ +PD +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 1488 FDVLDQLRSDETLSGIPVVVFTG 1510
FD+L +++ +PV+V +
Sbjct: 63 FDLLPRIKKAR--PDLPVLVMSA 83


32BBta_4373BBta_4380Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_43731019-0.108244single-stranded DNA-binding protein
BBta_43744190.182866outer-membrane protein
BBta_43754180.317655outer-membrane protein
BBta_43764190.338124outer-membrane protein
BBta_43773170.108604outer-membrane protein
BBta_43781150.774012outer-membrane protein
BBta_43791140.614175excinuclease ABC subunit A
BBta_43803131.077387hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4374OMPADOMAIN383e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 37.6 bits (87), Expect = 3e-05
Identities = 49/223 (21%), Positives = 76/223 (34%), Gaps = 34/223 (15%)

Query: 22 AADLAARPYTKAPMAAPAPLPTWTGFYIGLQGGGGWGRSDETFFNAPNGFGFAGTQRYDI 81
A +A A +A AP +Y G GW + +T F NG T +
Sbjct: 5 AIAIAVALAGFATVAQAAPKDN--TWYTG--AKLGWSQYHDTGFINNNG----PTHENQL 56

Query: 82 NGGFAGGVIGYNWQVDNIVFGLEGDYHWADINGRSGVITAGLGDSYFTKLRGFGDIKGRL 141
G GG +QV N G E Y W GR G ++ K +G + +L
Sbjct: 57 GAGAFGG-----YQV-NPYVGFEMGYDWL---GRMPY--KGSVENGAYKAQG-VQLTAKL 104

Query: 142 GWAAGPAL-FFVSGGAAVGDLQHRYDNPAFSTIQNDWRWGWTIGAGAEYMFAPNWSAKVE 200
G+ L + G V + + + +D G EY P + ++E
Sbjct: 105 GYPITDDLDIYTRLGGMVWRADTKSNVYGKN---HDTGVSPVFAGGVEYAITPEIATRLE 161

Query: 201 YNYLDFGKSTLQYNNPLVASNRSEWSDTVHTVKAGISYHFGGP 243
Y Q+ N + ++ + G+SY FG
Sbjct: 162 Y----------QWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQG 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4376OMPADOMAIN375e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 37.2 bits (86), Expect = 5e-05
Identities = 50/262 (19%), Positives = 73/262 (27%), Gaps = 69/262 (26%)

Query: 1 MKSTLFVTAGVAALGLAPASAADFAARPYGKAPPPAYVTPLPSWAGFYLGANGGADWSRN 60
MK T +A A+ A A P +Y GA G +
Sbjct: 1 MKKTA---IAIAVALAGFATVAQAA----------------PKDNTWYTGAKLGWSQYHD 41

Query: 61 CWTLNRVNGVPVVPTQSEGCHN--ATSGLIGGQIGYRWQAASWVFGLEAQGNWTDLKSSN 118
+N H +G GG +Q + G E +W
Sbjct: 42 TGFINNNGP----------THENQLGAGAFGG-----YQVNPY-VGFEMGYDWL------ 79

Query: 119 ASSAAFAAGITNNTKTDAIGL-FTGQIGYAWGNVL-WYVKGGAAVAHNKYTGTANAAAPV 176
G N A G+ T ++GY + L Y + G V
Sbjct: 80 --GRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVY----- 132

Query: 177 AVGTLLDSASETRWGGTVGTGVEFGFAPNWSVAVEYDHLFMGSRDITFPATAIVNARVDT 236
+T GVE+ P + +EY I +A
Sbjct: 133 ------GKNHDTGVSPVFAGGVEYAITPEIATRLEYQWT-----------NNIGDAHTIG 175

Query: 237 IKQDIDMATVRVNYRFGGPAAA 258
+ D M ++ V+YRFG AA
Sbjct: 176 TRPDNGMLSLGVSYRFGQGEAA 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4377OMPADOMAIN434e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 43.0 bits (101), Expect = 4e-07
Identities = 49/245 (20%), Positives = 73/245 (29%), Gaps = 48/245 (19%)

Query: 1 MKKLLVITTALVGIAAAMPASAADLAARPYTKAPPVVVPILSWSGIYAGIQGGGGWGTSK 60
MKK I A+ A A AA + Y G + G
Sbjct: 1 MKKTA-IAIAVALAGFATVAQAAPKD-----------------NTWYTGAKLGWSQ---- 38

Query: 61 ETFIGRFNAPGFLGTQNYNTNGGFVGGVIGYNWQFDNLVVGLEGDYHWSDINGRSAVINA 120
++ GF+ G G +Q N VG E Y W GR +
Sbjct: 39 ------YHDTGFINNNGPTHENQLGAGAFG-GYQV-NPYVGFEMGYDWL---GRMPYKGS 87

Query: 121 GVGDTYFTKLTSFGDIKGRLGYAVGPAL-FFVSGGAAVGELQHRYDRAAGVFFGQNTTRW 179
Y + +LGY + L + G V R D + V+ + T
Sbjct: 88 VENGAYKAQGVQLT---AKLGYPITDDLDIYTRLGGMVW----RADTKSNVYGKNHDTGV 140

Query: 180 GYTVGAGAEYMFAPNWSAKLEYNYLDFGKSTLQYVGVPGRSEWKDSVHTVKAGLNYHFGG 239
G EY P + +LEY + + +G + + G++Y FG
Sbjct: 141 SPVFAGGVEYAITPEIATRLEYQWTN-------NIGDAHTIGTRPDNGMLSLGVSYRFGQ 193

Query: 240 PVIAK 244
A
Sbjct: 194 GEAAP 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4378OMPADOMAIN310.002 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.4 bits (71), Expect = 0.002
Identities = 45/221 (20%), Positives = 76/221 (34%), Gaps = 39/221 (17%)

Query: 1 MKKILLATVALAALAAPAAAADLAARPTYTKAPVLAPVQTWTGFYIGAF---GGYANEDA 57
MKK +A A A A A YT A + W+ ++ F G +E+
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKL-----GWSQYHDTGFINNNGPTHENQ 55

Query: 58 STAALKGGFA-----GGTVGYNWQQGPLVFGLEADAAWADINATVGIPGVFGLTDRIEST 112
A GG+ G +GY+W G + A+ + + +TD ++
Sbjct: 56 LGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIY 115

Query: 113 GTVRGRIGYAFDTVLLYGTGGYAWGNNKLSATVGGVAGSETKFLSGWAAGAGVEWMFAPK 172
+ GG W + S G + + GVE+ P+
Sbjct: 116 TRL----------------GGMVWRADTKSNVYGKNHDTGVSPV----FAGGVEYAITPE 155

Query: 173 WSLKGEYLYKSLESSTYFGGAVPLGT-LNLHTFQVGVNYHF 212
+ + EY + + G A +GT + +GV+Y F
Sbjct: 156 IATRLEYQW-----TNNIGDAHTIGTRPDNGMLSLGVSYRF 191


33BBta_4437BBta_4444Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_44372121.843112oxidoreductase
BBta_44384122.026175exodeoxyribonuclease III
BBta_44395132.960429beta-lactamase
BBta_44406152.751872urease accessory protein ureD
BBta_44418181.967784urease accessory protein UreG
BBta_44424171.463524urease subunit alpha
BBta_44432151.371610bifunctional urease subunit gamma/beta
BBta_44442161.096031urease accessory protein UreF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4439BLACTAMASEA325e-114 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 325 bits (834), Expect = e-114
Identities = 128/280 (45%), Positives = 176/280 (62%), Gaps = 6/280 (2%)

Query: 8 QLFPLALLPLFARPARAA----DLQQTIAGIEADCGGRLGVALLDSASGA-LSGHRLDER 62
+ L ++ L A A + I E+ GR+G+ +D ASG L+ R DER
Sbjct: 2 RYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADER 61

Query: 63 FPMCSTFKALLAAAILTKVDAGAEQLSRRIPIAQADILSYAPVTKQHVGTSGLSVGELCE 122
FPM STFK +L A+L +VDAG EQL R+I Q D++ Y+PV+++H+ G++VGELC
Sbjct: 62 FPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLAD-GMTVGELCA 120

Query: 123 ATVTLSDNTAANLLLATLDGPAGLTRTIRGFGDAITRLDRIEPGLNESIPGDPRDTTTPA 182
A +T+SDN+AANLLLAT+ GPAGLT +R GD +TRLDR E LNE++PGD RDTTTPA
Sbjct: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180

Query: 183 AMAQTLAKLTVANTLSAASRDVMNGWLIGCKTGAAKLRAGLPAEWRVGDKTGAGDHGSSN 242
+MA TL KL + LSA S+ + W++ + +R+ LPA W + DKTGAG+ G+
Sbjct: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240

Query: 243 DVAVIWPAGRGPVIVTSYLTETKASDDRRNAAHAAVGRAV 282
VA++ P + IV YL +T AS RN A +G A+
Sbjct: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAAL 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4442UREASE7170.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 717 bits (1852), Expect = 0.0
Identities = 283/571 (49%), Positives = 378/571 (66%), Gaps = 11/571 (1%)

Query: 3 TLTRRAYAELYGPTKGDLVRLADTSLLAEIEHDYTTYGHELLVGAGKNLRDGEAVAAHRT 62
++R AYA ++GPT GD VRLADT L E+E D+TT+G E+ G GK +RDG + T
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQ-VT 62

Query: 63 SQHKALDVVVKNATIVDAVIGIVKADIGIRDGRIVGIGKAGNSDVMPDVHPDMVVGHTTA 122
+ A+D V+ NA I+D GIVKADIG++DGRI IGKAGN D+ P V ++VG T
Sbjct: 63 REGGAVDTVITNALILDHW-GIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTE 119

Query: 123 PIAGGPFIVTAGAIESHAHLISPEQSDHALSGGTTTMIGNGSGPVFDVGSGS---GP-NF 178
IAG IVTAG ++SH H I P+Q + AL G T M+G G+GP + + GP +
Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179

Query: 179 GHFLKSIEFSPLNYALFGRG-GSNPEAVEEAVAAGGMSVKIHEDFGAAPDVIDKTLIAAD 237
+++ + P+N A G+G S P A+ E V G S+K+HED+G P ID L AD
Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239

Query: 238 RHDFAVHLHTDSINEYGFCEDTMAAVDGRTIHMYHVEGAGGGHAPDLLKVVSWPNVIPSS 297
+D V +HTD++NE GF EDT+AA+ GRTIH YH EGAGGGHAPD++++ PNVIPSS
Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299

Query: 298 TNPTNPYTSYGMEEGVPMTMICHQLNYNAPEDVMFGEARVRAQSMAAEDFLHDMGAISIF 357
TNPT PYT + E + M M+CH L+ PED+ F E+R+R +++AAED LHD+GA SI
Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359

Query: 358 GTDTQGMGRLAENVAKCWQLASVMKDRTGRLPEETTARADNERIKRYIAKLTINPAIAVG 417
+D+Q MGR+ E + WQ A MK + GRL EE T DN R+KRYIAK TINPAIA G
Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEE-TGDNDNFRVKRYIAKYTINPAIAHG 418

Query: 418 IDHVVGSIEVGKMADLVLWPRASFGLKPYMVIKNGFPVWAAMGDGNGSLGLSEPMIQKRM 477
+ H +GS+EVGK ADLVLW A FG+KP MV+ G A MGD N S+ +P+ + M
Sbjct: 419 LSHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPM 478

Query: 478 WGALGAAPQRLGVNFMSKLAVDADIRGRLGLGRATVQIKNVR-RLRKTDMIRNAAMPHVE 536
+GA G + V F+S+ ++DA + GRLG+ + V ++N R + K MI N+ PH+E
Sbjct: 479 FGAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIE 538

Query: 537 VDPQTFEVRADGKLLMCPPATTVPLARRFML 567
VDP+T+EVRADG+LL C PAT +P+A+R+ L
Sbjct: 539 VDPETYEVRADGELLTCEPATVLPMAQRYFL 569


34BBta_4503BBta_4509Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_45032110.932283carboxymethylenebutenolidase
BBta_450419-0.001232lipid-A-disaccharide synthase
BBta_4505311-1.181216hypothetical protein
BBta_4506311-1.131909UDP-N-acetylglucosamine acyltransferase
BBta_4507311-0.690312(3R)-hydroxymyristoyl-ACP dehydratase
BBta_4508211-0.146217UDP-3-O-[3-hydroxymyristoyl] glucosamine
BBta_4509212-0.267008surface antigen domain-containing protein
35BBta_4565BBta_4571Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4565211-0.333557*peptidase T
BBta_4566213-0.547225S-adenosyl-L-methionine (SAM)-dependent
BBta_4567314-1.447911hypothetical protein
BBta_4568414-2.059023hypothetical protein
BBta_4569616-2.544372cyclic nucleotide binding protein
BBta_4570415-1.089186*ATP-dependent protease La
BBta_45712120.001581ATP-dependent protease ATP-binding subunit ClpX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4570PF02370353e-04 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 35.5 bits (81), Expect = 3e-04
Identities = 25/123 (20%), Positives = 56/123 (45%)

Query: 215 QVEKRIRSRVKRQMEKTQREYYLNEQMKAIQKELGDEDGRDELADLEERINKTKLSKEAR 274
+ + + R+ + + +RE ++++ ++KE ++ R E + ER ++ K +E +
Sbjct: 45 ENDPQYRALMGENQDLRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQ 104

Query: 275 EKAQHELKKLRQMSPMSAEATVVRNYLDWLLSIPWNKKSKVKKDLEAAQAVLDADHYGLE 334
+K Q E ++L A+ + + L+ KK+LE L +H L+
Sbjct: 105 KKHQQEQQQLEAEKQKLAKEKQISDASRQGLNRDLEASRAAKKELEPKHQKLGTEHQKLK 164

Query: 335 KVK 337
+ K
Sbjct: 165 EEK 167


36BBta_4692BBta_4708Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4692214-0.360031SsrA-binding protein
BBta_4693213-0.538087large-conductance mechanosensitive channel
BBta_4694110-0.371760dihydrodipicolinate synthase
BBta_4695315-0.460313lytic transglycosylase
BBta_4696519-1.686998porin domain-containing protein
BBta_4698518-1.456754porin domain-containing protein
BBta_4699116-1.007290hypothetical protein
BBta_4700217-0.708312hypothetical protein
BBta_4701417-0.569533porin domain-containing protein
BBta_4702-126-0.448886*hypothetical protein
BBta_4703-125-1.577595TetR family transcriptional regulator
BBta_4704-313-1.136125drug resistance transporter
BBta_4705-111-1.521891hypothetical protein
BBta_4706012-0.201950hypothetical protein
BBta_4707112-0.551116hypothetical protein
BBta_4708214-0.567674hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4693MECHCHANNEL1427e-47 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 142 bits (359), Expect = 7e-47
Identities = 72/137 (52%), Positives = 97/137 (70%), Gaps = 8/137 (5%)

Query: 1 MLKEFREFAMKGNVVDLAVGVIIGAAFGAIVTSLVSDVIMPIIGAITGGLDFSNYFIPLS 60
++KEFREFAM+GNVVDLAVGVIIGAAFG IV+SLV+D+IMP +G + GG+DF + + L
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 61 KSVTAGNLADAKKQGAVLAYGQFLTLTLNFFIIAFVLFLIIRGMNRLKRLQETQPAAAPK 120
A V+ YG F+ +F I+AF +F+ I+ +N+L R +E +PAAAP
Sbjct: 63 -------DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPA 114

Query: 121 PSREVELLTEIRDLLKK 137
P++E LLTEIRDLLK+
Sbjct: 115 PTKEEVLLTEIRDLLKE 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4694CARBMTKINASE290.019 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.0 bits (65), Expect = 0.019
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 11/65 (16%)

Query: 68 IAEAKGRVPVIAGAGSNSTREAV-----ELAEHAEKAGADAVLVVTP------YYNKPTQ 116
IA G VPVI G EAV + AE+ AD +++T YY +
Sbjct: 190 IASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKE 249

Query: 117 EGLYQ 121
+ L +
Sbjct: 250 QWLRE 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4695IGASERPTASE300.043 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.043
Identities = 17/100 (17%), Positives = 33/100 (33%), Gaps = 5/100 (5%)

Query: 60 TAAKEGAESKGKADARAEAKPDSKSGPKGASEAPRAAAPRDIMPGTAGLPAPRQHAALPA 119
T KE A + + A+ E + + + +P+ + P PA +
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP--QAEPARENDPTVNI 1155

Query: 120 ARKPVVPAAVAATSSTSQSDKDALENVIELVRKRKPDDAT 159
+ T+ T Q K+ NV + V + +
Sbjct: 1156 KEP---QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4703HTHTETR422e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.3 bits (99), Expect = 2e-07
Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 3 GYDAASLNEIAGMVGIRKASLYSHVASKDELFLLVLEDAARIERDFVMATLAAPPSAGEP 62
G + SL EIA G+ + ++Y H K +LF + E + + + A P G+P
Sbjct: 28 GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFP--GDP 85

Query: 63 GGAYIEVLAARYDASVH 79
E+L +++V
Sbjct: 86 LSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4704TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.7 bits (142), Expect = 1e-11
Identities = 39/160 (24%), Positives = 69/160 (43%), Gaps = 1/160 (0%)

Query: 9 VAVALGLITLLAPVSVDMYLPSLPVMAEEMNTTYPAMQLTLMVFLLAMGAGQIVFGPVID 68
+ + L +++ + ++ + SLP +A + N + F+L G V+G + D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 69 AYGRRRPLLVALAIFVVASLCAAVAHS-VEALLLARLLQGLAASLAIVTAMSTVRDIASG 127
G +R LL + I S+ V HS L++AR +QG A+ M V
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 128 VAAVQIFALLMTIQGLGPVMAPVLGGMIGAGLGWRAVFYF 167
+ F L+ +I +G + P +GGMI + W +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4706ISCHRISMTASE250.038 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 25.4 bits (55), Expect = 0.038
Identities = 8/38 (21%), Positives = 13/38 (34%)

Query: 35 DGIVDLIAAHKWFNLAALKGRVDAVAMRREVAEQMSDA 72
D + D L GR M + +Q+ +A
Sbjct: 176 DAVADFSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNA 213


37BBta_4809BBta_4814Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_48094131.856196oxidoreductase subunit
BBta_48105121.856554iron-sulfur-binding protein subunit of an
BBta_48113122.103905oxidoreductase subunit
BBta_48124141.925127hypothetical protein
BBta_48133141.772178hypothetical protein
BBta_48143131.619013hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4814IGASERPTASE482e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 48.1 bits (114), Expect = 2e-07
Identities = 53/330 (16%), Positives = 119/330 (36%), Gaps = 37/330 (11%)

Query: 195 PSNRFDAEGLTRLMEDIESKRKLRNDIEQDSMIKIRTRNLEAEREALDIERESETARLDQ 254
P+ +E + E+ + + K EQD+ A+ +++ ++T + Q
Sbjct: 1028 PAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ 1087

Query: 255 ERDIEMRRALQRTEVARERALRETEAEQAQIMARETIEKARIANELTIAEARIAAERDTR 314
T+ + +ET + + EKA++ E T ++ ++ +
Sbjct: 1088 SG--------SETKETQTTETKETATVEKE-------EKAKVETEKTQEVPKVTSQVSPK 1132

Query: 315 QR--EIERTRAVEERELLAREDIEKARIANQR--AIDAARIESEREVRQRDIERTRTIEE 370
Q E + +A RE +I++ + + E+ V Q E T
Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192

Query: 371 AEIAAREAVEKARIQQDRVVSDARIANDEETRRREIERTRAIEQAEIAAREATEKARIAQ 430
+ Q V S++ +N + R R R+ E A + +++ +A
Sbjct: 1193 NSVVENPENTTPATTQPTVNSES--SNKPKNRHRRSVRSVP-HNVEPATTSSNDRSTVAL 1249

Query: 431 TMIVNLERIS--SDERTRATEIA--------QVRAIQEAEIEAQQAVEAARIAREQALSA 480
+ + + SD R +A +A Q + E E Q V + + + S+
Sbjct: 1250 CDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKNYSS 1309

Query: 481 E---RIAAEHATRKLEIER--NQGIEIAGI 505
R +++ +L ++ + +++ G+
Sbjct: 1310 SQYRRFSSKSTQTQLGWDQTISNNVQLGGV 1339



Score = 47.4 bits (112), Expect = 3e-07
Identities = 45/281 (16%), Positives = 82/281 (29%), Gaps = 22/281 (7%)

Query: 394 RIANDEETRRREIERTRAIEQAEIAAREAT-------EKARIAQTMIVNLERISSDERTR 446
+ N E +R + T I + E AR+ + + + E T
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 447 ATEIAQVRAIQEAEIEAQQAVEAARIAREQALSAERIAAEHATRKLEIERNQGIEIAGIA 506
+ + E Q A E RE A A+ + T+ E+ + G E
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS-NVKANTQTNEVAQ-SGSETKETQ 1096

Query: 507 AREATEASRIAQEERVRALEIARIRTIEEVDIASREAIEAARIAQELAVAARRIESEKTT 566
E E + + +EE+ + +T E + S+ + + + A E++ T
Sbjct: 1097 TTETKETATVEKEEKAKVE---TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 567 RSLEIERTEAVEAADLKRREAVERRRIEVELALEAERIASSRTREVLNIDQKKAIELADE 626
E + A + + VE + ++ V N +
Sbjct: 1154 NIKEPQSQTNTTADT---EQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT-TQP 1209

Query: 627 VRVIELAAKRAERIDADRQVKEAEIIARKQVETADVSREQA 667
E + K R + VE A S
Sbjct: 1210 TVNSESSNKPKNRHRRSVRSVPH------NVEPATTSSNDR 1244


38BBta_4860BBta_4868Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_48603141.996135luciferase-like protein alkanesulfonate
BBta_48615122.024928ABC transporter permease
BBta_48623121.605556ABC transporter permease
BBta_48634110.751226ABC transporter ATP-binding protein
BBta_48643140.456144oligopeptide ABC transporter ATP-binding
BBta_4865322-1.418275hypothetical protein
BBta_48661122.648691hypothetical protein
BBta_48671132.411529hypothetical protein
BBta_48683132.277802hypothetical protein
39BBta_4932BBta_5006Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_49325180.411715deoxyribodipyrimidine photo-lyase type I
BBta_4933423-2.539133histone-like DNA-binding protein
BBta_4934323-1.807317hypothetical protein
BBta_4935425-2.802401phage-like integrase
BBta_4937530-5.281422hypothetical protein
BBta_4938530-4.822667hypothetical protein
BBta_4941526-2.808837hypothetical protein
BBta_4942627-1.823784DNA-invertase
BBta_4943526-1.929753hypothetical protein
BBta_4944425-0.613207hypothetical protein
BBta_4945626-2.488693hypothetical protein
BBta_4946525-3.319075hypothetical protein
BBta_4947631-5.568478hypothetical protein
BBta_4948634-5.734955hypothetical protein
BBta_4949638-7.005126hypothetical protein
BBta_4950638-6.825802hypothetical protein
BBta_4951230-3.091382*prophage integrase
BBta_4952134-1.991248hypothetical protein
BBta_4953029-1.501019hypothetical protein
BBta_4954-128-1.758903hypothetical protein
BBta_4956333-4.760350hypothetical protein
BBta_4957535-6.045270hypothetical protein
BBta_4958635-6.359338protein-L-isoaspartate(D-aspartate)
BBta_4959637-6.589148hypothetical protein
BBta_4960639-7.168763insertion sequence transposase protein
BBta_4962739-7.215110hypothetical protein
BBta_4963639-6.344492hypothetical protein
BBta_4964537-5.277219hypothetical protein
BBta_4965432-5.121868hypothetical protein
BBta_4966229-5.343880hypothetical protein
BBta_4969225-3.937957hypothetical protein
BBta_4972224-4.375659arylsulfatase regulatory protein
BBta_4973427-4.386405hypothetical protein
BBta_4975431-4.938351integrase core subunit
BBta_4977540-7.789471hypothetical protein
BBta_4978327-5.124010IS66 family insertion sequence transposase
BBta_4979329-5.930373hypothetical protein
BBta_4980226-5.319220transposase
BBta_4981225-5.367984hypothetical protein
BBta_4983121-4.137412hypothetical protein
BBta_4984116-1.538841transposase
BBta_4987026-3.716531insertion sequence transposase protein
BBta_4988124-4.790665acyltransferase family 3 protein
BBta_4989327-5.027559hypothetical protein
BBta_4990225-4.257116hypothetical protein
BBta_4991124-3.342232hypothetical protein
BBta_4992123-1.907243hypothetical protein
BBta_4993222-1.876230hypothetical protein
BBta_4994223-0.978560hypothetical protein
BBta_4995-2120.529455hypothetical protein
BBta_4996-1111.183772hypothetical protein
BBta_4997-1111.290493hypothetical protein
BBta_4998-1111.067247glyoxalase
BBta_4999-1121.360295hypothetical protein
BBta_5000-2141.118909multi-sensor signal transduction histidine
BBta_50012170.827068hypothetical protein
BBta_50021170.388876dihydrodipicolinate synthase
BBta_5003115-0.259717short-chain dehydrogenase/reductase (SDR)
BBta_5004213-1.294446dehydratase
BBta_5005113-1.372177hypothetical protein
BBta_5006213-0.905823acetyl-CoA acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4941PF052722162e-61 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 216 bits (550), Expect = 2e-61
Identities = 98/386 (25%), Positives = 151/386 (39%), Gaps = 37/386 (9%)

Query: 356 QQLAGELSDDAIIALRRIASDAFGANFGS-EVMLDAVRSIAIDHQYDPVCDMLDEAEAAW 414
++ G L D ++ L +G S + A+ A ++ P D + W
Sbjct: 487 RKAPGPLEDADVLRLADYVETTYGTGEASAQTTEQAINVAADMNRVHPFRDWVKAQ--QW 544

Query: 415 DGEPRLDRMAVDYLNAEDTLVN-------RAFIRKTMIAAVRRARHPGCKFDNITVLESP 467
D PRL++ V L + + ++ V R PGCKFD VLE
Sbjct: 545 DEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGT 604

Query: 468 EGWNKSGFWRVIAGDQFYSDESIIGRQSREIQEQLSTVWIHENAELAGMKKQEVETVKAY 527
G KS + G F+SD ++ EQ++ + +E +E+ ++ + E VKA+
Sbjct: 605 GGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEAVKAF 664

Query: 528 ASRQEDIARPAYGRVVKRQPRHSIDVGTTNADTYLQSQTGNRRFWPIKVLAPIDLDKLKR 587
S ++D R AYGR V+ PR + TTN YL TGNRRFWP+ V +L L++
Sbjct: 665 FSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYLFDITGNRRFWPVLVPGRANLVWLQK 724

Query: 588 DRIQLLGEAAAYEKRGEAITLDP----SLWSAAGVEQDARRIRDPWEDILDDMPTHAYID 643
R QL EA GE P + EQ+ R + +
Sbjct: 725 FRGQLFAEALHLYLAGERYFPSPEDEEIYFRP---EQELRLVETGVQG------------ 769

Query: 644 QNGHAEHIRVRPGEQEPVGAVRIIHVVGDRQCVATATLLQHVLQVPRERQTQTITMRLST 703
+ R G GA + + V T L L + + + ++
Sbjct: 770 ---RLWALLTREGAPAAEGAAQKGYSVNTTFV--TIADLVQALGADPGKSSPMLEGQVRD 824

Query: 704 TMKQVGWERHHDKLVIDGQRVRGYWR 729
+ + GWE + GQR RGY R
Sbjct: 825 WLNENGWEYLRET---SGQRRRGYMR 847


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4943PF05272280.042 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.042
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 23/89 (25%)

Query: 15 VMDILQSLGMDVSKWADMKGGATRAASNPKYCYNWSFLQPGEFVVACLWYESLKQRSGEL 74
+ D++Q+LG D K + M G R N +E L++ SG+
Sbjct: 800 IADLVQALGADPGKSSPMLEGQVRDWLNE------------------NGWEYLRETSGQ- 840

Query: 75 YYEINRGKWIKPRTEPGTGSSNKRANDFD 103
R +++P+ P + +K A+
Sbjct: 841 ----RRRGYMRPQVWPPVIAEDKEADQAH 865


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4957FIMREGULATRY290.006 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 28.8 bits (64), Expect = 0.006
Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 8/83 (9%)

Query: 140 GQHSVPSDPVDELADTERVAVAVAGL--AEVFRLFAG------DRVVLQIIDGLANGLTA 191
H V S + R +V + G F L G DRV+L + D L G +
Sbjct: 2 AHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSR 61

Query: 192 REICRAYGISAIDYDTARRRMRR 214
+E+C Y ++ + T R+ R
Sbjct: 62 KEVCEKYQMNNGYFSTTLGRLIR 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4964V8PROTEASE477e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 46.9 bits (111), Expect = 7e-08
Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 57/244 (23%)

Query: 143 RTVCRIDYADLSPPGFGTGFLVGPDLVLTNWHVVERVERAPENVRHDVANQLRFRFDLLE 202
V I + +G +VG D +LTN HVV+ H + L+ +
Sbjct: 88 APVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDA--------THGDPHALKAFPSAIN 139

Query: 203 RAAAEDGRGRVALAQVSDGSPLLRTSPAGGMEVRGRSGEPSMTELDYALIRLTEDVGN-- 260
+ +G ++ SGE + + ++ + +G
Sbjct: 140 QDNYPNGGFTAE-------------------QITKYSGEGDLAIVKFSPNEQNKHIGEVV 180

Query: 261 DPV-----ASTALGETRGYIQLRPNMPLPTVNSALMALQHPMRGELQFAIGIALGPNQTG 315
P A T + + + P+ T +G++ + G A+
Sbjct: 181 KPATMSNNAETQVNQNITVTGYPGDKPVAT--------MWESKGKITYLKGEAM------ 226

Query: 316 SRAKHTVATQEGSSGSPVLDEYLSPIAMHNGTRFGTARERQAYNTAVPLSHIVADLRNSG 375
++ ++T G+SGSPV +E I +H +N AV ++ V +
Sbjct: 227 ---QYDLSTTGGNSGSPVFNEKNEVIGIH------WGGVPNEFNGAVFINENVRNFLKQN 277

Query: 376 ITEM 379
I ++
Sbjct: 278 IEDI 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5003DHBDHDRGNASE873e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.0 bits (215), Expect = 3e-22
Identities = 74/259 (28%), Positives = 108/259 (41%), Gaps = 27/259 (10%)

Query: 4 LDGKVALITGAGGGLGEAYARLFAREGAAVVVNDLGGPRDGSGSDLSMAGQVAAAITAEG 63
++GK+A ITGA G+GEA AR A +GA + D + +V +++ AE
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK---------LEKVVSSLKAEA 56

Query: 64 GRAVANGADISTMAGGQSVFDDAIRHFGRADILVNNAGILRDQTFAKSSEADWDKVIQVH 123
A A AD+ A + R G DILVN AG+LR S+ +W+ V+
Sbjct: 57 RHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116

Query: 124 LKGTFCCTLPVFRWMRDNGGGVIVNTSSTSGLIGNFGQSNYGAAKGGIWGLSNVLAVEGR 183
G F + V ++M D G IV S + + Y ++K + L +E
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 184 KYNIR----------------VWTLAPGALTRMTADLPRYKENP--GAALTPEGIAPAVL 225
+YNIR +W GA + L +K P IA AVL
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 226 YMVSHLSGDQTGKVLGVSG 244
++VS +G T L V G
Sbjct: 237 FLVSGQAGHITMHNLCVDG 255


40BBta_5154BBta_5167Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_51542140.633554branched chain amino acid ABC transporter
BBta_51554132.383745*rRNA methylase
BBta_51565151.911285hypothetical protein
BBta_51570152.011781hypothetical protein
BBta_5159-2151.849929hypothetical protein
BBta_5160-1161.542968*hypothetical protein
BBta_51615170.332070hypothetical protein
BBta_5162724-1.034898TetR family transcriptional regulator
BBta_51631026-1.596534hypothetical protein
BBta_51651328-2.305866hypothetical protein
BBta_51661224-2.256182hypothetical protein
BBta_5167717-1.189927outer-membrane immunogenic protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5162HTHTETR482e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.5 bits (115), Expect = 2e-09
Identities = 14/68 (20%), Positives = 33/68 (48%)

Query: 6 RSERSRKAALTAALTIIARDGPGRLTLDAIARESGLSKGGLMHQFPNKEAVLKALLEQQF 65
++ +R+ L AL + ++ G +L IA+ +G+++G + F +K + + E
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 66 AHFDEFAR 73
++ E
Sbjct: 68 SNIGELEL 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5167OMPADOMAIN562e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 56.1 bits (135), Expect = 2e-11
Identities = 52/196 (26%), Positives = 73/196 (37%), Gaps = 34/196 (17%)

Query: 46 FYIGGHLGGAFAGNNSLEGNNGR-----FLGGVQGGFDYQFAPNWVLGIEAQYSWMDRNT 100
+Y G LG + + NNG G GG YQ P +G E Y W+ R
Sbjct: 28 WYTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGG--YQVNPY--VGFEMGYDWLGRMP 83

Query: 101 TNFGFPGATVVSSTGANQLGSVTGRLGYTWGPAL-LYAK-GGYAWRDGNALGVNVAGVPA 158
+V + Q +T +LGY L +Y + GG WR NV G
Sbjct: 84 YK-----GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWR--ADTKSNVYG--- 133

Query: 159 GFTTSGNTKDGYTVGGGLEYMFAPNWSAKAEYQY-YNFGNTTFTGGPAPVVGSRFNSDEH 217
+ +T GG+EY P + + EYQ+ N G+ G D
Sbjct: 134 ---KNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIG---------TRPDNG 181

Query: 218 TVKVGVNYRFGWSGPT 233
+ +GV+YRFG
Sbjct: 182 MLSLGVSYRFGQGEAA 197


41BBta_5219BBta_5242Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_52192141.423255hypothetical protein
BBta_52202171.123821hypothetical protein
BBta_52213150.352553carboxymethylenebutenolidase
BBta_52223170.609015hypothetical protein
BBta_52233170.451982hypothetical protein
BBta_5224116-0.217543hypothetical protein
BBta_5225113-1.796787hypothetical protein
BBta_5226011-1.084305hypothetical protein
BBta_5227011-0.233202hypothetical protein
BBta_52280120.827470hypothetical protein
BBta_5230-1130.793958hypothetical protein
BBta_52311102.384520serine O-acetyltransferase
BBta_52331113.135687hypothetical protein
BBta_5234-1113.191206hydrolase
BBta_5235-2112.399139hypothetical protein
BBta_5236-2130.717274hypothetical protein
BBta_5237-1140.431380salicylate 1-monooxygenase
BBta_5238-117-1.294446hypothetical protein
BBta_5240-119-2.296904*amidase
BBta_5241023-3.515672ABC transporter permease
BBta_5242125-3.320416ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5220RTXTOXIND290.013 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.6 bits (64), Expect = 0.013
Identities = 13/94 (13%), Positives = 28/94 (29%), Gaps = 6/94 (6%)

Query: 55 ETERAVALTKQIQARTVQASEQLVEKTKGLEASQQESIDQLQSMQEELQSMRRLLAAQQA 114
T + K++ +A + A + + + L LL Q
Sbjct: 196 STWQNQKYQKELNLDKKRAERL------TVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249

Query: 115 DTKRLTDQVSTLKDTVDGLRQSFASTQASEQASA 148
+ +Q + + V+ LR + + E
Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5223OUTRMMBRANEA260.032 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 25.7 bits (56), Expect = 0.032
Identities = 16/63 (25%), Positives = 21/63 (33%), Gaps = 5/63 (7%)

Query: 25 GFHHFHHGGFYHGFGPGF--ALGFGFGSPYYYGPTYYYPYTYEDLGGCYV---ARKRVRT 79
G+ +H GF + GP LG G Y P + Y+ LG
Sbjct: 35 GWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYK 94

Query: 80 AHG 82
A G
Sbjct: 95 AQG 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5224cloacin300.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.003
Identities = 30/106 (28%), Positives = 40/106 (37%), Gaps = 9/106 (8%)

Query: 27 GGGGGGAGGGASAGGAASGGGAGGGAVSGTTTSGS---------AAGAAAGTAGGANAAT 77
GG G G GA + GG G V G + GS G+ +G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 78 TGIGASGIGTNGASSGAALSQPGAPGTNGTNSLGTAQSSGSGGSGS 123
G +G G+ +G LS AP G +L T + G S S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108



Score = 30.5 bits (68), Expect = 0.003
Identities = 28/79 (35%), Positives = 32/79 (40%), Gaps = 3/79 (3%)

Query: 25 GGGGGGGGAGGGASAGGAASGGGAGGGAVSGTTTSGSAAGAAAGTAGGANAATTGIGASG 84
G GGG G G S+ GGG+G G G GS G G + TG S
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GGSGHGNGGGNGNSGGGSGTGGNLSA 83

Query: 85 IGTNGASSGAALSQPGAPG 103
+ A ALS PGA G
Sbjct: 84 VAAPVAFGFPALSTPGAGG 102



Score = 29.3 bits (65), Expect = 0.008
Identities = 25/74 (33%), Positives = 30/74 (40%)

Query: 26 GGGGGGGAGGGASAGGAASGGGAGGGAVSGTTTSGSAAGAAAGTAGGANAATTGIGASGI 85
G GGG+G G G SGGG+G G + A G A + GA I A +
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 86 GTNGASSGAALSQP 99
A AAL P
Sbjct: 113 SAAIADIMAALKGP 126


42BBta_5254BBta_5265Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_52542161.035572ABC transporter substrate-binding protein
BBta_52554161.684976carboxypeptidase
BBta_52563172.139153ABC transporter permease
BBta_52573162.371147ABC transporter permease
BBta_52582152.763732ABC transporter ATP-binding protein
BBta_52591142.394639hypothetical protein
BBta_52600131.719837amidohydrolase family protein
BBta_5261-1121.164571amidase
BBta_52621100.384686enoyl-CoA hydratase
BBta_52632100.507963oxidoreductase
BBta_52643110.209632hypothetical protein
BBta_52652120.374751glutathione transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5258HTHFIS330.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.004
Identities = 20/139 (14%), Positives = 45/139 (32%), Gaps = 23/139 (16%)

Query: 198 LSLIRELQ-RDHGTAVLFITHDMGVVAEIADRVSVMRQGRLVETGPLDKILRSPEMDYTR 256
L+ ++ VL ++ + +G D + + ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQ----NTFMTAIKASEKG------AYDYLPKPFDLTELI 112

Query: 257 NLLAAVPSLIPRPPRPETSEPVVLETNELGKVYRERSFIGKAREVVAA-QNVTLTLRKGR 315
++ + R P + +G++ + + + ++
Sbjct: 113 GIIGRALAEPKRRPSKLEDDS-----------QDGMPLVGRSAAMQEIYRVLARLMQTDL 161

Query: 316 TLGIVGESGSGKSTVARCI 334
TL I GESG+GK VAR +
Sbjct: 162 TLMITGESGTGKELVARAL 180


43BBta_5363BBta_5381Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_53632122.519355hypothetical protein
BBta_53641111.981682amidohydrolase family protein
BBta_53653142.663689hypothetical protein
BBta_53663132.758774inner membrane transport protein
BBta_53672162.136388alcohol dehydrogenase
BBta_53683121.888524hypothetical protein
BBta_53692103.002026hypothetical protein
BBta_53711103.528270hypothetical protein
BBta_53722103.329284hypothetical protein
BBta_5373293.054688nuclease-like protein
BBta_5374283.438606D-alanyl-alanine synthetase A
BBta_5375393.394225HlyD family secretion protein
BBta_53762103.280973hypothetical protein
BBta_53772113.277619ABC transporter ATP-binding protein
BBta_53783123.964152hypothetical protein
BBta_53794123.758965esterase
BBta_53801152.751872hypothetical protein
BBta_53811143.114649hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5375RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 26/195 (13%), Positives = 53/195 (27%), Gaps = 32/195 (16%)

Query: 1 MRPVLRRVVIA--SIALAVAAAVGWMAMPGPIPVETAVVTKGRFVATVEDDGKTRVRQRY 58
PV RR + I + A + VE G+ +
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVL---GQVEIVATANGKLTH---------SGRSK 97

Query: 59 VVAAPLAGRLGRLRFKAGDQVKADDVVATITPAPAPLLDPRARREVEERLGAAEANVERA 118
+ + + K G+ V+ DV+ +T L A ++++ +A
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-----LGAEA------DTLKTQSSLLQA 146

Query: 119 KAVVARAQAQSAQAAADLARTMALAASGAATIQAKERAELTMRVAERDLRAAEYVDHAAQ 178
+ R Q S + L + + L ++ Q
Sbjct: 147 RLEQTRYQILSRSIELN-----KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF--STWQ 199

Query: 179 HELGQLRAMLARYGS 193
++ Q L + +
Sbjct: 200 NQKYQKELNLDKKRA 214



Score = 37.5 bits (87), Expect = 7e-05
Identities = 32/166 (19%), Positives = 55/166 (33%), Gaps = 18/166 (10%)

Query: 125 AQAQSAQAAADLARTMALAASGAATIQAKERAELTMRVAERDLRAAEYVDHA-AQHELGQ 183
+ + +A +L + A+ ++ + + +G
Sbjct: 257 QENKYVEAVNELRVY---KSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 184 LRAMLARYGSNTDADVPADSWNVTAPVGGLV--LKVTQESETMVQPGTALLDL-GDPHDL 240
L LA+ A V + APV V LKV E +V L+ + + L
Sbjct: 314 LTLELAKNEERQQASV------IRAPVSVKVQQLKVHTEG-GVVTTAETLMVIVPEDDTL 366

Query: 241 EVVVDVLSTDAVEIAPGADVTID----RWGGAGRLPGRVRRVEPAA 282
EV V + D I G + I + G L G+V+ + A
Sbjct: 367 EVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5377PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 56 VVLLGPSGSGKSTLLNILGGLD 77
VVL G G GKSTL+N L GLD
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620


44BBta_5521BBta_5539Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5521315-3.123594chemotactic signal-response protein CheL
BBta_5522616-2.706397hypothetical protein
BBta_5524515-3.424801hypothetical protein
BBta_5525717-2.962747hypothetical protein
BBta_5526515-1.430938flagellar biosynthesis regulatory protein FlaF
BBta_5527313-0.588458flagellin protein, C-terminus
BBta_55282130.665530flagellin protein, C-terminus
BBta_55290100.910226flagellar biosynthesis repressor FlbT
BBta_55300101.152887flagellin protein, C-terminus
BBta_5531-191.153264pyridoxal-dependent decarboxylase
BBta_55320110.464428hypothetical protein
BBta_55332120.407533hypothetical protein
BBta_55342140.655850hypothetical protein
BBta_55352140.989066hypothetical protein
BBta_55362140.919585flagellar hook-associated protein FlgK
BBta_55371141.694573flagellar hook protein flgE
BBta_55380133.156502methionine sulfoxide reductase B
BBta_55391133.371396FAD dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5521FLGFLGJ353e-05 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 35.5 bits (81), Expect = 3e-05
Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 5/97 (5%)

Query: 33 ADALTKVSPKA----QAKAKATATDFEAMFLNSMFAQMTSGVKGDGPFGDTPSTGVWRSM 88
A +L ++ KA A + A E MF+ M M + DG F + T ++ SM
Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLF-SSEHTRLYTSM 73

Query: 89 LMEQYSKNFAKAGGVGLSNDVFRTLILQQAKSSGSGA 125
+Q ++ G+GL+ + + + +Q S
Sbjct: 74 YDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5527FLAGELLIN562e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 56.2 bits (135), Expect = 2e-10
Identities = 81/507 (15%), Positives = 160/507 (31%), Gaps = 16/507 (3%)

Query: 17 LQNTASLLATTQNNLATGNKVNTALDNPTEFFTAQSLNNRASDIANLLDSIGNGVQVLQA 76
L + S L++ L++G ++N+A D+ A + + + +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 77 ANTGLTSLQKLVDSAKSIASQVLQAPTGYTTKSSITSAVIPGATANNLLGSSSNNFVTGS 136
L + + + ++ Q + SI + + + + + +
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQT----QFN 132

Query: 137 TVNNDNLSSAVAITGSTRLSGTPSSTSNDLASSITTGDTLVVNGVVFTFVAGSVSAGTNI 196
V + + + I T + + D VNG V S+ N+
Sbjct: 133 GVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNV 192

Query: 197 GVGDTVSNLLAAIDSVTGATATPSSVTGGKIALATGTAQDLTVSGTALAKLGLTAATTTR 256
DT + + A + TA + A G
Sbjct: 193 TGYDTYAVGANKYRVDVNSGAV----------VTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 257 NAPALSGQTLTIASTGGGVATNITFGTGASQISTLAQLNTALASNNLQASLSTTGQLTIL 316
N A+ T ++ G A I + + + + G+++
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 317 TTNEAASSTIGAVAGSSTASSMAFNGVTASTPVADTNSQTTRAGLIAQYNNVLAQINTTA 376
E + T+ + + A + + + N Q T + L+ + A
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL--EA 360

Query: 377 QDASFNGINLLNGDTLKLVFNETGRSTLNITGVTFNSTGLGLSALVVGTDFLDSNSANKV 436
+A + + TL + + T G+S L+ S
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420

Query: 437 LSTLNSASTAIRSEASSLGSNLSIVQIRQDFNKNLINVLQTGSSNLTLADTNEEAANSQA 496
L++++SA + + + SSLG+ + N + L + S + AD E +N
Sbjct: 421 LASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSK 480

Query: 497 LSTRQSIAVSALALANQSQASVLQLLR 523
Q S LA ANQ +VL LLR
Sbjct: 481 AQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5528FLAGELLIN553e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 55.4 bits (133), Expect = 3e-10
Identities = 80/498 (16%), Positives = 154/498 (30%), Gaps = 7/498 (1%)

Query: 17 LQSTAQLLATTQNNLATGKKVNSALDNPTNFFTAQGLDNRASDISNLLDGIGNGVQVLQA 76
L + L++ L++G ++NSA D+ A + ++ +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 77 ANTGITSLQKLVDSAKSIANQVLQSSVGYSTKSNVTSAALAGATASSLIGASTTAVTGSV 136
+ + + + ++ Q + S ++ + T V
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136

Query: 137 VLNDNTSSAVAITGTTKLSGTPGTSSNDLASSITTGDTLVVNGTTFTFIAGTSSSGTNIG 196
+ DN I T + D VNG + SS N+
Sbjct: 137 LSQDNQMK---IQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 197 VGDTVTNLLSTIQSATGVTSSITAGAITLTPPAAGLTLSGTSLAKLGLSAVGNSLSGQTL 256
DT + + + +T P + + L
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAV-DLFKT 252

Query: 257 TIAATGGGTATSITFGLGTGQVNSLNDLNTKLAANNLQASFDTSSGKISITTTNDAASAT 316
T + G A +I + G+ D + + D + ++TT + T
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK---VSTTINGEKVT 309

Query: 317 IGAIGGTAAASSQSFNGLTAAAPVADATAQSQRSSLVAQYNNVLQQINTTAADASFNGVN 376
+ TA A++ L ++ V + Q + N + + A +A
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 377 LLNGDTLKLTFNETGKSSLSITGVTFNIAGLGLSNLTAGTDFLDNNSANKVLNVLNTASS 436
+ K +L+ + + G+S L S L +++A S
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 437 TLRSEASTLGSNLSVVQIRQDFNKNLINVLQTGSSNLTLADTNEEAANSQALSTRQSIAV 496
+ + S+LG+ + N + L + S + AD E +N Q
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 497 SALSLANQSQASVLQLLR 514
S L+ ANQ +VL LLR
Sbjct: 490 SVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5530FLAGELLIN577e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.4 bits (138), Expect = 7e-11
Identities = 82/498 (16%), Positives = 159/498 (31%), Gaps = 7/498 (1%)

Query: 19 LQSTAQLLATTQNNLSTGKKVNSALDNPTNFFTAQGLDNRASDISNLLDGIGNGVQVLQS 78
L + L++ LS+G ++NSA D+ A + ++ +G+ + Q+
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 79 ANTGITSLQKLVDSAKSIANQVLQSAVGYSTKSNVTSAALTGATTTSLIGASSTAVTGSV 138
+ + + + ++ +Q+ G ++ S++ S I S +
Sbjct: 77 TEGALNEINNNLQRVRELS---VQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNG 133

Query: 139 VLNDNTSTAVAITGSTKLSGTPSTSSNDLASSITTGDTLVVNGTTFTFIAGTSSSGTNIG 198
V + + I T + + D VNG + SS N+
Sbjct: 134 VKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 199 VGDSVTNLLSTIQSATGVTSSITAGAITLTPPAAGLTLSGTSLAKLGLSAVGNSLSGQTL 258
D+ + + + +T P + + L
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAV-DLFKT 252

Query: 259 TIAATGGGTATSVTFGLGTGQVNSLNDLNAKLAANNLQASFDTATSKITISTTNDAASAT 318
T + G A ++ + G+ D + + D +STT + T
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK---VSTTINGEKVT 309

Query: 319 IGAIGGTAAASSQSFNGLTAAAPVADATAQSQRSSLVAQYNNVLAQINTTAADASFNGVN 378
+ TA A++ L ++ V + Q + N + A +A
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 379 LLNGDTLKLTFNENGKSTLSITGVTFNTGGLGLSTLTAGTDFLDNNSANKVIGVLNTASS 438
+ K TL+ + + G+STL S + +++A S
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 439 TLRNEASTLGSNLSVVQIRQDFNKNLINVLQTGSSNLTLADTNEEAANSQALSTRQSIAV 498
+ S+LG+ + N + L + S + AD E +N Q
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 499 SALSLANQSQASVLQLLR 516
S L+ ANQ +VL LLR
Sbjct: 490 SVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5536FLGHOOKAP1995e-24 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 99.3 bits (247), Expect = 5e-24
Identities = 86/324 (26%), Positives = 149/324 (45%), Gaps = 24/324 (7%)

Query: 1 MSGLRSTQAALSIISSNVANANTPGY----VAQNPNQIEVASGG-FGSTVMTTGVNRQLD 55
MSGL + QAAL+ S+N+++ N GY + +GG G+ V +GV R+ D
Sbjct: 8 MSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYD 67

Query: 56 LFVQNQLRTETSGGAYADQIANILKQLQSVYGTPGGSGTLETSLNNFTTALQSLSNNPSN 115
F+ NQLR + + + ++ ++ T + +L T + +F T+LQ+L +N +
Sbjct: 68 AFITNQLRAAQTQSSGLTARYEQMSKIDNMLST--STSSLATQMQDFFTSLQTLVSNAED 125

Query: 116 QSAQSVAMSAAQGLAQQLNATTKGIQTLRSNVEQDIGNSVTQANAAISQIATLNTQ---L 172
+A+ + ++GL Q T + ++ V IG SV Q N QIA+LN Q L
Sbjct: 126 PAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQISRL 185

Query: 173 QGLNTSDPLSATLQDQRDNAINTLSKYVDIRVVSDGSNGVSVFTNSGVQLVGAGLSSQFT 232
G+ L DQRD ++ L++ V + V ++ +G LV + Q
Sbjct: 186 TGVGAGAS-PNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQLA 244

Query: 233 FSSPGTLDATSQYNSNPTKSGVGQLNIKLSNGVSLDVVSNNLLNSGQIAADLKLRDQTLV 292
+ ++ ++P+++ V ++ N + LLN+G + L R Q L
Sbjct: 245 -----AVPSS----ADPSRTTVAYVDGTAGNIEIPE----KLLNTGSLGGILTFRSQDLD 291

Query: 293 QAQTQVDQLAATMASALSDKTTAG 316
Q + + QLA A A + + AG
Sbjct: 292 QTRNTLGQLALAFAEAFNTQHKAG 315



Score = 47.3 bits (112), Expect = 1e-07
Identities = 22/79 (27%), Positives = 38/79 (48%)

Query: 530 TVSGYLQQVVSQQGSASTLATQLSQGQSVVVSTLKEKFNSTAGVNMDSEMSNLIQVQNTY 589
+ + +VS G+ + S Q VV+ L + S +GVN+D E NL + Q Y
Sbjct: 466 SFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYY 525

Query: 590 SANAHIMSVVQSMMQSLLQ 608
ANA ++ ++ +L+
Sbjct: 526 LANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5537FLGHOOKAP1381e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 37.6 bits (87), Expect = 1e-04
Identities = 15/47 (31%), Positives = 26/47 (55%)

Query: 553 ISGGSLEGSNTDIADEFTKLIVTQQAYSANTKVITTANSMVQDLLNV 599
+S S ++ +E+ L QQ Y AN +V+ TAN++ L+N+
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.046
Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 5 DAMNTSVSGLSAQSFALQNISGNIANASTVGYK 37
+N ++SGL+A AL S NI++ + GY
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYT 34


45BBta_5623BBta_5630Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5623-216-3.675453hypothetical protein
BBta_5624-119-4.015532asparagine synthetase
BBta_5625121-5.509680hypothetical protein
BBta_5626119-5.047612hydrolase domain-containing protein
BBta_5627117-4.682887hypothetical protein
BBta_5628114-4.614941alginate O-acetyltransferase
BBta_5629113-3.730070hypothetical protein
BBta_5630111-3.384727hypothetical protein
46BBta_5738BBta_5798Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5738-113-3.006793acylglycerophosphoethanolamine acyltransferase
BBta_5739-122-5.601445hypothetical protein
BBta_5740-121-5.603538dimethyl sulfoxide reductase
BBta_5741-121-6.089735GntR family transcriptional regulator
BBta_5742-121-6.194168TRAP dicarboxylate family transporter subunit
BBta_5743-126-5.300854TRAP C4-dicarboxylate transport system subunit
BBta_5744-126-5.174028TRAP-type C4-dicarboxylate transport system
BBta_5745130-4.791542transcriptional regulator
BBta_5746133-4.807652Crp/FNR family transcriptional regulator
BBta_5747031-4.265183hypothetical protein
BBta_5748128-4.002790ABC transporter substrate-binding protein
BBta_5749028-3.624250ABC transporter ATP-binding protein
BBta_5750-127-3.495379ABC transporter permease
BBta_5751-125-3.483048hypothetical protein
BBta_5752025-3.703075hypothetical protein
BBta_5753125-3.848236hypothetical protein
BBta_5755024-3.441275alcohol dehydrogenase
BBta_5756022-3.702544hypothetical protein
BBta_5757122-3.892624poly-beta-hydroxybutyrate polymerase
BBta_5758122-3.805438hypothetical protein
BBta_5760122-3.301564hypothetical protein
BBta_5761024-2.363410formyl-coenzyme A transferase
BBta_5762033-5.064040acyl-CoA N-acyltransferases (Nat)
BBta_5763133-4.217199hypothetical protein
BBta_5764029-2.356556hypothetical protein
BBta_5765126-1.538132hypothetical protein
BBta_5766230-3.163127hypothetical protein
BBta_5767230-3.362442phage tail Collar domain
BBta_5770224-1.790252hypothetical protein
BBta_5771-121-2.336913hypothetical protein
BBta_5772026-3.176515hypothetical protein
BBta_5774130-2.967127hypothetical protein
BBta_5775125-1.456349hypothetical protein
BBta_5776022-1.613207hypothetical protein
BBta_5777022-1.974245hypothetical protein
BBta_5778127-1.630553hypothetical protein
BBta_5779223-1.439094hypothetical protein
BBta_5780314-2.263441hypothetical protein
BBta_5781217-2.908220hypothetical protein
BBta_5782118-3.738196hypothetical protein
BBta_5783225-4.418123hypothetical protein
BBta_5784226-5.035155hypothetical protein
BBta_5785225-5.089152phage major head protein
BBta_5786-124-4.176418hypothetical protein
BBta_5787-125-3.779694hypothetical protein
BBta_5788-126-3.586116hypothetical protein
BBta_5789-221-2.651521hypothetical protein
BBta_5790-219-1.445714hypothetical protein
BBta_5791023-2.070219hypothetical protein
BBta_5792533-4.371762hypothetical protein
BBta_5793332-4.666333hypothetical protein
BBta_5794334-3.990625hypothetical protein
BBta_5796335-4.914006hypothetical protein
BBta_5797441-5.239370hypothetical protein
BBta_5798228-3.544716hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5738TCRTETA340.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.004
Identities = 88/401 (21%), Positives = 141/401 (35%), Gaps = 60/401 (14%)

Query: 39 GAVAAHGDALVTVAGAVFIFPFFILSGLGGQLADRYVKGVVARRLKFAEIFAAGFAALGF 98
V AH L+ A++ F + + G L+DR+ + V L + AA A+
Sbjct: 39 NDVTAHYGILL----ALYALMQFACAPVLGALSDRFGRRPV---LLVSLAGAAVDYAIMA 91

Query: 99 FLHSIPLLFVALGMFGVIAALFGPVKFAMLPDQLTVGELATGNALVEGATFMAILIGTIA 158
+ +L++ + G+ A G V A + D E A + ++ G +
Sbjct: 92 TAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 159 GGQLVSGSSHMGWVALAVI-GLAVISWASASRIPHTAPSAPDLRITANPWTSTSHLLKSL 217
GG + S H + A A + GL ++ H P R NP S
Sbjct: 151 GGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF------- 203

Query: 218 YAAPRLWDGMVIVSWF-------WLVGAVVLSLLPALVKDVVGGTEGVVTLCLAIFAIGI 270
R GM +V+ LVG V +L +D + + LA F I
Sbjct: 204 ----RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI-- 257

Query: 271 AIGSLFAAHLSHVRPNLA--LVPIGAILMGVLGLDLSWAIATTETAKGLTPLGFIGSWAG 328
+ SL A ++ +A L A+++G++ G L F G
Sbjct: 258 -LHSLAQAMIT---GPVAARLGERRALMLGMIA-----------DGTGYILLAFATR--G 300

Query: 329 ARMLIDFALFAFGGGLFVVPAFAAVQAWSEPDERARIIAAGNV-LQAAFMVVGSLAVAGL 387
L A GG +PA A+ + +ER + L + +VG L +
Sbjct: 301 WMAFPIMVLLASGG--IGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358

Query: 388 QAAGVSI----AWIFFGLGLASFGAVWFVLNK--WGKEGVR 422
AA ++ AWI G A + L + W G R
Sbjct: 359 YAASITTWNGWAWI---AGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5743ACRIFLAVINRP290.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.011
Identities = 9/42 (21%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 28 SAVIPWAVFTRYVLNSAA--SWPEPMAVLLTILLTFIGAAAG 67
A++ + ++ +A SW P++V+L + L +G
Sbjct: 873 PALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLA 914


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5748BINARYTOXINB310.009 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.8 bits (69), Expect = 0.009
Identities = 23/122 (18%), Positives = 41/122 (33%), Gaps = 15/122 (12%)

Query: 155 ENAGPLKDTI--ANLRTFSDGLARNTGKLDSIVSGLEKMTGGGAPAQKVTYDLRAPRDFG 212
+ + ++ A RT+++ + NT + + + + G AP V
Sbjct: 358 SSTVAIDHSLSLAGERTWAETMGLNTADTARLNANIRYVNTGTAPIYNVLPTTSLVLGKN 417

Query: 213 PIAKTIKGQLAIPEPTAVAMLQTQRILFSPAKDYPGFGDALWADSIPKLLQARLIDAFEN 272
TIK + Q +IL +P YP A A + + I N
Sbjct: 418 QTLATIKAKEN----------QLSQIL-APNNYYPSKNLAPIALNAQDDFSSTPITM--N 464

Query: 273 YD 274
Y+
Sbjct: 465 YN 466


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5753SACTRNSFRASE340.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.1 bits (78), Expect = 0.001
Identities = 25/113 (22%), Positives = 44/113 (38%), Gaps = 10/113 (8%)

Query: 771 LRFFAPMKEFTHEFIARLTQLDYSRAMAFVALDETTNELVGVVRIHSDSVYESGEYA--- 827
RF P + + ++ ++ AF+ E N +G ++I S+ YA
Sbjct: 40 ERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLE--NNCIGRIKIRSNW----NGYALIE 93

Query: 828 -ILLRSDLKGRGLGWALMQLIIEYAKAEDLKMISGDVLQENIVMLEMCRNLGF 879
I + D + +G+G AL+ IE+AK + + NI F
Sbjct: 94 DIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5779cloacin353e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 3e-04
Identities = 27/88 (30%), Positives = 36/88 (40%)

Query: 191 NGGAGGSGSGGATSNGGVASGGDVNQTGSYGAASSGAGSGPQAALIGGAGAGSAFGGGTP 250
+GG G + GA S G +GG GA+ S GG+G+G +GGG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 251 GGWPNAGGLSATSFGAGGGGAGGPAAAA 278
G G S G GG + A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 33.9 bits (77), Expect = 8e-04
Identities = 38/121 (31%), Positives = 43/121 (35%), Gaps = 21/121 (17%)

Query: 155 MVGGGGGGGGSANTSANAGAGGAGGNTTFGTSFLAANGGA--------------GGSGSG 200
M GG G G N GA GN G + L GGA GGSGSG
Sbjct: 1 MSGGDGRG-------HNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 201 GATSNGGVASGGDVNQTGSYGAASSGAGSGPQAALIGGAGAGSAFGGGTPGGWPNAGGLS 260
G G N G+ + G S A + G A S G G +AG LS
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 261 A 261
A
Sbjct: 114 A 114



Score = 32.4 bits (73), Expect = 0.002
Identities = 34/133 (25%), Positives = 47/133 (35%), Gaps = 20/133 (15%)

Query: 198 GSGGATSNGGVASGGDVNQTGSYGAASSGAGSGPQAALIGGAGAGSAFGGGTPGGWPNAG 257
G G + G ++ G++N + GA G G + + GG G + G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS------GWSSENNPWGGGSGSGIHWG 57

Query: 258 GLSATSFGAGGGGAGGPAAAAGYSGGGGASGGYCEKLIASPAASYAFAVGAGGTAGTAGS 317
G S G G G +GG G GG S + AA AF A T G G
Sbjct: 58 GGSGHGNGGGNGNSGG-----GSGTGGNLS---------AVAAPVAFGFPALSTPGAGGL 103

Query: 318 GGTTGSSGGSGVI 330
+ + S I
Sbjct: 104 AVSISAGALSAAI 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5780TCRTETA290.008 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 28.6 bits (64), Expect = 0.008
Identities = 9/45 (20%), Positives = 18/45 (40%), Gaps = 3/45 (6%)

Query: 64 IGTITGGQAAGTFRFIGGSDPSWTAAGGSIGPFQYAVLYNATSAT 108
+ + G + GS + T+ +GP + +Y A+ T
Sbjct: 324 LSRQVDEERQG---QLQGSLAALTSLTSIVGPLLFTAIYAASITT 365


47BBta_5811BBta_5847Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5811020-3.601727arginase
BBta_5813225-4.384517hypothetical protein
BBta_5814123-1.996349hypothetical protein
BBta_5815-221-2.838190hypothetical protein
BBta_5816-218-2.692179cytochrome C-552
BBta_5817-119-3.049989hypothetical protein
BBta_5818-318-2.789076hypothetical protein
BBta_5819-217-2.809879transcriptional regulator
BBta_5820-221-4.090946hypothetical protein
BBta_5821021-3.586023major facilitator superfamily permease
BBta_5822116-2.761860hypothetical protein
BBta_5823116-2.443220hypothetical protein
BBta_5824116-2.522065hypothetical protein
BBta_5825115-2.190601hypothetical protein
BBta_5826115-1.719324hypothetical protein
BBta_5827215-1.281913protective surface antigen
BBta_5828122-1.415625hypothetical protein
BBta_5829118-0.977448hypothetical protein
BBta_5830217-0.918763hypothetical protein
BBta_5831017-1.341199hypothetical protein
BBta_5832-115-1.945806LexA repressor
BBta_5833-120-1.927575hypothetical protein
BBta_5834-118-1.809091quinone oxidoreductase
BBta_5835222-1.428369hypothetical protein
BBta_5837215-0.201950quaternary ammonium compound-resistance protein
BBta_5838215-0.017146hypothetical protein
BBta_58395130.850788hypothetical protein
BBta_58404140.457122hypothetical protein
BBta_58412131.013656ATP-binding component, PhnN protein
BBta_58421141.050675metal-dependent hydrolase
BBta_58431170.770093phosphonate ABC transporter ATP-binding protein,
BBta_58441151.645445phosphonate C-P lyase system protein PhnK
BBta_58451151.207833phosphonate metabolism protein PhnJ
BBta_58463171.852931phosphonate metabolism protein PhnI
BBta_58472161.343302carbon-phosphorus lyase complex subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5821TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 2e-05
Identities = 25/132 (18%), Positives = 49/132 (37%), Gaps = 1/132 (0%)

Query: 53 DFGTDASGVAMLASSYFWGYTLMQIPAGLLVDRYGVKRVVLCSMAASVVGSLAFALSPTL 112
DF + + +++ +++ G L D+ G+KR++L + + GS+ + +
Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSF 102

Query: 113 FDVFI-ARVIVACGDALVFTALLKLVAQSFTDERFGLMSGISQVSGYAGGVIATTPLAAA 171
F + I AR I G A ++ +VA+ E G G+ G +
Sbjct: 103 FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI 162

Query: 172 VSGFGWRACFFF 183
W
Sbjct: 163 AHYIHWSYLLLI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5843PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.010
Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 44 CVVLSGPSGAGKSSILKMIFGNY 66
VVL G G GKS+++ + G
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


48BBta_5957BBta_5967Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5957-126-3.581747ABC transporter ATP-binding protein
BBta_5958433-6.182368hypothetical protein
BBta_5959618-4.438878hypothetical protein
BBta_5960618-4.402440hypothetical protein
BBta_5961218-4.190527coenzyme PQQ synthesis protein PqqA
BBta_5962119-4.593631hypothetical protein
BBta_5963214-3.988006hypothetical protein
BBta_5964213-3.958614quinoprotein ethanol dehydrogenase
BBta_5965115-4.020216cytochrome C
BBta_5966114-3.767314ABC transporter substrate-binding protein
BBta_5967313-3.536045AraC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5957PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 37 VVVGPSGCGKSTMLRILAGLD 57
V+ G G GKST++ L GLD
Sbjct: 600 VLEGTGGIGKSTLINTLVGLD 620


49BBta_5980BBta_5986Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_59802121.787812fumarylacetoacetate hydrolase family protein
BBta_59814122.823780ABC transporter substrate-binding protein
BBta_59825163.484618GntR family transcriptional regulator
BBta_59834183.119834ABC transporter ATP-binding protein
BBta_59843152.748621ABC transporter permease
BBta_59852142.509742ABC transporter permease
BBta_59862132.073908fumarate reductase/succinate dehydrogenase
50BBta_6047BBta_6053Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_60472121.992728alpha/beta family hydrolase
BBta_60482101.482283hypothetical protein
BBta_60492111.493789beta-lactamase
BBta_60503121.583760sensor histidine kinase
BBta_60523110.939978TonB-dependent siderophore receptor
BBta_60534121.028294hypothetical protein
51BBta_6349BBta_6368Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_63492130.280938molybdopterin-guanine dinucleotide biosynthesis
BBta_6350213-0.404996cation efflux transporter
BBta_6351214-0.880419branched-chain amino acid ABC transporter
BBta_6352316-1.428932hypothetical protein
BBta_6353321-2.625477*P4-like integrase
BBta_6354323-2.785833formate dehydrogenase subunit alpha
BBta_6355330-6.440220phosphonate ABC transporter ATP-binding protein
BBta_6356432-6.009080phosphonate ABC transporter permease
BBta_6357535-6.435429phosphate/phosphonate ABC transporter
BBta_6358639-6.004324sulfonate ABC transporter substrate-binding
BBta_6359536-4.679721nitrate ABC transporter ATP-binding protein
BBta_6360431-3.818887sulfonate ABC transporter permease
BBta_6361427-3.355894transposase
BBta_6362629-3.918763hypothetical protein
BBta_6363629-4.097317transposase
BBta_6364629-4.837214phosphonate ABC transporter ATP-binding protein
BBta_6365631-5.126156phosphonate ABC transporter permease
BBta_6366320-3.053587phosphate/phosphonate ABC transporter
BBta_6367219-2.664508hypothetical protein
BBta_6368219-2.509822dioxygenase
52BBta_6436BBta_6449Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_64361113.029074bacteriochlorophyllide reductase subunit
BBta_64372102.835714bacteriochlorophyllide reductase subunit
BBta_64383113.4493112-desacetyl-2-hydroxyethyl
BBta_64394103.696428hydroxyneurosporene-O-methyltransferase
BBta_6440293.036523farnesyl-diphosphate synthase
BBta_6441292.775708hydroxyneurosporene and rhodopin dehydrogenase
BBta_64421102.021909hydroxyneurosporene synthase
BBta_64433113.098533hypothetical protein
BBta_64443113.041727phytoene synthase
BBta_64453122.548698phytoene dehydrogenase
BBta_64463112.876899alpha/beta hydrolase protoporphyrin IX magnesium
BBta_64472112.714674alpha/beta hydrolase
BBta_64483122.947164magnesium chelatase subunit D
BBta_64493112.114689protoporphyrin IX magnesium-chelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6449HTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 4e-04
Identities = 49/233 (21%), Positives = 83/233 (35%), Gaps = 29/233 (12%)

Query: 128 GEKSFEPGLLARANRGFLYIDEVNLLEDHLVDLLLDVAASGENVVERDG--LSIRHPARL 185
G ++ G +A G L++DE+ + LL V GE G IR R+
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRI 275

Query: 186 VLVGTGNPE----EGELRPQLLDRFGLSVEVKTPADLPTRIEVIKRRDAFETDPAAFVAH 241
V + + +G R L R + V ++ P L R E D V H
Sbjct: 276 VAATNKDLKQSINQGLFREDLYYRLNV-VPLRLPP-LRDRAE----------DIPDLVRH 323

Query: 242 WEKEEAKQRKRILA-AREHIASVKVAD-----RELENAARLCMALGTDGLRGELTL---M 292
+ ++ K+ + +E + +K RELEN R AL + + +
Sbjct: 324 FVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENEL 383

Query: 293 RAARAAAALDGAKAVDDGHLKTVAPMALRHRLRRNPLDDSGSSVRVERALNEL 345
R+ + ++ A A + A + + D S +R L E+
Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEM 436


53BBta_6508BBta_6516Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_65082121.717267endoribonuclease L-PSP
BBta_65092131.618428gamma-glutamyltranspeptidase
BBta_65103110.878448hydrolase
BBta_65113100.706381tryptophan 2,3-dioxygenase
BBta_65124101.952140kynureninase
BBta_65133112.604149hypothetical protein
BBta_65143112.221689TRAP-type transporter domain-containing protein
BBta_65154132.805479major facilitator superfamily permease
BBta_65162142.932424hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6515TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 64/380 (16%), Positives = 120/380 (31%), Gaps = 58/380 (15%)

Query: 44 VVLPTVQAEFAASRGAVSLAYTLTMFGFGLGGVIVGRITDRFGIVTAMALSIACTAAAYL 103
V LP + +F + + T M F +G + G+++D+ GI + I +
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 104 LAGASVTLWQFQ-AVYFLVGLGSSVTFAPLMAEASHWFVR-YRGLAVTIVAS-------- 153
+ + + F+ G G++ A +M + + + RG A ++ S
Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 154 GNYVAGTI-----WPPLVNWGVQQI-------------GWRASHVALGIFSAVTMSALLL 195
G + G I W L+ + I H + +++ +
Sbjct: 155 GPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFF 214

Query: 196 LLRARMGGAG------------AQDHAHAAPPQLALPISTN---------ALTVLLSVAA 234
+L + P + + N + +VA
Sbjct: 215 MLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAG 274

Query: 235 IACCVAMAMPQVHIVAYCGDLGYGVAAGAQMLSLMMALGILSRIGSGFLADRVGGMRTLL 294
V M VH L + M++ I IG G L DR G + L
Sbjct: 275 FVSMVPYMMKDVH------QLSTAEIGSVIIFPGTMSVIIFGYIG-GILVDRRGPLYVLN 327

Query: 295 IGSLAQGFALLFYLFFDGLTSLYLISGMFGLFQG--GIVPSYAIIVREAMPAREAATRVG 352
IG + L F TS ++ + + G + IV ++ +EA +
Sbjct: 328 IGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMS 387

Query: 353 IVIFASVFGMSFGGWVSGVI 372
++ F S G + G +
Sbjct: 388 LLNFTSFLSEGTGIAIVGGL 407


54BBta_6560BBta_6631Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_65602132.950656hypothetical protein
BBta_65614143.033940CysQ protein
BBta_65625151.737040hypothetical protein
BBta_65645171.907670hypothetical protein
BBta_65656180.581004hypothetical protein
BBta_6567925-1.256451*hypothetical protein
BBta_6568824-1.343608hypothetical protein
BBta_6569825-1.378385hypothetical protein
BBta_6571927-0.714451hypothetical protein
BBta_6573929-0.748427hypothetical protein
BBta_65741033-0.951085hypothetical protein
BBta_65751131-0.820093hypothetical protein
BBta_65761228-1.507117hypothetical protein
BBta_65771129-1.653052hypothetical protein
BBta_65781129-1.404874hypothetical protein
BBta_65791228-2.005215hypothetical protein
BBta_65801227-2.133934hypothetical protein
BBta_65811126-1.875783hypothetical protein
BBta_65821028-0.818432hypothetical protein
BBta_65831125-1.374967bacteriophage protein
BBta_65841024-2.014168bacteriophage protein, GP46-like protein
BBta_65851024-2.145503bacteriophage protein, gp45-like protein
BBta_65861021-1.757804Mu-like prophage tail protein
BBta_65871022-1.440674Mu-like prophage DNA circulation protein
BBta_65881022-2.027120hypothetical protein
BBta_65901023-0.869603hypothetical protein
BBta_65911022-0.898666hypothetical protein
BBta_6592820-0.076860Mu-like prophage FluMu tail sheath protein
BBta_6593719-0.435871hypothetical protein
BBta_6594619-0.922256hypothetical protein
BBta_6595618-1.157912hypothetical protein
BBta_6596821-1.579332hypothetical protein
BBta_6597821-1.282297peptidase S14, ClpP
BBta_6598923-1.896654lambda family phage portal protein
BBta_6599926-1.499316hypothetical protein
BBta_66001027-0.950233hypothetical protein
BBta_6601924-0.920783phage terminase large subunit
BBta_6602824-0.353907hypothetical protein
BBta_6603825-0.475090hypothetical protein
BBta_6604824-0.163525hypothetical protein
BBta_6605824-0.447436hypothetical protein
BBta_6606723-0.473042phage / plasmid primase P4
BBta_66078240.081174DNA primase
BBta_6608723-0.527639hypothetical protein
BBta_6609723-0.747998hypothetical protein
BBta_6610925-0.596812hypothetical protein
BBta_66119230.275682adenine-specific DNA-methyltransferase
BBta_661211261.217313hypothetical protein
BBta_66131126-0.812830single-stranded DNA-binding protein
BBta_661410260.359827hypothetical protein
BBta_661510260.180561hypothetical protein
BBta_66161128-1.106209hypothetical protein
BBta_6617828-2.049387hypothetical protein
BBta_6618828-3.243552hypothetical protein
BBta_6619830-1.010733hypothetical protein
BBta_6620832-2.207978hypothetical protein
BBta_6621530-3.846615hypothetical protein
BBta_6622431-3.418160hypothetical protein
BBta_6623632-3.991967hypothetical protein
BBta_6624734-4.643510hypothetical protein
BBta_6625733-4.694015hypothetical protein
BBta_6626938-6.993083prophage integrase
BBta_6627838-6.765254hypothetical protein
BBta_6628433-6.077833hypothetical protein
BBta_6629321-4.280456phage-related transcriptional regulator
BBta_6630215-3.542219hypothetical protein
BBta_6631011-3.358576hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6564GPOSANCHOR336e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.1 bits (75), Expect = 6e-04
Identities = 12/39 (30%), Positives = 14/39 (35%), Gaps = 1/39 (2%)

Query: 7 AQQAPPPAAPPYQAAPPYQQP-APPYQQPAPPPRPAPSP 44
A+QA A A Q P A P + P AP
Sbjct: 449 AKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQA 487


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6565GPOSANCHOR462e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 46.2 bits (109), Expect = 2e-07
Identities = 50/249 (20%), Positives = 88/249 (35%), Gaps = 14/249 (5%)

Query: 81 QLADLGKKTDAINRMKIELGEKNAAIFALEAREKALKDQLRATEEEFAAKTEALRAAEQA 140
+ ADL K + K + A +A A K L E + A A +
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 141 LKDKQAELVRLTTELNDKSLLADSRQVELVSVRAQVEELRTRVAEAEKEFAATQARLALE 200
L+ ++A L EL A + + A+++ L A A + L
Sbjct: 181 LEAEKAALEARQAELEKALEGAMN---FSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 201 RDESDTATKALNDARARVENLSQRVTELDRQLIVQVKEAEMLATRVADLEGRLATQGKLL 260
+ S + + A L R EL++ L + + + ++ LE A
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 261 AEREFENNQLREANAAADRALKELRDEIAGFGSGKSAALERLKSEKAALEEQLQAARDER 320
A+ E ++ L + R L R+ L++E LEEQ + + R
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDASREAKKQ-----------LEAEHQKLEEQNKISEASR 346

Query: 321 TKLQREMNA 329
L+R+++A
Sbjct: 347 QSLRRDLDA 355



Score = 40.4 bits (94), Expect = 1e-05
Identities = 60/354 (16%), Positives = 122/354 (34%), Gaps = 11/354 (3%)

Query: 47 AEIQADKDQLRAEFAMSARRLELSVDALKNKATSQLADLGKKTDAINRMKIELGEKNAAI 106
A +AD ++ + + L+ + + A + A+ +A I
Sbjct: 154 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 213

Query: 107 FALEAREKALKDQLRATEEEFAAKTEALRAAEQALKDKQAELVRLTTELNDKSLLADSRQ 166
LEA + AL + E+ A +K +AE L + +
Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273

Query: 167 VELVSVRAQVEELRTRVAEAEKEFAATQARLALERDESDTATKALNDARARVENLSQRVT 226
+ A+++ L A E E A + + + + + L+ +R + L
Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333

Query: 227 ELDRQLIVQVKEAEMLATRVADLEGRLATQGKLLAEREFENNQLREANAAADRALKELRD 286
+L+ Q ++ EA L L + + E E+ +L E N ++ + + LR
Sbjct: 334 KLEEQN--KISEASR-----QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 386

Query: 287 EIAGFGSGKSAALERLKSEKAALEEQLQAARDERTKLQREMNAIQQQAESTWAQERMENA 346
++ A ++++ +L A +L+ +++ A+ E
Sbjct: 387 DLDA----SREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442

Query: 347 LLRERINDIAAEVAKLAMQIEGPNSTIEAMLAAEPVSAKPANANGAPAPAAAGA 400
L+E++ A E+AKL + T +A + V K P A
Sbjct: 443 ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKA 496



Score = 39.7 bits (92), Expect = 2e-05
Identities = 39/263 (14%), Positives = 88/263 (33%)

Query: 74 LKNKATSQLADLGKKTDAINRMKIELGEKNAAIFALEAREKALKDQLRATEEEFAAKTEA 133
+K + L K ++ L + N + + K + + E A+K +
Sbjct: 58 RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQE 117

Query: 134 LRAAEQALKDKQAELVRLTTELNDKSLLADSRQVELVSVRAQVEELRTRVAEAEKEFAAT 193
L A + L+ + +T + K ++ + L + +A +E+ +A
Sbjct: 118 LEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK 177

Query: 194 QARLALERDESDTATKALNDARARVENLSQRVTELDRQLIVQVKEAEMLATRVADLEGRL 253
L E+ + L A N S + + L + +
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 254 ATQGKLLAEREFENNQLREANAAADRALKELRDEIAGFGSGKSAALERLKSEKAALEEQL 313
+ + + A A L++ + F + SA ++ L++EKAALE +
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 314 QAARDERTKLQREMNAIQQQAES 336
+ L ++++ ++
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDA 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6579PF07201290.034 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 29.4 bits (66), Expect = 0.034
Identities = 13/80 (16%), Positives = 29/80 (36%), Gaps = 6/80 (7%)

Query: 102 PLGTNTQQIATMAALQAALANLVSSAPATLDTLQELANALGNDPNYATTVTTALSNRLRV 161
P Q ++ + +L L +S +L L+ +P+ + L + L+
Sbjct: 95 PELEQKQNVSELLSL------LSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKG 148

Query: 162 DAAQGLTAAQKAQAIANLAL 181
+ QA+ ++A
Sbjct: 149 RPELAHLSHLVEQALVSMAE 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6587RTXTOXINA300.034 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.034
Identities = 32/164 (19%), Positives = 65/164 (39%), Gaps = 34/164 (20%)

Query: 124 RQGAASGLISIPLLSNV-----AYLAAENAAASIAQTLPTAISTAGQPAAAV-------- 170
+ AA ++ +L NV Y+ A+ AA ++ + A A A+
Sbjct: 267 TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSI 326

Query: 171 ------AAVTDTLSQAATAL---------DLVRQSYPIDPAVSA--TVRDAIAPLVASFA 213
A + SQ L +++ ID +++ TV +++ +++ A
Sbjct: 327 ADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAA 386

Query: 214 TSITNEAAPGADASAAVSSLVGLVRTTMDAMPAASATRAALELA 257
T+ + GA SA V ++ G++ ++A A A ++A
Sbjct: 387 TT----SLVGAPVSALVGAVTGIISGILEASKQAMFEHVASKMA 426


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6597adhesinmafb358e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 35.4 bits (81), Expect = 8e-04
Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 8/137 (5%)

Query: 159 LTAGEAVTQGFATKQIDEPATAMAAYDYRVYASAPEGLPVRVRDESNAAIAATKRESKTM 218
++AGEA+ G A + + EG + + A K + +
Sbjct: 240 ISAGEALGIGDILYGTRYAIDKAAMRN--IAPLPAEGKFAVIGGLGSVA-GFEKNTREAV 296

Query: 219 TPEEIAAANAAANATAAVVVPGTAAAPAAPVAAAAPAAEAGKSWAAGFYASAGNSGLTLA 278
NAA A A AA VA A AA+ GK+ +G +A + L L+
Sbjct: 297 DRWIQENPNAAETVEAVF-----NVAAAAKVAKLAKAAKPGKAAVSGDFADSYKKKLALS 351

Query: 279 DLNTIVAASASHEQAKD 295
D + +A + +A D
Sbjct: 352 DSARQLYQNAKYREALD 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6601FLGFLIH290.035 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 29.4 bits (65), Expect = 0.035
Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 4/47 (8%)

Query: 565 GWRLRGNQDNHFLDCRIYNMAIADHLGLSRMTADEWKILARDRAPAI 611
GWRLRG+ H C++ AD L A W+ L R AP +
Sbjct: 185 GWRLRGDPTLHPGGCKVS----ADEGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6624BCTERIALGSPD270.011 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 27.2 bits (60), Expect = 0.011
Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 13/67 (19%)

Query: 30 ENTMSLLAARKAERDAILADLLNALEVLRRQRGVAIAIEAGAEHLAELDAEEAAYFARAR 89
T +L+ + L ++ L++ R Q + +EA +AE+ +
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIRRPQ----VLVEA---IIAEVQDAD------GL 362

Query: 90 ALGAAWA 96
LG WA
Sbjct: 363 NLGIQWA 369


55BBta_6701BBta_6709Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_67013102.602094diguanylate cyclase
BBta_67024112.741314cobalamin synthesis protein cobW
BBta_6703382.760183D-glutamate deacylase
BBta_6704383.446989gamma-glutamyltranspeptidase
BBta_6705382.787781choline dehydrogenase
BBta_6706392.586501hypothetical protein
BBta_67072102.290599diguanylate cyclase
BBta_67082112.085018glucose 1-dehydrogenase
BBta_67093122.094885xylulose kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6703UREASE395e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.6 bits (90), Expect = 5e-05
Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 14/72 (19%)

Query: 5 DLIIRGGRVLDPETKLDATADVAVKDGAIAAVGD------------IAGVADQVIDAAGL 52
D +I +LD + AD+ +KDG IAA+G I G +VI G
Sbjct: 69 DTVITNALILDHWGIV--KADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 53 AVVPGFIDLHAH 64
V G +D H H
Sbjct: 127 IVTAGGMDSHIH 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6708DHBDHDRGNASE1251e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (314), Expect = 1e-36
Identities = 81/267 (30%), Positives = 124/267 (46%), Gaps = 16/267 (5%)

Query: 9 VRAPRLAGHYALVTGAAQGIGRAIAVRFAEEGAHVAINFGGPSPSGDETLALVQAASAAH 68
+ A + G A +TGAAQGIG A+A A +GAH+A + ++ +V + A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIA----AVDYNPEKLEKVVSSLKAE- 55

Query: 69 GHGARDHFTVKADIGVEADIAAMFETVLKRWPQLDCLVNNAGFQRESPSEALDVATYRAI 128
AR AD+ A I + + + +D LVN AG R +L + A
Sbjct: 56 ---ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT 112

Query: 129 LEVNLNGAVLCARHALAHFVARGGGNIINTSSVHQIIPKPGYLAYSISKSAMAGLTRTLA 188
VN G +R + + R G+I+ S +P+ AY+ SK+A T+ L
Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 189 LEFAGRGIRVNSVGPGAVDTPIN-AAWTGD----PVKRGVVESH---IPMGRVASPEEIA 240
LE A IR N V PG+ +T + + W + V +G +E+ IP+ ++A P +IA
Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIA 232

Query: 241 GVFAFLASSEASYITGQTIYACGGITL 267
FL S +A +IT + GG TL
Sbjct: 233 DAVLFLVSGQAGHITMHNLCVDGGATL 259


56BBta_6734BBta_6777Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6734210-0.384431methyl-accepting chemotaxis receptor
BBta_6735211-0.880297methyl-accepting chemotaxis receptor
BBta_6736117-2.492063CheW protein
BBta_6737227-4.882119chemotaxis protein CheA
BBta_6738644-8.924450response regulator receiver, CheY1
BBta_6741845-8.398648transcriptional regulator
BBta_6743946-7.977012hypothetical protein
BBta_6744943-7.639193phage integrase
BBta_67451040-6.842878hypothetical protein
BBta_67461036-4.511556hypothetical protein
BBta_67471336-3.591374resolvase
BBta_67491238-3.852851hypothetical protein
BBta_67501236-4.060528hypothetical protein
BBta_67511336-3.810807hypothetical protein
BBta_67521236-3.885934hypothetical protein
BBta_6754928-2.333156hypothetical protein
BBta_6755519-0.723879hypothetical protein
BBta_6756417-0.296891hypothetical protein
BBta_67572140.019211lambda integrase-like/ DNA breaking-rejoining
BBta_6758-1121.188603*hypothetical protein
BBta_6759-2131.164571ribosomal large subunit pseudouridine synthase
BBta_6760-2110.650621RNA polymerase factor sigma-32
BBta_67610111.193085hypothetical protein
BBta_67620120.732372diguanylate cyclase
BBta_67632141.332243hypothetical protein
BBta_67643141.884445two-component response regulator
BBta_67662141.986056short-chain dehydrogenase
BBta_6767-2131.554891hypothetical protein
BBta_67681140.750294hypothetical protein
BBta_6769733-8.191806hypothetical protein
BBta_6770723-6.772656hypothetical protein
BBta_6771618-4.764283hypothetical protein
BBta_6772617-4.747289hypothetical protein
BBta_6773515-4.435608hypothetical protein
BBta_6774416-4.122319molecular chaperone
BBta_6775-2121.376043RNA polymerase sigma factor RpoD
BBta_67760122.325078DNA primase
BBta_67772132.001296hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6737PF06580395e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 5e-05
Identities = 13/48 (27%), Positives = 17/48 (35%), Gaps = 8/48 (16%)

Query: 404 LIRNAIDHGIEDAGRRAAAGKPAQGRIELSAVHAGAQVLVTVSDNGGG 451
L+ N I HGI P G+I L V + V + G
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6738HTHFIS858e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 8e-23
Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 2/116 (1%)

Query: 2 AIVLTVDDSPSIRQMVKVTLEPAGHQVIEAGDGVQGLAKAQTSRPDIVITDLNMPVMNGL 61
A +L DD +IR ++ L AG+ V + D+V+TD+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELIRALRKERTLTGMPILFLTTESNDAVKQEAKSAGATGWITKPFKPEQLLTVVGK 117
+L+ ++K R +P+L ++ ++ +A GA ++ KPF +L+ ++G+
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6746V8PROTEASE353e-04 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 34.6 bits (79), Expect = 3e-04
Identities = 26/187 (13%), Positives = 56/187 (29%), Gaps = 52/187 (27%)

Query: 50 IITNRHVFQGAIQS----DFHYTMADAQGRSTGRHERFSIYDFGKAWIDHPDPDVDLALL 105
++TN+HV + + G I + + DLA++
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKY--------SGEGDLAIV 165

Query: 106 PTQPLFEQLEKVGRRPFYINLSQGIIASPEMLSTLDAI-EEITMIGYPNG-----LWDDV 159
K ++ + + + + + + IT+ GYP +W+
Sbjct: 166 ----------KFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESK 215

Query: 160 NNLPIVR-RGITATSAAREYRGKKQFLIDAACFPGSSGSPVF--------IYNSGSFSGL 210
+ ++ + D + G+SGSPVF I+ G +
Sbjct: 216 GKITYLKGEAMQ---------------YDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEF 260

Query: 211 HLGTRLT 217
+ +
Sbjct: 261 NGAVFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6764HTHFIS586e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.9 bits (140), Expect = 6e-13
Identities = 23/129 (17%), Positives = 48/129 (37%), Gaps = 8/129 (6%)

Query: 8 RATVLIVEDDPTQREMIALLLEESDYDVIACESAEAAELVLNKPGHRIVLMMTDVNLAGR 67
AT+L+ +DD R ++ L + YDV +A + +V+ TDV +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVV--TDVVMPD- 59

Query: 68 MSGVELAHIARARHPHINVVVTSGRPLSQPLPGGTK-----FWSKPWAPLDVLREAEVTL 122
+ +L + P + V+V S + ++ + KP+ +++ L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 123 ERAPEMARR 131
+
Sbjct: 120 AEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6766DHBDHDRGNASE822e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.6 bits (201), Expect = 2e-20
Identities = 61/194 (31%), Positives = 95/194 (48%), Gaps = 2/194 (1%)

Query: 3 AISGAAAAVTGASSGIGRALAQELAARGCDLALADRDEAGLASVASELATTGRKVTTHRL 62
I G A +TGA+ GIG A+A+ LA++G +A D + L V S L R
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVSDGQAIGQFAEDATRAHPSLNIVINNAGVALFGGFNEIDQAEMEWLFNINFWGVVHGT 122
DV D AI + R ++I++N AGV G + + E E F++N GV + +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 123 RAFLPHLARQRAAHIVNLSSIFGIIAPPGQSAYAAAKFAVRGFSESLRHELATANSPIRL 182
R+ ++ +R+ IV + S + +AYA++K A F++ L ELA N IR
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN--IRC 182

Query: 183 SVVHPGGVATAIAR 196
++V PG T +
Sbjct: 183 NIVSPGSTETDMQW 196


57BBta_6804BBta_6813Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6804-122-3.237484LysR family transcriptional regulator
BBta_6805-127-3.981836dihydrodipicolinate synthase
BBta_6806133-1.749876hypothetical protein
BBta_68072360.584668ATPase, ParA type
BBta_68082381.252339hypothetical protein
BBta_68102381.312428two-component regulatory protein
BBta_68112391.457957two component regulatory protein
BBta_68122381.462945arthrofactin synthetase/syringopeptin synthetase
BBta_68132361.863125arthrofactin synthetase/syringopeptin synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6810HTHFIS310.006 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.006
Identities = 20/86 (23%), Positives = 31/86 (36%), Gaps = 6/86 (6%)

Query: 61 DGIALIDHWRGASIGHPVLAAGTSCTKQQIVDAIDHGADDYISLPCGLMELATRLSAMLR 120
+ L+ + A PVL T + A + GA DY+ P L T L ++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL----TELIGIIG 116

Query: 121 RCRSARKLIRTIDHVSVDEDQMQVIV 146
R + K R + D +V
Sbjct: 117 RALAEPK--RRPSKLEDDSQDGMPLV 140


58BBta_6997BBta_7007Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6997225-2.584051hypothetical protein
BBta_6998325-2.627412hypothetical protein
BBta_6999020-2.148682hypothetical protein
BBta_7000018-2.408571hypothetical protein
BBta_7001018-1.793876two-component response regulator
BBta_7002116-1.286410hypothetical protein
BBta_7003014-1.250888ECF subfamily RNA polymerase sigma-24 factor
BBta_7004211-1.334100signal transduction histidine kinase
BBta_7005310-0.802715urease accessory protein UreG
BBta_7006210-0.452167urease accessory protein UreF
BBta_7007313-0.384575urease accessory protein UreE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7000PF03544476e-09 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 46.5 bits (110), Expect = 6e-09
Identities = 20/104 (19%), Positives = 41/104 (39%), Gaps = 2/104 (1%)

Query: 21 PPFAAAENSKELLKGTWAQEIVARLRAERRYPLQA--SGQGGHARVMFHLDGAGHLISVA 78
P + A + + A A R + +YP +A G +V F + G + +V
Sbjct: 137 PTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQ 196

Query: 79 LMESTGDALLDREALAMVERAQPFPSPPGVLVDDDLTFVLPVVF 122
++ + + +RE + R + P PG + ++ F +
Sbjct: 197 ILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTT 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7001HTHFIS444e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.7 bits (103), Expect = 4e-07
Identities = 19/133 (14%), Positives = 49/133 (36%), Gaps = 5/133 (3%)

Query: 124 ATDVLIIEDETFIAMDLESLIKNLGHNVIGVARTHADAVALAKNKKPGLILADIQLADGS 183
+L+ +D+ I L + G++V + A L++ D+ + D
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNA-ATLWRWIAAGDGDLVVTDVVMPDE- 60

Query: 184 SGLDAVNELL-RTFEVPVVFITAY--PERFLTGERPEPAFLISKPFQPAMVSAVASQALF 240
+ D + + ++PV+ ++A + + KPF + + +AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 241 FQRNSRNRMPKPA 253
+ +++ +
Sbjct: 121 EPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7002BACSURFANTGN250.038 Yersinia/Haemophilus virulence surface antigen sign...
		>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen

signature.
Length = 322

Score = 25.1 bits (54), Expect = 0.038
Identities = 15/82 (18%), Positives = 24/82 (29%), Gaps = 15/82 (18%)

Query: 3 DVKSQASKNAAPRKPGGLNAEIQSRIGH--QLRAMYDD-------------VVRQGVPDR 47
+ + K L ++ + I H R+M D + R
Sbjct: 30 VIGAHRVKVETALSHSNLQKKLSATIKHNQSGRSMLDRKLTSDGKANQRSSFTFSMIMYR 89

Query: 48 FTDLIRKLDVPAAQSHVENGGG 69
+ VPA + V N GG
Sbjct: 90 MIHFVLSTRVPAVRESVANYGG 111


59BBta_7156BBta_7170Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7156271.929535hypothetical protein
BBta_71571101.565967ABC transporter ATP-binding protein
BBta_71581111.462190hypothetical protein
BBta_71591100.553987alpha/beta family hydrolase
BBta_7160011-0.495991hypothetical protein
BBta_7161113-0.899839hypothetical protein
BBta_7164013-0.553378peptidase M29, aminopeptidase II
BBta_7165115-0.119676hypothetical protein
BBta_71662130.662563hypothetical protein
BBta_7167391.397021glyoxylase
BBta_71684102.077522hypothetical protein
BBta_71692131.420325short chain dehydrogenase
BBta_71702140.945169peptide ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7169DHBDHDRGNASE429e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 42.0 bits (98), Expect = 9e-07
Identities = 48/202 (23%), Positives = 81/202 (40%), Gaps = 23/202 (11%)

Query: 2 SASIKDSVVVITGATGAIATALIAELLARGAAKIYAA----------ARDVSGLAATGKI 51
+ I+ + ITGA I A +A LA A I A + A +
Sbjct: 3 AKGIEGKIAFITGAAQGIGEA-VARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 52 VPLKLDVTSDADAAAAALQASDATL--LINNAGVNHNTAFMLAPDLAFAKEEIE----IN 105
P + ++ D A ++ + L+N AGV + + EE E +N
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLR-----PGLIHSLSDEEWEATFSVN 116

Query: 106 YLAPLRLTRAFAPVLI-RNHGAVLNILTILARVNLPLMGSYCASKAAALSLTQGLRGELT 164
+R+ + ++ R G+++ + + A V M +Y +SKAAA+ T+ L EL
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 165 PKGVRIVGALPGAVDTRMTAGL 186
+R PG+ +T M L
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7170HTHFIS300.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.8 bits (67), Expect = 0.011
Identities = 10/37 (27%), Positives = 19/37 (51%)

Query: 26 PVMRDVIKSVDLAVPRGAVLGLVGESGSGKTTLGRLL 62
M+++ + + + L + GESG+GK + R L
Sbjct: 144 AAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


60BBta_7244BBta_7319Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7244218-2.587810flagellar basal-body rod modification protein
BBta_7245117-2.613087short-chain dehydrogenase
BBta_7246218-2.491801rhodanese-related sulfurtransferase
BBta_7247317-3.170520ABC transporter permease
BBta_7248518-3.420147ABC transporter ATP-binding protein
BBta_7249523-4.678028monooxygenase
BBta_7250631-5.771846nitrate/sulfonate/bicarbonate ABC transporter
BBta_7251839-6.6724203-mercaptopyruvate sulfurtransferase
BBta_72521146-9.277196transposase
BBta_72531048-8.678275hypothetical protein
BBta_7254637-5.883048flagellar basal-body rod protein flgF
BBta_7255532-5.418708hypothetical protein
BBta_7256532-5.347057hypothetical protein
BBta_7257531-5.374399hypothetical protein
BBta_7258527-5.028581MgtC family protein
BBta_7259527-5.131423lead, cadmium, zinc and mercury-transporting
BBta_7260527-6.148599opgC protein
BBta_7261529-5.853792hypothetical protein
BBta_7262529-5.753962hypothetical protein
BBta_7263529-5.739787RND divalent metal cation efflux transporter
BBta_7264435-7.004324nitrogen regulatory protein P-II
BBta_7265636-7.206228cobalt-zinc-cadmium resistance efflux pump CzcB
BBta_7266842-7.982716outer membrane cobalt-zinc-cadmium resistance
BBta_7267844-9.313629hypothetical protein
BBta_7268840-8.506025cation efflux system protein czcD
BBta_7269841-8.523358hypothetical protein
BBta_7271932-6.475987hypothetical protein
BBta_7274832-6.306694hypothetical protein
BBta_7275624-5.076205hypothetical protein
BBta_7276519-3.225545manganese transport protein
BBta_7277417-3.196109hypothetical protein
BBta_7278419-3.511941transposase
BBta_7280637-7.384361integral membrane protein
BBta_7281543-8.062362hypothetical protein
BBta_7286639-6.688312hypothetical protein
BBta_7287329-4.917680hypothetical protein
BBta_7288024-3.415975hypothetical protein
BBta_7289022-3.042837hypothetical protein
BBta_7290-121-2.136083hypothetical protein
BBta_7291-217-0.600064universal stress protein UspA
BBta_7292-315-0.454325poly-beta-hydroxyalkanoate synthase
BBta_7293-318-0.674919bifunctional enoyl-CoA hydratase/phosphate
BBta_7294-121-1.243806acetate kinase
BBta_7295123-1.986944hypothetical protein
BBta_7296022-1.822927cadmium-exporting ATPase
BBta_7297332-4.137894hypothetical protein
BBta_7300430-4.447583transposase
BBta_7301530-4.409140hypothetical protein
BBta_7302527-4.615669hypothetical protein
BBta_7303424-3.604767hypothetical protein
BBta_7304326-4.111722hypothetical protein
BBta_7305225-4.024523hypothetical protein
BBta_7306021-2.620972hypothetical protein
BBta_7309-119-1.934818hypothetical protein
BBta_7310-221-1.893412ParB-like nuclease
BBta_7311-222-2.409087hypothetical protein
BBta_7312-123-1.010620hypothetical protein
BBta_7313-123-0.979749phage integrase family protein
BBta_7314024-1.117309hypothetical protein
BBta_7315225-1.315872transposase
BBta_7316427-1.885075ParB-like nuclease
BBta_7317222-1.097602ATP-dependent exoDNAse
BBta_7318323-2.482772hypothetical protein
BBta_7319221-2.003377hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7245DHBDHDRGNASE1178e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (294), Expect = 8e-34
Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 20/267 (7%)

Query: 8 AVKDKKALVTGGASGIGLAIAEGLAENGAVVAIVDRNKEALEREIARLISRDLRVSGVHG 67
++ K A +TG A GIG A+A LA GA +A VD N E LE+ ++ L +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 68 DVA-ADGFDATIEAAIAALGGVDIVFANAGISGGYGPGVGGNDAGLLQNIDLKSWNHTIG 126
DV + D +G +DI+ AG+ GL+ ++ + W T
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVL----------RPGLIHSLSDEEWEATFS 114

Query: 127 VNLTGIVSTLKATIPTLKEQRSGKIVVTASIAGLRANPSIGYSYTASKAALVLLIKELAL 186
VN TG+ + ++ + ++RSG IV S S+ +Y +SKAA V+ K L L
Sbjct: 115 VNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA-AYASSKAAAVMFTKCLGL 173

Query: 187 ELAPFGVQVNGLAPGPFKTNINGGRFFDPDNAAREAA--------TVPLGRLAQPHEIKG 238
ELA + ++ N ++PG +T++ + D + A + +PL +LA+P +I
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 239 LALLLSSDASSYITGAVIPIDGGKTAG 265
L L S + +IT + +DGG T G
Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7254FLGHOOKAP1290.016 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.2 bits (65), Expect = 0.016
Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 8 ISLSRLMALQRRLDVAANNVANSQTTGY 35
++S L A Q L+ A+NN+++ GY
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGY 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7263ACRIFLAVINRP7600.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 760 bits (1963), Expect = 0.0
Identities = 227/1055 (21%), Positives = 425/1055 (40%), Gaps = 67/1055 (6%)

Query: 6 VMAAFGAWNFTRLPIDAVPDITNVQIQINSRAPGYSPLEVEQRITFPIETAMGGLPHLDS 65
++ GA +LP+ P I + +++ PG V+ +T IE M G+ +L
Sbjct: 18 ILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNGIDNLMY 77

Query: 66 TRSQS-RYGLSQVTIIFKDGTDIYFARQLVNERIQQVKDQLPPSIETAMGPISTGLGEIY 124
S S G +T+ F+ GTD A+ V ++Q LP ++ +
Sbjct: 78 MSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYL 137

Query: 125 LFTVEAKPEARKTGGDEYTPSDLRTIQDWIIKPQLRNVPGVIEVNTIGGFERQFHVLPDP 184
+ + T D+ +K L + GV +V G + + D
Sbjct: 138 MVA------GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMRIWLDA 190

Query: 185 AQLMAYKLSFREVMTALAANNANVGAGYI------ERNGEQYLVRTPGQVANVEEIKQIV 238
L YKL+ +V+ L N + AG + + + N EE ++
Sbjct: 191 DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVT 250

Query: 239 I-GSRHGVPVRVMDVADVKEGKDLRTGAATVDGKEVVLGTAMLLIGDNSRTVAQRVAAKL 297
+ + G VR+ DVA V+ G + A ++GK L G N+ A+ + AKL
Sbjct: 251 LRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKL 310

Query: 298 KEIGRSLPEGVIARAVYDRTKLVEATVATVEKNLVEGALLVIVILFLILGNFKAAIATAF 357
E+ P+G+ YD T V+ ++ V K L E +LV ++++L L N +A +
Sbjct: 311 AELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTI 370

Query: 358 VIPLSMLFTITGMVENKVSANLMSLG--AIDFGIIIDGAVIIVENCLRLLAEEQTRQGRV 415
+P+ +L T + S N +++ + G+++D A+++VEN R++ E++
Sbjct: 371 AVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLP---- 426

Query: 416 LDRQERFETILTASKEVIGPSLFGTLIIGVVYLPILTLTGVEGKMFTPMALTVLMALTGA 475
E + ++ G + +++ V++P+ G G ++ ++T++ A+ +
Sbjct: 427 -----PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 SILSLTFVPAAVALLVTGKVSEHE-------NWFMRGARYV---YTPLLAASIRNRWGVA 525
+++L PA A L+ +EH WF + YT + + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 526 GIAVLLMAVCGIAASRMGGEFIPSLDEGDVALQAIRIPGTSLTQSLEMQAMLERRLLKIP 585
I L++A + R+ F+P D+G G + ++ ++ + LK
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 586 EVREVFARTGTAEVATDLMPPSTSDGYVMLKPRKEWPDPEKPKASVVSEIQEAAEEIP-G 644
+ V + + + +V LKP +E E +V+ + +I G
Sbjct: 602 KA-NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 645 NVYEISQPIQQRFNELISGVRSDVG-VKIFGDDLDVLVQSAAKVQAVLQGVQGA-ADVKT 702
V + P EL + D + G D L Q+ ++ + + V+
Sbjct: 661 FVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 703 EQASGLPVLTVKLNRQALSRYGISVGDVQNLVEIAVGGKNSGMVFEGDRRFNIVVRLPEH 762
++++++ G+S+ D+ + A+GG + R + V+
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 763 LRSDISALKALPVPLPPLESQAKPVTALWSNSPLSQIRYVPLSELAQIDVAPGPNQISRE 822
R + L V E VP S G ++ R
Sbjct: 778 FRMLPEDVDKLYVRSANGEM-------------------VPFSAFTTSHWVYGSPRLERY 818

Query: 823 NGKRRIVVTANVRGRDLGSFVADAQEQVE-AKVKLPAGYWIGWGGQFEQLVSATQRLTIV 881
NG + + G+ DA +E KLPAG W G Q + + +
Sbjct: 819 NGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPAL 875

Query: 882 VPVALLLIFLLLFMSLGSMPDALLVFSGVPLALTGGIIALLLRGIPLSISAGIGFIALSG 941
V ++ +++FL L S + V VPL + G ++A L + +G + G
Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935

Query: 942 VAVLNGLVIITFI-ERLRSEGKNVIDAVREGSLTRLRPVLMTALVASLGFVPMAIATGAG 1000
++ N ++I+ F + + EGK V++A RLRP+LMT+L LG +P+AI+ GAG
Sbjct: 936 LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995

Query: 1001 AEVQRPLATVVIGGIISSTILTLLVLPALYVLFRR 1035
+ Q + V+GG++S+T+L + +P +V+ RR
Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 78.0 bits (192), Expect = 1e-16
Identities = 102/536 (19%), Positives = 206/536 (38%), Gaps = 57/536 (10%)

Query: 517 SIRNRWGVAGIAVLLMAVCGIAASRMGGEFIPSLDEGDVALQAIRIPGTSLTQSLEMQ-- 574
IR +A++LM +A ++ P++ V++ A PG Q+++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSA-NYPGAD-AQTVQDTVT 62

Query: 575 AMLERRLLKIPEVREVFARTGTAEVATDLMPPSTSDGYVMLKPR-KEWPDPEKPKASVVS 633
++E+ + I + + + S S G V + + DP+ + V +
Sbjct: 63 QVIEQNMNGIDNLMYMSST-------------SDSAGSVTITLTFQSGTDPDIAQVQVQN 109

Query: 634 EIQEAAEEIPGNVYEISQPIQQRFNE----LISGVRSDVGVKIFGDDLDVLVQSAAKVQA 689
++Q A +P V Q I + +++G SD DD+ V S K
Sbjct: 110 KLQLATPLLPQEVQ--QQGISVEKSSSSYLMVAGFVSDNP-GTTQDDISDYVASNVKDT- 165

Query: 690 VLQGVQGAADVKTEQASGLPVLTVKLNRQALSRYGISVGDVQNLVEIA----VGGKNSGM 745
L + G DV + + + L+ L++Y ++ DV N +++ G+ G
Sbjct: 166 -LSRLNGVGDV--QLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGT 222

Query: 746 VFEGDRRFNIVVRLPEHLRSDISALKALPVPLPPLESQAKPVTALWSNSPLSQIRYVPLS 805
++ N + ++ P E + S V L
Sbjct: 223 PALPGQQLNASIIAQTRFKN-------------PEEFGKVTLRVNSDGSV------VRLK 263

Query: 806 ELAQI-DVAPGPNQISRENGKRRIVVTANVRGRDLGSFVADAQEQVEAKVK--LPAGYWI 862
++A++ N I+R NGK + + A A + A+++ P G +
Sbjct: 264 DVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKV 323

Query: 863 GWGGQFEQLVSATQRLTIVVPV-ALLLIFLLLFMSLGSMPDALLVFSGVPLALTGGIIAL 921
+ V + + A++L+FL++++ L +M L+ VP+ L G L
Sbjct: 324 LYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAIL 383

Query: 922 LLRGIPLSISAGIGFIALSGVAVLNGLVIITFIERLRSEGK-NVIDAVREGSLTRLRPVL 980
G ++ G + G+ V + +V++ +ER+ E K +A + ++
Sbjct: 384 AAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALV 443

Query: 981 MTALVASLGFVPMAIATGAGAEVQRPLATVVIGGIISSTILTLLVLPALYVLFRRE 1036
A+V S F+PMA G+ + R + ++ + S ++ L++ PAL +
Sbjct: 444 GIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7268SECYTRNLCASE280.049 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 28.2 bits (63), Expect = 0.049
Identities = 15/78 (19%), Positives = 28/78 (35%), Gaps = 2/78 (2%)

Query: 122 ILGTPMLAVAAVGLIVNLVSMKLLSAGSSESLNVQGAYFEVLSDMLGSLGVIAAALIIMF 181
L + + GL+ S L S V M+ + + A ++M+
Sbjct: 119 YLTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMV--ICMTAGTCVVMW 176

Query: 182 TGWTLADPIIGAGIGLFI 199
G + D IG G+ + +
Sbjct: 177 LGELITDRGIGNGMSILM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7271cloacin323e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 3e-04
Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 2/33 (6%)

Query: 27 GGDHHGGGHFGGGYFGGHFGGGHHGGFGGHHGG 59
GG G H+GGG GH GG +G GG G
Sbjct: 47 GGGSGSGIHWGGG--SGHGNGGGNGNSGGGSGT 77



Score = 27.8 bits (61), Expect = 0.008
Identities = 11/31 (35%), Positives = 11/31 (35%)

Query: 25 HHGGDHHGGGHFGGGYFGGHFGGGHHGGFGG 55
G H GGG G G GG G G
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 27.0 bits (59), Expect = 0.015
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 9/51 (17%)

Query: 25 HHGGDHHGGGHFGGGYFGGHFGGGHHGG---------FGGHHGGHYGFYGG 66
H+ G H G+ GG G GGG G +GG G + GG
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59



Score = 26.2 bits (57), Expect = 0.028
Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 37 GGGYFGGHFGGGHHGGFGGHHGGHYGFYGG 66
GG G H+GGG G GG +G G G
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7275SYCDCHAPRONE502e-09 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 49.5 bits (118), Expect = 2e-09
Identities = 19/103 (18%), Positives = 33/103 (32%)

Query: 59 LIPEFPNLYLYRGVIWGDKGEYQRALQDFLTVSRLTPTDPLAFNNLGNVYDRLGDLDQAI 118
+ + G+Y+ A + F + L D F LG +G D AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 119 VNFDRAIGLRADYAQAYYNRAHTYALKQERERAIADYDQAISL 161
++ + + ++ A K E A + A L
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 46.5 bits (110), Expect = 3e-08
Identities = 29/139 (20%), Positives = 48/139 (34%), Gaps = 10/139 (7%)

Query: 17 NALKQGILEDKQRGFLLYSRGASYESLGLRDRALADFDAAIVLIPEFPNLYLYRGVIWGD 76
N + LE LYS + G + A F A VL +L G
Sbjct: 29 NEISSDTLEQ------LYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82

Query: 77 KGEYQRALQDFLTVSRLTPTDPLAFNNLGNVYDRLGDLDQAIVNFDRAIGLRADYA--QA 134
G+Y A+ + + + +P + + G+L +A A L AD +
Sbjct: 83 MGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKE 142

Query: 135 YYNRAHTY--ALKQERERA 151
R + A+K ++E
Sbjct: 143 LSTRVSSMLEAIKLKKEME 161



Score = 43.0 bits (101), Expect = 4e-07
Identities = 23/94 (24%), Positives = 35/94 (37%)

Query: 195 INPKDVTALTNRATINLTMERYENALTDFDSALLLHPGNAAIFLGRGRVHLIAGALDSAI 254
I+ + L + A +YE+A F + +L ++ FLG G G D AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 255 ADFKTAARLRPNNPYPIIWAHIARVHKGEDDREE 288
+ A + P A + KGE E
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAE 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7294ACETATEKNASE395e-138 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 395 bits (1017), Expect = e-138
Identities = 170/406 (41%), Positives = 246/406 (60%), Gaps = 23/406 (5%)

Query: 1 MKAIFSLNSGSSSIKFALFTLDGANNPMLSAGGKIERIGIAPSLRVRSADGD-IVLERDW 59
MK I +N GSSS+K+ L + + +L A G ERIGI SL +A+G+ I +++D
Sbjct: 1 MK-ILVINCGSSSLKYQLI--ESKDGNVL-AKGLAERIGINDSLLTHNANGEKIKIKKDM 56

Query: 60 PDGASLSHAELLKDIF-AWAAERLGD----RQVIAIGHRVVHGGTEFAAPRLVDDAVLDA 114
D H + +K + A G ++ A+GHRVVHGG F + L+ D VL A
Sbjct: 57 KD-----HKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKA 111

Query: 115 LEKLNPLAPLHQPHNLAAIRAIAQLRPGLPQVACFDTAFHHDKPPVAARLPLPR-AFHEQ 173
+ LAPLH P N+ I+A Q+ P +P VA FDTAFH P A P+P + +
Sbjct: 112 ITDCIELAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKY 171

Query: 174 GIRRYGFHGLSYEYIARRLREI-DPVLAAGRMIAAHLGNGASLCAMRDGKSIDTTMGFTA 232
IR+YGFHG S++Y+++R EI + + + ++I HLGNG+S+ A+++GKSIDT+MGFT
Sbjct: 172 KIRKYGFHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTP 231

Query: 233 LDGLMMGTRCGAIDPGVVLHLQTQLGMSAADVEDLLYRKSGLLGVSGISSDMRTLSD--- 289
L+GL MGTR G+IDP ++ +L + +SA +V ++L +KSG+ G+SGISSD R L D
Sbjct: 232 LEGLAMGTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAF 291

Query: 290 -NSGPEAEEAIELFCWRVAREAGGLIASLGGLDAFVFTAGIGENHADVRRRICERLAWSG 348
N A+ A+ +F +RV + G A++GG+D VFTAGIGEN ++R I + L + G
Sbjct: 292 KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLG 351

Query: 349 LWIDESANL--GSALRICEANSPVAVLVIPTNEERMIALHTLNVVR 392
+D+ N G I A+S V V+V+PTNEE MIA T +V
Sbjct: 352 FKLDKEKNKVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


61BBta_7384BBta_7396Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7384217-1.177444hypothetical protein
BBta_7385216-0.890087hypothetical protein
BBta_7386017-1.234557SARP family transcriptional regulator
BBta_7387122-2.907140DNA repair protein RadC
BBta_7388125-3.365264integrase
BBta_7389123-2.381529hypothetical protein
BBta_7390022-1.949263DNA binding protein
BBta_7391123-2.421821hypothetical protein
BBta_7392025-3.112799hypothetical protein
BBta_7393021-3.455434hypothetical protein
BBta_7394019-2.650458ardC antirestriction protein
BBta_7395-119-3.546281hypothetical protein
BBta_7396019-3.255561hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7393FLGMOTORFLIM290.007 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 28.7 bits (64), Expect = 0.007
Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 5/45 (11%)

Query: 8 LRALMRFYEAEARYSASGLPADRAALLETLHPDIVLHQPDSLPYG 52
+R L +E AR + + L +A L ++ + + D L Y
Sbjct: 52 MRTLSLMHETFARLTTTSL----SAQLRSM-VHVHVASVDQLTYE 91


62BBta_7409BBta_7438Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7409218-0.040099hypothetical protein
BBta_74101170.088000hypothetical protein
BBta_7411117-0.407009hypothetical protein
BBta_7412116-0.243519hypothetical protein
BBta_7413116-0.445242methylase/helicase
BBta_7414118-1.496566hypothetical protein
BBta_7415221-2.450112hypothetical protein
BBta_7416226-2.866140hypothetical protein
BBta_7418126-2.460540hypothetical protein
BBta_7420229-3.236242hypothetical protein
BBta_7421232-3.211593lead, cadmium, zinc and mercury-transporting
BBta_7423141-4.712622hypothetical protein
BBta_7424243-3.623391hypothetical protein
BBta_7425441-3.279874response regulator receiver
BBta_7426537-2.807983response regulator
BBta_7427434-0.989174hypothetical protein
BBta_7428333-0.572565hypothetical protein
BBta_7429533-0.727981hypothetical protein
BBta_7430331-0.393062hypothetical protein
BBta_7431229-0.530345replication protein A
BBta_74322250.739641partition protein
BBta_7433322-0.788834hypothetical protein
BBta_7434323-0.969734hypothetical protein
BBta_7435220-0.001044conjugal transfer protein
BBta_7436320-0.066915soluble lytic murein transglycosylase
BBta_74372190.073981hypothetical protein
BBta_7438216-0.344152conjugal transfer coupling protein TraG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7409PHPHTRNFRASE310.016 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.9 bits (70), Expect = 0.016
Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 157 LRTEFDQLQAQYEDADELPDEVDQRLSEIETALEAFESRPHVFDPADI 204
RTEF Y D D+LP E +++ + ++ + +P V DI
Sbjct: 296 YRTEF-----LYMDRDQLPTE-EEQFEAYKEVVQRMDGKPVVIRTLDI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7414PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.008
Identities = 38/142 (26%), Positives = 53/142 (37%), Gaps = 19/142 (13%)

Query: 3 PRDASDLARRLARDAETVCRHYLSNGRREGRYWQVGDARNTPGRSMFVRLKESSKGPAGK 62
P + + LA L A+ + +L G G ++ G G S V + GK
Sbjct: 8 PINFTSLADALLTRAKDLLPEWLPGGVLVGHEYECGSLAGGKGDSCKVNVT------TGK 61

Query: 63 WTDAATGEHG-DLLDIIRESCGLVDFHDVADEARR---------FLNLPRSEQEPSPNSA 112
W D +TGE G DLLD+ E GL A AR + P P P
Sbjct: 62 WCDFSTGESGRDLLDLYAEIHGLKVSKAAAQVAREEGLESVAGIVMGAPAGAPAPKPPRP 121

Query: 113 RPPAQTGSPEAARRLFAMSQPM 134
PP + P + + QP+
Sbjct: 122 EPPPR---PVVEKECWETIQPV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7425HTHFIS661e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 1e-15
Identities = 28/117 (23%), Positives = 54/117 (46%), Gaps = 11/117 (9%)

Query: 11 KPHVLVVEDEVLMRALIADELRTAGCAVIEANSADQALDYLVAGGKVDLMFSDIQMPGSL 70
+LV +D+ +R ++ L AG V ++A ++ A G DL+ +D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-AAGDGDLVVTDVVMPD-E 60

Query: 71 NGLQLAERVRADFPTVPVILTSGNDHLNSETMPG-------RFIAKPYDITQTVALV 120
N L R++ P +PV++ S T ++ KP+D+T+ + ++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7426HTHFIS592e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 2e-13
Identities = 33/127 (25%), Positives = 55/127 (43%), Gaps = 5/127 (3%)

Query: 1 MSDTRILIVDSNVLVRTPLAEYLRECGYQVLEATNSVEAKEVLNNTARPIDVVLIDVGAE 60
M+ IL+ D + +RT L + L GY V +N+ + A D+V+ DV
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVMP 58

Query: 61 DQSGFALAHWIRDRAPGPRVILAGTITGSVEKAADLCQDGPA--VSKPYDHRLVLDRIQR 118
D++ F L I+ P V++ + + A + G + KP+D ++ I R
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVM-SAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 119 LLAAKNR 125
LA R
Sbjct: 118 ALAEPKR 124


63BBta_7448BBta_7466Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7448326-2.922937conjugal transfer protein TrbG
BBta_7449327-3.538987conjugal transfer protein TrbI
BBta_7450429-3.710613hypothetical protein
BBta_7451329-4.118737transcriptional regulator
BBta_7452328-4.487375Na+/phosphate symporter
BBta_7453428-5.124153OmpA-like transmembrane domain-containing
BBta_7455131-5.594186manganese transport transmembrane protein
BBta_7457237-6.421907hypothetical protein
BBta_7458338-6.451261sensor histidine kinase with ATP-binding region
BBta_7459339-6.765445nodulation protein W, two-component
BBta_7460339-6.640062hypothetical protein
BBta_7462332-4.840778ABC transporter permease
BBta_7463229-4.179559ABC transporter ATP-binding protein
BBta_7465125-3.597334hypothetical protein
BBta_7466021-3.803939hypothetical protein
64BBta_7683BBta_7700Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7683310-0.285317branched-chain amino acid ABC transporter
BBta_7684-18-1.724892branched-chain amino acid ABC transporter
BBta_7685-111-3.318671branched-chain amino acid ABC transporter
BBta_7686-113-3.549234branched-chain amino acid ABC transporter
BBta_7687016-3.989589PAS/PAC sensor-containing diguanylate
BBta_7688423-5.088714hypothetical protein
BBta_7689424-4.984297*hypothetical protein
BBta_7690424-4.638173type II restriction enzyme, methylase subunit
BBta_7691419-2.311488integrase catalytic subunit
BBta_7692216-1.985982insertion element protein
BBta_7693217-1.460491ACP synthase
BBta_7694215-2.065973hypothetical protein
BBta_7695116-1.720297hypothetical protein
BBta_7696118-0.524180hypothetical protein
BBta_76971160.386458hypothetical protein
BBta_76994180.553968transcriptional regulator
BBta_77003180.591739hypothetical protein
65BBta_7709BBta_7819Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7709318-0.154557hypothetical protein
BBta_77103170.089302hypothetical protein
BBta_77113161.450694hypothetical protein
BBta_77122182.752644hypothetical protein
BBta_77131173.075853hypothetical protein
BBta_77141183.238462hypothetical protein
BBta_77150183.180645hypothetical protein
BBta_77160193.426767hypothetical protein
BBta_77170193.586394methylase/helicase
BBta_7718-2203.827564hypothetical protein
BBta_77192232.748364hypothetical protein
BBta_77202241.849837plasmid stability protein
BBta_77211261.751081plasmid stability protein
BBta_77222301.113130hypothetical protein
BBta_77235340.051507hypothetical protein
BBta_77246310.666232hypothetical protein
BBta_77255231.854226hypothetical protein
BBta_77264203.059389hypothetical protein
BBta_77273203.216448hypothetical protein
BBta_77293163.350363hypothetical protein
BBta_77303173.711315hypothetical protein
BBta_77312183.220232plasmid replication protein A
BBta_77321193.290838plasmid partition protein
BBta_77332203.789812hypothetical protein
BBta_77342172.881382hypothetical protein
BBta_77353172.946919conjugal transfer protein
BBta_77362183.205062hypothetical protein
BBta_77372173.006718soluble lytic murein transglycosylase
BBta_77382172.974000hypothetical protein
BBta_77392172.014683conjugal transfer coupling protein TraG
BBta_77401162.021609hypothetical protein
BBta_77411162.146052conjugal transfer protein TrbB
BBta_77422152.049066conjugal transfer protein TrbC
BBta_77432152.084564conjugal transfer protein TrbD
BBta_77440141.966668conjugal transfer ATPase TrbE
BBta_7745-1152.118041conjugal transfer protein TrbJ
BBta_7746-1162.184979hypothetical protein
BBta_7747-1141.337731conjugal transfer protein TrbL
BBta_7748216-0.238391conjugal transfer protein TrbF
BBta_7749119-0.630462conjugal transfer protein TrbG
BBta_7750320-1.585310conjugal transfer protein TrbI
BBta_7751529-3.895464hypothetical protein
BBta_7752539-6.304101insertion element ISR1-like protein
BBta_7753541-6.456870transposase
BBta_7754542-7.140541transposase
BBta_7755547-8.850720hypothetical protein
BBta_7759550-9.398062hypothetical protein
BBta_7760449-9.660724Acetyl-CoA synthetase
BBta_7762343-9.375499hypothetical protein
BBta_7764345-9.1306603-ketoacyl-ACP reductase
BBta_7765349-9.061193osmotically inducible sensory protein
BBta_7766353-9.265366nitrogen fixation regulation protein fixK
BBta_7767454-9.558621nitrogen fixation regulation protein fixK
BBta_7769451-9.846761adenylate kinase
BBta_7770551-10.221568nitrogen fixation regulation protein fixK
BBta_7774552-10.602474hypothetical protein
BBta_7776452-10.744105hypothetical protein
BBta_7777449-11.077721NADPH-dependent FMN reductase
BBta_7778547-10.221036poly(3-hydroxyalkanoate) polymerase family
BBta_7779549-10.134864UspA protein
BBta_7781341-8.675528DNA ligase-like protein
BBta_7782437-8.055859ATP-dependent DNA ligase
BBta_7783436-7.970715osmotically-inducible protein Y
BBta_7784440-8.005518osmotically inducible protein Y
BBta_7788441-8.307401heat shock protein (HSP-70 cofactor), grpE
BBta_7789435-7.105033chaperone clpB
BBta_7790439-7.637898chaperone protein dnaK
BBta_7791441-7.802018hypothetical protein
BBta_7792440-7.379055glutathione-regulated potassium-efflux system
BBta_7793434-6.816933multiple antibiotic resistance (MarC)-like
BBta_7794232-6.725829DNA-binding ATP-dependent protease La
BBta_7795438-8.522553sigma 54 modulation protein/ribosomal protein
BBta_7796435-8.682466hypothetical protein
BBta_7797430-7.304010DNA protection during starvation protein
BBta_7798429-7.288588thioredoxin
BBta_7799629-6.471088hypothetical protein
BBta_7800730-6.383670hypothetical protein
BBta_7801930-5.800299hypothetical protein
BBta_7802829-5.000114cell division protein
BBta_7803933-5.202471hypothetical protein
BBta_7804830-4.435045heat shock protein- DnaJ-like protein
BBta_7805637-6.520980hypothetical protein
BBta_7806642-7.221054small heat-shock protein molecular chaperone
BBta_7807545-8.392404Hsp20 family heat-shock protein
BBta_7808547-8.742672Hsp20 family heat-shock protein
BBta_7809446-8.627596serine protease do-like
BBta_7810452-10.697641zinc protease
BBta_7811448-10.131341Zn-dependent protease
BBta_7812842-9.286474hypothetical protein
BBta_7814636-7.691256hypothetical protein
BBta_7815226-5.248862transposase
BBta_7817320-4.462868transposase
BBta_7818216-3.775017transposase
BBta_7819014-3.895393transposase
66BBta_0082BBta_0087N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_00820153.2783292-octaprenylphenol hydroxylase
BBta_00830143.192181bifunctional
BBta_00841143.562818deoxyuridine 5'-triphosphate
BBta_00850123.449551sensor histidine kinase
BBta_00861133.459791hypothetical protein
BBta_00870123.224246hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0082YERSSTKINASE320.005 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.4 bits (73), Expect = 0.005
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 278 HALRDGFFHADMHPGNLFLDK-EGRLVAVDFGIMGRLGMKERRF 320
H + G H D+ PGN+ D+ G V +D G+ R G + + F
Sbjct: 260 HLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0083TONBPROTEIN300.016 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.0 bits (67), Expect = 0.016
Identities = 8/29 (27%), Positives = 10/29 (34%)

Query: 41 EPADPSEPPSAPPTPAPPPEPMVPQPGVA 69
+ P P P P P P P P+
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPV 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0085DNABINDINGHU280.035 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 28.1 bits (63), Expect = 0.035
Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 17/85 (20%)

Query: 668 IDAGAMKLELGPVDAAKAIEAAAEGVQDRLAT-DRIRLKVQVDPNVGTFVGDERRVVQVL 726
I A EL D+A A++A V LA ++++L G F ER
Sbjct: 8 IAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQL-----IGFGNFEVRER------ 56

Query: 727 YNLLANAVGFSPQ-DSTVLVSARRT 750
A G +PQ + + A +
Sbjct: 57 ----AARKGRNPQTGEEIKIKASKV 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0087TACYTOLYSIN260.019 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 26.5 bits (58), Expect = 0.019
Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 65 GLAYRRCELAWVNGDQIGVTFLKQGKKKANK 95
L Y E+ NG+ I K+G KKA+K
Sbjct: 118 SLNYNELEVLAKNGETIENFVPKEGVKKADK 148


67BBta_0415BBta_0422N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_04150121.186286hypothetical protein
BBta_04160140.682711rRNA large subunit methyltransferase
BBta_0417-1131.834107hypothetical protein
BBta_0418-2142.100900nicotinic acid mononucleotide
BBta_0419-1141.747661gamma-glutamyl phosphate reductase
BBta_0420-2151.344629hypothetical protein
BBta_0421-3151.800613hypothetical protein
BBta_0422-3162.126109gamma-glutamyl kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0415CHANLCOLICIN300.016 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.016
Identities = 27/191 (14%), Positives = 63/191 (32%), Gaps = 4/191 (2%)

Query: 46 PEQLKQRAQELEAARAAQKAAAEAQERLKAEIATLGEDRGKLNQQLIDVASQVRSVEVRV 105
EQ ++ + +A Q AEA+E+ A ++ + ++L S+V ++ +
Sbjct: 153 AEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEI 212

Query: 106 GEAETRLRGLDGREQDLRNSLEARRTEIIEVLAALQRAGRRTPPALLVRPEDALQS---L 162
+RL +L +R E+ + A + L R D LQ+
Sbjct: 213 KTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELV-KKLSPRANDPLQNRPFF 271

Query: 163 RTAMLLGSVVPEMRGRAEKLSTDLGELVALRKTIATERDKLAADRDKLRTDQTRLAALVE 222
+ ++++ + + I + ++ + R+ E
Sbjct: 272 EATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEE 331

Query: 223 ERQRKQATAEK 233
++ Q
Sbjct: 332 NLKKAQNNLLN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0417TYPE4SSCAGA270.044 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 26.6 bits (58), Expect = 0.044
Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 10 KATTSKSATAKARKTPTHDAALQAQPDADKTLR 42
K A A A+ T +D +AQ D +K+LR
Sbjct: 587 KTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLR 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0418LPSBIOSNTHSS325e-04 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 32.5 bits (74), Expect = 5e-04
Identities = 37/174 (21%), Positives = 64/174 (36%), Gaps = 38/174 (21%)

Query: 8 GSFNPPHQAH-----RAISLFALKRLQLDRVWWLVTPGNPLKDNGGLHALAERAAAARKV 62
GSF+P H R LF D+V+ V NP K + ++ ER K
Sbjct: 7 GSFDPITFGHLDIIERGCRLF-------DQVYVAVL-RNPNKQ--PMFSVQERLEQIAKA 56

Query: 63 AAD-PRIEISCLESVIGTRYTADTIDYLRRRASRLRFVWIMGADNLAQF----HRWQKWQ 117
A P ++ E + T++Y R+R + + G L+ F +
Sbjct: 57 IAHLPNAQVDSFEGL--------TVNYARQRQAG---AILRGLRVLSDFELELQMANTNK 105

Query: 118 HIAAQVPMAVVDRPPRSFRALNAPAARALARYR------LPEADAGRLADRAAP 165
+A+ + V + L++ + +AR+ +P A L D+ P
Sbjct: 106 TLASDLE-TVFLTTSTEYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFHP 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0422CARBMTKINASE422e-06 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 42.1 bits (99), Expect = 2e-06
Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 8/107 (7%)

Query: 136 VPVINENDTVATAEIRYGDNDRLAARVATMASADLLVLLSDIDGLYTAPPAQDPNARLIP 195
VPVI E+ + E D D ++A +AD+ ++L+D++G + +
Sbjct: 197 VPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTDVNGAALY--YGTEKEQWLR 253

Query: 196 VVESITADIEAMAGSAASEFSRGGMRTKIEAA-KIATTGGTHMLIAS 241
V+ ++ F G M K+ AA + GG +IA
Sbjct: 254 EVK--VEELRKYY--EEGHFKAGSMGPKVLAAIRFIEWGGERAIIAH 296


68BBta_0516BBta_0539N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0516213-2.481752multi-sensor hybrid histidine kinase
BBta_0517416-2.615255response regulator receiver
BBta_0518212-0.821180response regulator receiver
BBta_0519312-0.287308CRP/FNR family transcriptional regulator
BBta_0521313-0.267598hypothetical protein
BBta_0522315-0.794399hypothetical protein
BBta_0523111-1.088198flagellar biosynthesis repressor FlbT
BBta_0524110-1.021528chemotaxis protein CheA
BBta_0525116-3.485866CheW protein
BBta_0526113-1.798040chemotaxis protein CheY
BBta_0527112-1.556839chemotaxis protein CheR
BBta_0528012-0.815498hypothetical protein
BBta_0529010-0.308493response regulator receiver
BBta_0530090.041785two component LuxR family transcriptional
BBta_0531-180.026070PAS/PAC sensor hybrid histidine kinase
BBta_0532-110-0.098472response regulator receiver
BBta_0533-18-0.235663methyl-accepting chemotaxis protein
BBta_0534-18-0.134841methyl-accepting chemotaxis protein
BBta_0535011-0.070633GntR family transcriptional regulator
BBta_05360130.328800ABC transporter substrate-binding protein
BBta_0537-1120.922499ABC transporter ATP-binding protein
BBta_0538-3101.031459ABC transporter permease
BBta_0539-3111.571466dihydroxy-acid dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0516HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 2e-17
Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 3/138 (2%)

Query: 586 RVLVVDDNPTNRLVATKMLKDFDIQTDTACDGAEAVTAASRFNYDLILMDVRMPEMDGFQ 645
+LV DD+ R V + L + A + + DL++ DV MP+ + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 646 ATRTIRARGERRSNVPIIAFTANAFMEDIRACREAGMNDFVVKPARKKALVEAILRVLPA 705
I+ + R ++P++ +A E G D++ KP L+ I R L
Sbjct: 65 LLPRIK---KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 706 RTLAIETIASDAPPLAPV 723
+ D+ P+
Sbjct: 122 PKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0517HTHFIS638e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 8e-15
Identities = 24/138 (17%), Positives = 55/138 (39%), Gaps = 5/138 (3%)

Query: 2 RNELLVIEDADVHLSILRKIATQAGFNTTGVSSVDAASTILRTRHFDCVTLDLSLGERSG 61
+LV +D ++L + ++AG++ S+ + D V D+ + + +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 TEVLQRLAELKYRGPVLIISASENDRLDASVRIGNFLELNVCPPFSKPINLPLLRQTLKQ 121
++L R+ + + PVL++SA + ++ E KP +L L + +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSA--QNTFMTAI---KASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 122 IASETDRQKLVRRQAGRG 139
+E R+ +
Sbjct: 118 ALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0518HTHFIS831e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 1e-21
Identities = 26/105 (24%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 9 ILVVDDYATMIRIIRNLLKQLGFENVDDATDGSAALAKMQAKKYGLVISDWNMEPMTGYD 68
ILV DD A + ++ L + G++ V ++ + + A LV++D M +D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 69 LLREVRASPELSKTPFIMITAESKTENVIAAKKAGVSNYIVKPFN 113
LL ++ P ++++A++ I A + G +Y+ KPF+
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0522FLAGELLIN577e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.4 bits (138), Expect = 7e-11
Identities = 80/498 (16%), Positives = 155/498 (31%), Gaps = 7/498 (1%)

Query: 25 LQSTAQLLATTQNNLSTGKKVNSALDNPTNFFTAQGLDNRASDISNLLDGIGNGVQVLQA 84
L + L++ LS+G ++NSA D+ A + ++ +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 85 ANTGITSLQKLVDSAKSIANQVLQSAVGYSTKSTVTSAALTGATATSLIGASTTAVTGSA 144
+ + + + ++ +Q+ G ++ S + S I + +
Sbjct: 77 TEGALNEINNNLQRVRELS---VQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNG 133

Query: 145 VLNDNTSTAVAITGSTKLSGTPGTSSNDLASSITTGDTLVVNGTTFTFIAGTSSSGTNIG 204
V + + I T + D VNG + SS N+
Sbjct: 134 VKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 205 VGDTVTNLLSTIQSATGVTSSITAGAITLTPPAAGLTLSGTSLAKLGLSAVGNSLSGQTL 264
DT + + + +T P + + L
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAV-DLFKT 252

Query: 265 TIAATGGGTATSVTFGLGTGQVNSLNDLNAKLAANNLQATVASATGKISITTTNDAASST 324
T + G A ++ + G+ D + + ++TT + T
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK---VSTTINGEKVT 309

Query: 325 IGAIGGTAAASSQSFNGLTAAAPVADATAQSQRASLVAQYNNVLAQINTTAADASFNGIN 384
+ TA A++ L ++ V + Q N + A +A
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 385 LLNGDTLKLTFNETGKSTLSITGVTFNTGGLGLSTLTSGTDFLDNNSANKVIKVLNTASS 444
+ K TL+ + + G+STL + S + +++A S
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 445 TLRDEASTLGSNLSVVQVRQDFNKNLINVLQTGSSNLTLADTNEEAANSQALSTRQSIAV 504
+ S+LG+ + N + L + S + AD E +N Q
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 505 SALSLANQSQASVLQLLR 522
S L+ ANQ +VL LLR
Sbjct: 490 SVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0524HTHFIS884e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 4e-20
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 4/134 (2%)

Query: 795 QTQSVLLVDDSPFFRNMLAPVLKSAGYKVRTAASAIEGLATLRSGHTFDIVVTDIEMPEM 854
++L+ DD R +L L AGY VR ++A + +G D+VVTD+ MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE 60

Query: 855 NGFEFAEAIRSDQNLNQLPVIAVSSLVSPAAIERGRQAGLYDYIAK-FDRPGLIAALKEQ 913
N F+ I+ + LPV+ +S+ + + + G YDY+ K FD LI +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 914 IEERARAEANRRAA 927
+ E R +
Sbjct: 119 LAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0526HTHFIS814e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 4e-21
Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 5 LVVDDSSVVRKIARRILEELGFSVVEAEDGEQALELCTKSLPEAILLDWNMPVMDGYDFL 64
LV DD + +R + + L G+ V + + ++ D MP + +D L
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 65 GRLRRLPGGEGPKVVFCTTENDIDHISRALNAGANEYIMKPFD 107
R+++ V+ + +N +A GA +Y+ KPFD
Sbjct: 67 PRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0529HTHFIS671e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 1e-15
Identities = 31/107 (28%), Positives = 52/107 (48%), Gaps = 4/107 (3%)

Query: 26 LLVVDDDASQRTLISLAAKQAGHAVTVVASCAEAIREIGHKRFDCVTLDLMLEDGDGTEV 85
+LV DDDA+ RT+++ A +AG+ V + ++ A R I D V D+++ D + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 86 LKAIADSRFAGSVIVISGMDAARRIAARLYARSLGIDLQSLPKPVDL 132
L I +R V+V+S + + LPKP DL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA----YDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0530HTHFIS1123e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 112 bits (281), Expect = 3e-31
Identities = 39/155 (25%), Positives = 69/155 (44%)

Query: 7 SRGEIFVVDDDPAVRDTLSMVLSAGGYEVICFADGAALLAVARSRTPACILLDVHIPGKS 66
+ I V DDD A+R L+ LS GY+V ++ A L + ++ DV +P ++
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLDILRELHGEDYPAPIFMISGQGDIPMAVNAIKYGALDFIEKPFRGSEIVARLNEAIGA 126
D+L + P+ ++S Q A+ A + GA D++ KPF +E++ + A+
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 FARRQKENASPKIGSLHFPGREPLTRREREVLEQF 161
RR + + GR + VL +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0531HTHFIS598e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 8e-11
Identities = 41/122 (33%), Positives = 54/122 (44%), Gaps = 10/122 (8%)

Query: 1074 TILVVEDDRLVRGYVLAQLHALGYATLEAANAAEALAIANGGERFDLLFTDIIMPGTMNG 1133
TILV +DD +R + L GY +NAA G+ DL+ TD++MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD-ENA 62

Query: 1134 RQLASEIQRLRPGQRVLFTSGYT--ENAI--VHHGRLDEGLLLLPKPYRKSELAKMIRTA 1189
L I++ RP VL S AI G D LPKP+ +EL +I A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYD----YLPKPFDLTELIGIIGRA 118

Query: 1190 LT 1191
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0532HTHFIS942e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 2e-25
Identities = 35/125 (28%), Positives = 56/125 (44%), Gaps = 7/125 (5%)

Query: 2 AKILVVDDDPVMQMTIQRLLQQAGHAVTVADDGQKAIARFQGESFDLVVMDIFMPGMDGL 61
A ILV DDD ++ + + L +AG+ V + + DLVV D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EAMRLILKHAPSTPILMTSGRPNTPNSISEPDYLTMATKLGAVSALPKPFKPAALLAMVS 121
+ + I K P P+L+ S + +I A++ GA LPKPF L+ ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIK-------ASEKGAYDYLPKPFDLTELIGIIG 116

Query: 122 DCLER 126
L
Sbjct: 117 RALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0534IGASERPTASE330.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.003
Identities = 44/281 (15%), Positives = 80/281 (28%), Gaps = 30/281 (10%)

Query: 403 AQQQASSNYVREAEAREAMNRNMEEAVEAFRAKSTELLSEVDDNVGVMRSTAESLTGIAG 462
++Q++ + E +A E +N E A EA S V N T E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAK--------SNVKAN----TQTNE------- 1084

Query: 463 HATEQAASAAGASEKTAVNVQTVAAAAEQLASSIVEIGRQIELSNTTVRNANATTARSEA 522
+ + TV + VE + E+ T + +SE
Sbjct: 1085 -VAQSGSETKETQTTETKETATVEKEEKAK----VETEKTQEVPKVTS-QVSPKQEQSET 1138

Query: 523 EIESLAQAAQSISTVVDLIQAIAAQTNLLALNATIEAARAGEAGRGFAVVAQEVKSLAEQ 582
A ++ TV I+ +QTN A + A + V +
Sbjct: 1139 VQPQAEPARENDPTV--NIKEPQSQTNTTA-DTEQPAKETSSNVEQPVTESTTVNTGNSV 1195

Query: 583 TAKATQEIGQHVQGIQNSTSNAVASVKEVSTAMRQMADVTTAIASAVEQQGAATREITEN 642
Q NS S+ + + +V A S+ ++ A ++T
Sbjct: 1196 VENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTST 1255

Query: 643 VQMAASGSKMLASNISTVSGAIEETSQSAASVRD--ASNNV 681
A + ++ + + + NV
Sbjct: 1256 NTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNV 1296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0537PF05272280.040 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.040
Identities = 13/42 (30%), Positives = 16/42 (38%), Gaps = 2/42 (4%)

Query: 15 KTSSAVITATERVGFSVDRSDRFVLLGPSGCGKSTLLKAVGG 56
+ G D S VL G G GKSTL+ + G
Sbjct: 579 YILMGHVARVMEPGCKFDYS--VVLEGTGGIGKSTLINTLVG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0539PF00577320.007 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 32.1 bits (73), Expect = 0.007
Identities = 9/36 (25%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 508 ITLDVAARTIDLDVPEDELARRRAEWQPPAPRFERG 543
LDV + ++L +P+ ++ R + PP ++ G
Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPE-LWDPG 181


69BBta_0549BBta_0557N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0549-1102.334824hypothetical protein
BBta_05511100.899600hypothetical protein
BBta_0553091.141661Zn-dependent hydrolase
BBta_0554091.094616caspase-like domain-containing protein
BBta_0555-170.901931hypothetical protein
BBta_0556-170.790244HlyD family secretion protein
BBta_0557070.682711AcrB/AcrD/AcrF family mulitdrug efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0549RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.002
Identities = 19/131 (14%), Positives = 44/131 (33%), Gaps = 9/131 (6%)

Query: 41 GAAVSVLKAAKACFDNTVEVSGMVLARDET-AVRPDRPGLKVAEVLVDAGDTVTAGQNLA 99
++ + + + +G + + ++P + V E++V G++V G L
Sbjct: 67 FLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSI-VKEIIVKEGESVRKGDVLL 125

Query: 100 RLTPPEGGTIMVQAPVAGTILTSTATIGAFASGRGEALFSIIARNEYDLVGLVPSAQINR 159
+LT + A S+ R + L I N+ + L
Sbjct: 126 KLTA-------LGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 160 LAVNQTARIRI 170
++ + R+
Sbjct: 179 VSEEEVLRLTS 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0554PF03544505e-09 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 50.4 bits (120), Expect = 5e-09
Identities = 24/97 (24%), Positives = 30/97 (30%), Gaps = 2/97 (2%)

Query: 258 PGAGGVRPTPPVESSTAPVAAPTPTPTPTQVVVAPPLPPPPPPPPPPVQKADAPPAPPVV 317
P A P P VE P P P P VV P P P P PV+K + P
Sbjct: 63 PQAVQPPPEPVVEPE--PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 318 PPPPVPTPSPVIATTAPTTTAPATNVTAAPPVQDPFA 354
+P A PT++ +
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASGP 157



Score = 43.0 bits (101), Expect = 1e-06
Identities = 20/90 (22%), Positives = 23/90 (25%)

Query: 265 PTPPVESSTAPVAAPTPTPTPTQVVVAPPLPPPPPPPPPPVQKADAPPAPPVVPPPPVPT 324
P V+ PV P P P P P P P K P
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121

Query: 325 PSPVIATTAPTTTAPATNVTAAPPVQDPFA 354
S + T A T+ TA P
Sbjct: 122 ESRPASPFENTAPARPTSSTATAATSKPVT 151



Score = 42.7 bits (100), Expect = 1e-06
Identities = 27/128 (21%), Positives = 32/128 (25%), Gaps = 11/128 (8%)

Query: 264 RPTPPVESSTAPVA--------APTPTPTPTQVVVAPPLPPPPPPPPPPVQKADAPPAPP 315
P P S VA A P P P P P P PP PV P P
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 102

Query: 316 VVPPPPVPTPSPVIATTAPTTTAPATNVTAAPPVQDPFAEDPTIKSLTAKIAANPEDAGA 375
P P P + + AP + + + A
Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT---SKPVTSVASGPRA 159

Query: 376 LYRRGQVY 383
L R Y
Sbjct: 160 LSRNQPQY 167



Score = 31.5 bits (71), Expect = 0.006
Identities = 17/79 (21%), Positives = 21/79 (26%)

Query: 286 TQVVVAPPLPPPPPPPPPPVQKADAPPAPPVVPPPPVPTPSPVIATTAPTTTAPATNVTA 345
T V LP P P + P V PPP P P V
Sbjct: 35 TSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVI 94

Query: 346 APPVQDPFAEDPTIKSLTA 364
P P + +K +
Sbjct: 95 EKPKPKPKPKPKPVKKVEQ 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0555OMPADOMAIN805e-20 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 80.4 bits (198), Expect = 5e-20
Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 1/105 (0%)

Query: 80 TAEREEIAAVAKSKPNIDLEITFDYNSANISQKSLASVQALGRALTSPDLKGSTFVVAGH 139
V + ++ F++N A + + A++ L L++ D K + VV G+
Sbjct: 201 APAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGY 260

Query: 140 TDAAGGDAYNQDLSERRADSIKRYLVEKFGIAGADLVTVGYGKSK 184
TD G DAYNQ LSERRA S+ YL+ K GI + G G+S
Sbjct: 261 TDRIGSDAYNQGLSERRAQSVVDYLISK-GIPADKISARGMGESN 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0556RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 2e-05
Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 57 VRVTGFVVPRREAVVGADQEGSRVTDVLVKEGDTVTENQELARLTPPPPQ 106
G + + E S V +++VKEG++V + L +LT +
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE 133



Score = 30.2 bits (68), Expect = 0.011
Identities = 26/127 (20%), Positives = 50/127 (39%), Gaps = 25/127 (19%)

Query: 120 LRAPAAGVVTE--VRTIAGAPASPQAGPMFRIA-VNNELELDAEVPSVHLLKLSPGLPAR 176
+RAP + V + V T G + + + I ++ LE+ A V + + ++ G A
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAI 387

Query: 177 ITRDNLP-----DIIGRLRVVAPQVDRST--QLGH---ARITLTSSA--------TLKPG 218
I + P ++G+++ + D +LG I++ + L G
Sbjct: 388 IKVEAFPYTRYGYLVGKVKNINL--DAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSG 445

Query: 219 MFARANI 225
M A I
Sbjct: 446 MAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0557ACRIFLAVINRP6210.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 621 bits (1604), Expect = 0.0
Identities = 263/1038 (25%), Positives = 469/1038 (45%), Gaps = 49/1038 (4%)

Query: 5 ISAWSIRNPLPSILFSLILLILGWMSFSKLAVTRLPNADIPVISVAVAQFGAAPSELESQ 64
++ + IR P+ + + ++IL++ G ++ +L V + P P +SV+ GA ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTKIIEDGVSGVEGARHIQSLIT-DGLSVTTITFALETNTDRAINDVKDAVTRVRSDLPQ 123
VT++IE ++G++ ++ S G T+TF T+ D A V++ + LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 NVTEPLISRVDKIGLPIVTYAAISPGK--TPEQLSFFVDDVVKRELQGVRGVSQVERIGG 181
V + IS ++ +S T + +S +V VK L + GV V+ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 182 VEREILVSLDPDRLQAMGLTAVNVSQSLRGTNVDVAGGRAEIGKN------DQAIRTLAG 235
+ + + LD D L LT V+V L+ N +A G+ + +I
Sbjct: 181 -QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 236 AKTLNELSNTMIPLFG-GGEVRLADLGTVTDTIADRRTFARFNGEPVVALGIKRAKGASD 294
K E + + G VRL D+ V + AR NG+P LGIK A GA+
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 295 VVVAAAVQKRIDEVKAAHP-DVDLRLIDTSVEFTKGNYEAAISTLFEGAILAVIIVLLFL 353
+ A A++ ++ E++ P + + + F + + + TLFE +L +++ LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RDFRATVIAAISLPLSIFPAFWAMDILGFSLNMVSFLAITLSTGILVDDAIVEIENIVRH 413
++ RAT+I I++P+ + F + G+S+N ++ + L+ G+LVDDAIV +EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 MRMGKS-PYRAALDAADEIGLAVIAISLTIIAIFAPASFMSGIAGQFFKQFGITVSVQVF 472
M K P A + +I A++ I++ + A+F P +F G G ++QF IT+ +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 FSLLAARFVTPMLAAYFMK--HHDHEDPPPGF----------ILRAYQRIVTWSVQHYFV 520
S+L A +TP L A +K +H + GF + Y V +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 521 TVLLGIGIFAASIWSTTLLSQGFLPAQDAARSLLAIELPPGAQLAYTEKVTEEIVARLRK 580
+L+ I A + L FLP +D L I+LP GA T+KV +++ K
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 581 --RPEVRSVFVDGGRLQGAMEVRRAGMIVNYTPKTERKITQKELELAITKDLETIPDIRF 638
+ V SVF G V+ P ER + E I + + IR
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 639 WFTDENGLRPIKL--VLTGSDPKLVDNVAS----------ELASQMKRIPI-ISNVVSDT 685
F + I TG D +L+D +L + P + +V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 686 SLDRPELRVQPRADLAARLGVSTESLSQTIRVATIGDVGPALAKYDAGGRLVPIRVQLED 745
D + +++ + A LGVS ++QTI A G + + GR+ + VQ +
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTY---VNDFIDRGRVKKLYVQADA 776

Query: 746 AARGDVKILEQLRVPLGQYGEKGGVPLAVVADIKLDQGPTSISRYDRGRQATVSADLVGL 805
R + +++L V GE VP + G + RY+ + +
Sbjct: 777 KFRMLPEDVDKLYVRS-ANGEM--VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 806 AALGDATKQIDDLPVRKSMPASVKLIPSDDAESLNELASGFITAITAGLMIVYAVLVLLF 865
+ GDA +++L +PA + + + + + ++V+ L L+
Sbjct: 834 TSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 866 GTFLQPITILFSLPLSIGGAIGALMLTGKQLTIPVYIGMMMLMGIVTKNAIMLVEFAVEA 925
++ P++++ +PL I G + A L ++ + +G++ +G+ KNAI++VEFA +
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 926 -IHAGTRRDEAIIDAGMKRARPIVMTTIAMVAGMMPSALAVGAGGEFRSPMALAVIGGLV 984
G EA + A R RPI+MT++A + G++P A++ GAG ++ + + V+GG+V
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011

Query: 985 FSTVLSLLFVPAMFLVMD 1002
+T+L++ FVP F+V+
Sbjct: 1012 SATLLAIFFVPVFFVVIR 1029


70BBta_0650BBta_0657N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0650-19-1.177647serine protease
BBta_0651011-1.888864flavin-containing monooxygenase
BBta_0652112-1.778025two component LuxR family transcriptional
BBta_0653113-2.134040sensor histidine kinase
BBta_0654220-2.974642branched-chain amino acid ABC transporter
BBta_0655118-2.532578branched chain amino acid ABC transporter
BBta_0656016-2.309325branched chain amino acid ABC transporter
BBta_0657116-1.688183branched-chain amino acid ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0650V8PROTEASE511e-09 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 51.2 bits (122), Expect = 1e-09
Identities = 35/206 (16%), Positives = 64/206 (31%), Gaps = 35/206 (16%)

Query: 29 TDGVGRSVVTI---VGSRGNFCSGALIAPKLVLTAGHCVQPGVDYRIVEYDRERKPALKT 85
T+G V I + SG ++ +LT H V D A +
Sbjct: 83 TNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHG------DPHALKAFPS 136

Query: 86 VRRVAVHPG--FSMQSILAHRATADVALLELAAPAPER----VAAVLGVPQS-PLAAGNI 138
+P F+ + I + D+A+++ + + V + +
Sbjct: 137 AINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQN 196

Query: 139 FTVAGIGVAIPGDGKSGGVVRAAPLVATGRPGTLQIRLVDPATQGAKPGLGGCTGDSGGP 198
TV G PGD + + + + +Q D +T G G+SG P
Sbjct: 197 ITVTG----YPGDKPVATMWESKGKITYLKGEAMQY---DLSTTG---------GNSGSP 240

Query: 199 AFETQAQGPVLIGVVSWSTGPNLGDG 224
F + + +IG+
Sbjct: 241 VFNEKNE---VIGIHWGGVPNEFNGA 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0652HTHFIS865e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 5e-21
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 2/136 (1%)

Query: 12 LVVDDSPETLRLLTDALDGAGMTVMVALDGAAAMRIVDQITPDIILLDAVMPGIDGFETC 71
LV DD +L AL AG V + + A R + D+++ D VMP + F+
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 72 RKLKREAGLSNVPVIFMTGLADTEHIVQGLEAGGVDYVTKPIAVEEMLARIRVHLANARM 131
++K+ ++PV+ M+ ++ E G DY+ KP + E++ I LA +
Sbjct: 67 PRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 132 TQSARAALDVSGRFLL 147
S G L+
Sbjct: 125 RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0653HTHFIS763e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 3e-16
Identities = 27/153 (17%), Positives = 60/153 (39%), Gaps = 1/153 (0%)

Query: 916 PRKTILITDDDPVHRDLLREVLTPLGFILLSAPDGPSCLALAQHCQPDLFILDISMPGMD 975
TIL+ DDD R +L + L+ G+ + + + DL + D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 976 GWTVAETLRSSGHHQARILMTSASALEAHGRPLAQPFHDSYLMKPIDIPRLLEAIRQLLK 1035
+ + ++ + ++M++ + + + +D YL KP D+ L+ I + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYD-YLPKPFDLTELIGIIGRALA 120

Query: 1036 LDWSYQTDIPVAQPRWRPETGSRPPPRYVEELM 1068
+ + P G + + ++
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0657PF05272353e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 3e-04
Identities = 27/157 (17%), Positives = 48/157 (30%), Gaps = 45/157 (28%)

Query: 41 VIIGPNGAGKTTVLDLICGK----------TKATSGSIQFRGK------ELTKLRENEI- 83
V+ G G GK+T+++ + G Q G E+T R +
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAE 659

Query: 84 -VKAGVGR---KFQTPSVFEDLSVFENLEISFPRGRTVF-GSL---TFQRDAT------- 128
VKA +++ + PR + V + + D T
Sbjct: 660 AVKAFFSSRKDRYRGA--------YGRYVQDHPR-QVVIWCTTNKRQYLFDITGNRRFWP 710

Query: 129 --VQDRVEEVAEMIFLKDRLKTSAAELSH-GQKQWLE 162
V R V + + +L A L G++ +
Sbjct: 711 VLVPGRANLVW-LQKFRGQLFAEALHLYLAGERYFPS 746


71BBta_0808BBta_0812N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_08080102.749422RND family mulitdrug efflux protein
BBta_0809-1112.464199hypothetical protein
BBta_0810-1121.903870transcriptional regulator
BBta_0811-1131.959943hypothetical protein
BBta_0812-1151.071634hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0808ACRIFLAVINRP438e-139 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 438 bits (1128), Expect = e-139
Identities = 223/1045 (21%), Positives = 426/1045 (40%), Gaps = 62/1045 (5%)

Query: 16 LSAWALRQSSLVIFMMIVVLLAGAWSYTKLTRNEDPPFTIKTMVVSVIWPGATVADTTNL 75
++ + +R+ + I++++AGA + +L + P + VS +PGA +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 76 VTDRIEQKLEETPYLDRLDSYT-RAGESVIMVNLRDDTPPQAVSDVWYQVRKKVGDIAPM 134
VT IEQ + L + S + AG I + + T P QV+ K+ P+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATPL 117

Query: 135 LPSGVRGP-FFNDEFGDTFGIIYGFTAEG--FSDRELRDRLD-TIRAEVLRVKDVGKVQL 190
LP V+ ++ ++ ++ GF ++ + ++ D + ++ + R+ VG VQL
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 191 LGVQEEQIAVEFSPRKLAAFGLNAQQVMNALAAQNAVQPAG------VTRTGEEKIALRV 244
G + + + L + L V+N L QN AG + ++
Sbjct: 178 FG-AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 245 SGAFASEASLRAVTLHVAQ--RYVPLTDIATISRIPVDPPAAAFRVNSEKALGLAISMAP 302
F + VTL V V L D+A + + + R+N + A GL I +A
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLAT 295

Query: 303 TGNLSEFGAAVRERMTEIGARLPHGIDMVAVADQSSVVKDAIFGFLKVLVEAIAIVLAVS 362
N + A++ ++ E+ P G+ ++ D + V+ +I +K L EAI +V V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 363 FLSLGA-RAGLVVTFSIPLVLAMTFVGMELAGIGLQRISLGALIIALGLLVDDAMITVEA 421
+L L RA L+ T ++P+VL TF + G + +++ +++A+GLLVDDA++ VE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 422 MVSRLEAGWDRARAASY-AYDSTAFPMLTGTLVMIAGFIPVGFAASSAGEYCFSLFMVVL 480
+ + + A+ + ++ +V+ A FIP+ F S G + ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 481 ISLSASWIVAVLFSPIIGTWILPKSMAAHGHEDGRIGRAYNR-----------LLELVLR 529
+++ S +VA++ +P + +L A H G +N + +L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 530 RPAATIIVSLTALVVSGLGAMRLEQQFFPPSDRPELLVSLTLPQNASFEATDVQAKRLEA 589
+++ + + +RL F P D+ L + LP A+ E T ++
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 590 MLKTDPDIDRFSTYVGAGAIRFYLPMDVLLSNDNVTQMVVVARDLDARDRVKARLDA--- 646
+ + S + G N V + + R+ + +A
Sbjct: 596 YYLKNEKANVESVFTVNG-------FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIH 648

Query: 647 AFRTEFADLITRANPLELGPP------VGWPLKYRVT---GPDVGKVREIAMQLANIVAG 697
+ E I + P + + G + + QL + A
Sbjct: 649 RAKMELGK-IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 698 N-AETRDVNLSAGEPQKSVEVKVRQTEARAVGLSSQDVAAALATIFSGSPVTTVRDANRL 756
+ A V + E +++V Q +A+A+G+S D+ ++T G+ V D R+
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 757 VDVVVRSAGLERNDLATVANLQITGPDGAAIPLRQIADVAYGVEEPIIWRRQRLPIITVQ 816
+ V++ R V L + +G +P + P + R LP + +Q
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQ 827

Query: 817 ADVDAAVQPATVSEALSPAVAKFAAGLPDGYRIAEGGVVEEAAKGNASIFAVIPLMLLVM 876
+ P T S + A+ LP G G+ + A++ + +V+
Sbjct: 828 GE----AAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVV 883

Query: 877 VSLLMVQLRSFSRTGIALAMAPFGLIGVVGAMLPTGTAMGFVAQLGVIALAGMIIRNAVI 936
L S+S + + P G++GV+ A +G++ G+ +NA++
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAIL 943

Query: 937 LIEEA-DINVAGGMATDDAIKAAARHRARPILLTALAAILGMIPIAPQVFWG-----PMA 990
++E A D+ G +A A R R RPIL+T+LA ILG++P+A G +
Sbjct: 944 IVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVG 1003

Query: 991 FAIIGGLAAATLLTLTLLPVILSLL 1015
++GG+ +ATLL + +PV ++
Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 94.1 bits (234), Expect = 1e-21
Identities = 73/530 (13%), Positives = 172/530 (32%), Gaps = 47/530 (8%)

Query: 525 ELVLRRPAATIIVSLTALVVSGLGAMRLEQQFFPPSDRPELLVSLTLPQNASFEATDVQA 584
+RRP ++++ ++ L ++L +P P + VS P + D
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 585 KRLEAMLKTDPDIDRF-STYVGAGAIRFYLPMDVLLSNDNVTQMVVVARDLDARDRVKAR 643
+ +E + ++ ST AG++ L D V +
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQV-----QNKLQLATPL 117

Query: 644 LDAAFRTEFADLITRANPLELGPPVGWPLKYRVTGPDVGK-------VREIAMQLANI-- 694
L + + I+ + P + + L+ +
Sbjct: 118 LPQEVQQQ---GISVEKSS---SSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 695 VAGNAETRDVNLSAGEPQKSVEVKVRQTEARAVGLSSQDVAAALATIFSGSPVTTVRDAN 754
V DV L Q ++ + + L+ DV L +
Sbjct: 172 VG------DVQLFGA--QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTP 223

Query: 755 RLVDVVVRSAGLERNDLATVANLQ----ITGPDGAAIPLRQIADVAYGVEE-PIIWRRQR 809
L + ++ + + DG+ + L+ +A V G E +I R
Sbjct: 224 ALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING 283

Query: 810 LPIITVQADVDAAVQPATVSEALSPAVAKFAAGLPDGYRIAE----GGVVEEAAKGNASI 865
P + + ++A+ +A+ P G ++ V+ + + +
Sbjct: 284 KPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI--HEVV 341

Query: 866 FAVIPLMLLVMVSLLMVQLRSFSRTGIALAMAPFGLIGVVGAMLPTGTAMGFVAQLGVIA 925
+ ++LV + + + L++ T I P L+G + G ++ + G++
Sbjct: 342 KTLFEAIMLVFLVMYLF-LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL 400

Query: 926 LAGMIIRNAVILIEEA-DINVAGGMATDDAIKAAARHRARPILLTALAAILGMIPIA--- 981
G+++ +A++++E + + + +A + + ++ A+ IP+A
Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 982 --PQVFWGPMAFAIIGGLAAATLLTLTLLPVILSLLFAAEARRQEGADRA 1029
+ + I+ +A + L+ L L P + + L +
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0809RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 31/185 (16%), Positives = 65/185 (35%), Gaps = 21/185 (11%)

Query: 97 VANELRAAEADLASATAAESLAKTTLERQQILLDKQIVAQVRVDEADANWRSAKARLDAA 156
NELR ++ L + AK + L +I+ ++R +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR---------QTTDNIGLL 314

Query: 157 ASALANAKARLSYTRLLAPETGVITAIGANA-GQVVPAGQ---MVVRLASSLERDAVFNV 212
LA + R + + AP + + + + G VV + ++V +LE A V
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA--LV 372

Query: 213 AESVINNAPPDIEVKVTLVSDPSV---VLSGRVRDVSPTADPSTRT---YRVRVALPDTP 266
I + + + P L G+V++++ A R + V +++ +
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENC 432

Query: 267 PLSYG 271
+
Sbjct: 433 LSTGN 437



Score = 39.8 bits (93), Expect = 1e-05
Identities = 20/99 (20%), Positives = 34/99 (34%), Gaps = 7/99 (7%)

Query: 70 GRVTSRQVEIGASVQKGELLATLDDTNVANELRAAEADLASATAAESLAKTTLERQQILL 129
V V+ G SV+KG++L L AEAD ++ A+ R QIL
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALG-------AEADTLKTQSSLLQARLEQTRYQILS 157

Query: 130 DKQIVAQVRVDEADANWRSAKARLDAAASALANAKARLS 168
+ ++ + + + K + S
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0810HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 2e-14
Identities = 40/211 (18%), Positives = 78/211 (36%), Gaps = 12/211 (5%)

Query: 1 MARPALTPDELDSTRRRILAETAAIVASEGYAALSMRRIASAIGLTAGALYRYFPTKQHV 60
MAR T E TR+ IL + + +G ++ S+ IA A G+T GA+Y +F K +
Sbjct: 1 MARK--TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 61 LMALWSDAIAELDARIVA-MDAAHDHDVDAIAGILRAYAEFALADPARFR----LMFLEN 115
+W + + + + + + IL E + + R +
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 116 DLGQFDELAREQDFFASYLR--LQARVEQAIRAGALQAM-PAEAATQLLWGSVHGIVTLG 172
+G+ + + Q ++ ++ I A L A A ++ G + G +
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG--LME 176

Query: 173 VTVNQIDFGDLLRLAATAADTMLRGLSVPPS 203
+ DL + A +L + P+
Sbjct: 177 NWLFAPQSFDLKKEARDYVAILLEMYLLCPT 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0812IGASERPTASE506e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.1 bits (119), Expect = 6e-08
Identities = 54/318 (16%), Positives = 102/318 (32%), Gaps = 30/318 (9%)

Query: 597 YRPELPNPATEQPSAQIVSPAANQAAFAAAPRDFHAAA----APAAPPPPIPPKAIAEIL 652
Y PE+ + I +P QA + P + A AP PP P P E +
Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040

Query: 653 EPHAAQAARPTPIVPELPPDHPLEPGTRPLGRTASPAERIAASEDALSELPAGKREPVST 712
++ Q ++ + + + R A E + + V+
Sbjct: 1041 AENSKQESKTVEKNEQDATE-------------TTAQNREVAKEAKSNVKANTQTNEVAQ 1087

Query: 713 SSFIAAARRAAQAAAAAQPVDAKKADKAKKDKPKAKEPPKEAVKDLPKE--------AAK 764
S + + Q + +K +KAK + K +E PK + PK+ A+
Sbjct: 1088 S---GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 765 AKIKDKPTVALTAEGDEEVSTIGSKIRSLLVGASVVVIVLGTFKFAMSLLDSGSTPPPIT 824
++ PTV + E + +T + +S V + + P T
Sbjct: 1145 PARENDPTVNI-KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 825 PMESQGEAPRAQMPSATPSLSPRPATPDQSVPSLTSPTPMDRQSYNRSMPTEAETIAMLA 884
P +Q ++ + + R + + DR + T T A+L+
Sbjct: 1204 PATTQPTV-NSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 885 IPQDTDKQAAAAAAPDVT 902
+ + A V+
Sbjct: 1263 DARAKAQFVALNVGKAVS 1280



Score = 35.4 bits (81), Expect = 0.002
Identities = 46/249 (18%), Positives = 83/249 (33%), Gaps = 25/249 (10%)

Query: 550 IEGDLRNVRATSSQPAAILAPPIEAPMPAARAPAPQTYAEPARPEPAYRPELPNPATEQ- 608
I+ D+ +V + + + A + P+ P PA + +T AE ++ E + ATE
Sbjct: 1003 IQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETT 1062

Query: 609 -------PSAQIVSPAANQAAFAAAPRDFHAAAAPAAPPPPIPP----KAIAEILE---- 653
A+ A Q A KA E +
Sbjct: 1063 AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEV 1122

Query: 654 -PHAAQAA----RPTPIVPELPPDHPLEPGTRPLGRTASPAERIAASEDALSELPAGKRE 708
+Q + + + P+ P +P T + S A +E E + +
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181

Query: 709 PVSTSSFIAAARRAAQAAAAAQPVDAKKADKAKKDKPKAKEPPKEAVKDLPK--EAAKAK 766
PV+ S+ + + P + ++ K K + +V+ +P E A
Sbjct: 1182 PVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN-KPKNRHRRSVRSVPHNVEPATTS 1240

Query: 767 IKDKPTVAL 775
D+ TVAL
Sbjct: 1241 SNDRSTVAL 1249


72BBta_0879BBta_0885N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_0879-2121.1934433-oxoacyl-ACP reductase
BBta_0880-1121.550206histidine kinase
BBta_0881-2121.771133metal ion transporter
BBta_0882-2101.120998ABC transporter substrate-binding protein
BBta_0883-1100.675402hypothetical protein
BBta_0884090.088265ribitol 2-dehydrogenase (RDH)
BBta_0885-1100.020003methyl-accepting chemotaxis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0879DHBDHDRGNASE912e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 91.3 bits (226), Expect = 2e-24
Identities = 77/259 (29%), Positives = 107/259 (41%), Gaps = 31/259 (11%)

Query: 3 EQRVALVTAGGSGMGAAAARRLAADGFQVG----------ILSSSGKGEALATELGGLGV 52
E ++A +T G+G A AR LA+ G + + SS K EA E V
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 53 TGSNKSNDDLKRLVDGAMRRWGRIDVLVNSAGHGPRAPILELSDEQWHTGLDTYLLNVIR 112
S ++ R+ R G ID+LVN AG I LSDE+W V
Sbjct: 67 RDSAAIDEITARI----EREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 113 PSRLVTPIMQTQKNGAIINISTAWTFEPSAMFPTSAVFRAGLASFTKIFADSYAADNIRM 172
SR V+ M +++G+I+ + + P A +A FTK A NIR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 173 NNVLPG----------WIDSLPAT-------EERRSSVPMARYGKAEEIAATIAFLASDG 215
N V PG W D A E ++ +P+ + K +IA + FL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 216 AAYITGQNIRVDGGLTRSV 234
A +IT N+ VDGG T V
Sbjct: 243 AGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0880HTHFIS812e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 2e-18
Identities = 32/125 (25%), Positives = 55/125 (44%), Gaps = 5/125 (4%)

Query: 404 MDRTGSERILIVEDRADVAELARSILEDFGYRTEIAANAPTALELLDSEIRFDLLFTDLV 463
M IL+ +D A + + L GY I +NA T + + DL+ TD+V
Sbjct: 1 MTGA---TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVV 56

Query: 464 MPGGMNGTVLAREARKRQPKLKVLLTTGYSDAAVERADANIGEFEIINKPYRRTDLVRRV 523
MP N L +K +P L VL+ + + + G ++ + KP+ T+L+ +
Sbjct: 57 MPD-ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 524 RQLLD 528
+ L
Sbjct: 116 GRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0884DHBDHDRGNASE913e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 91.3 bits (226), Expect = 3e-24
Identities = 68/251 (27%), Positives = 113/251 (45%), Gaps = 23/251 (9%)

Query: 5 LEGKVAAVTGAASGIGLASSEAMLAAGARVVMIDRDASALARLRE------RHGEAVIPV 58
+EGK+A +TGAA GIG A + + + GA + +D + L ++ RH EA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--- 62

Query: 59 VIDLLDSTDCATLLPRILDAAGRLDILHANAGSYIGGDLVDARTDAIDRMLNLNVNVVMK 118
D+ DS + RI G +DIL AG G + + + ++N V
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 NVRDALPHMIDRGSGDIIVTSSLAAHYPTPWEPVYASSKWAIDCFVQTVRRQVFKHGIRV 178
R +M+DR SG I+ S A P YASSK A F + + ++ ++ IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 179 GAISPGPVVTALIAD-WPAEKLKEARESGS------------LLEPAEVANVIMFMLTRP 225
+SPG T + W E E GS L +P+++A+ ++F+++
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 226 RG-MTIRDVVM 235
G +T+ ++ +
Sbjct: 243 AGHITMHNLCV 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_0885PYOCINKILLER310.024 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.024
Identities = 37/193 (19%), Positives = 71/193 (36%), Gaps = 3/193 (1%)

Query: 356 GQMAAGNDDVVVPTSDRDDEIAAMAGSLQTFKEALLEKKRAEQAAAAEAQAKIERGQRVE 415
G AA N + + + EA K EQAAA + E+ ++
Sbjct: 183 GLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQA 242

Query: 416 RFTREFEMAIGEVVGVVSSASADLERSAASLTTTAGRSLELATVVTAASEEASTNVHSVA 475
A+ VV++A+ A + +++ A V ++ +V +V
Sbjct: 243 AIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVG 302

Query: 476 AAAEEMSSSVNEISRQVQDSARIASEALAQAR---KTNDNVAELAKAAARIGDVVELINA 532
A+ SS E + + + + A+ + N+ +AKA+ + + L N
Sbjct: 303 FASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNE 362

Query: 533 IAGQTNLLALNAT 545
G T L++ +T
Sbjct: 363 ARGNTTTLSVVST 375


73BBta_1166BBta_1174N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_11661121.765346IclR family transcriptional regulator
BBta_11672130.982222dihydrodipicolinate synthase
BBta_11681130.798717ABC transporter ATP-binding protein
BBta_1169-1110.725712ABC transporter permease
BBta_1170-190.316236ABC transporter substrate-binding protein
BBta_1171-191.291635SDR family dehydrogenase
BBta_1172-190.530053hypothetical protein
BBta_1174-2130.728391two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1166PF05616290.031 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.031
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 189 RQVKAAGYALSNQ--ENAPGLCVLAAPVIDRDGVPLAAIS 226
+ +KA GY ++ E APG V PV DR+G P+ ++
Sbjct: 254 KYIKATGYPGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVA 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1168PF05272330.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.1 bits (75), Expect = 0.001
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 7/51 (13%)

Query: 39 VHALSRISLDIKPG---EFVCV-VGPSGCGKSTLLRLLAGLDGYSDGRLLL 85
+ ++R+ ++PG ++ V G G GKSTL+ L GLD +SD +
Sbjct: 582 MGHVARV---MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1171DHBDHDRGNASE921e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 91.7 bits (227), Expect = 1e-24
Identities = 59/248 (23%), Positives = 105/248 (42%), Gaps = 30/248 (12%)

Query: 1 MTGTSSGIGRAIAERLLAGGWHVTGIDRALPDWTHA--------GFVGQQ--ADLAEPHA 50
+TG + GIG A+A L + G H+ +D P+ + AD+ + A
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVD-YNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 51 LIAQ----LADLGTVTALVHAAGYMRVGRLGELMPDSGEGMWQVHVGAAEALANALAPDM 106
+ ++G + LV+ AG +R G + L + E + V+ + +++ M
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYM 131

Query: 107 --GEGGRIVLIGSRTANGA-AGRSQYAATKAALTGLARSWAIELAPRRITVNVIAPAATD 163
G IV +GS A + YA++KAA + +ELA I N+++P +T+
Sbjct: 132 MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTE 191

Query: 164 TPFLRD--PSRAGTAPVLP----------PMGRFVDPAEVAALTAFLLSPEASAITGQTI 211
T G V+ P+ + P+++A FL+S +A IT +
Sbjct: 192 TDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNL 251

Query: 212 TICAGASL 219
+ GA+L
Sbjct: 252 CVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1174HTHFIS791e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 1e-20
Identities = 30/119 (25%), Positives = 61/119 (51%), Gaps = 2/119 (1%)

Query: 2 SRVLIADDEESMRTLVARAIAMDGHEIVTAQDGAEALDILTRENGAFDLLLTDIQMPIMD 61
+ +L+ADD+ ++RT++ +A++ G+++ + A + G DL++TD+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA--GDGDLVVTDVVMPDEN 61

Query: 62 GIALALSAARDYPDLVILLMTGFAHQRERASNLQAIAHDVITKPFSVADIRTAVADALA 120
L + PDL +L+M+ + A+D + KPF + ++ + ALA
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


74BBta_1273BBta_1282N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1273-290.190058hypothetical protein
BBta_1274090.431842hypothetical protein
BBta_1275090.467012phosphoesterase family protein
BBta_12762121.875399hypothetical protein
BBta_12773121.800230general secretion pathway protein D
BBta_12781131.935049general secretion pathway protein E
BBta_1279-1161.921609general secretion pathway protein F
BBta_1280-1211.580320general secretion pathway protein G
BBta_1281-1202.026640general secretion pathway protein H
BBta_12820212.533787general secretion pathway protein I, GspI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1273IGASERPTASE300.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.009
Identities = 22/123 (17%), Positives = 38/123 (30%), Gaps = 12/123 (9%)

Query: 80 PSPDMLDLPRRTEAPAKRKPQKTSNAQPVDDKSARKAARTDQKEQRRR--------EKEW 131
P+P T A ++ KT D R KE + E
Sbjct: 1028 PAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ 1087

Query: 132 QKQQAAAAKQREHQQKAVAAAEA---ALERAQKEHESIASEIAADRARIEQ-RAQAEERR 187
+ + E ++ A E +E + S+++ + + E + QAE R
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 188 WQD 190
D
Sbjct: 1148 END 1150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1277BCTERIALGSPD2079e-60 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 207 bits (529), Expect = 9e-60
Identities = 126/505 (24%), Positives = 225/505 (44%), Gaps = 42/505 (8%)

Query: 249 VFPVSNSAPEPLVAELEKIMDTGENGLSQNLVKLQVVSRLNAIMVVTRKPALLQTAATWI 308
V P++N A L L ++ D G V + ++++T + A+++ T +
Sbjct: 131 VVPLTNVAARDLAPLLRQLNDNAGVGSV-------VHYEPSNVLLMTGRAAVIKRLLTIV 183

Query: 309 RRLDQADSGRTSVHVYRIRYGDARQLAKVLTDMFGGASSSS--------------TDSTD 354
R+D A G SV + + A + K++T++ S S+ T++
Sbjct: 184 ERVDNA--GDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL 241

Query: 355 NQAAPGSDGTTTSVADRLSFNTNAAGSANGGATTSSRLQGAGGLSGMQSSSASSASSSTP 414
P S ++ +L G+ + A L +
Sbjct: 242 VSGEPNSRQRIIAMIKQLDRQQATQGNTK---VIYLKYAKASDLV---------EVLTGI 289

Query: 415 SASGLEPRSGGSGGQALMPNVRITPDTVNNSLLIYADRESYRTISSTLQQLDQPVLQVGI 474
S++ + AL N+ I N+L++ A + + + QLD QV +
Sbjct: 290 SSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLV 349

Query: 475 DATIAEVTLTNELSYGVQAYLSSKVLGLGTDKGSITNTQTTSVATATTAAATSALINRAL 534
+A IAEV + L+ G+Q + + T+ G +T S+ + AL
Sbjct: 350 EAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASAL 409

Query: 535 PGFNFLIGHEASPN--MILDALHTVTSVKVLSNPSLVVINNQTATLQVGDVVPVSTGSAT 592
FN + N M+L AL + T +L+ PS+V ++N AT VG VPV TGS T
Sbjct: 410 SSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQT 469

Query: 593 VLSSSNTVVNTIDYRNTGIILRVVPRIAANGNVRLEVEQEISNV---AAQSAASLTPTVS 649
+S + + NT++ + GI L+V P+I +V LE+EQE+S+V A+ +++ L T +
Sbjct: 470 --TSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFN 527

Query: 650 QRKVKSAISVANGQTVLLAGLISEQQNGNRNGIPGFDEIPILGDTFSHQDKKGTRTELII 709
R V +A+ V +G+TV++ GL+ + + + +P +IP++G F KK ++ L++
Sbjct: 528 TRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLML 587

Query: 710 FIRPQIIRDGSDAHQVAEELRSKLR 734
FIRP +IRD + Q + +
Sbjct: 588 FIRPTVIRDRDEYRQASSGQYTAFN 612



Score = 93.1 bits (231), Expect = 1e-21
Identities = 74/310 (23%), Positives = 127/310 (40%), Gaps = 25/310 (8%)

Query: 80 ADGKGYDLNFENTPIALVAKVVIGDILGAGYSIDPRVQGSVSLVSARPVPKSDILFVLES 139
A + + +F+ T I V L IDP V+G++++ S + + S
Sbjct: 25 AAAEEFSASFKGTDIQEFINTV-SKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83

Query: 140 ALRLSGVVLVREGGG-YKLTPLGDAIGAGRVDGEAGRAEPGYG----VSVVPLQYVGAQT 194
L + G ++ G K+ DA A A PG G VVPL V A+
Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVA--SDAAPGIGDEVVTRVVPLTNVAARD 141

Query: 195 ILKLMDSFATRA--GSVRADSTRNLLLIQGTGAERRSAIETALSF--DVDWMRGQSVGVF 250
+ L+ A GSV N+LL+ G R + I+ L+ VD +SV
Sbjct: 142 LAPLLRQLNDNAGVGSVVHYEPSNVLLMTG----RAAVIKRLLTIVERVDNAGDRSVVTV 197

Query: 251 PVSNSAPEPLVAELEKI--MDTGENGLSQNLVKLQVVSRLNAIMVVTRKPALLQTAATWI 308
P+S ++ +V + ++ + + + R NA+ +V+ +P Q I
Sbjct: 198 PLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAV-LVSGEPNSRQRIIAMI 256

Query: 309 RRLDQADSGRTSVHVYRIRYGDARQLAKVLTDMFGGASSSSTDSTDNQAAPGSDGTTTSV 368
++LD+ + + + V ++Y A L +VLT + SST ++ QAA ++
Sbjct: 257 KQLDRQQATQGNTKVIYLKYAKASDLVEVLTGI------SSTMQSEKQAAKPVAALDKNI 310

Query: 369 ADRLSFNTNA 378
+ TNA
Sbjct: 311 IIKAHGQTNA 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1279BCTERIALGSPF2386e-77 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 238 bits (608), Expect = 6e-77
Identities = 115/406 (28%), Positives = 189/406 (46%), Gaps = 4/406 (0%)

Query: 1 MPSYRYRALTQSGEIVVGSLVAPSRSEVERRISYLQLLPIETIEEKT-SGSGAASGFAFG 59
M Y Y+AL G+ G+ A S + + + L+P+ E + ++G +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AP---SAAEVTTFTRDLALLLKAGARLDDALELLSGDSEVGRLRPVVARLRTAIMAGESF 116
S +++ TR LA L+ A L++AL+ ++ SE L ++A +R+ +M G S
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 ADAAAAQPRLFSPMYVALMRVGEASGTLDHVLTALAGERERSEATRRKLTDAMQYPAFVF 176
ADA P F +Y A++ GE SG LD VL LA E+ + R ++ AM YP +
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 177 LAAIGVMLFFLVAVLPNFSAVLRDFGGRADTALGFFMSLSDIVRANAAAITLGCAMLIAA 236
+ AI V+ L V+P + M +SD VR + L A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 237 VWWLLRQPAVRAALTSALAVIPGIGDILMFYRTSLFCRNLGLLLGCGVTLSAALRILVDV 296
+LRQ R + L +P IG I T+ + R L +L V L A+RI DV
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 297 MAVTSSRAPWSAAADRVRHGGKLSQALSADNMLPPMALRMLRLGEETGQLPTLSTRIAEF 356
M+ +R S A D VR G L +AL + PPM M+ GE +G+L ++ R A+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 357 YESKLQRGLDRVVGIVGPAAILLISVVVGGLIVSIMTALLSVTQLV 402
+ + + +G+ P ++ ++ VV ++++I+ +L + L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1280BCTERIALGSPG1351e-43 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 135 bits (342), Expect = 1e-43
Identities = 47/136 (34%), Positives = 77/136 (56%), Gaps = 6/136 (4%)

Query: 10 VRRSGRENERGFTLVEMLVVITIIGLIMGLIGPRVLNYLSESKTKAARIQLQSFSAALDL 69
+R + ++ RGFTL+E++VVI IIG++ L+ P ++ ++ + A + + ALD+
Sbjct: 1 MRATDKQ--RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDM 58

Query: 70 FYLDVGRYPSTSEGLAALAQRP---PGLGTWNGPYLKGGAVPKDPWNTAYVYRAPGDHGP 126
+ LD YP+T++GL +L + P P +N +P DPW YV PG+HG
Sbjct: 59 YKLDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI-KRLPADPWGNDYVLVNPGEHGA 117

Query: 127 FDILSYGADGQEGGSG 142
+D+LS G DG+ G
Sbjct: 118 YDLLSAGPDGEMGTED 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1281BCTERIALGSPG391e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 39.5 bits (92), Expect = 1e-06
Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 11/83 (13%)

Query: 4 DSQRGFTLLEMVCVLAIVALLASVAWPYLPRQTSRPRLQAYALQAVTLLKSDRAASMRNG 63
D QRGFTLLE++ V+ I+ +LAS+ P L + Q V L + A M
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL---ENALDM--- 58

Query: 64 LRVDTRLDTSRRLISSGSGGAAL 86
+LD ++ G +L
Sbjct: 59 ----YKLDNH-HYPTTNQGLESL 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1282BCTERIALGSPH270.014 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 27.2 bits (60), Expect = 0.014
Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 6 QAGFTLIEVLVALAVVSVSIVAI 28
Q GFTL+E+++ L ++ VS +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMV 25


75BBta_1297BBta_1314N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1297-281.895863multidrug ABC transporter
BBta_1298-2131.655552multidrug resistant protein
BBta_1299-2141.369909hypothetical protein
BBta_1300012-0.243087hypothetical protein
BBta_1301012-1.182544hypothetical protein
BBta_1303013-1.128905hypothetical protein
BBta_1304-111-0.900150cytochrome B561
BBta_1305-112-0.760586hypothetical protein
BBta_1306-211-0.525513hypothetical protein
BBta_1307012-0.732012energy transducer TonB
BBta_1308-1120.285598biopolymer transport exbD protein
BBta_1309-2120.743040biopolymer transport protein
BBta_1310-381.262982hypothetical protein
BBta_1313-281.274946hydroxylase
BBta_1314-281.615211TonB-dependent receptor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1297ACRIFLAVINRP5940.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 594 bits (1532), Expect = 0.0
Identities = 245/1046 (23%), Positives = 440/1046 (42%), Gaps = 44/1046 (4%)

Query: 18 FAIRFRGIVLALACTILGYGLFSLGEAKYGVFPEFAPPQVSIQTEAPGLSPEQVEVLVTQ 77
F IR LA ++ G ++ + +P APP VS+ PG + V+ VTQ
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 78 PIETSINGLAGVESMRSSSIQ-GLSVLSVIFQPGTDVFRARQLVAERLTAIVNGLPQGVR 136
IE ++NG+ + M S+S G +++ FQ GTD A+ V +L LPQ V+
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 137 APSMTPLVPAAGTVLVIGLTSE--QRSLMDLRTIADWTVSRRLLAVAGVAQVTTYGRDIR 194
++ ++ ++V G S+ + D+ V L + GV V +G
Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 195 SLQVQVRADDLVRFGIGMNDVVAAARKATGVRGAGFVDTA------NQRVILQTQGQSLT 248
++++ + AD L ++ + DV+ + AG + + Q +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 249 PEQLARTVLL-HQSGASVVLGDVATVVSAPEPPIGGALINGKPGIMMMISQQYGANTRNV 307
PE+ + L + G+ V L DVA V E A INGKP + I GAN +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 308 AARVEAALQELRPAL-DGEKVQLHADLFRPATFIDSAIQNVLFALLIGGALVIVVLFLFL 366
A ++A L EL+P G KV + F+ +I V+ L LV +V++LFL
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLY---PYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 367 SDWRTSVISCTAIPLSLFSAVLALQWMGQSLDTMTLGGLAIAIGEVVDDAVIGVENVVRR 426
+ R ++I A+P+ L L G S++T+T+ G+ +AIG +VDDA++ VENV R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 427 LRENRALPSPRAAARVVYDAMLEVRSAVAYATFAVLLVFVPILALPGLAGRLFGPLGIAY 486
+ E++ P+ A +M +++ A+ + VF+P+ G G ++ I
Sbjct: 420 MMEDKL--PPKEA---TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 487 ILAVLASLIVALTVTPALAMMLLAGHKPRGAPREPPLARW-------SRHYYERLLRRLG 539
+ A+ S++VAL +TPAL LL + W S ++Y + ++
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 540 RFPKLVMATALLITLGGAGLVPWFGGTFLPDLKEGHLILHVSMLPGTSLDESLRLGTMIA 599
+ LI G L +FLP+ +G + + + G + + + ++ +
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 600 EAL--RAVPEVRSVAQHAGRAEAGIDTAGTHSSEIEVDFQP-----GLSGNAQELAERRI 652
+ V SV G + +G ++ V +P G +A+ + R
Sbjct: 595 DYYLKNEKANVESVFTVNGFSFSG---QAQNAGMAFVSLKPWEERNGDENSAEAVI-HRA 650

Query: 653 RAALANFAGINLSIKTYLTERIEETLSGFNAAVVINVFGPDLDDLETAARDIARELGEVW 712
+ L + T +GF+ +I+ G D L A + +
Sbjct: 651 KMELGKIRDGFVIPFNMPAIVELGTATGFDFE-LIDQAGLGHDALTQARNQLLGMAAQHP 709

Query: 713 GA-IDIQQQSPPGMPQVNVTLRPTDLQSWGLDAVEVLELIRTAYQGDVVGQAYEGNVVFN 771
+ + ++ Q + + Q+ G+ ++ + I TA G V + V
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK 769

Query: 772 VIVKLDPEASARPIQIGDMPIRTPSGAYVLLKQIADVYGTSGRYVVLHRNGQRVQTVTAN 831
+ V+ D + P + + +R+ +G V + G + NG +
Sbjct: 770 LYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE 829

Query: 832 VAERDLESFVAAAKRKIAKEVKLPPGAHVEFTGAAEAQAKSRRDLVVNASLATLGIVLLL 891
A A +A KLP G ++TG + + S +++ + + L L
Sbjct: 830 AAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCL 887

Query: 892 ALVTKHWRNLALVLINLPFAFVGGVVAVALTGGVLSLGSMVGFVTLFGIALRNSILMISH 951
A + + W V++ +P VG ++A L + MVG +T G++ +N+IL++
Sbjct: 888 AALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF 947

Query: 952 YEHLVAVEGRSWSLDLAIQGAADRLVPILMTSLVTGLGLLPLALGAGEPGREIEGPMAIV 1011
+ L+ EG+ ++ + RL PILMTSL LG+LPLA+ G G + + I
Sbjct: 948 AKDLMEKEGKG-VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG-AGSGAQNAVGIG 1005

Query: 1012 ILGGLLTSMALNLLVLPTLAVRFGRF 1037
++GG++++ L + +P V R
Sbjct: 1006 VMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1298RTXTOXIND290.023 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.023
Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 11/84 (13%)

Query: 50 GAVLDVARLTDLTNSYANAQAQLQTAQAKLEVARLAFDRARDLVQSAAMPKKDAEAAEGT 109
G VL +LT L + Q QA+LE R Q + + + E
Sbjct: 121 GDVL--LKLTALGAEADTLKTQSSLLQARLEQTRY---------QILSRSIELNKLPELK 169

Query: 110 FRTDQAALAAAESQVKTLMATARQ 133
+ +E +V L + ++
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKE 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1299IGASERPTASE392e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.9 bits (90), Expect = 2e-04
Identities = 51/311 (16%), Positives = 93/311 (29%), Gaps = 35/311 (11%)

Query: 1291 AKRRAAQPATAPHTLEEEIATVAKYLSLARRSGAELPAVARTRIGAPVPHIRAERGADDR 1350
KR T T A V + E+ V + P P +E +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPS----VPSNNEEIARVDEAPVPPPAPATPSETT--ET 1039

Query: 1351 SFDATFAKEPVAKESANAARKPSPDVRIFRRRSAAAVPVEP-------SSSAALEPSIVV 1403
+ + + +++ A + + R + + + V S S E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 1404 ATDPPVPPAPEQLPVPPPLRDETPPVAAAADPEPSGADTQPNAAPPA-----------PV 1452
+ E+ V E P V + P+ ++T A PA P
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 1453 AQAALESTIDTPVSTEAAEQPQQHAASSESRIHRSLVEMAFQATESATAPRRRLPRRRVM 1512
+Q + + P ++ Q S+ S+VE T + T P
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP 1219

Query: 1513 PIAAVGLLAVIGAALSWERAVDYLNAPSTVAAVDPAPPAEAPPRTAEVVQAAPDAKPATA 1572
+ R+V + P+T ++ D + A + DA+ A A
Sbjct: 1220 KNRHRRSV----------RSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDAR-AKA 1268

Query: 1573 NEERVTVGQSV 1583
+ VG++V
Sbjct: 1269 QFVALNVGKAV 1279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1300TONBPROTEIN320.003 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.003
Identities = 18/81 (22%), Positives = 24/81 (29%), Gaps = 11/81 (13%)

Query: 206 TMDAAAPSTQAPSSQR--------TPSAPPVAMRQPQAADPLPLPEGPSAGNPKPVVLAD 257
TM A + Q P P+ +A + P+ PKPV
Sbjct: 48 TMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK--- 104

Query: 258 AASAQPKLRVPETVAIPGGTF 278
QPK V + P F
Sbjct: 105 KVQEQPKRDVKPVESRPASPF 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1306PERTACTIN300.036 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.1 bits (67), Expect = 0.036
Identities = 16/53 (30%), Positives = 21/53 (39%)

Query: 190 GVLPRNLPPIPRQDAQPIPPVLADKPRMKLPPVNKGKPQEPQPQPPPMMDEDP 242
++ PP P+ QP P P+ PP PQ PQ QP + P
Sbjct: 561 SLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1307PF03544608e-13 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 60.0 bits (145), Expect = 8e-13
Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 22/224 (9%)

Query: 43 GGLGANGAEFALEMASPQVEDNDLPAGPD--SDAAEAAPEQMQQTAE-VKESDAVKDKPT 99
G G + ++ P + A P+ +Q E V E + +
Sbjct: 25 HGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIP 84

Query: 100 ETEEEADRVVTMNDPKKPEKEEQQQAVQQAE-----ASVASAAQEESARKALDEAAPPAE 154
E +EA V+ PK K + + V+Q + A+ E+ A ++
Sbjct: 85 EPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATA 144

Query: 155 TAKAPNPGMGKDKQKLTNDWGRKISAYFELHKRYPEGKKRN---GTVKVALVLSRAGKVM 211
P + + L+ +YP + G VKV ++ G+V
Sbjct: 145 ATSKPVTSVASGPRALSR-----------NQPQYPARAQALRIEGQVKVKFDVTPDGRVD 193

Query: 212 SASVMQSSGDSVFDQAALSMIHRSDPVPAPPSGLTDDQFSFSID 255
+ ++ + ++F++ + + R P P F I+
Sbjct: 194 NVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1310RTXTOXINA300.002 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.002
Identities = 19/98 (19%), Positives = 34/98 (34%), Gaps = 24/98 (24%)

Query: 5 AAAACAVGCVGVVIACALFMGLA--AVAGVADAFEAAG--------------------AA 42
A +IA A+ + ++ + +AD F+ A AA
Sbjct: 297 AQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAA 356

Query: 43 FAAGDGT--AAIGAVCASLAAAGAGLVAEVCGATVAWP 78
F G A++ + LA+ +G+ A + V P
Sbjct: 357 FHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAP 394



Score = 26.1 bits (57), Expect = 0.036
Identities = 8/42 (19%), Positives = 22/42 (52%)

Query: 27 AAVAGVADAFEAAGAAFAAGDGTAAIGAVCASLAAAGAGLVA 68
A++ ++ + + +A T+ +GA ++L A G+++
Sbjct: 366 ASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIIS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1314PF03544310.017 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.017
Identities = 19/101 (18%), Positives = 37/101 (36%), Gaps = 7/101 (6%)

Query: 19 DPSSDISGVKV--SAVASLIAVASFSSAEAQQAPLPSVTV----EAPVARPKPA-ASRPT 71
P+ IS V + + AV + P P EAPV KP +P
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 72 AEQVRARDAMRRAARQQQQQQVAAPKSNLPPDRNPYSSPAS 112
+ V+ + +R + + + + ++ P ++ A+
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAA 145


76BBta_1341BBta_1349N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1341014-0.199605ABC transporter substrate-binding protein
BBta_13420140.578306cobalamin synthesis protein/P47K family protein
BBta_1343-1161.348959hypothetical protein
BBta_1344-1161.091790hypothetical protein
BBta_1345-1200.910118homospermidine synthase
BBta_1346-1160.733705hypothetical protein
BBta_1348-2151.366693acetyltransferase (GNAT) family protein
BBta_1349-2140.714771ornithine decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1341adhesinb2701e-92 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 270 bits (693), Expect = 1e-92
Identities = 84/310 (27%), Positives = 160/310 (51%), Gaps = 19/310 (6%)

Query: 1 MKR---LLPALSLFLALL-----PSVPASAQTRPNVVTSFSILADFARRVGGDRISVTSL 52
MK+ L+ L F+ L S + ++ NVV + SI+AD + + GD+I++ S+
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60

Query: 53 VGPDSDAHVYTPTPHDAKDVGAARLLIVNGLGLE----GWLPRLQQASGSK--APIIVAT 106
V D H Y P P D K A L+ NG+ LE W +L + + K +
Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120

Query: 107 QGITP-----RKRGADADPHAWQSVGNARVYVRNIRDALVAADPADAAVFQANAERYLAE 161
+G+ + DPHAW ++ N +Y +NI L DPA+ ++ N + Y+ +
Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180

Query: 162 LDALDQEVRAEIGKIPPERRKVISTHDAFGYFADAYGIQFIAPLGVSTETEPSARDVAEI 221
L ALD+E + + IP E++ ++++ F YF+ AY + ++TE E + + +
Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240

Query: 222 IVQVRKDKIPAVFLENFNDDRLVGRIAAETGAKIGGTLYSDALSEENGPAPTYIAMVRHN 281
+ ++RK K+P++F+E+ DDR + ++ +T I +++D+++E+ +Y +M+++N
Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300

Query: 282 IRALTSALGR 291
+ + L +
Sbjct: 301 LEKIAEGLSK 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1344PF05616270.014 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 27.0 bits (59), Expect = 0.014
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 13/59 (22%)

Query: 44 AQATPE------PSATPAESKPGGTRPTTPAPEP-----ARPDVEVQKEGAKPALPPAP 91
AQ PE P+ PA ++ GTRP P P+P A PD + Q G +P P P
Sbjct: 325 AQPLPEVSPAENPANNPAPNENPGTRP-NPEPDPDLNPDANPDTDGQP-GTRPDSPAVP 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1348SACTRNSFRASE357e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 7e-05
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 9/74 (12%)

Query: 71 LVGTLRLWHVSAGGRDALMLGPLAVAAEARSLGVGAALMNAALMIAASRGHGAVVL---- 126
+G +++ + ++ +AVA + R GVG AL++ A+ A ++L
Sbjct: 76 CIGRIKI---RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 127 --LGDAPYYARFGF 138
+ +YA+ F
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1349ALARACEMASE348e-04 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 34.0 bits (78), Expect = 8e-04
Identities = 21/110 (19%), Positives = 42/110 (38%), Gaps = 4/110 (3%)

Query: 24 VVDLDVVRDNYMNFAKALPDSRVFYAVKANPAPEVLSLLASLGSSFDTATVAEIEMALA- 82
+DL ++ N +A +RV+ VKAN + + S + D + +E A+
Sbjct: 8 SLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAITL 67

Query: 83 --AGAT-PDRISYGNTIKKERDIARAYALGIRLFAVDCAAEVEKISRAAP 129
G P + G ++ +I + L + + ++ AP
Sbjct: 68 RERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP 117


77BBta_1420BBta_1425N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1420116-2.017185hypothetical protein
BBta_1421017-1.807347TetR family transcriptional regulator
BBta_1422-115-1.572632hypothetical protein
BBta_1423-117-1.751731RND family mulitdrug efflux protein
BBta_1424-118-2.476751RND family mulitdrug efflux protein
BBta_1425-216-2.592888RND family mulitdrug efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1420HTHTETR373e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 37.3 bits (86), Expect = 3e-06
Identities = 12/80 (15%), Positives = 30/80 (37%)

Query: 8 HAKSARTVRRIVQAALHLYREIGHKKTTVADIARISSMSSANIYRFLEARQAVEDFVVKQ 67
++ T + I+ AL L+ + G T++ +IA+ + ++ IY + + + + +
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 68 LLEETANAATETARNGGSAL 87
E
Sbjct: 66 SESNIGELELEYQAKFPGDP 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1421HTHTETR482e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.5 bits (115), Expect = 2e-09
Identities = 24/161 (14%), Positives = 54/161 (33%), Gaps = 9/161 (5%)

Query: 1 MFLEKGYRSASIDDISEMAPASKPTIYAHFPGKEALFAAVVARTISGLTDFEGFTPEGRT 60
+F ++G S S+ +I++ A ++ IY HF K LF+ + + S + + E
Sbjct: 23 LFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFP 82

Query: 61 IEDKLMSLGTVIIERVFEESLGMVRATIAEAPRFPELSRNVHDAARDRSLTAVSQLLNDA 120
+ + + E + ++ +T+ E R + H + V Q +
Sbjct: 83 GDP---------LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNL 133

Query: 121 TQKLARAPKGPFSAKRSRTTAQIFLDLILLPMLFRSLLGET 161
+ + L ++ R +
Sbjct: 134 CLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1423RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 23/118 (19%), Positives = 36/118 (30%), Gaps = 9/118 (7%)

Query: 79 SGRVLARYADVGSHVRAGEVLAVLDPAEQQADVDAATAAVTSAEA-QLRVAMATFERQR- 136
+ V G VR G+VL L +AD +++ A Q R + + +
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 137 SLIASGFTTRPAYDQAQEGLR-----LAESTLEAARAQLGTSKEALGYTALRAEADGV 189
L P + E L + + Q + L RAE V
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL--DKKRAERLTV 219



Score = 39.8 bits (93), Expect = 1e-05
Identities = 20/119 (16%), Positives = 40/119 (33%), Gaps = 6/119 (5%)

Query: 99 LAVLDPAEQQADVDAATAAVTSAEAQLRVAMATFERQRSLIASGFTTRPAYDQAQEGLRL 158
AVL+ + + S Q+ + + + + L+ F Q +
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 159 AESTLEAARAQLGTSKEALGYTALRAEADG-VITARNLEAGQVVPAAQPVFSLARDGER 216
TLE A+ +E + +RA V + G VV A+ + + + +
Sbjct: 312 GLLTLELAKN-----EERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1424RTXTOXIND371e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 1e-04
Identities = 19/122 (15%), Positives = 39/122 (31%), Gaps = 13/122 (10%)

Query: 96 LAVRSARAQLAKAQAQLATAQATENRQRTLINSDATTKQTLDNAEQ--------ARAGAE 147
AV + +A +L ++ + + I + K+ Q
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEI---LSAKEEYQLVTQLFKNEILDKLRQTT 308

Query: 148 GTVAQEQANLTKAIEQLGYAQIKADFGGVVIAVGA-EVGQVVSPGQSVVTVARPEIREAV 206
+ L K E+ + I+A V + G VV+ ++++ + PE
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV-PEDDTLE 367

Query: 207 VD 208
V
Sbjct: 368 VT 369



Score = 33.3 bits (76), Expect = 0.001
Identities = 15/91 (16%), Positives = 33/91 (36%), Gaps = 10/91 (10%)

Query: 77 GDLVSEGQIVGAIDPTALDLAVRSARAQLAKAQAQLATAQATENRQRTL---INSDATTK 133
G+ V +G ++ + A A K Q+ L A+ + R + L I + +
Sbjct: 115 GESVRKGDVLLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPE 167

Query: 134 QTLDNAEQARAGAEGTVAQEQANLTKAIEQL 164
L + + +E V + + + +
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1425ACRIFLAVINRP485e-156 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 485 bits (1250), Expect = e-156
Identities = 243/1041 (23%), Positives = 440/1041 (42%), Gaps = 56/1041 (5%)

Query: 64 LSDWALGHRSLVWYFMIAFMVAGLFAYLQLGRQEDPDFTIKTMVIQAQWPGASPEEMTRQ 123
++++ + W I M+AG A LQL + P + + A +PGA + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 124 VTDRIEKKLEELESLDYTKSVTV-AGQTTVFVYLRDSTKAIDVKPTWVRIRNMIADIKGD 182
VT IE+ + +++L Y S + AG T+ + + T D V+++N +
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 183 FPQGVIGPG-FNDRFGDVFGNVYAFTSDG--LSQRQLRDQVED-IRAKVLTVPAVGKVDI 238
PQ V G ++ + V F SD +Q + D V ++ + + VG V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 239 LGAQDEV-IYLEFSTRKIAALGLDVHAIMNSLQGQNAVAPSGLFQEGPE------RISVR 291
GAQ + I+L+ + L ++N L+ QN +G P S+
Sbjct: 178 FGAQYAMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 292 VNGQFTSEASLKAVNLRINDRFFP--LTDVATITRGYLDPARTLFRYNGQPAIALAIGMK 349
+F + V LR+N L DVA + G + + R NG+PA L I +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKLA 294

Query: 350 SGANLLQFGKALKEQVNKVIADLPIGIGVHLVADQPVVVEHAVSGFTEALFEAVIIVLGI 409
+GAN L KA+K ++ ++ P G+ V D V+ ++ + LFEA+++V +
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 410 SFLSLG-LRAGLVVAIAIPLVLAITFVVMAYSGISLQRISLGALIIALGLLVDDAMIAVE 468
+L L +RA L+ IA+P+VL TF ++A G S+ +++ +++A+GLLVDDA++ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 469 MMV-ARLEIGDPLEKAATHVYTSTAFPMLTGTLVTVAGFIPIGLNSSNAGEFTFTLFVVI 527
+ +E P ++A + ++ +V A FIP+ + G + I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 528 AVSLIVSWIVAVLFTPLLGVTILPSQMKGHHEDKGRLARMFSRLLLFCMHH--------- 578
++ +S +VA++ TP L T+L HHE+KG F+ ++H
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 579 --RWSTIGVTVAAFLLAVVGLQFVQQQFFPSSDRAELVIDWNLPQNASITDTNAQMARFE 636
+ + VV + F P D+ + LP A+ T + +
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 637 REQLQAN-DSVDHWSTYVGTG----APRFVLSF-DLQTTNTWFGQQVIVTKGGIAARDRL 690
L+ +V+ T G A ++F L+ G + + + R ++
Sbjct: 595 DYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDE--NSAEAVIHRAKM 652

Query: 691 KSQFENYLRTTFPGTDTYVKLLEVGPPVGRPVQYRLSGPDIAKVRDLSQKLAGIVRSSP- 749
+ P + L + + +G + +L G+ P
Sbjct: 653 ELGKI-RDGFVIPFNMPAIVELGTATGFDFELIDQ-AGLGHDALTQARNQLLGMAAQHPA 710

Query: 750 DLGNVVFDWMEPARVVKVDVLQDKARQLGVTSEDIATTLNSVFEGSPITQVRDSIYLVNV 809
L +V + +E K++V Q+KA+ LGV+ DI T+++ G+ + D + +
Sbjct: 711 SLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKL 770

Query: 810 MGRATAPERASIDTLRDLQLVGLGGQSVPLGAIANLRYELEQPTIWRRARIPTITLKAAV 869
+A A R + + L + G+ VP A + P + R +P++ +
Sbjct: 771 YVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME----I 826

Query: 870 VGNVQPKTVVDQLAPKVAEFTKGLPAGYSVRIGGSVEESAKSQAPIIAVVPLMLFVMATV 929
G P T + LPAG G + S A+V + V+
Sbjct: 827 QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 930 LMVQLQSFARLFLVFAVAPLAVIGVVMAMLPSGAPLGFVAILGVLALIGILVRNSVILIV 989
L +S++ V V PL ++GV++A ++G+L IG+ +N+++++
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 990 QIEDLK-KEGRPAWEAVVEATEHRMRPILLTAAAASLALIPIA------REIFWGPMAYA 1042
+DL KEG+ EA + A R+RPIL+T+ A L ++P+A +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN-AVGIG 1005

Query: 1043 MMGGIIVGTLLTLLFLPALYV 1063
+MGG++ TLL + F+P +V
Sbjct: 1006 VMGGMVSATLLAIFFVPVFFV 1026



Score = 87.6 bits (217), Expect = 2e-19
Identities = 83/518 (16%), Positives = 179/518 (34%), Gaps = 41/518 (7%)

Query: 574 FCMHHRWSTIGVTVAAFLLAVVGLQFVQQQFFPSSDRAELVIDWNLPQNASITDTNAQMA 633
F + + + + + + + +P+ + + N P A +
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GADAQTVQDTVT 62

Query: 634 RFEREQLQANDSVDH-WSTYVGTGAPRFVLSFDLQTTNTWFGQQVIVTKGGIAARDRLKS 692
+ + + D++ + ST G+ L+F T QV V A L
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDI--AQVQVQNKLQLATPLLPQ 120

Query: 693 QFENYLRTTFPGTDTYVKLLEVGPPVGRPVQYRLSGPDIAKVRDLSQKLAGIVRSSPDLG 752
+ + + + +Y+ + Q +S + V+D +L G+ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV-------G 173

Query: 753 NVVFDWMEPARVVKVDVLQDKARQLGVTSEDIATTLNSVFEGSPITQVRDSIYLVNVMGR 812
+V + A + +D D + +T D+ L + Q+ + L
Sbjct: 174 DVQLFGAQYAMRIWLD--ADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231

Query: 813 ATAPERASIDTLRDLQ----LVGLGGQSVPLGAIANLRYELEQPTIWRRAR-IPTITLKA 867
A+ + + V G V L +A + E + R P L
Sbjct: 232 ASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291

Query: 868 AVVGNVQPKTVVDQLAPKVAEFTKGLPAG--------YSVRIGGSVEESAKSQAPIIAVV 919
+ + K+AE P G + + S+ E K+ I +V
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 920 PLMLFVMATVLMVQLQSFARLFLVFAVA-PLAVIGVVMAMLPSGAPLGFVAILGVLALIG 978
L++++ LQ+ R L+ +A P+ ++G + G + + + G++ IG
Sbjct: 352 FLVMYLF-------LQNM-RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 979 ILVRNSVILIVQIED-LKKEGRPAWEAVVEATEHRMRPILLTAAAASLALIPIA-----R 1032
+LV ++++++ +E + ++ P EA ++ ++ A S IP+A
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 1033 EIFWGPMAYAMMGGIIVGTLLTLLFLPALYVAWFKIHP 1070
+ + ++ + + L+ L+ PAL K
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501


78BBta_1515BBta_1521N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1515234-5.757105secretory protein kinase, cpaF
BBta_1516434-5.254016Type II secretion system protein
BBta_1517435-5.466611hypothetical protein
BBta_1519636-5.811602pilus assembly protein CpaC
BBta_1520830-5.523498hypothetical protein
BBta_1521728-5.927628hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1515PF07675300.015 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.015
Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 3 ASTRPTVSEAKDFIRDQIFLRIEPLVAVRISQQDLMVSVNKLVAEIATGRKILLNQD 59
++ + S I D +F+ IEP VR ++ ++++ + V TG + LL+ D
Sbjct: 356 SAKKAEGSREVKRIGDGLFVTIEPANDVRANEAKVVLAADN-VWGDNTGYQFLLDAD 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1516BCTERIALGSPF347e-04 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 7e-04
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 7/120 (5%)

Query: 164 ARMLRAGLPITVAMRTVAVDGSPP-VSRVFGLIADELRIGVPLEEALDTNSREIGLPDFR 222
A ++ A +P+ A+ VA P +S++ + ++ G L +A+ F
Sbjct: 78 ATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFP-----GSFE 132

Query: 223 FFAVAMTLQFATGGNLTATLESLSDIIRKRRAARLKA-KAATGEIRLTAYTLGAIPILTT 281
AM T G+L A L L+D +R+ R + +A LT + + IL +
Sbjct: 133 RLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLS 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1519BCTERIALGSPD1503e-41 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 150 bits (380), Expect = 3e-41
Identities = 72/277 (25%), Positives = 118/277 (42%), Gaps = 9/277 (3%)

Query: 172 VINAMRVAASQQVMLRVRFIEVSRQAEREIGVNWFGANASGTRGINTGTGAISQAGPTAT 231
VI + + QV++ EV +G+ W NA T+ N+G +
Sbjct: 336 VIAQLDIRR-PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAI----- 389

Query: 232 SAGVPVFNTIGTFAGSTLSAPFGVGLFNLANKGGSVDVLITALEKKGLARRLAEPDLVAL 291
AG +N GT + S SA G+ +L+TAL LA P +V L
Sbjct: 390 -AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTL 448

Query: 292 SGDTASFLAGGEYPVPS-VQSSSGTTPVITVLYKPFGVQLTFVPTVLASGIINLRLTPSV 350
A+F G E PV + Q++SG TV K G++L P + + L + V
Sbjct: 449 DNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEV 508

Query: 351 SELDYTNAVAISGTLVPALSKREARTAIELRDGQSFAIAGLLQSDNLRDVGQLPWLGSVP 410
S + A + S L + R A+ + G++ + GLL ++P LG +P
Sbjct: 509 SSVAD-AASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIP 567

Query: 411 VLGTLFRSTSYQQKETDLVVIVTPHLVAPAAPGQALA 447
V+G LFRSTS + + +L++ + P ++ + +
Sbjct: 568 VIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQAS 604


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1521SYCDCHAPRONE458e-09 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 44.5 bits (105), Expect = 8e-09
Identities = 21/93 (22%), Positives = 37/93 (39%)

Query: 5 YFNRGIYGTAEKYFQSAVEKAPKDVSAWIGLAASYDRLGRFDLADHAYGQAIKLGGETTQ 64
+ G Y A K FQ+ D ++GL A +G++DLA H+Y + + +
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPR 105

Query: 65 ILNNLGYSYMLRGKLTAARTKFMEAYRREPDNP 97
+ + +G+L A + A D
Sbjct: 106 FPFHAAECLLQKGELAEAESGLFLAQELIADKT 138


79BBta_1545BBta_1552N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1545427-5.876569HlyD family heavy metal efflux pump
BBta_1546529-6.013759cation efflux system protein
BBta_1547728-5.755845hypothetical protein
BBta_1548521-5.027725hypothetical protein
BBta_1549519-4.265511LuxR transcriptional regulator
BBta_1550519-3.519579ATP-binding region, ATPase-like protein
BBta_1551314-2.101370hypothetical protein
BBta_1552315-2.089436OmpA-like transmembrane domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1545RTXTOXIND479e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.7 bits (111), Expect = 9e-08
Identities = 33/242 (13%), Positives = 80/242 (33%), Gaps = 45/242 (18%)

Query: 140 DLLSAASSLIATSGVLQLTTRALNRLKTLYESRAVAQ---KDVEQAISDQQTAEGAHKAA 196
+ L+ + + + ++ L+ +L +A+A+ + E + +K+
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 197 RDAVR------------IFGKTDAEIDAIIKERRADST---------------LVVKSPI 229
+ + + EI +++ + V+++P+
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 230 NGRITARNA-APGLFVQPGSAPAPFSVADTSTMWMIANVAESDVSAIHVGQHVKVSVMSY 288
+ ++ G V V + T+ + A V D+ I+VGQ+ + V ++
Sbjct: 335 SVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 289 PGKIF---EGHISTISSN--VDPNTH------RMLVRSEIEDPDH--ELRSGMFARFSIV 335
P + G + I+ + D + + + + L SGM I
Sbjct: 394 PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453

Query: 336 IG 337
G
Sbjct: 454 TG 455



Score = 34.0 bits (78), Expect = 0.001
Identities = 23/124 (18%), Positives = 43/124 (34%), Gaps = 21/124 (16%)

Query: 114 GRILETFAKVGDEVKKGQVLFTIDSPDLLSAASSLIATSGVLQLTTRALNRLKTLYES-- 171
+ E K G+ V+KG VL + L + A +L S +LQ R + L S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLT--ALGAEADTLKTQSSLLQARLEQT-RYQILSRSIE 161

Query: 172 ---------------RAVAQKDVEQAISDQQTAEGAHKAARDAVRI-FGKTDAEIDAIIK 215
+ V++++V + S + + + + K AE ++
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221

Query: 216 ERRA 219

Sbjct: 222 RINR 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1546ACRIFLAVINRP5620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 562 bits (1449), Expect = 0.0
Identities = 218/1082 (20%), Positives = 412/1082 (38%), Gaps = 87/1082 (8%)

Query: 8 FGLTRRAIILLGVLVFICGGLIAFRNLNIEAYPNPAPVILEITAQAPGLSAEEMERYYTI 67
F + R + ++ + G +A L + YP AP + ++A PG A+ ++ T
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 68 PMEVGLAVTPGVDVIRSTSF-YGLSFVRVTFKYGVEFYFAYTQAALSLQQ-RVNLPNNTQ 125
+E + + + STS G + +TF+ G + A Q LQ LP Q
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 126 PNIQQSSQAGEILRYQLA---GPPHFGLTNLRTVQDWIVQRRLLTVPGVVQVNSWGGTTK 182
++ P ++ V+ L + GV V +G
Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 183 QYDVEVDLHKLDAYNLTLQQVTTALSNSNINVGGREI-----AVGQQ-SVNIRGVGLFDS 236
+ +D L+ Y LT V L N + ++ GQQ + +I F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 237 GGEKDLTQGYKVSDIENVVL-TQMNGVPVQVKDVAKVSVGFVPRLGIAGRDSNDDVVMAI 295
+ V L +G V++KDVA+V +G IA + + I
Sbjct: 243 -----------PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291

Query: 296 VVMGRTYHTNEVLPRVEAEIAKMNSDGTLPPGVKLVPFYDRGTLISVTTRTVLHNLIFGC 355
+ + + ++A++A++ P G+K++ YD + ++ V+ L
Sbjct: 292 KLATGA-NALDTAKAIKAKLAELQP--FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 356 ALVFLIQWLFLGDLRSAIIVGVNIPFALFFSVIVLVLLGQDANLLSVG--AVDFGIIVDS 413
LVFL+ +LFL ++R+ +I + +P L + +L G N L++ + G++VD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 414 AVILVENIFRNFQASDKHKQETLGYLSEGQWGPDPTRPKPGSPNLWTDRLRLILASALQV 473
A+++VEN+ R + E + P S Q+
Sbjct: 409 AIVVVENVER--------------VMMEDKLPP----------------KEATEKSMSQI 438

Query: 474 DKAVFFSAAITVAAFVPLFTMQGVEGQIFNPMARTYGYALVGALISTFTISPVLGSFLL- 532
A+ A + A F+P+ G G I+ + T A+ +++ ++P L + LL
Sbjct: 439 QGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498

Query: 533 ---PEHVTETETIVVRALRAV------YAPALRWALGHRKLVASLGLAMLGVTGLLMMRL 583
EH Y ++ LG + ++ +L +RL
Sbjct: 499 PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558

Query: 584 GSEFLPHLEEGNLWIRATMPPTIGLMSGEPVAHKAREILLRH--PEITTVVTQHGRPDDG 641
S FLP ++G +P + V + + L++ + +V T +G G
Sbjct: 559 PSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSG 618

Query: 642 SDAAGFNNLELFAPLKPFDQWP-AGLTKDKLTKQLQQEFADEAPGVVFNFSQYIQDNFEE 700
N F LKP+++ + + + + + E G V F+
Sbjct: 619 Q---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGT 675

Query: 701 QLSGVKGANSAKIVGPDLVTLEELARQVRHEMAQVRGIEDLDVF--WVRGQPNLNIKVDR 758
+ G L + Q+ AQ + V + ++VD+
Sbjct: 676 --ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPA-SLVSVRPNGLEDTAQFKLEVDQ 732

Query: 759 ERAARYGLNTGDVNTLVQAALGGAHATALLEADRQFNVVVRLPAEYRESLEAVRNIKVGI 818
E+A G++ D+N + ALGG + ++ R + V+ A++R E V + V
Sbjct: 733 EKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYV-- 790

Query: 819 NTPAAANAYIPLSELADITLDTGSSYIYHESRERYIPVKFSVRDRDLGGAVAEAQERIAE 878
+A +P S GS + + + ++ G E +A
Sbjct: 791 --RSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS 848

Query: 879 NVKLPPGYRVLWAGEFESLQLAKKRLEVIVPISLAMILVLLYGLFNSLRDSLLALAGIPF 938
KLP G W G +L+ + +V IS ++ + L L+ S + + +P
Sbjct: 849 --KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPL 906

Query: 939 AVAGGIIALYVTGLDFSISAAIGFVSLFGVSVMDGILMITYYNQARQGVADSV-EAMFQA 997
+ G ++A + + +G ++ G+S + IL++ + + V EA A
Sbjct: 907 GIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMA 966

Query: 998 STQRMRPMLMTAMSACIGLFPAALSEGIGAQVQRPLATVVVGGMLIGPIMLLVVVPALRV 1057
R+RP+LMT+++ +G+ P A+S G G+ Q + V+GGM+ ++ + VP V
Sbjct: 967 VRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFV 1026

Query: 1058 ML 1059
++
Sbjct: 1027 VI 1028



Score = 71.8 bits (176), Expect = 1e-14
Identities = 65/350 (18%), Positives = 137/350 (39%), Gaps = 21/350 (6%)

Query: 724 LARQVRHEMAQVRGIEDLDVFWVRGQPNLNIKVDRERAARYGLNTGDVNTLVQAAL---- 779
+A V+ ++++ G+ D+ +F Q + I +D + +Y L DV ++
Sbjct: 158 VASNVKDTLSRLNGVGDVQLF--GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 780 GGAHATALLEADRQFNVVVRLPAEYRESLEAVRNIKVGINTPAAANAYIPLSELADITLD 839
G +Q N + ++ + E + + +N+ + + L ++A + L
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFK-NPEEFGKVTLRVNSDGSV---VRLKDVARVELG 271

Query: 840 TGSSYIYHESRERYIPVKFSVRDRDLGGAVAEAQ---ERIAE-NVKLPPGYRVLWAGEFE 895
G +Y ++ A+ A+ ++AE P G +VL+ ++
Sbjct: 272 -GENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP--YD 328

Query: 896 SLQLAKKRLE-VIVPISLAMILVLL--YGLFNSLRDSLLALAGIPFAVAGGIIALYVTGL 952
+ + + V+ + A++LV L Y ++R +L+ +P + G L G
Sbjct: 329 TTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY 388

Query: 953 DFSISAAIGFVSLFGVSVMDGILMI-TYYNQARQGVADSVEAMFQASTQRMRPMLMTAMS 1011
+ G V G+ V D I+++ + EA ++ +Q ++ AM
Sbjct: 389 SINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV 448

Query: 1012 ACIGLFPAALSEGIGAQVQRPLATVVVGGMLIGPIMLLVVVPALRVMLLG 1061
P A G + R + +V M + ++ L++ PAL LL
Sbjct: 449 LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1549HTHFIS753e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 3e-18
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 3/123 (2%)

Query: 7 KILCIEDDRETAALIVEELTERGFDVTLAYDGGEGFAAIFRTMPDLVLCDINMRVMSGFE 66
IL +DD ++ + L+ G+DV + + + I DLV+ D+ M + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 VLEHLTKIAPRFNNMPFIFLTALTDRRNELKGRQLGADDYVTKPIDFDILVSIINARLAH 126
+L + K P +P + ++A +K + GA DY+ KP D L+ II LA
Sbjct: 65 LLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 VAR 129
R
Sbjct: 122 PKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1552OMPADOMAIN310.009 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.4 bits (71), Expect = 0.009
Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 12/96 (12%)

Query: 80 IYATAGLAYSQGRLTEDPFDPATQKKIGFRAGWVAGAGVEAPVDGNWTARIEYLY-SNFG 138
IY G + D K V GVE + R+EY + +N G
Sbjct: 114 IYTRLGGMVWRA----DTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIG 169

Query: 139 SLNVMLPSGTSYGSTFDLHTVRLGLNRKLGGSAKEP 174
+ + G+ D + LG++ + G P
Sbjct: 170 DAH-------TIGTRPDNGMLSLGVSYRFGQGEAAP 198


80BBta_1618BBta_1628N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1618-1131.707459AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_16190112.417596AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_16202121.995867TetR family transcriptional regulator
BBta_1621091.091182hypothetical protein
BBta_1623-1100.894592transport protein permease
BBta_1624-212-0.014422HlyD family secretion protein
BBta_1626-213-1.799948UDP-galactose 4-epimerase
BBta_1627-314-1.810412hypothetical protein
BBta_1628-215-1.040065Signal transduction histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1618ACRIFLAVINRP10680.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1068 bits (2763), Expect = 0.0
Identities = 419/1042 (40%), Positives = 654/1042 (62%), Gaps = 16/1042 (1%)

Query: 3 LSKFFIDRPIFAGVLSTLIFLGGLIALFAMPISEYPDVVPPSVVVRATYPGANPKVIAET 62
++ FFI RPIFA VL+ ++ + G +A+ +P+++YP + PP+V V A YPGA+ + + +T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VATPIEEQINGVENMLYMSSQATTDGAMTLTVTFKLGTDPDKATQLVQNRVQQAEPRLPN 122
V IE+ +NG++N++YMSS + + G++T+T+TF+ GTDPD A VQN++Q A P LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 123 VVRQLGVITKKSSPDLTMVVHLISPNGRYDTTYLRNYAVLNVKDRLARIDGVGDVQLFGA 182
V+Q G+ +KSS MV +S N + +Y NVKD L+R++GVGDVQLFG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 183 GDYSMRVWVDPQKAAEHGLSASDIVKAIQAQNVEAAAGVVGASPSVKGLDLQLSVNAEGR 242
Y+MR+W+D ++ L+ D++ ++ QN + AAG +G +P++ G L S+ A+ R
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 243 LSNEDQFADIVVKTGAHGEITRLRDVARIELGASEYGLRSLLDNKQAVAIPIFQAPGSNA 302
N ++F + ++ + G + RL+DVAR+ELG Y + + ++ K A + I A G+NA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 LQISDNVRATMAEIAKTMPEGVEYHIVYDPTQFVRSSIEAVVHTLFEAIVLVVIVVILFL 362
L + ++A +AE+ P+G++ YD T FV+ SI VV TLFEAI+LV +V+ LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 QTWRASIIPLLAVPVSIVGTFAVMHLFGFSINALSLFGLVLAIGIVVDDAIVVVENVER- 421
Q RA++IP +AVPV ++GTFA++ FG+SIN L++FG+VLAIG++VDDAIVVVENVER
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 422 NIESGLSPRDATYQAMREVSGPIIAIALVLVAVFVPLAFISGLTGQFYKQFALTIAISTV 481
+E L P++AT ++M ++ G ++ IA+VL AVF+P+AF G TG Y+QF++TI +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVNSLTLSPALSALLLKGHHDPKDRLTRFLDRALGWFFRGFNRTFVRASDNYSGSVSK 541
+S + +L L+PAL A LLK ++ G FF FN TF + ++Y+ SV K
Sbjct: 480 LSVLVALILTPALCATLLKP-------VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 542 VITGKAAVMILYVLLIGATGLLFKQVPGGFVPGQDKQYLVGFARLPDGATLDRSEEVIRK 601
++ +++Y L++ +LF ++P F+P +D+ + +LP GAT +R+++V+ +
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 602 MSDIALT--QPGVESSIAFPGLSISGFTNSSNAGIVFSGLKPFDERKDPSLSGGAIALQL 659
++D L + VES G S SG + NAG+ F LKP++ER S A+ +
Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRA 650

Query: 660 NKKYAGIQDAFIAMFPPPPVNGLGTIGGFKLQIEDRAGRGYEALNDVTAAFMGALQKSP- 718
+ I+D F+ F P + LGT GF ++ D+AG G++AL +G + P
Sbjct: 651 KMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPA 710

Query: 719 EIAGAFSSFQVNVPQLFADIDRTKALQLGVPVTEVFNTLQIYLGSYYVNDFNKFGRTYSV 778
+ + + Q ++D+ KA LGV ++++ T+ LG YVNDF GR +
Sbjct: 711 SLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKL 770

Query: 779 RVQADAPFRARADDIRQLKVRSSSGEMVPLSALLNVRQSAGPERAIRYNGFLSSDINAAA 838
VQADA FR +D+ +L VRS++GEMVP SA G R RYNG S +I A
Sbjct: 771 YVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEA 830

Query: 839 APGFSSGQAQAAAEKIAAEVLPPGFAFEWTDLTYQEFIAGNSGLWVFPLAILLVFLVLAA 898
APG SSG A A E +A++ LP G ++WT ++YQE ++GN + ++ ++VFL LAA
Sbjct: 831 APGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 899 LYESLTLPLAILMIVPMALLAAMAGVYISKGDNNIFTQIGLIVLVGLSAKNAILIVEFAR 958
LYES ++P++++++VP+ ++ + + N+++ +GL+ +GLSAKNAILIVEFA+
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 959 EL-EFAGRSPLRAAIEASRLRLRPILMTSMAFIMGVLPLVLSTGAGSEMRRAMGVAVFSG 1017
+L E G+ + A + A R+RLRPILMTS+AFI+GVLPL +S GAGS + A+G+ V G
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009

Query: 1018 MIGVTVFGLFLTPVFYVLLRTL 1039
M+ T+ +F PVF+V++R
Sbjct: 1010 MVSATLLAIFFVPVFFVVIRRC 1031



Score = 79.1 bits (195), Expect = 7e-17
Identities = 67/326 (20%), Positives = 122/326 (37%), Gaps = 16/326 (4%)

Query: 738 IDRTKALQLGVPVTEVFNTL-----QIYLGSYYVNDFNKFGRTYSVRVQADAPFRARADD 792
+D + + +V N L QI G G+ + + A F+ ++
Sbjct: 188 LDADLLNKYKLTPVDVINQLKVQNDQIAAGQL-GGTPALPGQQLNASIIAQTRFKN-PEE 245

Query: 793 IRQLKVRSS-SGEMVPLSALLNVRQSAGPER-AIRYNGFLSSDINAAAAPGFSSGQ-AQA 849
++ +R + G +V L + V R NG ++ + A G ++ A+A
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKA 305

Query: 850 AAEKIA--AEVLPPGFAFEWTDLTYQEFIAGNSGLWVFPL--AILLVFLVLAALYESLTL 905
K+A P G + F+ + V L AI+LVFLV+ +++
Sbjct: 306 IKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRA 364

Query: 906 PLAILMIVPMALLAAMAGVYISKGDNNIFTQIGLIVLVGLSAKNAILIVE-FARELEFAG 964
L + VP+ LL A + N T G+++ +GL +AI++VE R +
Sbjct: 365 TLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDK 424

Query: 965 RSPLRAAIEASRLRLRPILMTSMAFIMGVLPLVLSTGAGSEMRRAMGVAVFSGMIGVTVF 1024
P A ++ ++ +M +P+ G+ + R + + S M +
Sbjct: 425 LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLV 484

Query: 1025 GLFLTPVFYVLLRTLTGMKPLTQHGG 1050
L LTP L + GG
Sbjct: 485 ALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1619RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.6 bits (108), Expect = 2e-07
Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 10/97 (10%)

Query: 96 AEVDRAQAQLEAARARAAFAANELERGAQLVGNSIVTKRDYDQRDNGNREAIANVKAAEA 155
E+ ++QLE + A E + QL N I+ K R+ N+
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKL---------RQTTDNIGLLTL 316

Query: 156 SLQTAKLNLDYTQVRAPVDGRVGRIEV-TVGNLVAAG 191
L + + +RAPV +V +++V T G +V
Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353



Score = 39.8 bits (93), Expect = 1e-05
Identities = 25/129 (19%), Positives = 49/129 (37%), Gaps = 5/129 (3%)

Query: 58 RVEVRPRVAGAILSANFTEGSLVKAGDVLFKIDPAPYAAEVDRAQ-----AQLEAARARA 112
E++P + EG V+ GDVL K+ A+ + Q A+LE R +
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 113 AFAANELERGAQLVGNSIVTKRDYDQRDNGNREAIANVKAAEASLQTAKLNLDYTQVRAP 172
+ EL + +L ++ + + ++ + + Q + L+ + RA
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215

Query: 173 VDGRVGRIE 181
+ RI
Sbjct: 216 RLTVLARIN 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1620HTHTETR603e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 3e-13
Identities = 25/200 (12%), Positives = 55/200 (27%), Gaps = 5/200 (2%)

Query: 10 DRDTALDQAMEVFWRHGYEGATIAQLTEAMGINPPSLYAAFGSKEALLKAALDRYTARRE 69
R LD A+ +F + G ++ ++ +A G+ ++Y F K L + +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 70 AWMEEVLSAPTAREVTARMLMGVAEKQTDPSNPPGCLLVQGGLACGSGSANVPFELAARR 129
E A + + + + L+ + + +
Sbjct: 72 ELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 130 AQTEEQLRHRF----ERAKAEGDLAEAADPAALARYVSAVITGMSVMASSGADREALAQV 185
+ R + L A + I+G+ L +
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190

Query: 186 AEVAIRSVEEQSVRAPIAAN 205
A + + E + P N
Sbjct: 191 ARDYVAILLEMYLLCPTLRN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1623TCRTETB532e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 52.6 bits (126), Expect = 2e-09
Identities = 64/408 (15%), Positives = 151/408 (37%), Gaps = 26/408 (6%)

Query: 35 SFLANFDSRLTSVGLPDLRGAFSLGFDEGAWLSTA-----AIGSQILIAPAVAWLATVFG 89
SF + + + +V LPD+ F+ W++TA +IG+ + L+ G
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK-----LSDQLG 77

Query: 90 LRRVLGVPSLVYAALSLIIPFVRDYPTLIVLAILHGMLLGTFVPATLMIIF-RNLPIRWW 148
++R+L ++ S+I + +L+++A PA +M++ R +P
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 149 LPAISIYAIRVGFALDTSTSLVGFYVEHLGWQWLYWQSVVLAPLMALMVYLGTPAEPVNR 208
A + V ++ G ++ W +L ++ + ++ L +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKG 197

Query: 209 TLLRNADWGGMLLLGSAVSMIYAGLDQGNRLDWLQSGTVVALLGGGAVLFVLFLVNEALS 268
D G++L+ + + + S + F++F+ +
Sbjct: 198 HF----DIKGIILMSVGIVFFMLFTTSYSISFLIVS----------VLSFLIFVKHIRKV 243

Query: 269 PQPWAHFNVLFSRNIGLSLAVILLYTLTSLSNASLVPNFLATITQLRPEQSGSLLLVYGA 328
P+ + + + + + T S+VP + + QL + GS+++ G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 329 LPLIVLVPLSIWMLRHLDPRIVVVLGLGAFAAANLLGTQLTHAWAREDFIVIVLLQSFGQ 388
+ +I+ + ++ P V+ +G+ F + + L +I++ G
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGV-TFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 389 AFTLLPIIILALSNSDPARATSFAAYIQIMRLGGAEIGVALMGTWLRV 436
+FT I + S+ A + + + G+A++G L +
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1624RTXTOXIND1032e-26 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 103 bits (258), Expect = 2e-26
Identities = 76/450 (16%), Positives = 140/450 (31%), Gaps = 93/450 (20%)

Query: 6 SNVDAASRTASTPPSSAAPTAGLWARLAI--------PLLAVIVALAFVALATLRWNAWT 57
S + TP L A L + P L + F+ +A + +
Sbjct: 20 SETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFI-LSVLG 78

Query: 58 GTATTQTTNDAYVRADLTQ-LSSRVAGEVLKVAVSDFQRVKAGDLLIQIDP----ADY-- 110
T N + ++ + V ++ V + + V+ GD+L+++ AD
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 111 ------------------------------------------EAQVSQAEATVEAAQAAL 128
E +V + + ++ +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 129 DNLSNQIELQYAT-IAQAEAQQVSAGAAEVQARQEEERQ---QSLSQSEAGTRQRLEQAT 184
N Q EL A+ E +R E+ R SL +A + + +
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE 258

Query: 185 AAYAKAQADVRASRAVIAAQRHQL----------------EVLTGTKKQRGADLAGAKAA 228
Y +A ++R ++ + ++ E+L +Q ++
Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIGLLTLE 317

Query: 229 LAAARLKLGYTRITAPFDGVVGQRQVQA-GDYVNVGSSLIAVVPLPQVFIV-ANYKETQL 286
LA + + I AP V Q +V G V +L+ +VP V A + +
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 287 THVAPGQPVDITVDTFPGE---QMHGRVERIAPASGSQFALLPPDNATGNFTKVVQRIPV 343
+ GQ I V+ FP + G+V+ I + D G V+ I
Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEE 430

Query: 344 RIALDPNQPLLARLLPGMSVVTHIHTAGRT 373
N+ + L GM+V I T R+
Sbjct: 431 NCLSTGNKNI--PLSSGMAVTAEIKTGMRS 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1626NUCEPIMERASE1485e-44 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 148 bits (374), Expect = 5e-44
Identities = 77/345 (22%), Positives = 122/345 (35%), Gaps = 72/345 (20%)

Query: 7 ILIAGGAGYIGAHCSKAVADAGFTPICYDNLT--------------LGHRSFVQWGPLVV 52
L+ G AG+IG H SK + +AG + DNL L F
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ----FHK 58

Query: 53 GDIADSIKVASTIRQYDVQAVMHFAASSAVGESVADPQKYYLNNVAGTLGLLQGMREAGC 112
D+AD + + V AV S+ +P Y +N+ G L +L+G R
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 113 TRLVFSSTGAVYGNAGREPIPESAAGPT---VNPYGRSKYMIEQILSDYRAAYGFSAIAL 169
L+++S+ +VYG +P S V+ Y +K E + Y YG A L
Sbjct: 119 QHLLYASSSSVYG--LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 170 RYFNACGADASGAIGELRDPETHLIPRALMAILGHVPDFAIFG-----TDYETP----DG 220
R+F G P PD A+F + ++ G
Sbjct: 177 RFFTVYG------------------PWGR-------PDMALFKFTKAMLEGKSIDVYNYG 211

Query: 221 TAVRDYIHVDDLAAAHIAAIGRLMEGHRGGA---------------YNLGTGSGYSVREI 265
RD+ ++DD+A A I + YN+G S + +
Sbjct: 212 KMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDY 271

Query: 266 VDAIRNETGEQVPLVYRERRPGDPPVLVADPRRAEQELGFKPRRS 310
+ A+ + G + +PGD AD + + +GF P +
Sbjct: 272 IQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETT 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1628PF06580461e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.0 bits (109), Expect = 1e-07
Identities = 40/242 (16%), Positives = 83/242 (34%), Gaps = 40/242 (16%)

Query: 174 FLTRAADIVGIAIERLHEEARLREALDHQAMLTREMS--------HRVKNSLASVVAMLR 225
+ G + +++A + + ++ H + N+L ++ A++
Sbjct: 128 TFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALIL 187

Query: 226 VQAFGAGSVEAK------QALGEAGSRVAAIAQVHDHLSRASRIGSIDVDVFLTDFCKRL 279
A + +L + +R ++A + ++ SI F RL
Sbjct: 188 EDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQ-------FEDRL 240

Query: 280 QLVAGAHVLRCDADPIRLPADLAVPLGLLINELVSNAVKHAYPG--QDGPIDISARDVGG 337
Q + + D+ VP +L+ LV N +KH Q G I + G
Sbjct: 241 QFE-----NQINPAI----MDVQVP-PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNG 290

Query: 338 CLQLSVADQGVGLPAGFDIDQTRTSLGFRMINGMVQQLHGC---LKVSANQPTGTRVLFE 394
+ L V + G + T G + + +Q L+G +K+S + +
Sbjct: 291 TVTLEVENTG---SLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLS-EKQGKVNAMVL 346

Query: 395 LP 396
+P
Sbjct: 347 IP 348


81BBta_1799BBta_1806N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1799-190.536575glutathione S-transferase
BBta_1800-290.697717hypothetical protein
BBta_1801-3100.781429aspartate aminotransferase
BBta_1802-181.926286hypothetical protein
BBta_1803092.265719hypothetical protein
BBta_1805-291.579164elongation factor G
BBta_1806-1101.065561two-component sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1799FIMREGULATRY280.012 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 28.0 bits (62), Expect = 0.012
Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 17/90 (18%)

Query: 99 QHALEP-NIGAAYFWLLL----VKGGRDLQTHALEDWMERGYAALQVMENHLKTNDYF-- 151
+ L P ++ +F+LL+ + R + A++D++ G++ +V E + N YF
Sbjct: 20 ESVLLPGSMSEMHFFLLIGISSIHSDRVIL--AMKDYLVGGHSRKEVCEKYQMNNGYFST 77

Query: 152 AARQL-----TVADIALYGYTHVADRCDFD 176
+L A +A Y YT + FD
Sbjct: 78 TLGRLIRLNALAARLAPY-YTDESS--AFD 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1801ARGDEIMINASE290.035 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 29.0 bits (65), Expect = 0.035
Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 2/56 (3%)

Query: 185 AELKALTEVLVKHP--HVWIMTDDMYEHLVYDDFVFTTPAQVEPSLFDRTLTVNGV 238
+E+ L +VL+ P + +T + ++ ++DD + A+ E +F L N V
Sbjct: 12 SEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNLV 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1805TCRTETOQM3083e-98 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 308 bits (790), Expect = 3e-98
Identities = 129/641 (20%), Positives = 254/641 (39%), Gaps = 45/641 (7%)

Query: 2 EAILARTGAIPRAGSVDAGTSVGDASAEARHHKMSVAMTAATTTFMGDSYTFLDCPGSIE 61
E++L +GAI GSVD GT+ D + R +++ + + +D PG ++
Sbjct: 21 ESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNIIDTPGHMD 80

Query: 62 FAHDMRAALPAVDAAIVVCEADEKKLPQLQIILRELEELRIPRFLFLNKIDRANARIRET 121
F ++ +L +D AI++ A + Q +I+ L ++ IP F+NKID+ +
Sbjct: 81 FLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINKIDQNGIDL--- 137

Query: 122 LATLQPASRVPLVLRQIPIWNGELIEGFVDLALERAFIYREHKASEVIALEGGNLDREKE 181
V + I E + + + ++ +Y + E
Sbjct: 138 ----------STVYQDI----KEKLSAEI-VIKQKVELYPNMCVTNFTESE--------- 173

Query: 182 ARFSMLEKLADHDDSLMEQLLEDIPPPRDAVFDDLARELRDGLICPVLLGSALRENGVLR 241
+ + + +D L+E+ + + + + + + PV GSA G+
Sbjct: 174 ----QWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDN 229

Query: 242 LMKALRHEAPGIAETAKRLGVTETKEALGYVFKTLHLQHGGKLSLTRVLSGHLDDGATLH 301
L++ + ++ + E G VFK + + +L+ R+ SG L ++
Sbjct: 230 LIEVITNKFYSSTHRGQ-------SELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVR 282

Query: 302 AASGEAARVSGILAATGAYESKRAAAEAGDTVALGKLDVVKTGDTIATGKTAPQALVKID 361
+ E +++ + + K A +G+ V L + + +K + K PQ +I+
Sbjct: 283 ISEKEKIKITEMYTSINGELCKIDKAYSGEIVIL-QNEFLKLNSVLGDTKLLPQR-ERIE 340

Query: 362 PCPPVLALSIAALDRKDDVKLGQALQRLHEEDPSLTMVQNPRTHDTVLWGQGEMHLRVAL 421
P+L ++ + L AL + + DP L + TH+ +L G++ + V
Sbjct: 341 NPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTC 400

Query: 422 ERLRDRFGVHVKSQAPAIGYQETIRKPITQRGRHKKQSGGHGQFGDVVLDIKPLPRGEGF 481
L++++ V ++ + P + Y E K + + + + L + PLP G G
Sbjct: 401 ALLQEKYHVEIEIKEPTVIYMERPLK--KAEYTIHIEVPPNPFWASIGLSVSPLPLGSGM 458

Query: 482 QFTEKVVGGAVPRNYIPAVEEGVVDALARGPLGFPVIDVAVTLTDGSYHSVDSSDLAFRT 541
Q+ V G + +++ AV EG+ +G G+ V D + G Y+S S+ FR
Sbjct: 459 QYESSVSLGYLNQSFQNAVMEGIRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRM 518

Query: 542 AARVGVSEGLPQCQPVLLEPIHTVEIVCPTEATARINAILSARRGQILSFDTRDGWPGWD 601
A + + + L + LLEP + +I P E +R I+ ++
Sbjct: 519 LAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEV--- 575

Query: 602 CVRAMMPEAEIGELIVELRSATAGAGSFTRQFDHMAEVTGR 642
+ +P I E +L T G + TG
Sbjct: 576 ILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGE 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1806PF06580391e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 1e-05
Identities = 51/329 (15%), Positives = 104/329 (31%), Gaps = 46/329 (13%)

Query: 25 ALLGYAIAIGATFIAFVLRLALVDTLPDGFPYLTFFPAVILTAFFCGFGPGVLCAVLSGL 84
+L+G + + +F+ R + L G L PA ++ + +L+ +
Sbjct: 49 SLMGLVLTHA--YRSFIKRQGWL-KLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFI 105

Query: 85 LAWSVFIPGSGYQTALALGFYAIIVAVDITL----IHIMHEAAQRLRQERAVSESLYDNQ 140
V AL++ F ++V +L H Q + ++ + Q
Sbjct: 106 NTKPVAF---TLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQ 162

Query: 141 RVLFQELQHRVANNIQFIAGLLMMQKRQAIADPSRAIGILDEAQARLQTISRIHRMLHDP 200
L+ ++ N F+ L + + DP++A +L L + R +
Sbjct: 163 ---LMALKAQI--NPHFMFNALNNIRALILEDPTKAREMLT----SLSELMR-----YSL 208

Query: 201 GRMDVDIGPYLQEI--CNDVLDSSGAR-----EVTCVVDFVPAKLDLTRLTALSLLVVEL 253
+ E+ + L + + + ++ + + +LV L
Sbjct: 209 RYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINP-----AIMDVQVPPMLVQTL 263

Query: 254 VTNALKHAFGP-GQAGTIRVNMRPLDATHYALTISDDGQGMSADADPGAGDSLG-----W 307
V N +KH Q G I + D L + + G + G L
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGT-KDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERL 322

Query: 308 RICQGLAAQLNGRLTYSSDGGTTVRLEFP 336
++ G AQ+ G + P
Sbjct: 323 QMLYGTEAQIK---LSEKQGKVNAMVLIP 348


82BBta_1872BBta_1880N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_1872016-0.613672two component, sigma54 specific, Fis family
BBta_1873314-0.038437flagellar motor switch protein
BBta_18742130.637561flagellar assembly protein H
BBta_18751120.492940flagellar motor switch protein G
BBta_18761111.341277flagellar MS-ring protein
BBta_18781101.573318flagellar basal-body rod modification protein
BBta_18800131.476514flagellar hook length determination protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1872HTHFIS423e-147 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 423 bits (1088), Expect = e-147
Identities = 161/471 (34%), Positives = 243/471 (51%), Gaps = 37/471 (7%)

Query: 2 RLLIVGTLKGQLTAATKIAMENGATVTHAEDNEQAMRVLRGGKGADLLLVDVAL---DIR 58
+L+ T + G V + R + G G DL++ DV + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 59 DLVMRLDAEHIHVPIVACGITSDARAAVAAIHAGAKEYIPLPPDPELIAAV--------- 109
DL+ R+ +P++ + A+ A GA +Y+P P D + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 110 -----LAAVANDSRDLVYRDEAMAKVIKLAQQIAGSDASVMITGESGTGKEVLARYVHSR 164
L + D LV R AM ++ ++ ++ +D ++MITGESGTGKE++AR +H
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 165 SLRAKRSFISINCAAIPEHLLESELFGHEKGAFTGAVARRVGKFEEANGGTLLLDEISEM 224
R F++IN AAIP L+ESELFGHEKGAFTGA R G+FE+A GGTL LDEI +M
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 225 DVRLQSKLLRAIQERVIDRVGGTRPVPVDIRIIATSNRNLADAVREGTFREDLLFRLNVV 284
+ Q++LLR +Q+ VGG P+ D+RI+A +N++L ++ +G FREDL +RLNVV
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 285 NLKIPPLRERPADILELAQHFARKYSEANAVPLRPISAEAKRVLTTNRWPGNVRELENTL 344
L++PPLR+R DI +L +HF ++ + R EA ++ + WPGNVRELEN +
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKR-FDQEALELMKAHPWPGNVRELENLV 362

Query: 345 HRAVLMAQGDEIGPDAILTPDGARLDIGKTQPAVA-----HATMAAEQVTR--------- 390
R + D I + I + + + A A + A E+ R
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 391 ----ALVGRTVADVERDLILETLKHCLGNRTHAANILGISIRTLRNKLNEY 437
L R +A++E LIL L GN+ AA++LG++ TLR K+ E
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1873FLGMOTORFLIN922e-27 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 92.3 bits (229), Expect = 2e-27
Identities = 35/76 (46%), Positives = 55/76 (72%)

Query: 36 ADLEAVFDVPVQVSAVLGRSKMDVGELLKLGPGTVLELDRKVGEAIDIYVNNRLVARGEV 95
D++ + D+PV+++ LGR++M + ELL+L G+V+ LD GE +DI +N L+A+GEV
Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111

Query: 96 VLVEDKLGVTMTEIIK 111
V+V DK GV +T+II
Sbjct: 112 VVVADKYGVRITDIIT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1875FLGMOTORFLIG302e-103 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 302 bits (776), Expect = e-103
Identities = 111/340 (32%), Positives = 196/340 (57%), Gaps = 2/340 (0%)

Query: 24 RQAQREKTEPLSGPRRAAVMMLALGEQYGGKIWQQLDDDEVRELSLAMSTLGTVEADVVE 83
++ + L+G ++AA++++++G + K+++ L +E+ L+ ++ L T+ +++ +
Sbjct: 5 KEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKD 64

Query: 84 DLMLEFVSRMSASGALM-GNFDATERLLQQYLPPERVNGIMDEIRGPAGRNMWEKLSNVQ 142
+++LEF M A + G D LL++ L ++ I++ + +E +
Sbjct: 65 NVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRAD 124

Query: 143 EEVLANYLKNEYPQTIAVVLSKLKPEHAARVLAILPEDMALDVIGRMLRMEAVQKEVIER 202
+ N+++ E+PQTIA++LS L P+ A+ +L+ LP ++ +V R+ M+ EV+
Sbjct: 125 PANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVRE 184

Query: 203 VEQTLRVEFMSNLSQTRR-RDAHEVMAEIFNNFDRQTETRFITSLEEENRESAERIKALM 261
VE+ L + S S+ + + EI N DR+TE I SLEEE+ E AE IK M
Sbjct: 185 VERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKM 244

Query: 262 FTFDDLIKLDSASAQTLLRNVDKDKLGVALKSANEEVRNFFFGNMSSRAAKMLQDDMAAM 321
F F+D++ LD S Q +LR +D +L ALKS + V+ F NMS RAA ML++DM +
Sbjct: 245 FVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFL 304

Query: 322 GPVRLRDVDEAQALLVNLAKDLAAKGEIMLSKNRADDELV 361
GP R +DV+E+Q +V+L + L +GEI++S+ +D LV
Sbjct: 305 GPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1876FLGMRINGFLIF340e-113 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 340 bits (873), Expect = e-113
Identities = 169/557 (30%), Positives = 262/557 (47%), Gaps = 47/557 (8%)

Query: 5 LDFLKGLGAARLTAMIAVTAALIGFFGFVIMRVTAPQMTTLFTDLGMDDSSSIIKDLERQ 64
L++L L A +I +A + +++ P TLF++L D +I+ L +
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 65 GIPFEIRNEGSVILVPKDKVTRLRMKLAEGGLPKGGGVGYEIFDKSDALGTTSFVQNINH 124
IP+ N I VP DKV LR++LA+ GLPKGG VG+E+ D+ G + F + +N+
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQE-KFGISQFSEQVNY 131

Query: 125 LRALEGELARTIRAIDRIQAARVHLVLPERPLFSRETPEPSASIVVRVRG--ALDAAQIR 182
RALEGELARTI + +++ARVHL +P+ LF RE PSAS+ V + ALD QI
Sbjct: 132 QRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQIS 191

Query: 183 AIRHLVASAVNGLKPQRVSIVDEAGQLLA---DGTQTDIDQQVGDERRNTFEKRMRKQVE 239
A+ HLV+SAV GL P V++VD++G LL + D Q+ E R+++++E
Sbjct: 192 AVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFAND--VESRIQRRIE 249

Query: 240 DIVSSVVGAGRARVQLSADFDFNRITQTSDRYDPEGRVLRSSQTREEQSASSE------- 292
I+S +VG G Q++A DF QT + Y P G +++ + + S +
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 293 ------SNGQVTVNNE----LPGNQQNQQQ---------QQPPRDQSKKTEETNNYEISR 333
SN N P NQQN Q +S + ET+NYE+ R
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 334 TTKTEVTEAGRVNRISVAVLVDGAYSKNEKGELVYKERSKEELDRIAALVRSAIGFDQKR 393
T + G + R+SVAV+V+ + K + +++ +I L R A+GF KR
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNYKTLADGKPL----PLTADQMKQIEDLTREAMGFSDKR 425

Query: 394 GDQVEVVNLKFAD-APTVPQINEPSGFLGMLQFTKDDVMYVIELAVMMLLGIVVVFMVVR 452
GD + VVN F+ T ++ F F + L V+++ I+ VR
Sbjct: 426 GDTLNVVNSPFSAVDNTGGELP----FWQQQSFIDQLLAAGRWLLVLVVAWILWRK-AVR 480

Query: 453 PLVKKIIASDEVAAALKSAVPALTDETAQAQAHAQQTATLIDVAQVQGQVHAQSVHRVGE 512
P + + E A A + + + + L Q R+ E
Sbjct: 481 PQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIRE 537

Query: 513 LADRNPGEAAAIIRQWL 529
++D +P A +IRQW+
Sbjct: 538 MSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_1880FLGHOOKFLIK320.006 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.1 bits (72), Expect = 0.006
Identities = 44/180 (24%), Positives = 71/180 (39%), Gaps = 5/180 (2%)

Query: 353 AQAANQANAPASDPTAQLATALQPQLTTPTSQSGQITSANLTATAATATAVPLHGLAVEI 412
A Q+ A + + A P +T +Q +A + +A + L+ I
Sbjct: 190 LVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVL-SAPLGSHEWQQSLSQHI 248

Query: 413 AASALNGKSRFEIRLDPAELGRIDVRIDVDRNGQVTSHLRVEKPETLAMLQQTAPQLQQA 472
+ G+ E+RL P +LG + + + VD N Q + A L+ P L+
Sbjct: 249 SLFTRQGQQSAELRLHPQDLGEVQISLKVDDN-QAQIQMVSPHQHVRAALEAALPVLRTQ 307

Query: 473 LQDAGL---KSNNSGLQFSLRDQNSSGQNGGDNQQNGNAQRLIVTEDETVPAQLAGRSYG 529
L ++G+ +SN SG FS + Q +S Q N + VP L GR G
Sbjct: 308 LAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTG 367


83BBta_2036BBta_2043N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_20362131.164571photosynthetic apparatus regulatory protein
BBta_20382121.244731hypothetical protein
BBta_20391110.664233RND family mulitdrug efflux protein
BBta_20400110.645427RND family mulitdrug efflux protein
BBta_20410110.315056RND family mulitdrug efflux protein
BBta_20430101.317616TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2036HTHFIS1112e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 111 bits (280), Expect = 2e-31
Identities = 21/128 (16%), Positives = 55/128 (42%)

Query: 6 SLLIVEDDDGFARTLKRSFERRGYEAVVAASLEEVDEALKEKTFGYAVVDLKLGGASGLV 65
++L+ +DD L ++ R GY+ + ++ + + V D+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 CVEKLHAHDPEMLIVVLTGFASIATAVEAIKLGACHYLAKPSNTDDIEEAFRKAEGNAQI 125
+ ++ P++ ++V++ + TA++A + GA YL KP + ++ +A +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 ALPERSTS 133
+
Sbjct: 125 RPSKLEDD 132



Score = 51.0 bits (122), Expect = 4e-10
Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 3/71 (4%)

Query: 106 PSNTDDIEEAFRKAEGNAQIALPE---RSTSIKTLEWERIHQTLIETDFNISEAARRLGM 162
S + +EE R+ + ALP + +E+ I L T N +AA LG+
Sbjct: 402 LSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGL 461

Query: 163 HRRTLARKLEK 173
+R TL +K+ +
Sbjct: 462 NRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2038RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 6e-07
Identities = 23/183 (12%), Positives = 67/183 (36%), Gaps = 10/183 (5%)

Query: 41 EADFGRREAQLKAAQEQIAQAKAAIDQQVAAKL-------KAERAAIAEAEAAKARQAVA 93
EAD + ++ L A+ + + + KL + ++E E + +
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 94 DEMGNRDRQLAELHEAILSKDAKLADAQKAQAEMMRKQRELDEARRELDLTVEKKVQESL 153
++ Q + + K A+ A+ ++++R + ++ K +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVL-ARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 154 VAVRDKARLEAEEGLKAKVAEKETQIAGMQRQIEELRRKAEQGSQQLQGEALEIELESQL 213
AV ++ E ++ ++Q+ ++ +I + + + +Q + E L+ ++
Sbjct: 252 HAVLEQENKYVE--AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD 309

Query: 214 RAR 216

Sbjct: 310 NIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2039ACRIFLAVINRP488e-158 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 488 bits (1257), Expect = e-158
Identities = 244/1047 (23%), Positives = 443/1047 (42%), Gaps = 69/1047 (6%)

Query: 6 LSDWALEHRSLVWYFMIAFMAAGLYSYLHLGREEDPSFTIKTMLIQAKWPGASAEEMTRQ 65
++++ + W I M AG + L L + P+ + + A +PGA A+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 66 VTERIEKKLEELPALDYTKSLTVP-GQTTVFVNLRDTTKARDVVPTWLQVRNLINDIKGD 124
VT+ IE+ + + L Y S + G T+ + + T D +QV+N +
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 125 FPSGVVGPG-FNDRFGDVFGNIYAFTSDG--LSQRQLRDRVEE-VRAKVLTVPNVGRVDI 180
P V G ++ + + F SD +Q + D V V+ + + VG V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 181 VGAQNEV-IYLEFSPRKVAALGIDQRTILNALQAQNAVSPSGVLQAGPE------RVAVR 233
GAQ + I+L+ + + ++N L+ QN +G L P ++
Sbjct: 178 FGAQYAMRIWLD--ADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 234 VSGQFTSEESLKAINLRINDRFFP--LTDVATISRGYEDPPTSLFRFNGQPAIGLAIGMK 291
+F + E + LR+N L DVA + G E+ + R NG+PA GL I +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIKLA 294

Query: 292 AGANLLDFGEALKEEMTKVVADLPVGVGVHLVSDQPKIVEEAVGGFTKALYEAVIIVLAI 351
GAN LD +A+K ++ ++ P G+ V D V+ ++ K L+EA+++V +
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 352 SFVSLG-VRAGLVVAISIPLVLAITFLVMAYTGISLQRISLGALIIALGLLVDDAMIAVE 410
++ L +RA L+ I++P+VL TF ++A G S+ +++ +++A+GLLVDDA++ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 411 MMVARLEVGDSLKKAATYVYTS-TAFPMLTGTLVTVAGFIPIGLNNSAAGEFTFTLFVVI 469
+ + K AT S ++ +V A FIP+ + G + I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 470 AVSLIVSWIVAVLFTPLLGVTILPASLPSHHAEAGRLTRMFRAMLDACM----------- 518
++ +S +VA++ TP L T+L HH G F D +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 519 RHRWTTIIVTVLIFALSVFGMRFVQQQFFPSSDRNELVIDFNLPQNSSIAETNAQMARFE 578
+++ LI A V + F P D+ + LP ++ T + +
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 579 QEALKGD-PDIDHWSTYVGSGAQRFVLSFDVQPADVSFGQIIVVTKSLEAR--------- 628
LK + +++ T G F + G V K E R
Sbjct: 595 DYYLKNEKANVESVFTVNG---------FSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 629 --DRARAKL-QTYLTKTFPGTDAYVHLLDIGPPVGRPVQYRVSGPDVAKVREIAQQLAGV 685
RA+ +L + P + L + + +G + + QL G+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQ-AGLGHDALTQARNQLLGM 704

Query: 686 MRAN-SHLGAVIFDWMEPARVVKVDVLQDKARQLGVTSSDIANALNSVLDGSSITQVRDD 744
+ + L +V + +E K++V Q+KA+ LGV+ SDI +++ L G+ + D
Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764

Query: 745 IYLIDVRGRAQAKERQSLETLRDLQLSGSNGQSIPLAALATLRYEIEQPEIWRRSRQPTL 804
+ + +A AK R E + L + +NG+ +P +A T + P + R + P++
Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 805 TLKASVIDQVQPNTVVDQLKPAITAFNAKLPAGFKVVTGGAVEESAKSQAPIIAVVPIML 864
++ P T + +KLPAG G + S A+V I
Sbjct: 825 EIQGE----AAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISF 880

Query: 865 FTMATILMLQLQSFQRLFLVFAVAPLALIGVVAALLPSGAPLGFVAILGVLALIGILIRN 924
+ L +S+ V V PL ++GV+ A ++G+L IG+ +N
Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 925 SVILIVQIEHLRD-EGRPPWEAVMEATEHRMRPIMLTAAAASLALIPIA------REVFW 977
+++++ + L + EG+ EA + A R+RPI++T+ A L ++P+A
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 978 GPMAYAMMGGIIVGTVLTLLFLPALYV 1004
+ +MGG++ T+L + F+P +V
Sbjct: 1001 -AVGIGVMGGMVSATLLAIFFVPVFFV 1026



Score = 97.2 bits (242), Expect = 1e-22
Identities = 83/515 (16%), Positives = 188/515 (36%), Gaps = 34/515 (6%)

Query: 519 RHRWTTIIVTVLIFALSVFGMRFVQQQFFPSSDRNELVIDFNLPQNSSIAETNAQMARFE 578
R ++ +++ + + +P+ + + N P + + +
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GADAQTVQDTVTQVI 65

Query: 579 QEALKGDPDIDH-WSTYVGSGAQRFVLSFDVQPADVSFGQIIVVTKSLEARDRARAKLQT 637
++ + G ++ + ST +G+ L+F D Q+ V K A ++Q
Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG-TDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 638 YLTKTFPGTDAYVHLLDIGPPVGRPVQYRVSGPDVAKVREIAQQLAGVMRANSHLGAVIF 697
+ +Y+ + Q +S + V++ +L GV G V
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV-------GDVQL 177

Query: 698 DWMEPARVVKVDVLQDKARQLGVTSSDIANALNSVLDGSSITQVRDDIYLIDVRGRAQAK 757
+ A + +D D + +T D+ N L D + Q+ L + A
Sbjct: 178 FGAQYAMRIWLD--ADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 758 ER---QSLETLRDLQLSGS-NGQSIPLAALATLRYEIEQPEIWRRSR-QPTLTLKASVID 812
+ ++ E + L + +G + L +A + E + R +P L +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLAT 295

Query: 813 QVQPNTVVDQLKPAITAFNAKLPAGFKVV----TGGAVEESAK--SQAPIIAVVPIMLFT 866
+K + P G KV+ T V+ S + A++ + L
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL-- 353

Query: 867 MATILMLQLQSFQRLFLVFAVAPLALIGVVAALLPSGAPLGFVAILGVLALIGILIRNSV 926
++ L LQ+ + + P+ L+G A L G + + + G++ IG+L+ +++
Sbjct: 354 ---VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410

Query: 927 ILIVQIE-HLRDEGRPPWEAVMEATEHRMRPIMLTAAAASLALIPIA-----REVFWGPM 980
+++ +E + ++ PP EA ++ ++ A S IP+A +
Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 981 AYAMMGGIIVGTVLTLLFLPALYVAWFRIKTPENS 1015
+ ++ + + ++ L+ PAL + + E+
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHH 505


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2040RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.001
Identities = 26/181 (14%), Positives = 53/181 (29%), Gaps = 18/181 (9%)

Query: 95 LAVRASVADVSRSQAQLTNATGTEGRQRTLLETDATTKATLESAEQS--RAAAQASVIKA 152
L + + + + + L E ++
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 153 QANLAKAREQLGYAQLKADFAGIVTAVSA-EVGQVVSPG---VSVVTVARPDIREAVIDV 208
LAK E+ + ++A + V + G VV+ + +V A++
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 209 GDDLASGLQAGTPFTVRLQVDPSVSA---SGKVREIAPRADATTRTR-----RVRIALDN 260
D + G ++++ P GKV+ I DA R V I+++
Sbjct: 375 KD--IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL--DAIEDQRLGLVFNVIISIEE 430

Query: 261 P 261

Sbjct: 431 N 431



Score = 32.1 bits (73), Expect = 0.003
Identities = 17/122 (13%), Positives = 45/122 (36%), Gaps = 19/122 (15%)

Query: 76 GDIVKKDQVVAAIDPAALELAVRASVADVSRSQAQLTNATGTEGRQRTL---LETDATTK 132
G+ V+K V+ + E AD ++Q+ L A + R + L +E + +
Sbjct: 115 GESVRKGDVLLKLTALGAE-------ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPE 167

Query: 133 ATLESAEQSRAAAQASVIKAQANLAKARE---------QLGYAQLKADFAGIVTAVSAEV 183
L + ++ V++ + + + +L + +A+ ++ ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 184 GQ 185

Sbjct: 228 NL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2041RTXTOXIND482e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.3 bits (115), Expect = 2e-08
Identities = 24/145 (16%), Positives = 40/145 (27%), Gaps = 8/145 (5%)

Query: 80 VSGRVTERLVEVGAHVTAGQVLARIDPAEQQADLDSAEAAVSAA--ESQVKVATSNFNRQ 137
+ V E +V+ G V G VL ++ +AD ++++ A E S
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 138 QTLLSSGFTTRATFDQAQETLRTAEGSLEAAK---AQRDTAKDALTYTELRADADG---T 191
L F E SL + Q + L + RA+
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 192 ITARNIEAGQVAQAAQPVFTLARDG 216
I + +L
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQ 247



Score = 43.7 bits (103), Expect = 9e-07
Identities = 39/223 (17%), Positives = 78/223 (34%), Gaps = 25/223 (11%)

Query: 99 QVLARIDPAEQQADLDSAEAAVSAAESQVKVATSNFNRQQTLLSSGFTTRATFDQAQETL 158
Q +A+ EQ+ A + +SQ++ S T+ ++ + L
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESE--ILSAKEEYQLVTQLFKNEILDKL 304

Query: 159 RTAEGSLEAAKAQRDTAKDALTYTELRADADGTITARNIEA-GQVAQAAQPVFTLARDGD 217
R ++ + ++ + +RA + + G V A+ + + + D
Sbjct: 305 RQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDD 364

Query: 218 RDAVIDVYEAALSLKFDDTKVALALLSDPSVTA---------KGRVREVSP--TIDSKSG 266
V A + K D + + + V A G+V+ ++ D + G
Sbjct: 365 TLEV----TALVQNK-DIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 267 TV-RVKIAIENPP-----PEMILGSAVSATGFIKPEQRIVLPW 303
V V I+IE + L S ++ T IK R V+ +
Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2043HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 2e-14
Identities = 25/130 (19%), Positives = 52/130 (40%), Gaps = 3/130 (2%)

Query: 20 PRQARGRERREQLLDAAAALIAESGLAAVSMHAAAQRAGASIGSAYHFFRDKDQMLDALA 79
+ +E R+ +LD A L ++ G+++ S+ A+ AG + G+ Y F+DK + +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 80 ERHDAELRSAFDRVLQRTDAEWAALSPAEIIEQLIGWAIRYFVRHPDALATLDLHDKAMH 139
E ++ + + + + EI+ ++ + R L + H
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLS-VLREILIHVLESTVTEERRR--LLMEIIFHKCEFV 120

Query: 140 GEFKAVIDRI 149
GE V
Sbjct: 121 GEMAVVQQAQ 130


84BBta_2105BBta_2110N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_21052121.364571N-isopropylammelide isopropylaminohydrolase
BBta_21061141.993934N-carbamoyl-D-amino acid hydrolase
BBta_21071132.119727Hydantoin racemase HyuA
BBta_2108-2121.010896dihydropyrimidinase
BBta_2109-2130.911594UDP-glucose 4-epimerase
BBta_2110-1120.786299short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2105UREASE441e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 43.6 bits (103), Expect = 1e-06
Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 12/72 (16%)

Query: 1 MDLIIRNAVLAQQGELRCADIGISKGRIAAIA----PSLQ--------ADGEARDAENCL 48
+D +I NA++ + ADIG+ GRIAAI P +Q E E +
Sbjct: 68 VDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKI 127

Query: 49 VVPGLIETHIHL 60
V G +++HIH
Sbjct: 128 VTAGGMDSHIHF 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2108UREASE387e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.2 bits (89), Expect = 7e-05
Identities = 28/100 (28%), Positives = 37/100 (37%), Gaps = 22/100 (22%)

Query: 7 DLIIRGGRVATTTDTFEADVAISGETIAAIGHG------------LGPAKREIDARGKYV 54
D +I + +AD+ + IAAIG +GP I GK V
Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128

Query: 55 LPGGVDSHCHI---EQLSAAGIVNADTFESATTSAAFGGT 91
GG+DSH H +Q+ A S T GGT
Sbjct: 129 TAGGMDSHIHFICPQQIEEA-------LMSGLTCMLGGGT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2109NUCEPIMERASE952e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 94.8 bits (236), Expect = 2e-24
Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 9/190 (4%)

Query: 1 MHVLIFGGAGFVGLNVAQGLLERGHAVTMFDA------APLPNAAQRAFADHGDRLHVIT 54
M L+ G AGF+G +V++ LLE GH V D L A A G
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPG--FQFHK 58

Query: 55 GDVTDERAVASAIAGGCDAVVMGAAITAGPERDAADPERILAVNLLAQVPILQTARSSGV 114
D+ D + A G V + +P NL + IL+ R + +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 115 RRVINLSSAASYGASGQRFAVLEETMPCEPVSLYAITKFASERVASRLSDLWQTDVISVR 174
+ ++ SS++ YG + ++ + PVSLYA TK A+E +A S L+ +R
Sbjct: 119 QHLLYASSSSVYGLN-RKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 175 LSAVFGPFER 184
V+GP+ R
Sbjct: 178 FFTVYGPWGR 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2110DHBDHDRGNASE1045e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (260), Expect = 5e-29
Identities = 71/251 (28%), Positives = 116/251 (46%), Gaps = 4/251 (1%)

Query: 3 LAGKVAVITGGGSGIGRASAVMFAREGAFLALVDRDAAGAEETRALMRDAGGDGSVHIGD 62
+ GK+A ITG GIG A A A +GA +A VD + E+ + ++ D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 VGDADFAKTTIENVAAARGRVDVLMTAAGFSCGGTVVTTDPADWDAVFRANVGGTWLWAK 122
V D+ + G +D+L+ AG G + + +W+A F N G + ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 123 VAVPIMQVQKSGSIITVASQLAVAGGKGNSAYIAAKGAIISLTRTMAVDFATDGIRVNAL 182
M ++SGSI+TV S A +AY ++K A + T+ + ++ A IR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 183 APGAIDTPLLRRSFARHADPEPV----REASRLRHAMKRFGEADEVAAAALFLASDEASF 238
+PG+ +T + +A E V E + +K+ + ++A A LFL S +A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 239 TTGIVLPVDGG 249
T L VDGG
Sbjct: 246 ITMHNLCVDGG 256


85BBta_2142BBta_2148N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_214208-1.530528chemotaxis protein CheY
BBta_214308-1.185685chemotaxis protein CheA
BBta_214409-1.828063chemotaxis protein CheW
BBta_2145010-2.111511methyl-accepting chemotaxis protein
BBta_2146014-2.539133chemotaxis protein CheR
BBta_2147014-1.434366MCP proteins methylation stimulator CheD
BBta_2148014-0.629338chemotaxis response regulator protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2142HTHFIS923e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 3e-25
Identities = 31/119 (26%), Positives = 58/119 (48%), Gaps = 3/119 (2%)

Query: 1 MTK-RILAVDDSKTMRDMVSFTLKKAGFDVAEAEDGKAALNVLTGGKFDLIITDLNMPNM 59
MT IL DD +R +++ L +AG+DV + + G DL++TD+ MP+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGISLIKSVRAGSQHRTVPILILTTESDGAKKADGKAAGATGWLVKPFNPDQLIATVNR 118
+ L+ ++ +P+L+++ ++ GA +L KPF+ +LI + R
Sbjct: 61 NAFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2143PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 12/52 (23%), Positives = 22/52 (42%), Gaps = 8/52 (15%)

Query: 390 IRNALDHGIEPPAERLAAGKPRQGTINLSAAQRSGRIVIEVSDDGRGINRDK 441
+ N + HGI P+ G I L + +G + +EV + G ++
Sbjct: 264 VENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEVENTGSLALKNT 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2146CHANNELTSX300.006 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 30.4 bits (68), Expect = 0.006
Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 14/57 (24%)

Query: 71 INAVTTNHTSF--------------FREAHHFDFLSRVIIPEFLSGNTTRLRIWSAG 113
+N V + HT F F + FDF + P F GN+T IW+ G
Sbjct: 39 VNVVGSYHTRFGPQIRNDTYLEYEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKG 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2148HTHFIS756e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 6e-17
Identities = 44/190 (23%), Positives = 74/190 (38%), Gaps = 25/190 (13%)

Query: 1 MSAIRVLIVDDSAFVRQMLTDLLSSDPGIEVLGAAPNPIVARDMIKSLNPDVLTLDIEMP 60
M+ +L+ DD A +R +L LS G +V + N I + + D++ D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALS-RAGYDVRITS-NAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 RMDGLAFLDKIMTLRP-MPVLMISSLTQKGADTAVRALEMGAFDCVAKPVIGLVEGLPAL 119
+ L +I RP +PVL++S+ TA++A E GA+D + KP
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQN--TFMTAIKASEKGAYDYLPKPF---------- 106

Query: 120 RDEIVTKVKAAAAAKIRPRSGDEVRPLHRPGVSYSSSEKIIAVGASTGGVEALQELLMAF 179
++ + A P+ VG S E + L
Sbjct: 107 --DLTELIGIIGRALAEPKRRPSKLEDDSQDGMP-------LVGRSAAMQEIYRVLARLM 157

Query: 180 PSDAPAVVIT 189
+D ++IT
Sbjct: 158 QTDLT-LMIT 166


86BBta_2207BBta_2217N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2207-2131.305985hypothetical protein
BBta_2208-1111.608522hypothetical protein
BBta_2209-3100.683570hypothetical protein
BBta_2210-390.269977TetR family transcriptional regulator
BBta_2211-310-0.262348hydrolase
BBta_2213-210-0.615231MFS family transporter
BBta_2214-110-0.718206component of multidrug efflux system
BBta_2215010-1.063364RND efflux transporter
BBta_2216011-0.793438TetR family transcriptional regulator
BBta_22170120.146111hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2207OMPADOMAIN822e-20 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 81.5 bits (201), Expect = 2e-20
Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 2/104 (1%)

Query: 104 EIQFDYNSAAISKTSIQAVQELGKALSDPNLKGSTFMVAGHTDGIGSDGFNQDLSERRAD 163
++ F++N A + A+ +L LS+ + K + +V G+TD IGSD +NQ LSERRA
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 164 TLKRYLVEKFGLAGQDLVTVGYGKTKLKDAANPADPVNRRVQVV 207
++ YL+ K G+ + G G++ N D V +R ++
Sbjct: 280 SVVDYLISK-GIPADKISARGMGESNPV-TGNTCDNVKQRAALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2210HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 2e-13
Identities = 30/158 (18%), Positives = 54/158 (34%), Gaps = 2/158 (1%)

Query: 5 LSRALVVFSERGYHAASLAELRAAMKLATGSLYKAFEDKRAIFVATLDYYIARRDRQLRE 64
L AL +FS++G + SL E+ A + G++Y F+DK +F + + E
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 65 RL-DAEQSGRDKLRALLQFYAESASDSEGRRGCLVVTSAAELASLDAEIAVKVRAAL-RR 122
LR +L ES E RR + + + + + + L
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 123 VETALCDLVRLGQADGSIRTGRDPDAVAGALFCLIQGL 160
+ ++ + A + I GL
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2214RTXTOXIND354e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 4e-04
Identities = 12/91 (13%), Positives = 27/91 (29%), Gaps = 3/91 (3%)

Query: 77 GKVAKRLVEVGQTVEIGQPLATLDEVDLKLQAEQAEAEFRAA--TGVLAQASAAETRAKE 134
V + +V+ G++V G L L + + + ++ A Q + +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 135 LRVKGWTTDAQMDQAKASADEARARLNRAER 165
L + + R E+
Sbjct: 165 LPELKLPDEPYFQNVSE-EEVLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2215ACRIFLAVINRP473e-152 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 473 bits (1220), Expect = e-152
Identities = 238/1053 (22%), Positives = 434/1053 (41%), Gaps = 66/1053 (6%)

Query: 6 LSAWAVSHPPLVLFLIVALGLFGFISYERLGRAEDPSFTVKVVNVSMIWPGATATEMQSQ 65
++ + + P L + L + G ++ +L A+ P+ V+VS +PGA A +Q
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 66 VADPIEKKLQELPYFEKVETYSKPAFAAM-QITFRDSTPPKEVPYLFYLVRRKLADVQAS 124
V IE+ + + + + S A + +TF+ T P V+ KL
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIA---QVQVQNKLQLATPL 117

Query: 125 LPSGVLGPFVNDEFSDVDSILFMM-TGDGADYAQLKK---VAEGLRQRLLKVDGVTKVNL 180
LP V ++ E S ++ D Q VA ++ L +++GV V L
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 181 YGTQDE-RIYVEFSHAKLATLGITPQALFDSLAKQNNVTPAGTVETS------SQRVPLR 233
+G Q RI+++ L +TP + + L QN+ AG + + +
Sbjct: 178 FGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 234 VTGALDGAKAVAETPVESN--GRVFRLGDIATVTHGFVDPPTYKVHQEGKPALGLGIVTA 291
+ + + N G V RL D+A V G + GKPA GLGI A
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIKLA 294

Query: 292 KGANILELGKDVHAASDDFMKAVPQGIELKQIADQPKVVETAVGEFVHSFVEALVIVLFV 351
GAN L+ K + A + PQG+++ D V+ ++ E V + EA+++V V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 352 SFLALG-WRTGIVVALSVPLVLGIVFIVMSAMGLDLHRISLGALIIALGLLVDDAIIAVE 410
+L L R ++ ++VP+VL F +++A G ++ +++ +++A+GLLVDDAI+ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 411 MMV-VKMEQGWDRVRAASFAWESTAFPMLTGTLVTAAGFLPIGFASSAVGEYTGTIFWIV 469
+ V ME A + ++ +V +A F+P+ F + G +
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 470 AISLVASWFVAVIFTPYIGVKLLPDMKAHHGHDENA-------VYDTRMYRGLRAVVSWC 522
++ S VA+I TP + LL + A H ++ +D V
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHYTNSVGKI 533

Query: 523 VAHRITTVAATVGVFVASIVAFGHVQQQFFPLSERPELFLQLRLPEGTAFNVTEKAARK- 581
+ + + +V F + F P ++ ++LP G T+K +
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 582 AEGFLKDDKD-IETFTSYVGQGSPRFWLGLNPQLPNEAFAEIVIVAKDVAARERIKARIE 640
+ +LK++K +E+ + G + Q N A + + + + A
Sbjct: 594 TDYYLKNEKANVESVFTVNG-------FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 641 AAAAN---GEINEARVRVDRFNFGPP--------VGFPVQFRVI-GPDTAKVREIAYQVR 688
A G+I + V F P GF + G + + Q+
Sbjct: 647 IHRAKMELGKIRDGFV----IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLL 702

Query: 689 DVMRQN-ANVRDPQFDWNEQSPYLKLVVDQDRARAIGLTPQDVSQSLAMLISGVQVTTIR 747
+ Q+ A++ + + E + KL VDQ++A+A+G++ D++Q+++ + G V
Sbjct: 703 GMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI 762

Query: 748 DGIEKVGVIARAVPSERLDLAGVGDLTITSRNGVAVPLQQIAKIEYSHEEPILRRRNRDM 807
D + +A R+ V L + S NG VP + + P L R N
Sbjct: 763 DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLP 822

Query: 808 AITVRSDVADGVQAPDVTNQIWPKLADIRSHLDPAYRIEMGGAIEESAKGNASIFVLFPL 867
++ ++ + A G + D + ++ S L + G + L +
Sbjct: 823 SMEIQGEAAPGTSSGDAMALM----ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 868 MVIVMLTLLMIQLQSFSRLVLVFLTAPLGIIGASFGLNVANAPFGFVALLGLIALAGMIM 927
+V+ L +S+S V V L PLGI+G + N ++GL+ G+
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 928 RNTVILVDQI-ETDVASGLTRREAIIEATVRRARPVVLTALAAILAMIPLSRSAFWG--- 983
+N +++V+ + G EA + A R RP+++T+LA IL ++PL+ S G
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 984 --PMAITIMGGLFVATFLTLLYLPGLYALWFRK 1014
+ I +MGG+ AT L + ++P + + R
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2216HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 2e-14
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 1/88 (1%)

Query: 12 DTRERILVVAERLFRQIGYQKTTVADIAKELRMSPANVYRFFDSKKAIHEAVARTLMGQV 71
+TR+ IL VA RLF Q G T++ +IAK ++ +Y F K + + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 72 -EAAALVIADRPGPAAPRLREMLGTIHR 98
E A PG LRE+L +
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLE 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2217MICOLLPTASE270.046 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.4 bits (60), Expect = 0.046
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 7/69 (10%)

Query: 32 GWISSRDPDGHVNLAPYSFFNAFNYTPPI---IGFSSTNMKDSAANIRDTGEFVWNLVTR 88
G + + D HVN P + + + + + I F T KD I+ + W+
Sbjct: 760 GMNTDTNTDVHVNKEPKAVIKS-DSSVIVEEEINFDGTESKDEDGEIK---AYEWDFGDG 815

Query: 89 DLATQMNAT 97
+ + + AT
Sbjct: 816 EKSNEAKAT 824


87BBta_2360BBta_2364N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2360-291.625348pyruvate phosphate dikinase
BBta_2361-2101.602314hypothetical protein
BBta_2362-2121.117445L-aspartate oxidase
BBta_2363-1130.498214MFS permease
BBta_2364-2121.848721dehydrogenase NAD(P)-binding domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2360PHPHTRNFRASE3123e-97 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 312 bits (800), Expect = 3e-97
Identities = 119/486 (24%), Positives = 187/486 (38%), Gaps = 97/486 (19%)

Query: 521 EATKLQADGRKVILVRIETSPEDIHGMHA--AEGILTTRGGMTSHAAVVARGMGKPCVSG 578
E L + +++ + +P D ++ +G T GG TSH+A+++R + P V G
Sbjct: 146 ETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVG 205

Query: 579 CGTIRVDYGRGTMSIGGRTFKTGDVITIDGSVGQVLAGRMPMI----EPELSGEFGTLMG 634
+ + GD++ +DG G V+ E + +
Sbjct: 206 TKEVTEK------------IQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQE 253

Query: 635 WADAV---------RETGVRVNADTPEDARTAIKFGAEGIGLCRTEHMFFEETRIRTVRE 685
WA V + N TP+D + G EGIGL RTE ++ + ++
Sbjct: 254 WAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDRDQL----- 308

Query: 686 MILSEEEQERRAALAKLLPMQRADFVELFEIMKGLPVTIRLLDPPLHEFLPHTQAEVEEV 745
+EEEQ + E+ + M G PV IR LD + L + Q E
Sbjct: 309 --PTEEEQ-------------FEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKEL- 352

Query: 746 ARAMNTDPRRLADRARELSEFNPMLGFRGCRLAIAYPEIAEMQARAIFEAAVEAEKRTGE 805
NP LGFR RL + +I Q RA+ A+
Sbjct: 353 ---------------------NPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGN----- 386

Query: 806 AVGLEVMVPLIATKAEFDLVKARIDAMAQAVIRETGAKLE-YQVGTMIELPRAALMAGQI 864
L+VM P+IAT E KA + ++ E + +VG M+E+P A+ A
Sbjct: 387 ---LKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLF 443

Query: 865 AEAAEFFSFGTNDLTQTTYGISRDDA-GSFLGAYVAKGILEIDPFISIDRDGVGELVKIG 923
A+ +FFS GTNDL Q T R + S+L P+ V ++K
Sbjct: 444 AKEVDFFSIGTNDLIQYTMAADRMNERVSYL----------YQPYHPAILRLVDMVIKAA 493

Query: 924 VERGRATRPNLKVGICGEHGGDPASVAFCHKIGMNYVSCSPYRVPIARLAAAQSALG--K 981
G+ VG+CGE GD ++ +G++ S S + AR + + K
Sbjct: 494 HSEGK------WVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELK 547

Query: 982 EIASQA 987
A +A
Sbjct: 548 PFAQKA 553


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2361IGASERPTASE372e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.0 bits (85), Expect = 2e-04
Identities = 23/129 (17%), Positives = 46/129 (35%), Gaps = 10/129 (7%)

Query: 104 QVPPRYQASDFPKVDRTLKGDRLVVRPPSQPQPSSVPTEAAEPAPAPPEPAPAMSPAPTD 163
Q P+ + PK ++ V+P ++P + PT + + +
Sbjct: 1120 QEVPKVTSQVSPKQEQ-----SETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 164 SNSSVFGAKTTEAPVAPLPAPRAALDPELQEALSAPPLAQYESA-KPEIQPSAELK--PA 220
++S+V T V + +PE + P ES+ KP+ + ++ P
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSV--VENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 221 TDAAAVDSA 229
A S+
Sbjct: 1233 NVEPATTSS 1241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2363TCRTETB448e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 43.7 bits (103), Expect = 8e-07
Identities = 30/143 (20%), Positives = 62/143 (43%), Gaps = 2/143 (1%)

Query: 67 LLWGLGQPLAGAIADRFGVMRVICVGALLYAGGLLMMRYSTSPLSLNIGAGVLVGFGLSG 126
L + +G + G ++D+ G+ R++ G ++ G ++ S SL I A + G G +
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118

Query: 127 CSFNLILSAFSKLLPPERRGVALGAGTAAGSLGQFLFAPFGVALIDQFGWQSALMVFATL 186
L++ ++ +P E RG A G + ++G+ + G + W S L++ +
Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMI 177

Query: 187 MLFIVPLSLAVATPPAAEAKPTD 209
+ VP + + D
Sbjct: 178 TIITVPFLMKLLKKEVRIKGHFD 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2364NUCEPIMERASE320.004 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.004
Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 153 VVVTGAAGGVGSVATAVLAKLGYHVIA 179
+VTGAAG +G + L + G+ V+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVG 29


88BBta_2604BBta_2616N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2604-2110.667176hypothetical protein
BBta_2605-1100.483968hypothetical protein
BBta_2608-113-0.170303hypothetical protein
BBta_26091130.919021general secretion pathway protein D
BBta_26100151.297993hypothetical protein
BBta_26112141.506890general secretion pathway protein E
BBta_26120141.262972general secretion pathway protein F
BBta_26131160.989264general secretion pathway protein G
BBta_26141180.548200general secretion pathway protein H
BBta_26151200.769681general secretion pathway protein I
BBta_26161200.254207general secretion pathway protein J
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2604PF03544310.009 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.009
Identities = 19/114 (16%), Positives = 32/114 (28%)

Query: 157 PMWSRAAAGLGALLVAAAGSWFMLAREPIPAPPAQPQAGEASVPSSGPASTDPAVRSVTM 216
A + +VA A A +P P P +P+ +P + +
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 100

Query: 217 EAVSAATAPASAPADPPSEALAIAGPAPQVAAAVALAADLTAQAASTTPTPSPE 270
+ + +P A A TA AA++ P S
Sbjct: 101 PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2609BCTERIALGSPD2601e-78 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 260 bits (665), Expect = 1e-78
Identities = 170/669 (25%), Positives = 286/669 (42%), Gaps = 92/669 (13%)

Query: 111 AIGNGFELNFENSPIATVAKVVLGDVLGVGYTIDPRIQGTVSLSSVRPVPKSDVVYVLEN 170
A F +F+ + I V + L IDP ++GT+++ S + + +
Sbjct: 25 AAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83

Query: 171 ALRLAGTALV-RDNSGYRLIPLGEAVGAGNVDAPGAAAEPGYGIT--VLPLQYVSAPTLL 227
L + G A++ +N +++ +A A A AA G + V+PL V+A L
Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLA 143

Query: 228 KLLDSF--ALKPGTVRADPGRNMLLIQGTGSERRTAVDAALSF--DADWMKGQSVGIFPI 283
LL G+V N+LL+ G R + L+ D +SV P+
Sbjct: 144 PLLRQLNDNAGVGSVVHYEPSNVLLMTG----RAAVIKRLLTIVERVDNAGDRSVVTVPL 199

Query: 284 ANGNPEPIVAELEKI--MDTGDSGLGQNMVKFQTISRMNAVMVISRKPALLRTAETWIKR 341
+ + +V + ++ + + G + R NAV+V + R IK+
Sbjct: 200 SWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAM-IKQ 258

Query: 342 LDAGNTLRNSVHVYRVKYGEARQMARVLNEMFTGNSASADASSTDLAPGSGAASSSSSER 401
LD + + V +KY +A + VL + S+ + AA ++
Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLTGI-----------SSTMQSEKQAAKPVAALD 307

Query: 402 LSAAGTASTGSLSGQQPGNNGLVANRGFGGAPPATQTSLDSAPESRASAGGSKPLMDGVR 461
+ A N L+ V
Sbjct: 308 KNIIIKA--------HGQTNALI-----------------------------------VT 324

Query: 462 ITPDVVNNTLLIYADRENYRIIEATLRQCDRPQLQVAIDATIAEVTLNDNLNYGVQAYLT 521
PDV+N+ +E + Q D + QV ++A IAEV D LN G+Q
Sbjct: 325 AAPDVMND-------------LERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA-- 369

Query: 522 SKNLGLRPDQGSLLNTTSTSAPATVASAAGTITNAFLNRAFPGFNFLIGSESQPS--AIL 579
+KN G+ S L ++ A A + GT++++ + A FN + Q + +L
Sbjct: 370 NKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLL 428

Query: 580 SALHTVTDVKVLSNPSLVVIDNQAATLQVGDQVPVSTGSATVLTTNNTVVNTIDYRNTGI 639
+AL + T +L+ PS+V +DN AT VG +VPV TGS T T+ + + NT++ + GI
Sbjct: 429 TALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQT--TSGDNIFNTVERKTVGI 486

Query: 640 ILRVAPRVNENGNVRLEIEQEISNV---SPQTANSLTPTVAQRKVKSAVAVASGQTVLLA 696
L+V P++NE +V LEIEQE+S+V + T++ L T R V +AV V SG+TV++
Sbjct: 487 KLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVG 546

Query: 697 GLISETHQGTRNGVPGVDQIPVLGDLFSNNDRTRGRTELIIFIRPQIIRDGTDAHYVAEE 756
GL+ ++ T + VP + IPV+G LF + + + L++FIRP +IRD + +
Sbjct: 547 GLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSG 606

Query: 757 LRSKLRGTI 765
+
Sbjct: 607 QYTAFNDAQ 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2612BCTERIALGSPF2043e-64 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 204 bits (521), Expect = 3e-64
Identities = 100/389 (25%), Positives = 176/389 (45%), Gaps = 3/389 (0%)

Query: 3 GTISAATAKDVIDRIEYLRLLPIETVEEKTASPVTRDMLAWFGGP---TQADVTTFTRDL 59
GT A +A+ + L+P+ E + + + +D+ TR L
Sbjct: 18 GTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKIRLSTSDLALLTRQL 77

Query: 60 ALLLKAGARLEDSLELLAGDADIGRLRPIVNRLRSSILAGESLADALAHHPLQFPAMYLA 119
A L+ A LE++L+ +A ++ L ++ +RS ++ G SLADA+ P F +Y A
Sbjct: 78 ATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCA 137

Query: 120 LVRVGEASGSLDHVLEVLGAERARADAMRRKLSDAMQYPLFVLAAAALVMLFFFLVVLPQ 179
+V GE SG LD VL L + MR ++ AM YP + A V+ VV+P+
Sbjct: 138 MVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPK 197

Query: 180 FSAVLRDFGAKTDTTITTFLDISDFVRGHGAALASAAAAIVAAGGLLFGRAGVRAALIAQ 239
+ + +SD VR G + A A A ++ + R + +
Sbjct: 198 VVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHRR 257

Query: 240 LCRLPGLAKVFGSYRTSVFCRNLAILLGSGLTLTATLRILVDIMSAIGNGAEWGAAADRV 299
L LP + ++ T+ + R L+IL S + L +RI D+MS A D V
Sbjct: 258 LLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAV 317

Query: 300 RHGGKLSEALAATSILPAMATRMIRIGEETGQVPVLAGRIAEFYEAKLQRSLDRIVGIVG 359
R G L +AL T++ P M MI GE +G++ + R A+ + + + +G+
Sbjct: 318 REGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFE 377

Query: 360 PAAIILISTVVGGLIVSVMSALLSVTQLV 388
P ++ ++ VV ++++++ +L + L+
Sbjct: 378 PLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2613BCTERIALGSPG1338e-43 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 133 bits (336), Expect = 8e-43
Identities = 49/139 (35%), Positives = 80/139 (57%), Gaps = 7/139 (5%)

Query: 21 RHEDQRGFTLVEMLVVIAIIGMIMGLIGPRVLSSLNESRVKTARIQIQSFSSALDLFYLD 80
+ QRGFTL+E++VVI IIG++ L+ P ++ + ++ + A I + +ALD++ LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 81 TGRYPTTSEGLAALVRPVATIPG---WAGPYLKGGNVPSDPWSHAYVYRSPGQQVPYEIL 137
YPTT++GL +LV P + +P+DPW + YV +PG+ Y++L
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI-KRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 138 SYGADGQEGGTDLAADISS 156
S G DG+ G D DI++
Sbjct: 122 SAGPDGEMGTED---DITN 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2616BCTERIALGSPG310.003 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.6 bits (69), Expect = 0.003
Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 4 RADHPQAGFTLIEALAAMALMGLLVSALAAIASHWLPNWNRGLNRIQRSESVSV--ALD 60
RA Q GFTL+E + + ++G+L + + + + N + + S+ V++ ALD
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVL---ASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


89BBta_2737BBta_2740N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_27370100.044788AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_2738-1110.555539RND efflux membrane fusion protein
BBta_27390100.754096outer membrane efflux protein
BBta_27402100.719136TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2737ACRIFLAVINRP7540.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 754 bits (1948), Expect = 0.0
Identities = 315/1029 (30%), Positives = 522/1029 (50%), Gaps = 31/1029 (3%)

Query: 5 DIFIRRPVLALVVSALILVLGLRSMTTLSILQYPRTQNAIVTVSTAYAGADSDIIAGFIT 64
+ FIRRP+ A V++ ++++ G ++ L + QYP V+VS Y GAD+ + +T
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 TPLENAIAQANGIDYMTSTSV-TGSSTITVNLRLNYDSGKAMTEINAKVNSVLNQLPTGT 123
+E + + + YM+STS GS TIT+ + D A ++ K+ LP
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 QQPTLTLKVGQTIDAMYIGFASDVLAPNQ--ITDYLTRVVQPKLQAVSGVQTAELIGAKK 181
QQ ++++ + M GF SD Q I+DY+ V+ L ++GV +L GA+
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 FALRAWLDPDKLAAYGLTATDVSTALSGND------YISGLGTTKGRLVQVDLAAATNLH 235
A+R WLD D L Y LT DV L + + G G+ + + A T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 SLEQYRNLVVK-QVNGAIIRLRDVASVTLGNDDYDTATRFNGKTAVYIGIQVAPTANLLD 294
+ E++ + ++ +G+++RL+DVA V LG ++Y+ R NGK A +GI++A AN LD
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 VINGVKAVFPDLQQQLPNGIRSEIIYDSTDFVNSSIDEVEHTLVEAIGIVIVVVFMFLGS 354
+KA +LQ P G++ YD+T FV SI EV TL EAI +V +V+++FL +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 WRSVLIPVIAIPLSLIGTFGLMLLMGFSINLLTLLALVLAIGLVVDDAIIVVEAVHR-LI 413
R+ LIP IA+P+ L+GTF ++ G+SIN LT+ +VLAIGL+VDDAI+VVE V R ++
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 414 DEGVSPVEAAIQSARALASPIIAMTVVLIAVYVPVGFQGGLTGALFTEFAFTLAGAVTVS 473
++ + P EA +S + ++ + +VL AV++P+ F GG TGA++ +F+ T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 AVIALTLSPMNCAWLLRRTERGSTTLEARLVRAIDHVLERLAHAYRRCLESVFSTKPVVV 533
++AL L+P CA LL+ + + + + Y + + + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 534 VFAFLLLAMIPWLYVSSTSELAPDEDQGVVLTLTNSAPSATLEQRQFYAQQLHD--LIAD 591
+ L++A + L++ S P+EDQGV LT+ AT E+ Q Q+ D L +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 592 RPETRTVFQI-------DASTQVIGGWVLKPWDQRKATTKTLQPEFQQMLNGIAGARSVA 644
+ +VF + A + LKPW++R + + + + R
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 645 FQPSPLPGSNGLPIQ--FVIGSTGTFNQLNDVSRDLMNKALA-----SGKFIFLDSDLKI 697
P +P L F +D N+ L + + +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 698 DKPQTTVVFDRDKAADLGLKMSDLGGALATMLGGNYLDYFSLEGRSYKVIPQVRQNDRLT 757
D Q + D++KA LG+ +SD+ ++T LGG Y++ F GR K+ Q R+
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 758 DAQLLDYSIRAGSSTMVPLATVAHLEHKTIPESLNHFQQINAATISGVAMPGVSMGDALA 817
+ +R+ + MVP + L + + + I G A PG S GDA+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 818 TLKELGKSLPSGYSIDYGGASRQLEQESGGFVTTFVFALIIIFLALAALFNSFVDPIIIL 877
++ L LP+G D+ G S Q + +++FL LAAL+ S+ P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 878 VSVPMSLAGALIFISLGIGGASINIYTQVGLVTLMGLVSKHGILMVEVANELRE-QGKDR 936
+ VP+ + G L+ + ++Y VGL+T +GL +K+ IL+VE A +L E +GK
Sbjct: 902 LVVPLGIVGVLLA--ATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 937 TEAIIEAATIRLRPILMTTAAMVLGVIPLITASGAGAVSRFNMGLVIATGLSIGTLFTLF 996
EA + A +RLRPILMT+ A +LGV+PL ++GAG+ ++ +G+ + G+ TL +F
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 997 VVPVAYALI 1005
VPV + +I
Sbjct: 1020 FVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2738RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 32/144 (22%), Positives = 57/144 (39%), Gaps = 19/144 (13%)

Query: 152 ALVTQQKAIVEQK----TLKAPFAGR-LGIRAVDLGQYLGAGTTIVTL-QSLDPIFVDFY 205
L+T + A E++ ++AP + + ++ G + T++ + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 206 LPQQSLAEIRIGQDVGVAVDAFPGERF---TGQISAINAK--VDSTSRNV------QVRA 254
+ + + I +GQ+ + V+AFP R+ G++ IN D V
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 255 TL--RNNQARLLPGMFAKVEIATG 276
L N L GM EI TG
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 42.5 bits (100), Expect = 2e-06
Identities = 18/92 (19%), Positives = 37/92 (40%), Gaps = 3/92 (3%)

Query: 75 QVAGIVQSIQFNSGDSVEAGQPLLQLVATDDVAKLQSLQATAENYAIVVKRDE---EQIK 131
IV+ I G+SV G LL+L A A Q++ + R + I+
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 132 FNAVSQATLDSDRANLKNAQALVTQQKAIVEQ 163
N + + L + ++ V + +++++
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2739RTXTOXIND511e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 1e-08
Identities = 20/101 (19%), Positives = 37/101 (36%)

Query: 175 RQVEALQAQADYQAFRMEATYLSLTANVVTAAITDASLRDQIAATEEIIKAETDQLERIQ 234
+V L + Q + N+ ++ +I E + + E +L+
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241

Query: 235 RQFDVGAISKSDVLSQEATLAQAKATLAPLQKQLAQQRNQL 275
AI+K VL QE +A L + QL Q +++
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2740HTHTETR753e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 3e-18
Identities = 35/193 (18%), Positives = 62/193 (32%), Gaps = 12/193 (6%)

Query: 33 AGRTERKHTQILESARTLFLESGFDTTSVDAIARHAGVSKATLYAHFTDKDALLLALVEE 92
+ IL+ A LF + G +TS+ IA+ AGV++ +Y HF DK L + E
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 93 DCERRRDPLWMPHDRTIDLERDLRE-IGQRFLSIFLDDQALAMHRLIMSCAARY-PAIAE 150
+ + + D + + + + RL+M + E
Sbjct: 66 SESNIGE---LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 151 AFMRAGPDRCDAE-----VAAFLRAAKAEGLIDVP-NVRLAATQFLMLIQGRLPLTWALS 204
+ R + L+ ++ R AA I G L W +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG-LMENWLFA 181

Query: 205 LQAPSAAESRAQL 217
Q+ +
Sbjct: 182 PQSFDLKKEARDY 194


90BBta_2785BBta_2789N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2785-390.867118major facilitator superfamily permease
BBta_2786-211-0.173222transcriptional regulator FixK
BBta_2787-110-1.033029response regulator receiver protein
BBta_2788011-1.220112response regulator FixJ
BBta_2789112-0.718763PAS/PAC sensor signal transduction histidine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2785TCRTETA477e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.1 bits (112), Expect = 7e-08
Identities = 35/139 (25%), Positives = 62/139 (44%), Gaps = 9/139 (6%)

Query: 303 LTSPSVGAVAQQYGRRPLLMLGFSALAIRGALFAVVHDPYILVAVQIFDGMTAAVFAVMI 362
+P +GA++ ++GRRP+L++ + A+ A+ A ++L +I G+T A AV
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-A 116

Query: 363 PLTVADVAFGSG---HFNLAQGIVGTATGIGASLSTVVAGFVSDRFGSATAFLGLSGVAL 419
+AD+ G HF G + G G V+ G + F F + +
Sbjct: 117 GAYIADITDGDERARHF----GFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNG 171

Query: 420 TGLVLIALLMPETGQHARR 438
+ L+PE+ + RR
Sbjct: 172 LNFLTGCFLLPESHKGERR 190



Score = 29.4 bits (66), Expect = 0.027
Identities = 32/146 (21%), Positives = 54/146 (36%), Gaps = 11/146 (7%)

Query: 50 FIFFLADVQTGFGPFIAVYLTTQK--WTQAQIGLVLSIGGIVGLIGQMPGGAIIDAAKSE 107
+FF+ + + V + W IG+ L+ GI+ + Q + A E
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 108 RRV--AGLAVAAIGCCALAYAV--WPIFPVVAGAATLHAAASCVLGPAIAAISLGLVGPR 163
RR G+ G LA+A W FP++ + A+ + PA+ A+ V
Sbjct: 277 RRALMLGMIADGTGYILLAFATRGWMAFPIM-----VLLASGGIGMPALQAMLSRQVDEE 331

Query: 164 GMAERLGRNARFASLGNGVAAAIMGT 189
+ G A SL + V +
Sbjct: 332 RQGQLQGSLAALTSLTSIVGPLLFTA 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2787HTHFIS865e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.4 bits (214), Expect = 5e-23
Identities = 35/128 (27%), Positives = 55/128 (42%)

Query: 5 TKPKVYVADDDADVLGSLRFLLEADGFDVRTFRSGIALLNASDHAPVDCFVVDYKMPELN 64
T + VADDDA + L L G+DVR + L D V D MP+ N
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GVELAERLRKQGATAPVVLITGHYDDRLEARAAAVGIQDFLLKPLLDDNLVKRIRKAIAN 124
+L R++K PV++++ +A+ G D+L KP L+ I +A+A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 125 EHSTTAQR 132
++
Sbjct: 122 PKRRPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2788HTHFIS1262e-36 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 126 bits (319), Expect = 2e-36
Identities = 41/149 (27%), Positives = 71/149 (47%)

Query: 4 RAKVYVIDDDEAMRDSLNFLLDSSGFEVVLFEHAQSFLEHLPSLSFGCVVSDVRMPGIDG 63
A + V DDD A+R LN L +G++V + +A + + + VV+DV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 IELLKRMKAAGSSFPILVMTGHGDVPLAVEAMKLGAVDFLEKPFEDERLVAMIETAIRQA 123
+LL R+K A P+LVM+ A++A + GA D+L KPF+ L+ +I A+ +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 124 EPAAKNESITQDILGRIASLSPRERQVMD 152
+ + S +++
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2789INVEPROTEIN300.014 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 30.5 bits (68), Expect = 0.014
Identities = 27/131 (20%), Positives = 50/131 (38%), Gaps = 17/131 (12%)

Query: 247 HDLTEHQQTQARLRELQSELVHVSRLSAMGEMASALAHELNQ-PLAAISNYMKGS-RRLL 304
H + + Q + + + EM++ALA N+ S+ + S R+L
Sbjct: 27 HTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSAALAQFRNRRDYEKKSSNLSNSFERVL 86

Query: 305 AGSSDPNTGKI-----------ESAMDRAAEQAIRAGQIIRRLRDFVSRGESE----KRV 349
+ P +I E + +A ++ LR+ + R + E K++
Sbjct: 87 EDEALPKAKQILKLISVHGGALEDFLRQARSLFPDPSDLVLVLRELLRRKDLEEIVRKKL 146

Query: 350 ESLSKLIEEAG 360
ESL K +EE
Sbjct: 147 ESLLKHVEEQT 157


91BBta_2842BBta_2849N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_2842-18-1.265194hypothetical protein
BBta_2843-19-1.186561histidine kinase, HAMP region
BBta_2844-113-0.248813cytochrome P450
BBta_2845014-0.547010TetR family transcriptional regulator
BBta_2846017-0.011615RND efflux membrane fusion protein
BBta_28470170.713052RND family multidrug efflux protein
BBta_28480161.476514ABC transporter permease
BBta_28490161.324338ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2842NUCEPIMERASE475e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 46.7 bits (111), Expect = 5e-08
Identities = 28/121 (23%), Positives = 43/121 (35%), Gaps = 19/121 (15%)

Query: 3 GATGTIGRATVRALVARGHEVVCF----------IRPHR-DVVEVPGAALRIGDVTDPVS 51
GA G IG + L+ GH+VV ++ R +++ PG D+ D
Sbjct: 7 GAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD-RE 65

Query: 52 LARDGFRGEPFDAVVSCMASRTGV---PGDAQAIDDQ---AHVNVLDAACRAGITHFVLL 105
D F F+ V R V + A D +N+L+ I H +
Sbjct: 66 GMTDLFASGHFERVFI-SPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLLYA 124

Query: 106 S 106
S
Sbjct: 125 S 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2845HTHTETR621e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 1e-13
Identities = 29/149 (19%), Positives = 56/149 (37%), Gaps = 8/149 (5%)

Query: 8 SDLRRQLILGAAKRCFARHGYVGTTTKSVAAAAAISEALLFKHFSSKAALYAEILAEECE 67
+ RQ IL A R F++ G T+ +A AA ++ ++ HF K+ L++EI
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 68 ADPDFELLLGQEPSTVTLVKLMAGMVRHFLSTADGANEEEEQRLRLMITSHLDDGEFARL 127
+ EL + L L ++ + EE +RL + I H +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVL----ESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 128 LYTKIDDLIG----AKFVDSLDSAVASGD 152
+ + + + +L + +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKM 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2846RTXTOXIND651e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 64.9 bits (158), Expect = 1e-13
Identities = 36/180 (20%), Positives = 61/180 (33%), Gaps = 17/180 (9%)

Query: 129 QGDLANFKAQVTVAQLSLDRAKQLASRQFGPQATVDQAQAAYDQAQAGIAKTEALIAQKL 188
+ L ++++ A+ QL + + Q +AK E +
Sbjct: 272 KSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR--QTTDNIGLLTLELAKNEERQQASV 329

Query: 189 VRAPFDGDLGVRKV-EVGQYLSAGTQIVSLT-DLSELWVNFTVTEKDSGQLKVGQTVRVG 246
+RAP + KV G ++ ++ + + L V V KD G + VGQ +
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389

Query: 247 VDAYPGK---VFEGKITTIEPQIATD---------TRNIRVQATIANPDHI-LKPGMFAT 293
V+A+P GK+ I D +I +I L GM T
Sbjct: 390 VEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVT 449



Score = 37.1 bits (86), Expect = 9e-05
Identities = 26/181 (14%), Positives = 63/181 (34%), Gaps = 33/181 (18%)

Query: 14 IDDKPRKRTRPVLWFTIVAILLAALVGGFVWFNMFRDKMIAQFFANMKPPPINVSMATAT 73
I+ +R R V +F ++ LV F+ + +ATA
Sbjct: 49 IETPVSRRPRLVAYF-----IMGFLVIAFILS----------VLG-----QVE-IVATAN 87

Query: 74 AEVVPNLLTAVGDLAAVHQVNVTSDASGRITDIMFTPGTTVKQGTPLVQLYDAPEQGDLA 133
++ + + + + +I+ G +V++G L++L + D
Sbjct: 88 GKLTH----------SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTL 137

Query: 134 NFKAQVTVAQLSLDRAKQLASRQFGPQATVDQAQAAYDQAQAGIAKTEALIAQKLVRAPF 193
K Q ++ Q L++ + + + + + + +++ E L L++ F
Sbjct: 138 --KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 194 D 194

Sbjct: 196 S 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2847ACRIFLAVINRP8110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 811 bits (2096), Expect = 0.0
Identities = 315/1029 (30%), Positives = 535/1029 (51%), Gaps = 30/1029 (2%)

Query: 5 DIFIKRPVLSLVVSMLILLLGLRAAFELPIRQYPKLSNTVVNVTTVYPGASADLIQGFIT 64
+ FI+RP+ + V+++++++ G A +LP+ QYP ++ V+V+ YPGA A +Q +T
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 TPLEQAVASAEGVDYITSSSVQ-GTSTIQVFIKLNFDPNQALTEVLAKVNSVRYLIPRES 123
+EQ + + + Y++S+S G+ TI + + DP+ A +V K+ L+P+E
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 NDPIVTKSTGQTTSVMYLGFSSE--ELSGSAISDYLTRVVQPVLSTVDGVAAANILGGQT 181
++ ++ +M GF S+ + ISDY+ V+ LS ++GV + G Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181

Query: 182 FAMRIWLNPEKMAGRNVSPADVAAAISANNFQSAAGQAKGYFIVS------NVTTNTGLT 235
+AMRIWL+ + + ++P DV + N Q AAGQ G + ++ T
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 DVNQFKRMIVKA-KDGGFVRMEDIATVELAAQSTDASVAFNGERAIFIGVDATPQGNPLN 294
+ +F ++ ++ DG VR++D+A VEL ++ + NG+ A +G+ N L+
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVKGVRALFPDLERNLPPSMKMKVAYDSTKFIQSSIDEVEHTLGEAVLIVIVVIFLFLAS 354
K ++A +L+ P MK+ YD+T F+Q SI EV TL EA+++V +V++LFL +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVIIPVVTIPLSLIGVCTMMLALGFSINLLTLLAMVLAIGLVVDDAIVVVENIHRHL- 413
+R+ +IP + +P+ L+G ++ A G+SIN LT+ MVLAIGL+VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 414 EEGRAPVQAALQGAREIVGPVISMTITLAAVYAPIGFLGGLTGSLFREFAFTLAGSVIVS 473
E+ P +A + +I G ++ + + L+AV+ P+ F GG TG+++R+F+ T+ ++ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GVIALTLSPMMCSVLLR------SAEEGRFARIVNRVFGAMTRWYGRQLDRSLDYRPVTA 527
++AL L+P +C+ LL+ +G F N F Y + + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 528 LFAVTILGLVGFLYMHTSKELAPEEDQGIVFSITKAPKYANIDYINFYTDKLDKVFQKFP 587
L I+ + L++ PEEDQG+ ++ + P A + D++ + K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 588 ETDLRFVLN------GINGPQGGFAGMLLKPWDERTR---SSIKLKPLVQAELSKIEGIN 638
+ ++ V G A + LKPW+ER S+ + + EL KI
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 639 AFAFNLPA--LPGGPGGLPVQMVISSTSGFQAVYEQMAKLKDAARKSGIFIVS-DSDLEF 695
FN+PA G G +++ + G A+ + +L A + +VS +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 696 NQPVVKVWINRSKASDLGITMQSIGNTLAVLLGGNYINRFNLEGRSYQVIPQVPRDKRLS 755
+ K+ +++ KA LG+++ I T++ LGG Y+N F GR ++ Q R+
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 756 PDSLSGYYVATAAGQQVPLSTVVTIETATDPNSLTHFNQLNSATFSAVPMPGVTVGQAVD 815
P+ + YV +A G+ VP S T L +N L S PG + G A+
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 816 FLEKQAKNLPSGFSHDYLADARQYVQEGNQLAITFGFALIIIFLVLAAQFESLRDPLVIM 875
+E A LP+G +D+ + Q GNQ + +++FL LAA +ES P+ +M
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 876 ISVPMAIVGALIPLFFGLATMNIYTQVGLLTLVGLITKHGILMVEFANELQLKEGVDRRT 935
+ VP+ IVG L+ ++Y VGLLT +GL K+ IL+VEFA +L KEG
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 936 AIETSARIRLRPILMTTAAMVTGLIPLLTASGAGAASRFSIGLVVVAGMSIGTLFTLFVL 995
A + R+RLRPILMT+ A + G++PL ++GAG+ ++ ++G+ V+ GM TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 996 PAVYTVIAT 1004
P + VI
Sbjct: 1022 PVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2849SHAPEPROTEIN280.044 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 28.2 bits (63), Expect = 0.044
Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 6/89 (6%)

Query: 38 MAGEERDPAVIEQIRKQYRLDQPLPVQYGYWIKGVLSGDF--GESLRIKMPVRDLILQKL 95
+ G+ D A+I +R+ Y + IK + + E I++ R+L + +
Sbjct: 189 IGGDRFDEAIINYVRRNYGSL--IGEATAERIKHEIGSAYPGDEVREIEVRGRNLA-EGV 245

Query: 96 PVTLQLASMAIVIAFLIGIPAGIISAVKR 124
P L S I+ A L GI+SAV
Sbjct: 246 PRGFTLNSNEILEA-LQEPLTGIVSAVMV 273


92BBta_2988BBta_2992N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_29880150.394120hypothetical protein
BBta_29891160.783890cation/heavy metal efflux system protein
BBta_29901141.080182ABC transporter ATP-binding protein
BBta_29912151.133316branched chain amino acid ABC transporter
BBta_29921130.151824hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2988RTXTOXIND491e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.4 bits (118), Expect = 1e-08
Identities = 44/248 (17%), Positives = 83/248 (33%), Gaps = 40/248 (16%)

Query: 131 ADTVQAQNDFITAITSMNKAKSALDLADIQYKRAKDLFEGRAIPLKDFQQAEATSVQAQN 190
+ + + + +T + +N+ ++ + + L +AI + E V+A N
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266

Query: 191 DMRSSQTALEAARN-----KLRILGFTD--------------AAIADFQNKGTIDRE--- 228
++R ++ LE + K T I + + E
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326

Query: 229 -ITIYAPIGGTVVQRKV-GPGQYVNAGASDPVFVI-GDLSTVWLTAFVRESDAALVDVGE 285
I AP+ V Q KV G V + + VI + T+ +TA V+ D ++VG+
Sbjct: 327 ASVIRAPVSVKVQQLKVHTEGGVVTTA--ETLMVIVPEDDTLEVTALVQNKDIGFINVGQ 384

Query: 286 EILVNVLALPGR---PLKASINYVA--TSIDPAT------RRLLVRATI--DNKDGLLKP 332
++ V A P L + + D + + NK+ L
Sbjct: 385 NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSS 444

Query: 333 EMFANVTI 340
M I
Sbjct: 445 GMAVTAEI 452



Score = 43.7 bits (103), Expect = 1e-06
Identities = 20/125 (16%), Positives = 38/125 (30%), Gaps = 4/125 (3%)

Query: 85 ITEGKVAVDEDRSTPVFSPYAGRVTKLLARPGETVRQGQPLFVIEAADTVQAQNDFITAI 144
GK+ RS + V +++ + GE+VR+G L + A +A +
Sbjct: 85 TANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA-EADTLKTQSS 142

Query: 145 TSMNKAKSALDLADIQYKRAKDLFEGRAIPLKDFQQAEATSVQAQNDMRSSQTALEAARN 204
+ + + L E + FQ V + Q +N
Sbjct: 143 LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ--FSTWQN 200

Query: 205 KLRIL 209
+
Sbjct: 201 QKYQK 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2989ACRIFLAVINRP5720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 572 bits (1477), Expect = 0.0
Identities = 227/1063 (21%), Positives = 426/1063 (40%), Gaps = 61/1063 (5%)

Query: 4 LVAIAVQRRFLMVGMFVAVLIGGLIAFRQLNIEAYPDPTPPMVDIVTQSPGLSAEEIERY 63
+ ++R + + +++ G +A QL + YP PP V + PG A+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 ITIPIETQVAGIKNLRTIRTISL-YGLSDVKLQFSFDYTYDEALQQVLNRLSQLSP-LPG 121
+T IE + GI NL + + S G + L F D A QV N+L +P LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 NVTPQISPL--SAVGEIYRYRLRGP-PGYSVLDLKTLQDWVLQRRFRAVPGVIDVTGWGG 178
V Q + S+ + PG + D+ ++ + GV DV +G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 179 KTKTYEIQVDFNKLVANGLTLPQLLQAVGNSNINVGGNTV------EIGTQSAVVRGVGL 232
+ I +D + L LT ++ + N + + +A +
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 IRSMDDLAN-TMVSQSGGNPVLVKDVANVTVAEKPRLGIAGLNNNDDIVQGIVLMRRGEQ 291
++ ++ T+ S G+ V +KDVA V + + IA +N + + G
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAG-LGIKLATGAN 298

Query: 292 SSPTIARVEQLVDQINNSTILPPGVRIERIYDRKDLIDTTTHTVLHNMVVGIGLIVLLQW 351
+ T ++ + ++ P G+++ YD + + H V+ + I L+ L+ +
Sbjct: 299 ALDTAKAIKAKLAELQ--PFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356

Query: 352 VFLGNLRSALIVGATIPFALFFAVIILVLRGESANLLSVG--AIDFGLIVDATVIMVEAI 409
+FL N+R+ LI +P L IL G S N L++ + GL+VD +++VE +
Sbjct: 357 LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENV 416

Query: 410 FRRLSQTTPLSAAEESHISPETMMGMKSHAILSAAADVSRSIFFAAAIIIAAFLPLFTLS 469
R + E + P+ A + + + ++ A ++ A F+P+
Sbjct: 417 ERVM---------MEDKLPPK-------EATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 470 GVEGNIFGPMARTYAYALAGGLLATFTVTPALSAIILPAHVEETETRLMRL--------- 520
G G I+ + T A+A +L +TPAL A +L E
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD 520

Query: 521 -LHAIYAPVLRWAVGNRNLVITGAIGLVLLTVVVGRLLGLEFLPKLEEGNLWIRATLPPT 579
Y + +G+ + +V VV+ L FLP+ ++G LP
Sbjct: 521 HSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAG 580

Query: 580 ISLQEGNAYVNEMRK--MIRARPEVEAVVSQHGRPDDGTDAAGFFNAEFFAPLKPVKDWP 637
+ + ++++ + + VE+V + +G G F LKP ++
Sbjct: 581 ATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERN 637

Query: 638 GTK-DKDQLTAELLKQLDDRFPGVEFNFSQYLQDNVSEAVSGVKGENSIKLYGNDLQALT 696
G + + + +L G F+ + V + I G ALT
Sbjct: 638 GDENSAEAVIHRAKMELGKIRDGFVIPFN--MPAIVELGTATGFDFELIDQAGLGHDALT 695

Query: 697 DTANKIKQVLSTVQG-VTDLAVFTSLGQPTVQIDIDRAKAARYGLAPGDINATIKVAIGG 755
N++ + + + + ++++D+ KA G++ DIN TI A+GG
Sbjct: 696 QARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGG 755

Query: 756 DTAGDLYEPGSDRHFPIIVRLAPEYRKSAEAIENLRIGAQGPNGVTQIPLSEVATIKLVS 815
D + G R + V+ ++R E ++ L + + NG +P S T V
Sbjct: 756 TYVNDFIDRG--RVKKLYVQADAKFRMLPEDVDKLYVRS--ANGEM-VPFSAFTTSHWVY 810

Query: 816 GAAYIYREQQERYLPIKFSVRERDLGSAIQEAQQKVAEQVQLPAGSRVEWVGEFGNLQDA 875
G+ + R + I+ + +A + LPAG +W G + +
Sbjct: 811 GSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLS 868

Query: 876 IKRLSIVVPISLALIGVLLWFNFGSMADTLLAMSVIPMAIFGGVVGLVLSGIPFSVSAAI 935
+ +V IS ++ + L + S + + M V+P+ I G ++ L V +
Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMV 928

Query: 936 GFIALFGIAVMDGIIILSQFNQLID-EGVDRIEAVIRTGELQLRPVLMTCVVAGIGLLPA 994
G + G++ + I+I+ L++ EG +EA + ++LRP+LMT + +G+LP
Sbjct: 929 GLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 995 ALSTGIGSQVQKPLAIVVVTGMMLAPVVILITLPVLISLFSRR 1037
A+S G GS Q + I V+ GM+ A ++ + +PV + R
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 84.9 bits (210), Expect = 1e-18
Identities = 72/364 (19%), Positives = 147/364 (40%), Gaps = 19/364 (5%)

Query: 688 YGNDLQALTDTANK-IKQVLSTVQGVTDLAVFTSLGQPTVQIDIDRAKAARYGLAPGDIN 746
G ++D +K LS + GV D+ +F Q ++I +D +Y L P D+
Sbjct: 147 PGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG--AQYAMRIWLDADLLNKYKLTPVDVI 204

Query: 747 ATIKVA----IGGDTAGDLYEPGSDRHFPIIVRLAPEYRKSAEAIENLRIGAQGPNGVTQ 802
+KV G G PG + II + K+ E + + +G +
Sbjct: 205 NQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT---RFKNPEEFGKVTLRVN-SDG-SV 259

Query: 803 IPLSEVATIKLVSGAAYIYREQQERYLPIKFSVRERDLGSAIQEAQ---QKVAE-QVQLP 858
+ L +VA ++L G Y + ++ +A+ A+ K+AE Q P
Sbjct: 260 VRLKDVARVEL-GGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFP 318

Query: 859 AGSRVEWVGEFGNL-QDAIKRLSIVVPISLALIGVLLWFNFGSMADTLLAMSVIPMAIFG 917
G +V + + Q +I + + ++ L+ ++++ +M TL+ +P+ + G
Sbjct: 319 QGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLG 378

Query: 918 GVVGLVLSGIPFSVSAAIGFIALFGIAVMDGIIILSQFNQ-LIDEGVDRIEAVIRTGELQ 976
L G + G + G+ V D I+++ + ++++ + EA ++
Sbjct: 379 TFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQI 438

Query: 977 LRPVLMTCVVAGIGLLPAALSTGIGSQVQKPLAIVVVTGMMLAPVVILITLPVLISLFSR 1036
++ +V +P A G + + +I +V+ M L+ +V LI P L + +
Sbjct: 439 QGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498

Query: 1037 RRVR 1040

Sbjct: 499 PVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2991TCRTETA320.009 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.7 bits (72), Expect = 0.009
Identities = 23/91 (25%), Positives = 33/91 (36%), Gaps = 6/91 (6%)

Query: 206 VLVLSAIVSALAGFFYAGFGAVAAPENASFTFGTNLVIMVALGGRGTVIGPVIGALCIEV 265
VL + IV+ + G A GA A + M A G G V GPV+G L
Sbjct: 98 VLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF 157

Query: 266 ASAYLANSLP-YVWELIVGLALIIVILAFPD 295
+ P + + GL + P+
Sbjct: 158 SPH-----APFFAAAALNGLNFLTGCFLLPE 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_2992ACETATEKNASE290.030 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 28.6 bits (64), Expect = 0.030
Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 5/63 (7%)

Query: 159 GLSTKAVIMNEMLAQGLGVDSSRVRLITFALGSGLAALSGALITPLSSVDPNMGVPWLIG 218
G S K V ++ A+ L +++IT LG+G S A + S+D +MG L G
Sbjct: 180 GTSHKYV--SQRAAEILNKPIESLKIITCHLGNGS---SIAAVKNGKSIDTSMGFTPLEG 234

Query: 219 AFM 221
M
Sbjct: 235 LAM 237


93BBta_3077BBta_3085N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3077-2101.521349serine protease do-like
BBta_3078-1110.902791two component transcriptional regulator
BBta_3079-1130.683544two component LuxR family transcriptional
BBta_3080-2120.474050bacteriophytochrome
BBta_3081-3120.333387light harvesting protein B-800-850
BBta_3082-3110.442396light harvesting protein B-800-850 subunit
BBta_3083-2110.938356chlorophyll major facilitator superfamily (MFS)
BBta_3084-3100.941531hypothetical protein
BBta_3085-3121.235022two-component sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3077V8PROTEASE643e-13 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 64.3 bits (156), Expect = 3e-13
Identities = 29/160 (18%), Positives = 50/160 (31%), Gaps = 26/160 (16%)

Query: 143 TGQGSGFFISADGYAVTNNHVVDGADKVEVTTD------------DGKTYSAKVIGTDQR 190
T SG + +TN HVVD +G + ++
Sbjct: 101 TFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 191 TDLALIKVEG-------GSNFPFAKLA-DGKPRIGDWVLAVGNPFGLGGTVTAGIVSAVG 242
DLA++K G A ++ + + ++ + G P +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKIT 219

Query: 243 RDIGNGPYDDFIQIDAPVNKGNSGGPAFDVDGNVVGVNTA 282
G +Q D GNSG P F+ V+G++
Sbjct: 220 YLKGEA-----MQYDLSTTGGNSGSPVFNEKNEVIGIHWG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3078HTHFIS878e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 8e-22
Identities = 34/159 (21%), Positives = 64/159 (40%), Gaps = 11/159 (6%)

Query: 7 SAMRLLIIEDDRESADYLVKAFREVGHVADLAGDGEEGLAMAESGDYDVLVVDRMLPKRD 66
+ +L+ +DD L +A G+ + + +GD D++V D ++P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLSLIGALRDKGNRTPVLILSALGQVDDRIKGLRAGGDDYLPKPYAFAELLARVE----- 121
L+ ++ PVL++SA IK G DYLPKP+ EL+ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 --VLSRRHGGPAEETTYKVGD----LELDRLSHRVARGK 154
+ +++ VG E+ R+ R+ +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3079HTHFIS571e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 1e-11
Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 4/105 (3%)

Query: 18 RIRVFFADDHPIVLDGI-KALIAEDAQLDLVGSALDGPTALRRAIDLRPDVVVLDLSMPG 76
+ ADD + + +AL + + +A + D+VV D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG---DGDLVVTDVVMPD 59

Query: 77 MSGIEVARKLLAACPSSRVLLLTVHEDAAYFRQALEIGIVGYVLK 121
+ ++ ++ A P VL+++ +A E G Y+ K
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3083TCRTETB310.009 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.4 bits (71), Expect = 0.009
Identities = 19/70 (27%), Positives = 30/70 (42%)

Query: 343 AVIAAAPLSSALLFGLGTLLIGFGAGLFGHGTLTATMNLAPQDQAGLALGAWGAVQASAA 402
+VI S L + + G GA F + P++ G A G G++ A
Sbjct: 93 SVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGE 152

Query: 403 GVAIALGGIM 412
GV A+GG++
Sbjct: 153 GVGPAIGGMI 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3085PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 27/111 (24%)

Query: 359 LVENAIKYGKPAALPLDPNAAAATREILIEARREGGQVLLSVTDHGEGIPEAERKHAVER 418
LVEN IK+G + IL++ ++ G V L V + G + ++
Sbjct: 263 LVENGIKHG------IAQLPQGGK--ILLKGTKDNGTVTLEVENTGSLALKNTKE----- 309

Query: 419 FVRLEASRTLPGSGLGLSLASA-VATLHGGE--LRLGDARPGLIATLVLPA 466
+G GL + L+G E ++L + + + A +++P
Sbjct: 310 -----------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


94BBta_3419BBta_3427N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3419021-1.905605amino acid permease
BBta_3420014-1.912821cytochrome B561
BBta_342119-1.377224hypothetical protein
BBta_3422010-1.004104hypothetical protein
BBta_3423012-0.854675hypothetical protein
BBta_3424011-0.742481hypothetical protein
BBta_3425111-0.376855copper tolerance protein
BBta_34261120.280611cation efflux system protein cusA
BBta_34272151.217439cation efflux system protein cusB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3419RTXTOXINA310.013 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.013
Identities = 39/154 (25%), Positives = 55/154 (35%), Gaps = 33/154 (21%)

Query: 14 TAQSVAAPADEPRLVRSLTLTHAVLYGMGVTIGAGIYVLV-GIAAGRSGMHAPLAFIGAA 72
T AA + LT VL +G I Y++ A G S A I +A
Sbjct: 265 TRTKAAAGVE---------LTTKVLGNVGKGISQ--YIIAQRAAQGLSTSAAAAGLIASA 313

Query: 73 LVMSFSAASFAELGTRMPVSASEAAYVQ-------------AAFQRKWLSLGTGLLVVLT 119
+ ++ S SF + + + Y Q AAF + TG +
Sbjct: 314 VTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHK-----ETGAIDASL 368

Query: 120 ATVSA--ATVSRG-SAGYVAVFVSAPAPLIVCGV 150
T+S A+VS G SA V AP +V V
Sbjct: 369 TTISTVLASVSSGISAAATTSLVGAPVSALVGAV 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3421TCRTETA290.021 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.021
Identities = 21/96 (21%), Positives = 34/96 (35%), Gaps = 16/96 (16%)

Query: 255 VLFGLTGGLIPCPAAITVLLLCIQLKQFSLGFVLVLCFGIGLAITMVSAGVLAALSLRHV 314
++ G+TG A + + GF + CFG G+ V G++ S
Sbjct: 104 IVAGITGATGAVAGAYIADITDGDERARHFGF-MSACFGFGMVAGPVLGGLMGGFS---- 158

Query: 315 ERHWSGFSRFAHRAPYVSAALIVLVGLYTGWLGLHE 350
AP+ +AA + + TG L E
Sbjct: 159 -----------PHAPFFAAAALNGLNFLTGCFLLPE 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3424PF03944270.024 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 27.3 bits (60), Expect = 0.024
Identities = 20/75 (26%), Positives = 35/75 (46%), Gaps = 3/75 (4%)

Query: 51 KMLGELRSTMMGNTGSETLLDRLTAQEKWLSARL--DATRSLRTTLGNLVATFSDEQKTA 108
++L ELR+ + + + + D L E++L+ RL D + L L A +E
Sbjct: 72 RILSELRNLIFPSGSTNLMQDILRETERFLNQRLNTDTVARVNAELTGLQANV-EEFNRQ 130

Query: 109 ADELLAPNLGIMPMA 123
D L PN +P++
Sbjct: 131 VDNFLNPNRNAVPLS 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3426ACRIFLAVINRP6930.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 693 bits (1789), Expect = 0.0
Identities = 219/1050 (20%), Positives = 435/1050 (41%), Gaps = 52/1050 (4%)

Query: 11 RNLLLVLFGTGFAAAAGFYALIHLPLDAIPDLSDTQVIVYTEYSGQAPQVIEDQVTYPLT 70
R + AG A++ LP+ P ++ V V Y G Q ++D VT +
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66

Query: 71 TAMLTVPKSKVVRGFSF-FGVSFVYVIFEDGTDIYWARSRVMEFLNSAASRLPAGV-TPT 128
M + + S G + + F+ GTD A+ +V L A LP V
Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126

Query: 129 IGPDATGVGWVYQYAVMS--KELNLADTRTLQDWNLKFALAKAEGVAEVASIGGFVKQYN 186
I + + ++ +S D N+K L++ GV +V G
Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185

Query: 187 VVLDPQRMRDRGISMQKIRDAIRASNADVGGRTVELS------EFEYVIRGKGYIKSIDD 240
+ LD + ++ + + ++ N + + + + I + K+ ++
Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245

Query: 241 LGNIVLK-NSGGTPVLLRDVAHVELGPDERRGIAELNGEGEVASGIVLQRFGVNALDVIE 299
G + L+ NS G+ V L+DVA VELG + IA +NG+ A + G NALD +
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTAK 304

Query: 300 NVKKRFKDIASSLPKSVEIVPVYDRSNLIYAAIDTLKHTLLEESLVVAFVCIVFLLHVRS 359
+K + ++ P+ ++++ YD + + +I + TL E ++V V +FL ++R+
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRA 364

Query: 360 ALVAILMLPVGVLMAFGAMKLLGLGSNIMSLGGIAIAIGAMVDAAIVMIENAHKHLERAK 419
L+ + +PV +L F + G N +++ G+ +AIG +VD AIV++EN + + K
Sbjct: 365 TLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDK 424

Query: 420 PDQSRVQILIDAAAEVGPALFFSLLIITVSFMPIFTLESQEGRLFSPLAFTKTFSMAAAA 479
+ + +++ AL ++++ F+P+ G ++ + T +MA +
Sbjct: 425 --LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 480 LLSVTLVPALMVIFVRGRIVPEQRNPINRFLIW----------IYRPVIKSVLRAKMLVI 529
L+++ L PAL ++ N F W Y + +L + +
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK-GGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 530 LAAVAVLAISVWPARQLGTEFMPSLNEGTLLYMPTTLPGISVTKAAELM-QMQDRIIKS- 587
L ++A V +L + F+P ++G L M G + + +++ Q+ D +K+
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 588 FPEVASVYGKAGRAATATDPAPTEMFETVVNLKPKEQW-RAGVTIDSLIAEMDKALQFPG 646
V SV+ G + + F V+LKP E+ + +++I L
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAF---VSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 647 VSNAWTMPIKARIDMLSTGIRTPIGVKVMGTDLTEIDRLAKQIERVIKTVPGT-SSAYAE 705
+ A +++ + + G + + Q+ + P + S
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 706 RGIGGYYLEITPDREALARYGLLIQDVQDTIAAALGGQTVTTTVEGRQRFTVNMRYPRDL 765
++ D+E G+ + D+ TI+ ALGG V ++ + + ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 766 RDNPKAIASDVLVPMPSGGAVPLGEVAKIEPARGPTSIRTENGQLATYIYVDIRDRDIGS 825
R P+ + + V +G VP G + NG + I + G+
Sbjct: 779 RMLPEDVDK-LYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---GT 834

Query: 826 YVGDAQRAVTESI--QFPAGYYVVWSGQYEYLQRAAARLKIVIPVTLTIIFLLLYLNFRA 883
GDA A+ E++ + PAG W+G + + + ++ ++ ++FL L + +
Sbjct: 835 SSGDAM-ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 884 MTETLIVMLSLPFALVGGIWMMWWLGFNLSVAVAVGFIALAGVAAETGVVMLIYLDHALA 943
+ + VML +P +VG + V VG + G++A+ ++++ +
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF------ 947

Query: 944 EMRARHAAENRTLTRQDLQDAIMEGAVERVRPKMMTVVAIMAGLLPIIWSTGTGSEIMQR 1003
A+ E + +A + R+RP +MT +A + G+LP+ S G GS
Sbjct: 948 ---AKDLMEKEGKG---VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 1004 IAVPMIGGMISSTLLTLIVIPAIFGLVKGR 1033
+ + ++GGM+S+TLL + +P F +++
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3427RTXTOXIND518e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 8e-09
Identities = 31/123 (25%), Positives = 45/123 (36%), Gaps = 15/123 (12%)

Query: 267 APRDGIVLERNA-VEGMRANPGDVLFRIA-DISLVWALVDVAERDLGSIAVGQPVTIRAR 324
AP V + EG + L I + + V +D+G I VGQ I+
Sbjct: 332 APVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE 391

Query: 325 SFPGRTF---TGSIAVIYPQVNKDTRTA---RVRIEL-------ANSDLALLPDMYVDAE 371
+FP + G + I +D R V I + N ++ L M V AE
Sbjct: 392 AFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451

Query: 372 IDT 374
I T
Sbjct: 452 IKT 454



Score = 30.6 bits (69), Expect = 0.014
Identities = 10/51 (19%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 155 TLHVAVKAPGTIQLDERRVSVIAMRAESFVQKVADVTTGTRVKAGQPLMEI 205
+ + A G + R + + S V+++ V G V+ G L+++
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPI-ENSIVKEII-VKEGESVRKGDVLLKL 127


95BBta_3533BBta_3537N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3533-2100.644986SPINDLY family O-linked N-acetylglucosamine
BBta_3534-1120.853625*hypothetical protein
BBta_3535-213-0.139599arylsulfatase
BBta_3536-113-0.860466component of multidrug efflux pump, mdtB-like
BBta_3537-111-0.635867component of multidrug efflux pump, mdtA-like
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3533SYCDCHAPRONE482e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 48.0 bits (114), Expect = 2e-08
Identities = 22/115 (19%), Positives = 39/115 (33%), Gaps = 3/115 (2%)

Query: 47 QQILQDLPSHFGALHLLGVSERDSGRFDEAVLVLTRAIESDPRSAEAQSDLGLALFRLGR 106
+ + L+ L ++ SG++++A V D + LG +G+
Sbjct: 26 AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQ 85

Query: 107 YEEARARYERAIALRPNFPAALTHLGNTLMNLFRFEEAISAHDRAIAL---KPDY 158
Y+ A Y + P H L+ EA S A L K ++
Sbjct: 86 YDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 37.6 bits (87), Expect = 6e-05
Identities = 15/92 (16%), Positives = 30/92 (32%)

Query: 97 LGLALFRLGRYEEARARYERAIALRPNFPAALTHLGNTLMNLFRFEEAISAHDRAIALKP 156
L ++ G+YE+A ++ L LG + +++ AI ++ +
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 157 DYGEAHANRGMALMFTSRNGEAAESFDRALSL 188
+ L+ EA A L
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3534HTHTETR493e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.9 bits (116), Expect = 3e-09
Identities = 32/181 (17%), Positives = 60/181 (33%), Gaps = 4/181 (2%)

Query: 16 SRQQRSRETTQALLAAGAELLCSRTLAELSISELCVQIGATVGAFYSRFDSKEAYFNALL 75
+Q ++ET Q +L L + ++ S+ E+ G T GA Y F K F+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 76 ALTLRDGREQLLKLPPAPPG--PGGADDQCLQLVRGIVVWMRRHKGVLRAALMRSESGAN 133
L+ + E L+ PG + + ++ V RR +L +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRR--LLMEIIFHKCEFVG 121

Query: 134 NWTGFKELAQALSERAAMVLLPLLHGARAPGRRAATAQERRTVAFGIQVVLGTLVNAILN 193
++ + L + + L A RR + G + N +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 194 D 194

Sbjct: 182 P 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3536ACRIFLAVINRP7800.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 780 bits (2015), Expect = 0.0
Identities = 275/1040 (26%), Positives = 501/1040 (48%), Gaps = 37/1040 (3%)

Query: 6 ISAPFIKYPIGTSLLMAGILFIGLVAYPLLPVAPLPQVDFPTIQVTANLPGGSPETMATS 65
++ FI+ PI +L ++ G +A LPVA P + P + V+AN PG +T+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 66 VAQPLERQFAQIPGIAQMTSTS-YLGTAAITIQFDLNRNIDGAANDVQGAINAASGQLPK 124
V Q +E+ I + M+STS G+ IT+ F + D A VQ + A+ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 125 NLPSPPTYRKVNPADSPILLLSATSETLPLT--TVSDAVDAGLAQQISQISGVAQVIIGG 182
+ + S +++ S+ T +SD V + + +S+++GV V + G
Sbjct: 121 EVQQQGISV-EKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 183 QQKPSIRIQIDPAKLVAKGLSLEDVRSQIAITTVDAPKGNI------DGERRAYTIYAND 236
Q ++RI +D L L+ DV +Q+ + G + G++ +I A
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 237 QLTDSKDWNDVII-AYRNGGPLRIRDIGQAVSAAEDAKQAAWANGQRGVFLVIFKQPGAN 295
+ + +++ V + +G +R++D+ + E+ A NG+ L I GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 296 VIETVDRIKGLLPRLVAAIPPAIKIDVISDRTTTIRAAVEDVQFTLILTIFLVVMVIFIF 355
++T IK L L P +K+ D T ++ ++ +V TL I LV +V+++F
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 356 LRSFWATVIPTITVPLALLGACAMMWGVGYTLDNLSLMALTIAVGFVVDDAIVMLENISR 415
L++ AT+IPTI VP+ LLG A++ GY+++ L++ + +A+G +VDDAIV++EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 416 Y-VEEGETPMAAAYKGAKEIGFTIVSISISLVAVLIPLLLMGGIIGRLFREFAVVLAMTI 474
+E+ P A K +I +V I++ L AV IP+ GG G ++R+F++ + +
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 475 FVSMFVSLTLTPMMASRFLR---NEHAAQHGKFYQWSEAAFDAMLRAYEKGLDLALRWRF 531
+S+ V+L LTP + + L+ EH G F+ W FD + Y + L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 532 ATLMVFFATLGLSVYLFILIPKGFFPQQDVGLITAT----SEASQDISFKAMQSRQEALA 587
L+++ + V LF+ +P F P++D G+ + A+Q+ + K + +
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 588 KIVMADHDVASLAMNIGGSGRAGNNGNMFITLKPREERDA---TAQQIIARLRPQLEKVE 644
K A+ + SG+A N G F++LKP EER+ +A+ +I R + +L K+
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 645 GARLYMQAAQDIRLGGRPSRTQFEFTLQD---PNLAELNEWAPKILEKMRSLP-QLRDVA 700
+ I G + T F+F L D L + ++L P L V
Sbjct: 659 DGFVIPFNMPAIVELG--TATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 701 TDQQADGTTVQLAINRDTASRYGITPQLIDDTLYDAFGQRQVAQYFTQLNSYRVILEILP 760
+ D +L ++++ A G++ I+ T+ A G V + + ++ ++
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 761 ELQGSLDTLNKLYVKSPTTGDQVPLSTFAS--WTTAPVRPLSISHQGQFPSITISFNLAQ 818
+ + + ++KLYV+S G+ VP S F + W + PS+ I A
Sbjct: 777 KFRMLPEDVDKLYVRSAN-GEMVPFSAFTTSHWVYGSP---RLERYNGLPSMEIQGEAAP 832

Query: 819 GVALGQATDAVQRAMVELGAPSTLNSSFQGTAQAFQQSLSTVPLLILAALVVVYLILGIL 878
G + G A ++ +L P+ + + G + + S + P L+ + VVV+L L L
Sbjct: 833 GTSSGDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890

Query: 879 YESYIHPITILSTLPSAGVGALAILMAFGYDFSLIALIGVILLIGIVKKNGIMMVDFAIA 938
YES+ P++++ +P VG L F + ++G++ IG+ KN I++V+FA
Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950

Query: 939 AERDQHLPPEQSIRQAALLRFRPIMMTTMAALLGGVPLMLGNGTGSEIRQPLGYAMVGGL 998
+ ++ A +R RPI+MT++A +LG +PL + NG GS + +G ++GG+
Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 999 IVSQMLTLFTTPVVYLYLDK 1018
+ + +L +F PV ++ + +
Sbjct: 1011 VSATLLAIFFVPVFFVVIRR 1030



Score = 104 bits (262), Expect = 6e-25
Identities = 70/508 (13%), Positives = 167/508 (32%), Gaps = 33/508 (6%)

Query: 4 GGISAPFIKYPIGTSLLMAGILFIGLVAYPLLPVAPLPQVDFPTIQVTANLPGGSPETMA 63
+ L+ A I+ +V + LP + LP+ D LP G+ +
Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586

Query: 64 TSVAQPLERQF--AQIPGIAQMTSTSYLGTAA-------ITIQFDLNRNIDGAANDVQGA 114
V + + + + + + + + + +G N +
Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 115 INAASGQLPK---------NLPSPPTYRKVNPADSPILLLSATSETLPLTTVSDAVDAGL 165
I+ A +L K N+P+ D ++ + ++ A + L
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLG----HDALTQARNQLL 702

Query: 166 AQQISQISGVAQVIIGGQQ-KPSIRIQIDPAKLVAKGLSLEDVRSQIAITTVDAPKGNID 224
+ + V G + ++++D K A G+SL D+ I+ +
Sbjct: 703 GMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI 762

Query: 225 GERRAYTIYA---NDQLTDSKDWNDVIIAYRNGGPLRIRDIGQAVSAAEDAKQAAWANGQ 281
R +Y +D + + + NG + + + NG
Sbjct: 763 DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTT-SHWVYGSPRLERYNGL 821

Query: 282 RGVFLVIFKQPGANVIETVDRIKGLLPRLVAAIPPAIKIDVISDRTTTIRAAVEDVQFTL 341
+ + PG + + + ++ L + +P I D + + R + +
Sbjct: 822 PSMEIQGEAAPGTSSGDAMALME----NLASKLPAGIGYDW-TGMSYQERLSGNQAPALV 876

Query: 342 ILTIFLVVMVIFIFLRSFWATVIPTITVPLALLGACAMMWGVGYTLDNLSLMALTIAVGF 401
++ +V + + S+ V + VPL ++G D ++ L +G
Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936

Query: 402 VVDDAIVMLENI-SRYVEEGETPMAAAYKGAKEIGFTIVSISISLVAVLIPLLLMGGIIG 460
+AI+++E +EG+ + A + I+ S++ + ++PL + G
Sbjct: 937 SAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 461 RLFREFAVVLAMTIFVSMFVSLTLTPMM 488
+ + + + +++ P+
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3537RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 17/108 (15%), Positives = 43/108 (39%), Gaps = 2/108 (1%)

Query: 73 NTVQVRTRVDGQIDKIGFTEGQLVKEGDLLVEIDPRPFQAALDQAKAK--RAQDEANLNN 130
+ +++ + + +I EG+ V++GD+L+++ +A + ++ +A+ E
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 131 ANLDLQRYTKLGEFATRQQTDTQRATVAQLTAQISADEAAIANAQTQL 178
KL E + Q + ++ S + + Q Q
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202



Score = 32.1 bits (73), Expect = 0.003
Identities = 13/96 (13%), Positives = 33/96 (34%), Gaps = 3/96 (3%)

Query: 111 QAALDQAKAKRAQDEANLNNANLDLQRY-TKLGEFATRQQTDTQRATVAQLTAQISADEA 169
+ +A + ++ L ++ + + + + Q T I
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI-LDKLRQTTDNIGLLTL 316

Query: 170 AIANAQTQLSYTQVKAPISG-VAGLRQVDIGNIVNA 204
+A + + + ++AP+S V L+ G +V
Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352


96BBta_3574BBta_3584N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3574-2111.592902MFS family transporter
BBta_3575190.990299acetyl-CoA acetyltransferase
BBta_35760110.748943enoyl-CoA hydratase
BBta_35770101.021201TetR family transcriptional regulator
BBta_3578-1120.443770methylmalonate-semialdehyde dehydrogenase
BBta_3579-1130.633939acyl-CoA dehydrogenase
BBta_3580-2121.3562333-hydroxyisobutyryl-CoA hydrolase
BBta_3581-3111.2361403-hydroxyisobutyrate dehydrogenase
BBta_3582-3120.110895acetyl-CoA synthetase
BBta_3583-215-0.561508nodN-like protein
BBta_3584-2130.0011043-oxo-acyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3574TCRTETB310.013 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.013
Identities = 19/129 (14%), Positives = 47/129 (36%)

Query: 46 LQQEFGWSTAEISSALSIRFVLFGLMAPFAAALLNRYGLRNITLLAQVIVVSALLASLAM 105
+ +F A + + + F + L ++ G++ + L +I +
Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99

Query: 106 TQIWHLVLLWGFVIGIGTGMTALVLGATIATRWFVARRGLVVGVLTASTATGQLVFLPLL 165
+ L+++ F+ G G ++ +A RG G++ + A G+ V +
Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159

Query: 166 ASLTERLGW 174
+ + W
Sbjct: 160 GMIAHYIHW 168



Score = 30.2 bits (68), Expect = 0.014
Identities = 38/159 (23%), Positives = 62/159 (38%), Gaps = 5/159 (3%)

Query: 242 VFWMLFATFFICGASTNGLIQVHLIPMCLDFGIPQVQAASLLAAMGIFDFFGTIVSGWLS 301
+ W+ +FF ++ V L + DF P + A + GT V G LS
Sbjct: 16 LIWLCILSFF--SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 302 DRYDNRWLLFWYYGLRGLSLLFLPFTDFSFYGLSLFAMFYGLDWIATVPPTVRLTAQRFG 361
D+ + LL + + + + F SF+ L + A F A P V + R+
Sbjct: 74 DQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 362 T-ERANLVFGWVFAAHQLGAGTA-AFGAGLSRTLLQSYL 398
E FG + + +G G A G ++ + SYL
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3577HTHTETR574e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 4e-12
Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 13/180 (7%)

Query: 1 MRYSPEHKAETHARIVRKASVRLREKGAHGVGVADLMKEAGLTHGGFYAHFESREALVVE 60
R + + ET I+ A ++G + ++ K AG+T G Y HF+ + L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 AFGYAMDRSVARWRKLLDELP--PQKRLAAIVDGYLSRQHRDDLGHGCAVPAL-GAEIAR 117
+ + + + P P L I+ L ++ E
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 118 ESAKTRKAFAAKQEDLIQLFAGYIV----------DVPPKIARKRASAMLATLMGTLVMA 167
E A ++A + + D+ + A ++ LM + A
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3579cdtoxinb300.015 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 30.0 bits (67), Expect = 0.015
Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 2/71 (2%)

Query: 153 KQFISGAGATDILVAMVRTGADGPGGISTLVIDAKTPGVSFGANERKMGWNAQPTRAVIF 212
+Q ISG A DIL V+ P +PG+ + N++P + I+
Sbjct: 46 RQLISGENAVDIL--AVQEAGSPPSTAVDTGTLIPSPGIPVRELIWNLSTNSRPQQVYIY 103

Query: 213 ENARVPVANRL 223
+A + R+
Sbjct: 104 FSAVDALGGRV 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3580PF00577310.007 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 31.4 bits (71), Expect = 0.007
Identities = 25/135 (18%), Positives = 42/135 (31%), Gaps = 16/135 (11%)

Query: 134 GLGFFPDVGGTWLLSRAPGELGAYFGLTGQTMNGPDAIHARFADAVVPTERWPALREVLT 193
G+ + T +L +APG A N AV+P T
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVE------NQTGVRTDWRGYAVLPY---------AT 751

Query: 194 KVRPGAVSLEIDQLIAGFATGETAGPVAAQQ-AKIDAWFAHDRMHDIFVALEADGSELAL 252
+ R V+L+ + L V + A + A F + + L + L
Sbjct: 752 EYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPF 811

Query: 253 ATLKTLKEKSPRGMV 267
+ T + G+V
Sbjct: 812 GAMVTSESSQSSGIV 826


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3584DHBDHDRGNASE831e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.8 bits (204), Expect = 1e-20
Identities = 58/201 (28%), Positives = 90/201 (44%), Gaps = 10/201 (4%)

Query: 6 DGRVAIVTGAGNGLGRAHALGLASRGARVVVNDFGGARDGTGGSLTAAETVVEEIRKAGG 65
+G++A +TGA G+G A A LAS+GA + D+ + E VV ++
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK---------LEKVVSSLKAEAR 57

Query: 66 TAMADGADVSNFEQVTAMVERATKEWGSVDIMCANAGILRDKSFGKMEVADFAKVLDVHL 125
A A ADV + + + R +E G +DI+ AG+LR + ++ V+
Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 126 VGTFYCCKAVWNGMRERNYGRIVMTTSSSGLFGNFGQANYGAAKSGMVGLMNVLAEEGRK 185
G F ++V M +R G IV S+ A Y ++K+ V L E +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 186 TNIRVNIISP-TAATRMTEEL 205
NIR NI+SP + T M L
Sbjct: 178 YNIRCNIVSPGSTETDMQWSL 198


97BBta_3659BBta_3668N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3659-111-0.307320ABC transporter permease
BBta_36600120.125170ABC transporter permease ATP-binding protein
BBta_3661-111-0.435702HlyD family secretion protein
BBta_3662-112-0.416602TetR family transcriptional regulator
BBta_3664-2100.213215oxidoreductase
BBta_36650111.826472YeeE/YedE family membrane protein
BBta_36662110.588755ArsR family transcriptional regulator
BBta_36671100.483034thioredoxin
BBta_3668111-0.187631transmembrane cytochrome C biogenesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3659ABC2TRNSPORT482e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 48.0 bits (114), Expect = 2e-08
Identities = 39/165 (23%), Positives = 63/165 (38%), Gaps = 4/165 (2%)

Query: 195 AALIREREHGTIEHLLVMPVTPSEIMLAKI-WSMGLVVLAASTFALGVVIHGLLAVPIEG 253
AA R T E +L + +I+L ++ W+ LA + + G
Sbjct: 89 AAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL--- 145

Query: 254 SWLLFLAGTALYLFATTSLGIFLATQAGTMPQFGLLLMLVLLPLQLLSGTMTPRESMPEL 313
S L L AL A SLG+ + A + F LV+ P+ LSG + P + +P +
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 314 IRDIMLAAPNTHFVMMAQSILFRGAGFDVVWRQFASLFVIGTILF 358
+ P +H + + + I+ DV A I F
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFF 250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3660PF05272320.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.020
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 43 CMVGLLGPDGVGKSSLLSLIAGARVIQEGRVEVLGGDMADAA 84
V L G G+GKS+L++ + G + ++ G D+
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI--GTGKDSY 636


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3661RTXTOXIND733e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 73.3 bits (180), Expect = 3e-16
Identities = 65/412 (15%), Positives = 134/412 (32%), Gaps = 85/412 (20%)

Query: 30 TKRPPPRTSWRKVLLLLLLLGAGGGYYAWQKLSHPGLPPGLASGNGRIEATEIDVSTKIA 89
+ P R ++ L ++ G + +GR + +
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIEN 104

Query: 90 GRIKEILVNEGDFISAGQVLVRMDTEQLEARRRQAEAELRRAVIGVETAKSLIRQREAEG 149
+KEI+V EG+ + G VL+++ EA + ++ L +A + + L R E
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 150 VAAD--------ATVAQQDAK---------FEQAEKKRARSEQLIT-------------- 178
+ V++++ F + ++ + E +
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 179 ----TAAVSQQVLDDDR---ANSLATKAAVSAAKAHAAATEAALSAAKAQVVDAEAAVDA 231
+ V + LDD K AV + L K+Q+ E+ + +
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284

Query: 232 ARAAIETISADI-----------------------------NDSTLRSPRDGRV-QYRVA 261
A+ + ++ S +R+P +V Q +V
Sbjct: 285 AKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344

Query: 262 QLGEVLAAGGRILNLVDLGDVY-MTFFLPTAEAGRVAMGSDVRLVLDAAPQYI---VPAK 317
G V+ ++ +V D +T + + G + +G + + ++A P + K
Sbjct: 345 TEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGK 404

Query: 318 VTFVADVAQFTPKTVETEEERQKLMFRIKARISPELLRKNIRNVKTGLPGMA 369
V + A E++R L+F + I L +N+ GMA
Sbjct: 405 VKNINLDA--------IEDQRLGLVFNVIISIEENCLSTGNKNIPLS-SGMA 447


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3662HTHTETR565e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 5e-12
Identities = 30/182 (16%), Positives = 62/182 (34%), Gaps = 12/182 (6%)

Query: 6 TSRPKTSSRERILKAAVKRFSQNSYERTSLRDIAGDVGIDVSYVHRCFGSKEKLFTEAI- 64
T + +R+ IL A++ FSQ TSL +IA G+ ++ F K LF+E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 65 RDAADMTDLIAGTPEEIAASMAQRAVSSRPPKGQALGIFIHSISSPEALPILRQFVLETA 124
+++ +L + + L + S + E +L + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVL-------REILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 125 IEPLSGRIGDASARCATLAM-ALLTGVTLFRNVIRVKPFVES-DEDELRALIAGVLGAVL 182
+ + R L + ++ I K ++ G + ++
Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQ--TLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175

Query: 183 EH 184
E+
Sbjct: 176 EN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3668ACRIFLAVINRP290.023 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.023
Identities = 16/80 (20%), Positives = 33/80 (41%), Gaps = 10/80 (12%)

Query: 36 GVSAAETVIEAAPVPRRRTVLTALCFVAGFSLVFVGLGATASVFGRFIAGHLSQLGLVAG 95
G E + A + R ++T+L F+ G + + G AG +Q + G
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG----------AGSGAQNAVGIG 1005

Query: 96 LVIIMMGLHFLGLLRIPLLY 115
++ M+ L + +P+ +
Sbjct: 1006 VMGGMVSATLLAIFFVPVFF 1025


98BBta_3708BBta_3715N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3708-1100.460637ABC transporter ATP-binding protein
BBta_3709-291.471366hypothetical protein
BBta_37100101.592132phosphopantetheinyl transferase
BBta_37111101.624737GntR family transcriptional regulator
BBta_37121101.218097acetylornithine
BBta_37132110.735649ABC transporter substrate binding protein
BBta_3714280.896343nickel ABC transporter permease NikB
BBta_3715180.922323peptide ABC transporter membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3708SECA310.014 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.0 bits (70), Expect = 0.014
Identities = 14/57 (24%), Positives = 19/57 (33%), Gaps = 11/57 (19%)

Query: 441 RDTDPGRV-------DDLLRRFGLDRKVKFVDGAFSTLDLSTGQRKRLAMIGALLEN 490
R D G D L+R F DR V G L + G+ + + N
Sbjct: 577 RQGDAGSSRFYLSMEDALMRIFASDR----VSGMMRKLGMKPGEAIEHPWVTKAIAN 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3709PF03544383e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.4 bits (89), Expect = 3e-05
Identities = 19/102 (18%), Positives = 26/102 (25%), Gaps = 2/102 (1%)

Query: 307 AAQPKPADAAQSTAPAQPAASAPASSSASPSPSTGP--QSAVSPQPAAPLQPASPPQSAA 364
P PA T A P + P P P + P+P
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 100

Query: 365 PAQAPPQTLLYPQATAPSPPASRQFGVPRAQTGRGQPSPWSA 406
P P Q P + P T +P+ +A
Sbjct: 101 PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3710ENTSNTHTASED984e-27 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 97.8 bits (243), Expect = 4e-27
Identities = 50/173 (28%), Positives = 75/173 (43%), Gaps = 11/173 (6%)

Query: 28 ASEICHGDEELFPEELAHVRAAVAKRRAEFATARMLARRALASLGATPISLVPGADRAPV 87
AS D L+ +R+A KR+AE R+ A AL +G + G R P+
Sbjct: 22 ASSFREHDL-LWLPHHDRLRSAGRKRKAEHLAGRIAAVHALREVGVRTVP-GMGDKRQPL 79

Query: 88 WPSGYTGSISHCADYCAVVVARSRDVRALGLDIEDVRELEP--SMHDLVLTSGERAWLRR 145
WP G GSISHCA V++R + +G+DIE + + ++ S ER L+
Sbjct: 80 WPDGLFGSISHCATTALAVISR----QRIGIDIEKIMSQHTATELAPSIIDSDERQILQA 135

Query: 146 QPQELQPILPILFFSAKEAYYKCQYPITRGFLEFSDVELTIDWPAGAFEARVL 198
L + FSAKE+ YK + F+ ++T A +L
Sbjct: 136 SLLPFPLALTLA-FSAKESVYKA-FSDRVTLPGFNSAKVTS-LTATHISLHLL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3715RTXTOXINA310.006 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.006
Identities = 24/94 (25%), Positives = 37/94 (39%), Gaps = 8/94 (8%)

Query: 71 LGADQFGRDVLSRLLSGARATVPMALAATLAGSAIGAVIGIGSAYLGGKSDEAIMRTNDA 130
L G D +S +LS A+ ++ A + A + + + LG + I + A
Sbjct: 235 LDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVG-KGISQYIIA 293

Query: 131 VMAIPGLLMALLLVSTLGNGAGNAVVAIALAFAP 164
A GL ST AG A+ LA +P
Sbjct: 294 QRAAQGL-------STSAAAAGLIASAVTLAISP 320


99BBta_3805BBta_3811N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3805119-3.944833phosphatase
BBta_3806218-3.036626hypothetical protein
BBta_3807-3171.805959alanine racemase (alr)
BBta_3808-3131.579755replicative DNA helicase
BBta_3809-2132.325827hypothetical protein
BBta_3810-2132.127962cyclopropane-fatty-acyl-phospholipid synthase
BBta_38110123.258812hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3805BACYPHPHTASE300.011 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 29.8 bits (66), Expect = 0.011
Identities = 36/128 (28%), Positives = 53/128 (41%), Gaps = 18/128 (14%)

Query: 117 GLVSSAASWMVEKVLARLDLKHAFDVVITQEDVVRH-----KPDPEAYWLALSGLGVDAT 171
GL ++ + EKV A+ L H +VV+TQED + K + Y L G G
Sbjct: 41 GLTIASGARESEKVFAQTVLSHVANVVLTQEDTAKLLQSTVKHNLNNYDLRSVGNGNSVL 100

Query: 172 TTL-----VFEDSLAGLKAA--KAAGCRCVAIRHSFNIRHDFSAADRE-IRSFDELAAAS 223
+L +D+ L+AA + +G R HS + H RE +RS
Sbjct: 101 VSLRSDQMTLQDAKVLLEAALRQESGARGHVSSHSHSALHAPGTPVREGLRSH-----LD 155

Query: 224 SYAPPVAP 231
PP+ P
Sbjct: 156 PRTPPLPP 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3806GPOSANCHOR376e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.6 bits (84), Expect = 6e-04
Identities = 59/355 (16%), Positives = 115/355 (32%), Gaps = 52/355 (14%)

Query: 540 SAQTYRAKLDAQLAQVNQLNRDMEVAEKQQSAVTQPADERRKLLMDV-QKLSDGVTESQA 598
A + A++ + + + + A + L +A
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 599 RLDAISEKVTERERELKAESTKARQREIKQELERLRPALTAAEQDLNRRSLERDEAYSIA 658
R + + + A+ A+ + ++ E L E+ L +
Sbjct: 191 RQAELEKALEGAMNFSTAD--SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248

Query: 659 QQRGTDIAKALARSESIRAAKEGAESCARQIRAGIDALEQKVDEIRQQQR--EEAANAVR 716
+ + A AR + A EGA + + A I LE + + ++ E + +
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLN 308

Query: 717 VNAENTLSDLSEFAQKNASLVPLEVGPLVAALKATLS-AKDPSKTSDALSALQRRLEEIP 775
N ++ DL + L A L S+ S +L+R L+
Sbjct: 309 ANRQSLRRDLDASREAKKQL---------EAEHQKLEEQNKISEAS--RQSLRRDLD--- 354

Query: 776 EFKRFRTSREEARQESAR----AELDQLSDTART-ISDFSESY--AKRNITSDN------ 822
SRE +Q A E +++S+ +R + ++ AK+ +
Sbjct: 355 ------ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSK 408

Query: 823 VQPLIKLKARLSEVL-------------VRPETNALKQAISESERELAQLHLEQA 864
+ L KL L E + E ALK+ +++ ELA+L +A
Sbjct: 409 LAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKA 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3807ALARACEMASE2516e-83 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 251 bits (643), Expect = 6e-83
Identities = 114/365 (31%), Positives = 175/365 (47%), Gaps = 25/365 (6%)

Query: 29 TIDLDALAANWRKLEKTAVPAECAGVIKADAYGCGIPQVARTLAAAGCKTFFVATLSEAK 88
++DL AL N + + A A V+KA+AYG GI ++ + A F + L EA
Sbjct: 8 SLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLNLEEAI 65

Query: 89 VARTALPDSAALYVLDGFFQNTG-EDYAAINCRPVIGDLNELAEWDMFCRRTGWKGGAAI 147
R + +L+GFF E Y + +L + K I
Sbjct: 66 TLRER-GWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKA----LQNARLKAPLDI 120

Query: 148 H--VDTGMNRLGFTPTDAQGMIPRIQ-SGDHGITLIMSHLASAEQLNSPSNARQLSLFRE 204
+ V++GMNRLGF P + +++ + G +MSH A AE + S ++ +
Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISG--AMARIEQ 178

Query: 205 VASVFTGVPAALAASSGIFLGAAFQFDMVRPGAALYGINPTPE----ADNPMQPVAEIKA 260
A +L+ S+ FD VRPG LYG +P+ + A+ ++PV + +
Sbjct: 179 AAE-GLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSS 237

Query: 261 RIVQLRDLARGDTVGYGGTWTARRPTKIAILSAGYADGYFRAASSNDGTRGADVVIAGQR 320
I+ ++ L G+ VGYGG +TAR +I I++AGYADGY R A + G V++ G R
Sbjct: 238 EIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPT-----GTPVLVDGVR 292

Query: 321 CPVAGRVSMDLIAVDVTDLPPKAARRGHFATLLGEGITVDELAHHFGTIGYEVMTNLGRR 380
G VSMD++AVD+T P A G L G+ I +D++A GT+GYE+M L R
Sbjct: 293 TMTVGTVSMDMLAVDLTPCP--QAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALR 350

Query: 381 YHRIY 385
+
Sbjct: 351 VPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3811IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 27/192 (14%), Positives = 55/192 (28%), Gaps = 20/192 (10%)

Query: 91 DARSMGHSGKDGKDGKETRTERRARSAVSAEPSEAKPDDAGGPAEPRRRSGHHKRSERSA 150
+ G + KET+T +A + +AK E + K + + +
Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK-------VETEKTQEVPKVTSQVS 1130

Query: 151 PAEGKPEAVPEAGTAPADGDGQSSAKGRHGKRKKPAPQDGAAADDPRSEPTGATRPAAAR 210
P + + E V + D + K + A + A + + T
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 211 DDNATRSEPPRVAPATS-------------SEAAPALRADPVPPVTPAPATTQAVPTAAS 257
N+ P PAT+ + ++R+ P ++ A
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250

Query: 258 SADDASAPSSPS 269
+ + S
Sbjct: 1251 DLTSTNTNAVLS 1262


100BBta_3818BBta_3824N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3818014-0.6906063-oxoacyl-ACP reductase
BBta_3819-1131.454691acyl carrier protein
BBta_3820-2111.5310203-oxoacyl-ACP synthase
BBta_3821-292.241693hypothetical protein
BBta_3822-282.691854hypothetical protein
BBta_3823-282.194740guanylate kinase
BBta_3824-271.290703hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3818DHBDHDRGNASE1252e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (316), Expect = 2e-37
Identities = 82/251 (32%), Positives = 123/251 (49%), Gaps = 13/251 (5%)

Query: 4 LSGRTALVTGATGGIGNAIAKAMHAQGATVAISGTRREVLETLAGEL---GSRVHVLPCN 60
+ G+ A +TGA GIG A+A+ + +QGA +A E LE + L P +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LSDSAEVEALVPAAEQAMGQVDILVANAGITRDNLFVQLRDEDWDEVIKVNLTATFRLSR 120
+ DSA ++ + E+ MG +DILV AG+ R L L DE+W+ VN T F SR
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 AATKLMMRKRFGRIIAITSIVGVTGNPGQGNYTASKAGIIGMIKTLGAEYAKRGVTANCI 180
+ +K MM +R G I+ + S Y +SKA + K LG E A+ + N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 APGFIKTPM-----TDALNDKQR-----ETILAKVPAARLGTPEDIAAAAVYLASNEAAY 230
+PG +T M D +Q ET +P +L P DIA A ++L S +A +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 231 VTGQTIHVNGG 241
+T + V+GG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3819ACRIFLAVINRP260.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 25.6 bits (56), Expect = 0.021
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 2/42 (4%)

Query: 34 GADSLDTVELVMAFEEEFGCEIPDDAAETILTVGDATKFLEK 75
GA++LDT + + A E P +L D T F++
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGM--KVLYPYDTTPFVQL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3821IGASERPTASE330.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.003
Identities = 17/86 (19%), Positives = 34/86 (39%), Gaps = 1/86 (1%)

Query: 321 ETYDLHQKNVAKLRAMERQTQNDTVEPEEAPAATAAGAVDPAAAAAAPRAPTGAKKPPAN 380
ET + ++ AK+ E+ + V + +P + V P A A PT K P +
Sbjct: 1102 ETATVEKEEKAKVET-EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 381 RASGAPAPAAGGRQGAAQSSPPVVQQ 406
+ + ++ ++ PV +
Sbjct: 1161 QTNTTADTEQPAKETSSNVEQPVTES 1186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3824IGASERPTASE330.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.004
Identities = 30/251 (11%), Positives = 78/251 (31%), Gaps = 20/251 (7%)

Query: 249 QMAARTRRSRAKPRTKAKAPAFGQRSARNGSGAGDGFRTQETASALDLASARLSAATQDQ 308
+ A + +K K + A + +A+N A + + + + ++ + +
Sbjct: 1038 ETVAENSKQESKTVEKNEQDA-TETTAQNREVAKEAKSNVKA----NTQTNEVAQSGSET 1092

Query: 309 AGARPRAAAPAPYQDHSNDVPSRHDRDQAAELPKLQPAEAVWQAADDLAARAAGRHSAMM 368
+ + ++ + ++ E+PK+ + Q + + +
Sbjct: 1093 KETQTTETKETATVEKEEK--AKVETEKTQEVPKVTSQVSPKQEQSE-TVQPQAEPAREN 1149

Query: 369 AQAVLSQGVQTQGSMAQGSMTSADMPIPSAQIAPAVPPATAPAFAAPPAAAAKASASSAE 428
V + Q+Q + T+AD P+ + + V + S E
Sbjct: 1150 DPTVNIKEPQSQTN------TTADTEQPAKETSSNVEQPVTES------TTVNTGNSVVE 1197

Query: 429 MALQGWSAQTRPAATWPGAQAPDAQIAAPASAAPSQTGAPAAPAPPPAPVDVAEIVSALR 488
A T+P + P + + P + + V + ++ S
Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257

Query: 489 KARTGNAASQT 499
A +A ++
Sbjct: 1258 NAVLSDARAKA 1268



Score = 32.7 bits (74), Expect = 0.007
Identities = 28/132 (21%), Positives = 41/132 (31%), Gaps = 13/132 (9%)

Query: 73 APPRTPQQPAPWDFAQPAPPKLTAKRTDGPAPRANAEALAEAP----QPRVRTDLAIRGT 128
P+ Q A+PA P + N A E P V + T
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 129 PKPQN-VAITPTTLTASDANLPRSLDASPWP--------SSPPPQTEPAISDSQPAAAIA 179
N V P T + + ++S P S P EPA + S + +A
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248

Query: 180 ADDNVELDSNAA 191
D ++NA
Sbjct: 1249 LCDLTSTNTNAV 1260


101BBta_3932BBta_3940N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_39320110.894099multidrug resistance protein mdtA
BBta_39330120.521111multidrug resistance protein mdtB
BBta_3934010-0.076564multidrug ABC transporter
BBta_3935-212-1.718181hypothetical protein
BBta_3936-214-2.712498*hypothetical protein
BBta_3937-112-2.614009alcohol dehydrogenase
BBta_3938-123-2.992473hypothetical protein
BBta_3939-121-1.986826two-component response regulatory protein,
BBta_3940-118-0.911041two-component response regulatory protein,
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3932RTXTOXIND394e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 4e-05
Identities = 23/171 (13%), Positives = 56/171 (32%), Gaps = 19/171 (11%)

Query: 88 NTVTVRSQVDGKLIAVNFVEGQDVKQGDVLAEIDPAIYQAQYDQAVAKKAQDEAQLANQK 147
+ ++ + + + EG+ V++GDVL ++ +A + ++ L +
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT-------QSSLLQAR 147

Query: 148 LDLARYEQLAASSAGSKQQADTQRAVVAQQQALIKADQAAIDNAAATLSYTKIVAPISGR 207
L+ RY+ L+ S +K Q + S +
Sbjct: 148 LEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ-------- 199

Query: 208 AGLRQVDQGNI-IRASDATGLVVITQLQPIAVQFSLPQQQIMRVNAAAAKG 257
Q Q + + A L V+ ++ + + ++ ++ K
Sbjct: 200 ---NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247



Score = 32.5 bits (74), Expect = 0.004
Identities = 19/129 (14%), Positives = 49/129 (37%), Gaps = 11/129 (8%)

Query: 125 YQAQYDQAVAKKAQDEAQLANQKLDLARYEQLAASSAGSKQQADTQRAVVAQQQALIKAD 184
+ ++ Q E+++ + K + QL + + T + L K +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI-LDKLRQTTDNIGLLTLELAKNE 322

Query: 185 QAAIDNAAATLSYTKIVAPISGR-AGLRQVDQGNIIRASDATGLVVITQLQPIAVQFSLP 243
+ + I AP+S + L+ +G ++ ++ T +V++ + + V +
Sbjct: 323 ER--------QQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQ 373

Query: 244 QQQIMRVNA 252
+ I +N
Sbjct: 374 NKDIGFINV 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3933ACRIFLAVINRP7650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 765 bits (1976), Expect = 0.0
Identities = 277/1031 (26%), Positives = 495/1031 (48%), Gaps = 26/1031 (2%)

Query: 7 FIRRPIATSLLGVALLIGGLLGYLALPVSALPQVDFPTVQVTTQLPGASPDVIAALITAP 66
FIRRPI +L + L++ G L L LPV+ P + P V V+ PGA + +T
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERQLGQIPSLAAMNSTS-SFGVSQISLQFDLGRDIDGATQDVQAAINAAAGVLPKSLPY 125
+E+ + I +L M+STS S G I+L F G D D A VQ + A +LP+ +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ- 123

Query: 126 PPTYAKVNPADAPVITLALTSDT--VSLRAMSDMADTILAQRLSQVSGVGRVSVLGGLKP 183
+ + + ++ SD + +SD + + LS+++GVG V + G +
Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QY 182

Query: 184 AVRVQADLARLAAYGISMEDLRAAIAGANVSGPKGSLDGAQQA------YLITANDQIAA 237
A+R+ D L Y ++ D+ + N G L G I A +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 ADAYRPIII-AYRNGSPVTIGDVATIVDGLENDRTGGWYQGVPAVILDIQRQPGANVIEV 296
+ + + + +GS V + DVA + G EN G PA L I+ GAN ++
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 297 VRQIRAEIPKLQRTIPAGVKLAIVSDRTVTIRASVHDVQFTLVLSVVLVTLVVLLFLRSL 356
+ I+A++ +LQ P G+K+ D T ++ S+H+V TL +++LV LV+ LFL+++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 357 RATIIAGVALPLSLITSFGVMYFAGFSLDNLSLMALTIGTGFVVDDAIVMIENIVRHM-E 415
RAT+I +A+P+ L+ +F ++ G+S++ L++ + + G +VDDAIV++EN+ R M E
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 416 NGESAMNASLRGAREIGFTVISLTVSLIAVFIPLLFMSGLVGRMFREFALTLTIAVVTSA 475
+ A+ + +I ++ + + L AVFIP+ F G G ++R+F++T+ A+ S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 476 VVSLTLTPMMCSRLLKHASDEWVLPG---LNAISRFIDRTVAVYHRSLLWVLDRQTETLV 532
+V+L LTP +C+ LLK S E + D +V Y S+ +L L+
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542

Query: 533 VTFATLAATLALYVVAPKGFLPLQDTGSITAVTEAGPDVSFAEMQRRQQEVAAAIQAD-- 590
+ +A + L++ P FLP +D G + + + Q+ +V +
Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602

Query: 591 PDVVGVVSVIGAGSVNPTTNVGRLVMTLKPLGERQDG---VAAVIDRLKQKTAGVPGMTV 647
+V V +V G N G ++LKP ER AVI R K + + V
Sbjct: 603 ANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFV 662

Query: 648 YFQPVQDVQISTQASRSQYQYTLTGADAGDVAKWAGQLVAEMRRD--PLFRDVSSEAQEG 705
+ + A+ ++ D A + M V E
Sbjct: 663 IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLED 722

Query: 706 GLRAELTIDRQRAGQLGVSLQGVTDTLNDAFAQRQISTIYGQANQYRVVLEAMPMYQRDP 765
+ +L +D+++A LGVSL + T++ A ++ + ++ ++A ++ P
Sbjct: 723 TAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLP 782

Query: 766 SILSKLYLPGAGGQ-VPLSAVASLKRTTAPLAISHQAQFPAVQLSFNLAPGVALSEAVDA 824
+ KLY+ A G+ VP SA + + P++++ APG + +A+
Sbjct: 783 EDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMAL 842

Query: 825 LHATENRIGMPGSIIGVYSGDAAEFAKSLAGQPWLLLAAVITIYIVLGVLYESYIHPITI 884
+ ++ +P I ++G + + S P L+ + + +++ L LYES+ P+++
Sbjct: 843 MENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 885 LSTLPSAGVGAILALMLFGQDLSVIGLIGIILLMGIVKKNAIMMIDFALEAERHENRTPY 944
+ +P VG +LA LF Q V ++G++ +G+ KNAI++++FA + E +
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 945 DAIVQACLLRFRPIMMTTLAALFGALPLAIESGTGSELRFPLGISIIGGLLLSQLLTLYT 1004
+A + A +R RPI+MT+LA + G LPLAI +G GS + +GI ++GG++ + LL ++
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1005 TPVIYLALDRL 1015
PV ++ + R
Sbjct: 1021 VPVFFVVIRRC 1031



Score = 89.1 bits (221), Expect = 5e-20
Identities = 77/503 (15%), Positives = 173/503 (34%), Gaps = 33/503 (6%)

Query: 6 PFIRRPIATSLLGVALLIGGLLGYLALPVSALPQVDFPTVQVTTQLP-GASPDVIAALIT 64
+ L+ ++ G ++ +L LP S LP+ D QLP GA+ + ++
Sbjct: 532 KILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLD 591

Query: 65 APLERQL----------GQIPSLAAMNSTSSFGVSQISLQFDLGRDIDGATQD-VQAAIN 113
+ L + + + G++ +SL+ R+ D + + V
Sbjct: 592 QVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 114 AAAGVLPKSLPYPPTYAKVNPADAPVITLALTSDTVSLRA-----MSDMADTILAQRLSQ 168
G + P + + + ++ + +L
Sbjct: 652 MELGKIRDGFVIPF---NMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 169 VSGVGRVSVLG-GLKPAVRVQADLARLAAYGISMEDLRAAIAGANVSGPKGS---LDGAQ 224
+ + V G +++ D + A G+S+ D+ I + G + G
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTI-STALGGTYVNDFIDRGRV 767

Query: 225 QAYLITANDQIA-AADAYRPIIIAYRNGSPVTIGDVATIVDGLENDRTGGWYQGVPAVIL 283
+ + A+ + + + + NG V T + R Y G+P++ +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLER-YNGLPSMEI 826

Query: 284 DIQRQPGANVIEVVRQIRAEIPKLQRTIPAGVKLAIVSDRTVTIRASVHDVQFTLVLSVV 343
+ PG A + L +PAG+ + + R S + + +S V
Sbjct: 827 QGEAAPGT----SSGDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAISFV 881

Query: 344 LVTLVVLLFLRSLRATIIAGVALPLSLITSFGVMYFAGFSLDNLSLMALTIGTGFVVDDA 403
+V L + S + + +PL ++ D ++ L G +A
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 404 IVMIENIVRHME-NGESAMNASLRGAREIGFTVISLTVSLIAVFIPLLFMSGLVGRMFRE 462
I+++E ME G+ + A+L R ++ +++ I +PL +G
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 463 FALTLTIAVVTSAVVSLTLTPMM 485
+ + +V++ ++++ P+
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3934ACRIFLAVINRP7310.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 731 bits (1888), Expect = 0.0
Identities = 278/1040 (26%), Positives = 495/1040 (47%), Gaps = 32/1040 (3%)

Query: 5 ISEPFIRRPVGTTLLSIGLFLVGIVAYIFLPVAPVPNVDFPSIFVTATRPGADPSVMAGT 64
++ FIRRP+ +L+I L + G +A + LPVA P + P++ V+A PGAD + T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VAAPLERRLGEIAGVDQITSTS-SLGTTNIQIQFSIGRDIDKAARDVQAAINASLSDLPS 123
V +E+ + I + ++STS S G+ I + F G D D A VQ + + LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DLPALPRFRKANTAAAPVFVLALTS--KTLTPSAMYDVADTVLAQRLFQVPGVGNVTVSG 181
++ ++++ + V S T + D + + L ++ GVG+V + G
Sbjct: 121 EVQQ-QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 182 ADQPAVRIALNPVSLANAGISTDDVRTAIINANPIGPVGIFEGGRQSE------TIAINR 235
A Q A+RI L+ L ++ DV + N G G +I
Sbjct: 180 A-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 236 QMRTAAEFRDILIKSS-NGSFVRLSDVADVEDSVRNVRSIAWFNKQPAVLIQIAKQGDAN 294
+ + EF + ++ + +GS VRL DVA VE N IA N +PA + I AN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 295 VIETVDRVKALIPELKQWLPAGIEISTLIDRTGTIRASVEDMQWTLLATAVLVMVVVFLF 354
++T +KA + EL+ + P G+++ D T ++ S+ ++ TL +LV +V++LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 355 LRRLVPTVAAGVSVPLALAGTCAAMWLAGFSIDNLSLMALAISVGFVVDDAIVMIENMYR 414
L+ + T+ ++VP+ L GT A + G+SI+ L++ + +++G +VDDAIV++EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 415 -NLEHGLPPMRAALDGARQIGFTVLSISLSLIAAFVPLIFMDGVVGRLLREFSLTLTFAI 473
+E LPP A QI ++ I++ L A F+P+ F G G + R+FS+T+ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 474 VVSTLVSLTVTPMICAHYIK-------ETLSDRASWFDRIVEGSLSGMVRFYAWTLRGVL 526
+S LV+L +TP +CA +K E WF+ + S++ L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 527 VYPFLTLMVFFATIALTVTLYIKVPKGYFPTDDSGFIIGATRASADISFQSMLSLQQRLA 586
Y L+++ +A V L++++P + P +D G + + A + + + ++
Sbjct: 539 RY----LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 587 DIVMADPAVAGIGSTIG-GGGGPGGATSNRGTMFINLKPPAERDGV--STELVINRLRRS 643
D + + A + S G G N G F++LKP ER+G S E VI+R +
Sbjct: 595 DYYLKNEK-ANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 644 LAMVPGIRLFMFAAQDVRAGGRQSDSDYQYT-LSSPDLDLLQKWAPIVAKRLETV-EGIT 701
L + + F + G + D++ + D L + + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 702 DISSDRDPGGLQLALRIDRQKAASLGVRVQDIDTALNNAFAQRQISIVYSQRNQYMVVLE 761
+ + Q L +D++KA +LGV + DI+ ++ A ++ + + ++
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 762 IDPKFQTDPSNLDRIYVAGAGDAQIPLSALVKAERGLSPLAVFHSQSFPSTTVSFNLLPD 821
D KF+ P ++D++YV A +P SA + + PS + P
Sbjct: 774 ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 822 VPLQAATANIQRAVDELHMPEGIRGGFDGNAGDFNKGSGRQPLLILSALVAMYIVLGVLY 881
A A ++ +L P GI + G + + P L+ + V +++ L LY
Sbjct: 834 TSSGDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 882 ESLAHPLTIISTLPSAGLGALLALQVTNTPLTVIAFVGIILLIGIVKKNGIMLVDFALEA 941
ES + P++++ +P +G LLA + N V VG++ IG+ KN I++V+FA +
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 942 ERQRGMSSADAIFEACRVRFRPILMTTLAALFAGLPLVLATGPGTELRRPLGITIIGGLL 1001
+ G +A A R+R RPILMT+LA + LPL ++ G G+ + +GI ++GG++
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011

Query: 1002 VSQILTLYTTPVIYLLIDRL 1021
+ +L ++ PV +++I R
Sbjct: 1012 SATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3939HTHFIS742e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-18
Identities = 28/127 (22%), Positives = 53/127 (41%), Gaps = 6/127 (4%)

Query: 2 TNILVVDDEVGVCCVIQHLLVREGYAVTALTDGRQALDVVARSDFAAALIDLNLADIDGN 61
ILV DD+ + V+ L R GY V ++ +A D + D+ + D +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DVIRAARAARPNMPIVMMSGMVLESDHGMPESLGLSARVQRLHRLAKPFKPKDLAQLMRE 121
D++ + ARP++P+++MS + ++ L KPF +L ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSA------QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 122 ILTPDEA 128
L +
Sbjct: 118 ALAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3940HTHFIS942e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 2e-25
Identities = 38/146 (26%), Positives = 61/146 (41%), Gaps = 7/146 (4%)

Query: 2 PTVLIIDDDSATRAALETLLKKKKFCVFLAPDGPTGIRLIGSVSFDAVVIDMFMPGMDGI 61
T+L+ DDD+A R L L + + V + + T R I + D VV D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ATIRELIKIDPTVPFIAISGYAFTDKKQGAPDFLGMAIKLGATAALQKPFDLLDLLEAVD 121
+ + K P +P + +S A + GA L KPFDL +L+ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQ-------NTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 122 RAIEVRQRLLADARGNLAPHRHLSGE 147
RA+ +R + + L G
Sbjct: 117 RALAEPKRRPSKLEDDSQDGMPLVGR 142


102BBta_3954BBta_3959N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_3954-170.536155hypothetical protein
BBta_3955-160.166023metalloendopeptidase
BBta_3956-180.197848hypothetical protein
BBta_3957-290.315848hypothetical protein
BBta_3958-110-0.308510outer membrane autotransporter barrel
BBta_3959-1100.753048hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3954PF07675300.018 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.1 bits (67), Expect = 0.018
Identities = 18/81 (22%), Positives = 32/81 (39%)

Query: 297 VPLSLPEDSTTASLPSAAGAAGKANGSPAAQRTALAAAGGSQDTVAHAGGGEPKMITVTV 356
+P SLP++ + S+ ++AG+ + T +A A G E V +
Sbjct: 254 LPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATVNMTKQITENGNYDVVI 313

Query: 357 ARGDSLWHISRRLLGGGTRYA 377
R + L I + G + Y
Sbjct: 314 TRSNYLPVIKQIQAGEPSPYQ 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3955TYPE4SSCAGA356e-04 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 35.1 bits (80), Expect = 6e-04
Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 4/73 (5%)

Query: 192 DREARLESRSPVLANVQPNQATKLAGVDNVLTRLTRSLDQVEARQLAMLSSAEETLESKL 251
+R+AR + + L ++ + KL V+ L +S D+ + + S AEETL++
Sbjct: 664 NRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNGKNKDFSKAEETLKA-- 721

Query: 252 RRMRGVITDLGLD 264
++G + DLG++
Sbjct: 722 --LKGSVKDLGIN 732


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3958VACCYTOTOXIN310.036 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.8 bits (69), Expect = 0.036
Identities = 60/286 (20%), Positives = 95/286 (33%), Gaps = 33/286 (11%)

Query: 99 GAGGVRFISARNVDLTFKGAISTNGGAGIVAYSGGEPSF-DVRVSSTGNISTRGAGAAGI 157
G+G R S+ + L I++ A I Y G + V GN+ G
Sbjct: 208 GSGAGRKASSTVLTLQASEGITSRENAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGA 267

Query: 158 TAGTSCCGGVTLDSTGDISTNGNSSNGL-VAQAGAFSSAVTSKGNVTTAGDSSAAIAAFS 216
S T TG+++ N + AQAG +S T G + + I A
Sbjct: 268 YLAPSYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPP 327

Query: 217 SGSLARISSQGNLATSGSSAPGITASGRDIAINSIGNIVTSGSGSDGILARSTSFGQLSS 276
G + N S ++ ++ + N+ V + S Q +
Sbjct: 328 EGGY---KDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNS----------AQKTE 374

Query: 277 ISSNGNITASGDGSYAIHAITSSIGNTTVTNLGGTISGGPGAGAGIYLQGGNANVLTNYG 336
I I DG +A T N TN GTI G G A++ TN
Sbjct: 375 IQPTQVI----DGPFAGGKNTVVNINRINTNADGTIRVG----------GFKASLTTNAA 420

Query: 337 TVTSAAGTAGSAILGDTGNIVVNNSGTIIGSVNLGRGTNSFNNLAG 382
+ G + +++V N + G++ + G NN G
Sbjct: 421 HLHIGKGGINLSNQASGRSLLVEN---LTGNITVD-GPLRVNNQVG 462


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_3959TCRTETB310.008 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.4 bits (71), Expect = 0.008
Identities = 43/190 (22%), Positives = 82/190 (43%), Gaps = 22/190 (11%)

Query: 236 LRHHA--FWALFATFFFTAIGMYAIAPQVVAYLIDAGFAPLQAATAWGFSGVVLLF--GM 291
LRH+ W +FF + + V I F A+T W + +L F G
Sbjct: 10 LRHNQILIWLCILSFFSV---LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 292 LGVSALDGLIGRRPSVLFSYAVSIAGILMLWALQVWPNILLLTGFVVCFGSMIGSRGPLL 351
L +G + +LF ++ G ++ + + ++L++ F+ G + P L
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG---AAAFPAL 123

Query: 352 SATAMKIF-----RGKRLGTIYGTITIGSGLGSALGSWSGGVIHDLTHGYDAVILLALVS 406
+ + RGK G I + +G G+G A+G G + H + Y ++L+ +++
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG---GMIAHYIHWSY--LLLIPMIT 178

Query: 407 VIIGMIPFLV 416
+I +PFL+
Sbjct: 179 II--TVPFLM 186


103BBta_4054BBta_4061N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_40549242.148738hypothetical protein
BBta_40553231.205876hypothetical protein
BBta_40561200.545136hypothetical protein
BBta_40571190.489807hypothetical protein
BBta_4058-122-2.114118hypothetical protein
BBta_4059-225-1.167391hypothetical protein
BBta_4060-217-0.566132globin family protein
BBta_4061-1131.327809response regulator receiver
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4054IGASERPTASE412e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.2 bits (96), Expect = 2e-07
Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 17/118 (14%)

Query: 13 QKAAQQARQGILQKFKAQPGPDDPEVVKRRQEREAAAARREQQRLE-------------R 59
+ + I + +A P P E A +++E + +E R
Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066

Query: 60 EAAKAEQKRLEEEAKAAEAARLAREAEEAAARLAE----MEAEQKAKRDARYAARKAR 113
E AK + ++ + E A+ E +E + +E E+KAK + +
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124



Score = 33.5 bits (76), Expect = 1e-04
Identities = 18/107 (16%), Positives = 33/107 (30%), Gaps = 7/107 (6%)

Query: 12 RQKAAQQARQGILQKFKAQPG-PDDP---EVVKRRQEREAA---AARREQQRLEREAAKA 64
Q A + Q +A+ + EV + E + + + E AK
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 65 EQKRLEEEAKAAEAARLAREAEEAAARLAEMEAEQKAKRDARYAARK 111
E ++ +E K +E E AE E + + +
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4057PERTACTIN290.021 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.3 bits (65), Expect = 0.021
Identities = 19/43 (44%), Positives = 21/43 (48%)

Query: 54 PAAAVQPGGQPHAQPPRQGSGPVAGQPSGPGPRAAAAPQARPP 96
P A QPG QP QPP+ P QP P R AP +PP
Sbjct: 571 PKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4058MICOLLPTASE280.014 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.8 bits (61), Expect = 0.014
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 23 DGRWMFPGGRKRARETAKDCLKREIKEELPKLKLGRLRLWKEVTA-----KNKRSGRKMS 77
D R + GGR + E + ++ + L +L +K VTA K +G +
Sbjct: 695 DMRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTAYFVNHKVDGNGNYVY 754

Query: 78 DAIF 81
D +F
Sbjct: 755 DVVF 758


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4061HTHFIS615e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 5e-14
Identities = 21/127 (16%), Positives = 50/127 (39%), Gaps = 7/127 (5%)

Query: 5 LLVIEDADVHLSILRKIAVQAGFATTGVNSVDGAINVLRRRNFECITLDLNLGERSGTEV 64
+LV +D ++L + +AG+ ++ + + + + D+ + + + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 65 LQLLADMKSRTPVLIISGSDDQTRDV-TVRAGKILGLNVYPPFCKPVDLALLRQTLRQIA 123
L + + PVL++S + + G Y KP DL L + +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKG------AYDYLPKPFDLTELIGIIGRAL 119

Query: 124 ADTDRQR 130
A+ R+
Sbjct: 120 AEPKRRP 126


104BBta_4065BBta_4077N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4065-282.056589hydrolase
BBta_4066-1111.746417hypothetical protein
BBta_40670100.9290832-C-methyl-D-erythritol 4-phosphate
BBta_4068-1110.503195tRNA-dihydrouridine synthase
BBta_4069-1110.288037signal transduction histidine kinase, nitrogen
BBta_4070-1100.434897nitrogen metabolism transcriptional regulator
BBta_4071080.335718multi-sensor signal transduction histidine
BBta_4072060.327175two component, sigma54 specific, Fis family
BBta_4073060.421175RNA-binding protein Hfq
BBta_4074060.707950GTP-binding protein (hflX)
BBta_4075070.410947acetolactate synthase catalytic subunit
BBta_4076090.387369hypothetical protein
BBta_40772100.0356714-hydroxybenzoate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4065PF06057290.028 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.028
Identities = 20/67 (29%), Positives = 25/67 (37%), Gaps = 1/67 (1%)

Query: 56 AKADESVPLGGLAITGENSLPGVKDCPIKGTKLPLIVFSHGRGGWFGQHHDLNEALADAG 115
A ADE GL + V TK PL++F G GGW + L G
Sbjct: 20 AFADEFADNLGLTLLPVEPSTQVNA-ASSHTKPPLVIFLSGDGGWATLDKAVGGILQQQG 78

Query: 116 FIVAAIS 122
+ V S
Sbjct: 79 WPVVGWS 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4070HTHFIS5890.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 589 bits (1520), Expect = 0.0
Identities = 357/481 (74%), Positives = 411/481 (85%), Gaps = 4/481 (0%)

Query: 1 MPAGSILVADDDTAIRTVLNQALSRAGYEVRLTGNAATLWRWVSQGEGDLVITDVVMPDE 60
M +ILVADDD AIRTVLNQALSRAGY+VR+T NAATLWRW++ G+GDLV+TDVVMPDE
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NAFDLLPRIKKMRPNLPVIVMSAQNTFMTAIRASERGAYEYLPKPFDLKELIAIVGRALA 120
NAFDLLPRIKK RP+LPV+VMSAQNTFMTAI+ASE+GAY+YLPKPFDL ELI I+GRALA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 EPKERAASQPDDNEFESIPLVGRSPAMQEIYRVLARLMQTDLTVMISGESGTGKELVARA 180
EPK R + DD++ + +PLVGRS AMQEIYRVLARLMQTDLT+MI+GESGTGKELVARA
Sbjct: 121 EPKRRPSKLEDDSQ-DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHDYGKRRNGPFVAVNMAAIPRDLIESELFGHERGAFTGANTRASGRFEQAEGGTLFLDE 240
LHDYGKRRNGPFVA+NMAAIPRDLIESELFGHE+GAFTGA TR++GRFEQAEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPMEAQTRLLRVLQQGEYTTVGGRTPIKTDVRIVAASNKDLRILIQQGLFREDLFFR 300
IGDMPM+AQTRLLRVLQQGEYTTVGGRTPI++DVRIVAA+NKDL+ I QGLFREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVVPLRLPPLRERIEDLPDLVRHFFALAEKDGLPPKKLDAAALERLKQHRWPGNVRELE 360
LNVVPLRLPPLR+R ED+PDLVRHF AEK+GL K+ D ALE +K H WPGNVRELE
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 361 NLARRLAALYPQDVITASVIDGELA---PPAVTTGSSTPTSIDNLGGAVEAYLSSHFQGF 417
NL RRL ALYPQDVIT +I+ EL P + ++ + ++ AVE + +F F
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 418 PNGVPPPGLYHRILKEIEIPLLTAALAATRGNQIRAADLLGLNRNTLRKKIRDLDIQVYR 477
+ +PP GLY R+L E+E PL+ AAL ATRGNQI+AADLLGLNRNTLRKKIR+L + VYR
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYR 479

Query: 478 S 478
S
Sbjct: 480 S 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4071PF06580463e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.0 bits (109), Expect = 3e-07
Identities = 28/157 (17%), Positives = 52/157 (33%), Gaps = 25/157 (15%)

Query: 555 IVRQVDDIRRMVDEFSRFARMPKPVMEGEDVA-----DTVRQAVFLMKVAHPE-IDIETE 608
I+ R M+ S R V+ V + L + + + E +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 609 IKQDPLRAQFDRRLISQALTNIIKNATEAIEVVPEEELGKGRIDVVASREGDDVTIDVID 668
I + Q L+ + N IK+ + G+I + +++ VT++V +
Sbjct: 246 INPAIMDVQVPPMLVQTLVENGIKHGIAQLP-------QGGKILLKGTKDNGTVTLEVEN 298

Query: 669 NGIGLPKVARQRLLEPYVTTRAKGTGLGLAIVGRVLE 705
G K ++ TG GL V L+
Sbjct: 299 TGSLALKNTKE------------STGTGLQNVRERLQ 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4072HTHFIS423e-147 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 423 bits (1090), Expect = e-147
Identities = 167/481 (34%), Positives = 262/481 (54%), Gaps = 35/481 (7%)

Query: 2 ASDILIVDDEADIRELVAGILDDEGFTTRTARDSDSALAEIAGRRPNLVFLDIWLQGSKL 61
+ IL+ DD+A IR ++ L G+ R ++ + IA +LV D+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD--E 60

Query: 62 DGLQLLEQIKKDHPDLPVVMISGHGNIETAVAAIKRGAYDFIEKPFKSDRLILVATRALE 121
+ LL +IKK PDLPV+++S TA+ A ++GAYD++ KPF LI + RAL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL- 119

Query: 122 TSRLKREVRELKQLAPTASTMTGRSPSMNQLRQTIERAAKANSRILIVGPPGSGKELAAR 181
+ KR +L+ + + GRS +M ++ + + R + + ++I G G+GKEL AR
Sbjct: 120 -AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 182 TLHAQSPRAEGPFVVINAAAITPERMEEELFGVEQS--NGEQSRKTGALEEAHGGTLFVD 239
LH R GPFV IN AAI + +E ELFG E+ G Q+R TG E+A GGTLF+D
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 240 EIADMPRETQNKILRVLVDQTFQRAGGTTKVHVDVRIISSTARNLEEEIAAGRFREDLYH 299
EI DMP + Q ++LRVL + GG T + DVRI+++T ++L++ I G FREDLY+
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 300 RLSVVPIRVPPLSERREDIPELIDYFMDQISVTTGLPKRQIGQDAMAVLQSHVWPGNVRQ 359
RL+VVP+R+PPL +R EDIP+L+ +F+ Q GL ++ Q+A+ ++++H WPGNVR+
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 360 LRNNVERVMILAAGGPEVIITADMLPPDVGSMVPAMPTSNNGEHIMGLPLR--------- 410
L N V R+ L +IT +++ ++ S +P P L +
Sbjct: 358 LENLVRRLTALYPQD---VITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 411 ----------------EAREVFERDYLIAQISRFSGNISRTAEFVGMERSALHRKLKALG 454
E ++A ++ GN + A+ +G+ R+ L +K++ LG
Sbjct: 415 YFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474

Query: 455 V 455
V
Sbjct: 475 V 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4077TCRTETB653e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 64.5 bits (157), Expect = 3e-13
Identities = 76/392 (19%), Positives = 143/392 (36%), Gaps = 27/392 (6%)

Query: 44 LCALVVGLDGFDAQALGFVAPALSKDLHLAPGALGPVFGASLFGVMIGSLVFGALADYLG 103
LC L + L P ++ D + P + V A + IG+ V+G L+D LG
Sbjct: 19 LCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 104 RKWLVIAGVLVFALGSL-ATTQATSVSDLVMIRFVTGIGLGGVLPNTIALTGEYSPQRRR 162
K L++ G+++ GS+ + S L+M RF+ G G + + Y P+ R
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 163 TLLIMLMFMTVSLGSAIGGAVAARLITAYGWQVIFLIGGALPLILCPILIIWLPESLSLL 222
L+ V++G +G A+ + W + LI +I P L+ L + + +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEVRI- 195

Query: 223 ALDTSKSDRVRTLLMRIDPSAMLPADARFTI----------------VEESGKGFLLPQL 266
D +LM + + ++I + + F+ P L
Sbjct: 196 ---KGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 267 FTKRRLLPTLLLWIMFFMNMMDIYFLNSWLPTLTHGVGLDVQAAIAVGIAFQLGGMLGTI 326
+ +L + F + + ++ H + ++ + + G I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 327 GLGLLIERFGFDRVLFATYVAGFLSIVTIGIAGASLPILVPAVFIAGVAVIGGQIGCNAY 386
G G+L++R G L+ + V+ A L + I V V+GG
Sbjct: 313 G-GILVDRRG---PLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 387 AARIYPTYIRGTGIGWALGIGRFGSILGVTLG 418
+ I + ++ G + + F S L G
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400



Score = 32.2 bits (73), Expect = 0.005
Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 3/154 (1%)

Query: 276 LLLWIMF--FMNMMDIYFLNSWLPTLTHGVGLDVQAAIAVGIAFQLGGMLGTIGLGLLIE 333
+L+W+ F ++++ LN LP + + + V AF L +GT G L +
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 334 RFGFDRVLFATYVAGFLSIVTIGIAGASLPILVPAVFIAGVAVIGGQIGCNAYAARIYPT 393
+ G R+L + V + + +L+ A FI G AR P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 394 YIRGTGIGWALGIGRFGSILGVTLGGLMLAA-HW 426
RG G I G +G +GG++ HW
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


105BBta_4103BBta_4110N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4103-110-0.601750serine-type D-Ala-D-Ala carboxypeptidase
BBta_4104014-0.731881epoxide hydrolase
BBta_4105018-1.642620hypothetical protein
BBta_4106019-1.381848hypothetical protein
BBta_4107021-1.997591*AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_4108023-2.026173secretion protein (HlyD)
BBta_4109-120-2.263935glycosyltransferase
BBta_4110-116-2.442620non-ribosomal peptide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4103BLACTAMASEA362e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 36.3 bits (84), Expect = 2e-04
Identities = 52/272 (19%), Positives = 86/272 (31%), Gaps = 31/272 (11%)

Query: 3 YAANQSVQGARKEEGGFDGDAPTAILMEANSGSVLFEKNADELRAPSSMMKLMTAEVVFN 62
+A+ Q ++ + E G M+ SG L ADE S K++ V
Sbjct: 20 HASPQPLEQIKLSESQLSGRVGMI-EMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLA 78

Query: 63 AVKEGTIKLNDEYRISENAWRRGGAPSGGSTMFAALNSKVSVSDLLHGAIIQSGNDACIA 122
V G +L + + L ++V +L AI S N A
Sbjct: 79 RVDAGDEQLERKIHYRQQD-----LVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANL 133

Query: 123 LAEGMAGNEKIFAADYMTKRARELGLTKS------TFANSNGLPDPGNKMTVRELAMLAR 176
L + G + T R++G + T N D + T +A R
Sbjct: 134 LLATVGGPAGL------TAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLR 187

Query: 177 HIILDFPEFYKIFGEKE----FTWNKIRQQNRNPLLNAMEGAD---GLKTGFTKEGGYGM 229
++ + + W + PL+ ++ A KTG + G G+
Sbjct: 188 KLLTS-----QRLSARSQRQLLQWMV-DDRVAGPLIRSVLPAGWFIADKTGAGERGARGI 241

Query: 230 VGSAVQNGTRLIVVVNGLEDPEDRATEAKKML 261
V N +VV L D E + +
Sbjct: 242 VALLGPNNKAERIVVIYLRDTPASMAERNQQI 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4107ACRIFLAVINRP5960.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 596 bits (1537), Expect = 0.0
Identities = 270/1038 (26%), Positives = 471/1038 (45%), Gaps = 46/1038 (4%)

Query: 39 ISAWSIRKPIPSLVLFGVLIVLGAVSLKTLPITQMPNIDIPIVTVTIAQTGAAPSELETQ 98
++ + IR+PI + VL +L++ GA+++ LP+ Q P I P V+V+ GA ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 99 VTKNVENAVTGVVGAKHVTS-SISDGVSVTTIEFQLGTPADRAVNDVRNAMANIRSELPQ 157
VT+ +E + G+ +++S S S G T+ FQ GT D A V+N + LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 158 SIEEPSIQRVEVGGMAIVTYAVSF--PAWTAEQVSWFVDDVITGALQGVRGVAQVKRAGG 215
+++ I + ++ P T + +S +V + L + GV V + G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV-QLFG 179

Query: 216 ADREIRVSLQPDRLIALGITAADVNAQLRATNVDLSGGR------GEVGAGEQTIRMLAG 269
A +R+ L D L +T DV QL+ N ++ G+ +I
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 270 AATIDGLANTAI-VLPGGRRTALKEIATVIDGAADARSFARLDGRPVVTFGVFRAKGFSD 328
+ + V G LK++A V G + AR++G+P G+ A G +
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 329 VAVAEAVGKRLELLAKEHP-DLSVSEIDSTVRYTKSDYRATMQTLTEGAILAVVVVLIFL 387
+ A+A+ +L L P + V T + + ++TL E +L +V+ +FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 388 RDLRATAISVLAIPLSILPTFWVMDIIGFSLNAVSLLAITLVTGILVDDAIVEIENIVRH 447
+++RAT I +A+P+ +L TF ++ G+S+N +++ + L G+LVDDAIV +EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 448 IRMGKSAYR-ASLQAADEIGLAVVATTMTIVAMFMPVSLMDGIAGQYFKQFGLTVAIAVT 506
+ K + A+ ++ +I A+V M + A+F+P++ G G ++QF +T+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 507 FSLLVARLITPLLAAYFLRA-PQRHSEGHG-----------VLMRHYLRMLEWSLRHRFV 554
S+LVA ++TP L A L+ H E G + HY + L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 555 TLAFAGLVFLGSLMVAGALPLGFLPTDDLSRSVLLIELPPGSTIADTVAATDRITALLKK 614
L L+ G +++ LP FLP +D + +I+LP G+T T D++T K
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 615 --RPEVRSVYAVGGTAGAHGLSVTAGDVRKATIIVDLVARSNRSHGQKAFEHDMRAALGA 672
+ V SV+ V G + + G + AG + R+ + +A H + LG
Sbjct: 600 NEKANVESVFTVNGFSFS-GQAQNAGMA--FVSLKPWEERNGDENSAEAVIHRAKMELGK 656

Query: 673 IPDLRY---------SFGNGGGGREFTLILSGRDGAVVEQAAFAVERDARQNVSVLANVV 723
I D G G + +G + QA + A Q+ + L +V
Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 724 STVALARPEIRILPRLEEAADLGISGTQIAEVARIATIGDVSARLAKFSAVDRQVPIRVQ 783
+ ++ E+A LG+S + I + A G R + VQ
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR---GRVKKLYVQ 773

Query: 784 LDERARGDLSTLDMLRIKARNGT-VPLATVAEIGFGEGPMTIERYDGRRRVAIEADLVGS 842
D + R +D L +++ NG VP + + G +ERY+G + I+ +
Sbjct: 774 ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 843 TPLGEAIAQVMALPSARNLPAGVTIARFGDSEIMDDVFSSFSFAIAAGVLMVLAVLVLLF 902
T G+A+A + L A LPAG+ G S + +A ++V L L+
Sbjct: 834 TSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 903 ADAMHPITIIFSMPLSIGGALLALLLAGHAINLSAIIGFLMLMGIVTKNAILLVDFAI-T 961
P++++ +PL I G LLA L ++ ++G L +G+ KNAIL+V+FA
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 962 EVASGVERTTALLEAGRKRAQPVIMTTAAMTAGMVPSALGLGDGGAFRSPMAVALIGGLL 1021
G A L A R R +P++MT+ A G++P A+ G G ++ + + ++GG++
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011

Query: 1022 ASTFLSLVFVPAAFTIMD 1039
++T L++ FVP F ++
Sbjct: 1012 SATLLAIFFVPVFFVVIR 1029



Score = 67.9 bits (166), Expect = 1e-13
Identities = 69/514 (13%), Positives = 161/514 (31%), Gaps = 27/514 (5%)

Query: 33 TEMALNISAWSIRKPIPSLVLFGVLIVLGAVSLKTLPITQMPNIDIPIVT--------VT 84
N + L+++ +++ V LP + +P D + T
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGAT 582

Query: 85 IAQTGAAPSELETQVTKNVENAVTGVVGAKHVT-SSISDGVSVTTIEFQLGTPADRAVND 143
+T ++ KN + V V + S + + + + + N
Sbjct: 583 QERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENS 642

Query: 144 VRNAMANIRSELPQ----SIEEPSIQRVEVGGMAIV--TYAVSFPAWTAEQVSWFVDDVI 197
+ + EL + + ++ + G A + + ++ + ++
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLL 702

Query: 198 TGALQGVRGVAQVKRAGGADR-EIRVSLQPDRLIALGITAADVNAQLRATNVDLSGGRGE 256
A Q + V+ G D + ++ + ++ ALG++ +D+N +
Sbjct: 703 GMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI 762

Query: 257 VGAGEQTIRMLAGAA---TIDGLANTAIVLPGGRRTALKEIATVIDGAADARSFARLDGR 313
+ + + A A + + + G T + R +G
Sbjct: 763 DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYG-SPRLERYNGL 821

Query: 314 PVVTFGVFRAKGFSDVAVAEAVGKRLELLAKEHPDLSVSEIDSTVRYTKSDYRATMQTLT 373
P + A G S + E LA + P + + L
Sbjct: 822 PSMEIQGEAAPGTSSGDAMALM----ENLASKLPAGIGYDWTGMSYQERLSGN-QAPALV 876

Query: 374 EGAILAVVVVLIFL-RDLRATAISVLAIPLSILPTFWVMDIIGFSLNAVSLLAITLVTGI 432
+ + V + L L +L +PL I+ + + ++ + G+
Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936

Query: 433 LVDDAIVEIENIV-RHIRMGKSAYRASLQAADEIGLAVVATTMTIVAMFMPVSLMDGIAG 491
+AI+ +E + GK A+L A ++ T++ + +P+++ +G
Sbjct: 937 SAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 492 QYFKQFGLTVAIAVTFSLLVARLITPLLAAYFLR 525
G+ V + + L+A P+ R
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4108RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 19/139 (13%), Positives = 41/139 (29%)

Query: 49 AVTGSLVAREENVVGAEVDGLRIVELLADVGDRVEAGQVLARLDGTMLRTQLAQNTAMIA 108
G L + ++ + E++ G+ V G VL +L + + +
Sbjct: 85 TANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLL 144

Query: 109 IAQASVAQMQASMAEMQANEAEAADALNRTLTLNSTGTISPAQLLARETQAKVAAAKSTA 168
A+ + Q ++ N+ + N + + Q +
Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 169 AAENLRMAKAEEAFAEAQR 187
NL +AE A+
Sbjct: 205 KELNLDKKRAERLTVLARI 223



Score = 45.2 bits (107), Expect = 3e-07
Identities = 40/310 (12%), Positives = 96/310 (30%), Gaps = 46/310 (14%)

Query: 34 TTLARAERQSITETLAVTGSLVAREENV-VGAEVDGLRIVELLADVGDR------VEAGQ 86
A A+ +L R + + E++ L ++L + + V
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 87 VLARLDGTMLRTQLAQNTAMIAIAQASVAQMQASMAEMQANEAEAADALNRTLTLNSTGT 146
L + + + Q Q + +A + A + + L+ +L
Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248

Query: 147 ISPAQLLARE------------TQAKVAAAKSTAA-----------------AENLRMAK 177
I+ +L +E ++++ +S + LR
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTT 308

Query: 178 AEEAFAEAQRGEIELKIARTELKAPTAGIISHRAAR-LGAVTGTAGEPLFRLI-RNGQIE 235
+ + E + + ++AP + + G V T E L ++ + +E
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV-TTAETLMVIVPEDDTLE 367

Query: 236 FDAEVPETVLPQIEPAQDVEVWLPGVSQS----IRGRVRLVDPTVDKASRLG---RVAVA 288
A V + I Q+ + + + + G+V+ ++ + RLG V ++
Sbjct: 368 VTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427

Query: 289 LSPHAAMRAG 298
+ +
Sbjct: 428 IEENCLSTGN 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4110NUCEPIMERASE406e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.8 bits (93), Expect = 6e-05
Identities = 65/327 (19%), Positives = 112/327 (34%), Gaps = 84/327 (25%)

Query: 1133 RIFLTGATGFVGSHLLSALIQETDAQIVCHVRAADRQSGEARLRQALDKRKLSVSGDEHR 1192
+ +TGA GF+G H+ L++ V D L+ VS + R
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-----QVVGID----------NLNDYY-DVSLKQAR 45

Query: 1193 IEVLTGNLGHPALGLDERGIRIVRD-----ECEAIYHCGANV---DFLRHYAALKPANVD 1244
+E+L G +D + D E ++ + L + A +N+
Sbjct: 46 LELL-AQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLT 104

Query: 1245 SVLTLLDWTASGRPKRLHYVSTLAVIDKFQAGPVSELTDLTSWQGLTSGYSQSKWVGDTL 1304
L +L+ + + L Y S+ +V + P S D + + S Y+ +K + +
Sbjct: 105 GFLNILEGCRHNKIQHLLYASSSSVYGLNRKMPFST--DDSVDHPV-SLYAATKKANELM 161

Query: 1305 VRQ-AQLRGLPLAIYRLSSVTG------------------DRANGICNETDLMWRIVRLY 1345
+ L GLP R +V G ++ + N M R
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGK-MKR----- 215

Query: 1346 AELGAIPDLDFRLNMTPADDVARAIVRLAGS----EASWGS-------------VYHLMN 1388
DF T DD+A AI+RL + W VY++ N
Sbjct: 216 ---------DF----TYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGN 262

Query: 1389 SQPVDVREVPRIFRR-LGVPIEIVPLD 1414
S PV++ + + LG+ + L
Sbjct: 263 SSPVELMDYIQALEDALGIEAKKNMLP 289


106BBta_4215BBta_4222N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4215-1120.068434biotin carboxyl carrier protein
BBta_4216-2110.567556acetyl-CoA carboxylase biotin carboxylase
BBta_4217-3111.109565multi-sensor signal transduction histidine
BBta_4218-2111.981564two-component response regulator/receiver
BBta_4219-2112.906793signal transduction histidine kinase
BBta_4220-1112.077547leucyl/phenylalanyl-tRNA--protein transferase
BBta_4221-1121.797253hypothetical protein
BBta_42220141.468576Levodione reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4215RTXTOXIND334e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 4e-04
Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 3/55 (5%)

Query: 107 FVEVGSKVSVGQTLMIIEAMKTMNQIPSPRAGTVTQILVEDGQPVEFGEPLVIIE 161
+V + L K I V +I+V++G+ V G+ L+ +
Sbjct: 77 LGQVEIVATANGKLTHSGRSKE---IKPIENSIVKEIIVKEGESVRKGDVLLKLT 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4217PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 3e-05
Identities = 17/102 (16%), Positives = 35/102 (34%), Gaps = 21/102 (20%)

Query: 398 LIDNAIKY-LKPGV-PGDIAIRARTKLGFAIFEITDNGRGIDPRDHQRIFDLFRRAGTQD 455
L++N IK+ + G I ++ G E+ + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 456 RPGQGIGLAHVRALVRRLGG---TMSVASELDQGSTFTITLP 494
G GL +VR ++ L G + ++ + + +P
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4218HTHFIS521e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.8 bits (124), Expect = 1e-10
Identities = 24/126 (19%), Positives = 53/126 (42%), Gaps = 13/126 (10%)

Query: 6 TIIMIEDDEGHARLIERNIRRSGVNNEIMPFTNGTDAVKYLLGTDGTGLDHKGRALLILL 65
TI++ +DD ++ + + R+G ++ +N +++ DG L++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGD---------LVVT 53

Query: 66 DLNLPDMTGIDILRMVKQNSFLKSAPVVILTTTDDSQEIKRCYELGCNVYITKPVNYESF 125
D+ +PD D+L +K PV++++ + + E G Y+ KP +
Sbjct: 54 DVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 126 ANAIRQ 131
I +
Sbjct: 112 IGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4219HTHFIS862e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-20
Identities = 36/135 (26%), Positives = 60/135 (44%), Gaps = 3/135 (2%)

Query: 5 APTLLYIDDDATLARLVERGLKRLGFTVEHAPDGAAGLERIQHGGIDVVALDQYMPGLDG 64
T+L DDDA + ++ + L R G+ V + A I G D+V D MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 LETLERIMALPEAPPVVFVTASQDSKIAVTALKAGAADYLVKDLQGEFVPLLQVAAEGAL 124
+ L RI PV+ ++A A+ A + GA DYL K + L AL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELIGIIGRAL 119

Query: 125 RQARLQKARDEAEAE 139
+ + + ++ E +++
Sbjct: 120 AEPKRRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4221IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.001
Identities = 43/232 (18%), Positives = 66/232 (28%), Gaps = 15/232 (6%)

Query: 79 PLAPPPGTAVPPTNAPVAVAPPPGQPGAAPPAGGQRQPPRGAPQNAAV-PPNGAVP---- 133
P V TN P A P+ A V PP A P
Sbjct: 983 PEVEKRNQTVDTTNIT-----TPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT 1037

Query: 134 QTPATLQPGDEVVTEPPAQKIVNKKASFSGLDKITGRIINFDEDIGETVQFGALRVKTDA 193
+T A + E Q A + K + + E Q G+ +T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 194 CYTR-PATEAANTDAFVQVDEITLQGEVKRIFSGWMFAASPGLHGVEHPIYDIWLVDCKE 252
T+ AT A V+ ++ +V S + E + V+ KE
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 253 PQTTVVSTAPDQKPAAQQPAQKRPP----QQRQAAPRPQAPPQQYQTQQMPP 300
PQ+ +TA ++PA + + P P+ P
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQP 1209



Score = 28.9 bits (64), Expect = 0.032
Identities = 14/60 (23%), Positives = 20/60 (33%), Gaps = 6/60 (10%)

Query: 251 KEPQTTVVSTAPDQKPAAQQPA-----QKRPPQQRQAAPRPQAPPQQYQTQQMPPPPPPP 305
KE QTT + + Q+ P Q +P+ Q + Q Q P P
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK-QEQSETVQPQAEPARENDP 1151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4222DHBDHDRGNASE1102e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 2e-31
Identities = 78/254 (30%), Positives = 122/254 (48%), Gaps = 6/254 (2%)

Query: 7 LDGRVAVVTGAAGLIGAATMRLLAARGARIVAIDRREQDLNAAIAALPASAE-PLAIAAD 65
++G++A +TGAA IG A R LA++GA I A+D + L +++L A A A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 66 VTDEAQVAAYVQRACDQFGTIDVFFNNAGIEGEIKSITDYPLEAFRRVLDVNVVGVFLGL 125
V D A + R + G ID+ N AG+ I E + VN GVF
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 126 KHVLPVMLQQNRGSIINTASIAGLIGSPQIAVYSASKHAVIGLTKSAAWECTGTNVRVNC 185
+ V M+ + GSI+ S + +A Y++SK A + TK E N+R N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 186 ICPGLIDSRMLSAIIEGRSGAPVPIDRIVDR----VPARRLGQGSEVAAIVAFLASDDAS 241
+ PG ++ M ++ +GA I ++ +P ++L + S++A V FL S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 242 YVSGAAYTVDGGRT 255
+++ VDGG T
Sbjct: 245 HITMHNLCVDGGAT 258


107BBta_4334BBta_4346N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4334010-0.670211hypothetical protein
BBta_433519-0.315345hypothetical protein
BBta_4336-18-0.229748toxin/protease secretion system
BBta_4337-18-0.157501toxin/protease secretion system
BBta_433808-0.146817hypothetical protein
BBta_434007-0.149919hypothetical protein
BBta_4341-180.099702hypothetical protein
BBta_4342-17-0.120570hypothetical protein
BBta_4343-110-0.214030hypothetical protein
BBta_4344117-0.199453methionyl-tRNA formyltransferase
BBta_4346119-0.575876hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4334GPOSANCHOR456e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 45.4 bits (107), Expect = 6e-07
Identities = 16/90 (17%), Positives = 32/90 (35%), Gaps = 11/90 (12%)

Query: 196 EQSEKLEEAEAPPKKKPAPAARPPMREPMPAPRRMEEPPRVRAETRPAPGRMRAPAPTKG 255
E+ KL +A + P P + +P + + + + + P+ G
Sbjct: 453 EELAKLRAGKASDSQTPDA---KPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 509

Query: 256 NSRRRNPSFQANPFVVATAAAIVIGSAMFA 285
+ ANPF A A ++ + + A
Sbjct: 510 ET--------ANPFFTAAALTVMATAGVAA 531


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4337RTXTOXIND341e-113 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 341 bits (877), Expect = e-113
Identities = 99/422 (23%), Positives = 176/422 (41%), Gaps = 2/422 (0%)

Query: 242 LRLGLRVLLVAAVLGGGWLTLVPLAGAIVVPGNLVVQSNVKAIQHPTGGIIAEIKVDNGK 301
RL ++ V+ L + G L K I+ I+ EI V G+
Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 302 RVTAGDLLVRLDATQSQAQLQAITKQLNEQRAKIARLSAERDGLDQPEYP-TSLTSRPDD 360
V GD+L++L A ++A L + R + R ++ + P L P
Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176

Query: 361 AHVRA-VIASENALFKARATTRKSQKELLQGRIVQLNNEISGMESQLDSKNKQIELIKGE 419
+V + +L K + +T ++QK + + + E + ++++ + K
Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236

Query: 420 LAGVQELYDKRLVPLTRLTTLQREAARIDGERGQLVSSVAETRSKISEAELQTVKIDQDF 479
L L K+ + + + + E S + + S+I A+ + + Q F
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 480 RSEVVKDLGDAQAKEGDLVEKGVAARDQLDRIEMRAPNSGTIHQLAVHTIGGVIKPGETI 539
++E++ L G L + ++ +RAP S + QL VHT GGV+ ET+
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 540 MELVPDSDELQVEAHVQPKDIDHVHTGQNAMVRFNAFNQRTTPQLSGQVSFVAPDITNDP 599
M +VP+ D L+V A VQ KDI ++ GQNA+++ AF L G+V + D D
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ 416

Query: 600 RSGSTFYTVRITLSEEERQRLGGVNLMPGMQAEVYVQTGSRTMLSYLMKPITDQWRRAFV 659
R G F + + L GM ++TG R+++SYL+ P+ + +
Sbjct: 417 RLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLR 476

Query: 660 EQ 661
E+
Sbjct: 477 ER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4340cloacin330.008 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.008
Identities = 18/73 (24%), Positives = 25/73 (34%)

Query: 1012 ANGTGGTAAAGTTNAGGAAAGAAGAAGGTTANGGAAAGGTSNAGGTANNGGGTANNGGTS 1071
A+ T G G T G + G+ + N G+ G + G NG +
Sbjct: 13 AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72

Query: 1072 TGGQTANNGSNNA 1084
G T N S A
Sbjct: 73 GGSGTGGNLSAVA 85



Score = 30.8 bits (69), Expect = 0.037
Identities = 31/101 (30%), Positives = 39/101 (38%), Gaps = 8/101 (7%)

Query: 435 MLAAAGQGFAADAMDVSGNNIAIGGATYVGSATTVSTATSVHGLAQGTIPVTATPNLANG 494
M G+G A SGN GG T +G A+ G + P
Sbjct: 1 MSGGDGRGHNTGAHSTSGN--INGGPTGLGVG---GGASDGSGWSSENNPWGGGSGSGIH 55

Query: 495 TGGSSTTPPSTGGGGAGGGTGTGGMHT---TPVNFDPAALA 532
GG S G G +GGG+GTGG + PV F AL+
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4341cloacin519e-09 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 51.3 bits (122), Expect = 9e-09
Identities = 33/86 (38%), Positives = 40/86 (46%), Gaps = 10/86 (11%)

Query: 511 MAGGGGAA--GGGTASGGNAGGGTTGGGTAGGGATDGGNA------GGNAGGGNAGGGNA 562
M+GG G G ++ GN GG TG G GG + G + GG +G G GG +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 563 GGGNAGGGANGGG--TTGGNNQTTTT 586
G GN GG N GG TGGN
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 45.9 bits (108), Expect = 4e-07
Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 4/71 (5%)

Query: 507 GGGGMAGG---GGAAGGGTASGGNAGGG-TTGGGTAGGGATDGGNAGGNAGGGNAGGGNA 562
G G GG G G G + G G ++ GGG+ G + GG +G GN GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 563 GGGNAGGGANG 573
GG +G G N
Sbjct: 71 SGGGSGTGGNL 81



Score = 44.7 bits (105), Expect = 8e-07
Identities = 36/86 (41%), Positives = 43/86 (50%), Gaps = 11/86 (12%)

Query: 481 GTGAPAGANPNIANGTGGTPASTTTPGGGGMAGGGGAAGGGTASGGNAGGGTTGGGTAGG 540
G G GA+ N GG P G G GGG + G G +S N GG +G G G
Sbjct: 6 GRGHNTGAHSTSGNINGG-------PTGLG-VGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 541 GATDGGNAGGNAGGGNAGGGNAGGGN 566
G + GN GGN GN+GGG+ GGN
Sbjct: 58 GGSGHGNGGGN---GNSGGGSGTGGN 80



Score = 36.6 bits (84), Expect = 3e-04
Identities = 28/85 (32%), Positives = 31/85 (36%), Gaps = 2/85 (2%)

Query: 483 GAPAGANPNIANGTGGTPASTTTPGGGGMAGGGGAAGGGTASGGNAGGGTTGGGTAGGGA 542
G P G G +S P GGG G G GG + GN GG GG +G G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGG--SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 543 TDGGNAGGNAGGGNAGGGNAGGGNA 567
A A G A GG A
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.002
Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 6/85 (7%)

Query: 535 GGTAGGGATDGGNAGGNAGGGNAGGGNAGGGNAGGG------ANGGGTTGGNNQTTTTHH 588
GG G T + GN GG G G GG + G G GGG+ G + + H
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 589 CHSHHSHHSQDHHAAAAAANTAAAA 613
+ + +S + AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4342SYCDCHAPRONE424e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.8 bits (98), Expect = 4e-06
Identities = 15/91 (16%), Positives = 27/91 (29%)

Query: 915 PQHLDAHFALGNLLYTAGKDIEAAKCYLKVLEFSPEHAETHNNIANVLLRQGHRERAIEH 974
L+ ++L Y +GK +A K + + + + G + AI
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 975 YKRAIASRPDYGDAYGNLGNAYLELNRLEEA 1005
Y + L+ L EA
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123



Score = 39.1 bits (91), Expect = 4e-05
Identities = 17/95 (17%), Positives = 29/95 (30%)

Query: 877 LSLSPNHPGILYAFAMVRQNQGMSEEAMVLLRRAIENKPQHLDAHFALGNLLYTAGKDIE 936
+S + LY+ A + G E+A + + LG G+
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 937 AAKCYLKVLEFSPEHAETHNNIANVLLRQGHRERA 971
A Y + + A LL++G A
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123



Score = 36.8 bits (85), Expect = 2e-04
Identities = 26/121 (21%), Positives = 43/121 (35%), Gaps = 6/121 (4%)

Query: 150 DDADAHQTLGFALQRLGQFERAMSHHEAALAARPQFAAAAASLGDACRQ-LGRHAEAIAH 208
D + +L F + G++E A +A + LG ACRQ +G++ AI
Sbjct: 34 DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLG-ACRQAMGQYDLAIHS 92

Query: 209 YERALTLQPNAPAVLLNIGGCQQAIGQTEAAVRTYQRALVLSPHLAEAHYNLGNLHLEMN 268
Y + P + C G+ A + L L+ L L ++
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEA----ESGLFLAQELIADKTEFKELSTRVS 148

Query: 269 S 269
S
Sbjct: 149 S 149



Score = 34.9 bits (80), Expect = 0.001
Identities = 18/109 (16%), Positives = 34/109 (31%)

Query: 944 VLEFSPEHAETHNNIANVLLRQGHRERAIEHYKRAIASRPDYGDAYGNLGNAYLELNRLE 1003
+ E S + E ++A + G E A + ++ + LG + + +
Sbjct: 28 LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87

Query: 1004 EAIEQNLLALKLKPERFGSYNNLGVAYQALGRFEEATAAFEKALELAPD 1052
AI + + + G EA + A EL D
Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIAD 136



Score = 33.7 bits (77), Expect = 0.002
Identities = 22/116 (18%), Positives = 38/116 (32%), Gaps = 4/116 (3%)

Query: 285 PDFPEAHNNLANALQSRGRHEEALAHYDEALRRRPSYAIAHRNRADTLRNMKRFDEAIAG 344
D E +LA G++E+A + + + M ++D AI
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 345 YHDALALEPADTTTLNHLAGVLMIVGRLDEAEQAYRSALAINPRNIGVHLNYGVVK 400
Y ++ + H A L+ G L EAE A + I + +
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL----IADKTEFKELS 144



Score = 32.6 bits (74), Expect = 0.005
Identities = 18/98 (18%), Positives = 30/98 (30%)

Query: 124 LGSAFTNLGDPAGAVRHLELALAADADDADAHQTLGFALQRLGQFERAMSHHEAALAARP 183
L G A + + D D+ LG Q +GQ++ A+ +
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 184 QFAAAAASLGDACRQLGRHAEAIAHYERALTLQPNAPA 221
+ + Q G AEA + A L +
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 32.6 bits (74), Expect = 0.006
Identities = 12/80 (15%), Positives = 28/80 (35%)

Query: 983 PDYGDAYGNLGNAYLELNRLEEAIEQNLLALKLKPERFGSYNNLGVAYQALGRFEEATAA 1042
D + +L + + E+A + L + LG QA+G+++ A +
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 1043 FEKALELAPDDAPIHLNLAN 1062
+ + + + A
Sbjct: 93 YSYGAIMDIKEPRFPFHAAE 112



Score = 31.1 bits (70), Expect = 0.016
Identities = 18/110 (16%), Positives = 32/110 (29%)

Query: 242 TYQRALVLSPHLAEAHYNLGNLHLEMNSWPIAVFHYERAIAERPDFPEAHNNLANALQSR 301
T +S E Y+L + + A ++ L Q+
Sbjct: 24 TIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAM 83

Query: 302 GRHEEALAHYDEALRRRPSYAIAHRNRADTLRNMKRFDEAIAGYHDALAL 351
G+++ A+ Y + A+ L EA +G A L
Sbjct: 84 GQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 31.1 bits (70), Expect = 0.020
Identities = 14/89 (15%), Positives = 23/89 (25%)

Query: 796 GDNNDAEAIFRLILAGQPRQFDALVGLGMICSGSSRLDEAKDCFQRAVAVNAKSAEAHGS 855
G DA +F+ + +GLG + D A + ++ K
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 856 IGAVEASAGRYDAAVGHYETALSLSPNHP 884
G A A L +
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKT 138



Score = 30.7 bits (69), Expect = 0.021
Identities = 17/92 (18%), Positives = 36/92 (39%), Gaps = 3/92 (3%)

Query: 829 SSRLDEAKDCFQRAVAVNAKSAEAHGSIGAVEASAGRYDAAVGHYETALSLSPNHPGILY 888
S + ++A FQ ++ + +GA + G+YD A+ Y + P +
Sbjct: 49 SGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPF 108

Query: 889 AFA---MVRQNQGMSEEAMVLLRRAIENKPQH 917
A + + +E + L + I +K +
Sbjct: 109 HAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 30.7 bits (69), Expect = 0.024
Identities = 15/92 (16%), Positives = 25/92 (27%)

Query: 862 SAGRYDAAVGHYETALSLSPNHPGILYAFAMVRQNQGMSEEAMVLLRRAIENKPQHLDAH 921
+G+Y+ A ++ L RQ G + A+ +
Sbjct: 48 QSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFP 107

Query: 922 FALGNLLYTAGKDIEAAKCYLKVLEFSPEHAE 953
F L G+ EA E + E
Sbjct: 108 FHAAECLLQKGELAEAESGLFLAQELIADKTE 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4346VACJLIPOPROT270.014 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 27.2 bits (60), Expect = 0.014
Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 1 MRLDRIGMAFAASILAGAATSPAFAQSPYD 30
M+L +A ++L G A+S Q D
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSD 30


108BBta_4369BBta_4383N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4369013-0.433976phosphopantetheine adenylyltransferase
BBta_4370-113-0.112833hypothetical protein
BBta_4371113-0.684624DNA gyrase subunit A
BBta_43731019-0.108244single-stranded DNA-binding protein
BBta_43744190.182866outer-membrane protein
BBta_43754180.317655outer-membrane protein
BBta_43764190.338124outer-membrane protein
BBta_43773170.108604outer-membrane protein
BBta_43781150.774012outer-membrane protein
BBta_43791140.614175excinuclease ABC subunit A
BBta_43803131.077387hypothetical protein
BBta_43821130.757446hypothetical protein
BBta_43830141.209194phosphoglycolate phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4369LPSBIOSNTHSS1691e-56 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 169 bits (429), Expect = 1e-56
Identities = 60/161 (37%), Positives = 104/161 (64%), Gaps = 5/161 (3%)

Query: 4 IALYPGSFDPVTNGHLDVVRQAVHLCDRLIVAVGVHHGKKPLFSTEERLAMVHEVLEPVA 63
A+YPGSFDP+T GHLD++ + L D++ VAV + K+P+FS +ERL + + + +
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 AAAGCGFEASTYDDLTVTAAQKAGAIMMIRGLRDGTDFDYEMQLAGMNQTMVPGIQTVFV 123
+ +++ LTV A++ A ++RGLR +DF+ E+Q+A N+T+ ++TVF+
Sbjct: 62 -----NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFL 116

Query: 124 PASVAVRPIAATLVRQIAAMGGDVSHFVPAAVAASLKAKFN 164
S ++++LV+++A GG+V HFVP+ VAA+L +F+
Sbjct: 117 TTSTEYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4374OMPADOMAIN383e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 37.6 bits (87), Expect = 3e-05
Identities = 49/223 (21%), Positives = 76/223 (34%), Gaps = 34/223 (15%)

Query: 22 AADLAARPYTKAPMAAPAPLPTWTGFYIGLQGGGGWGRSDETFFNAPNGFGFAGTQRYDI 81
A +A A +A AP +Y G GW + +T F NG T +
Sbjct: 5 AIAIAVALAGFATVAQAAPKDN--TWYTG--AKLGWSQYHDTGFINNNG----PTHENQL 56

Query: 82 NGGFAGGVIGYNWQVDNIVFGLEGDYHWADINGRSGVITAGLGDSYFTKLRGFGDIKGRL 141
G GG +QV N G E Y W GR G ++ K +G + +L
Sbjct: 57 GAGAFGG-----YQV-NPYVGFEMGYDWL---GRMPY--KGSVENGAYKAQG-VQLTAKL 104

Query: 142 GWAAGPAL-FFVSGGAAVGDLQHRYDNPAFSTIQNDWRWGWTIGAGAEYMFAPNWSAKVE 200
G+ L + G V + + + +D G EY P + ++E
Sbjct: 105 GYPITDDLDIYTRLGGMVWRADTKSNVYGKN---HDTGVSPVFAGGVEYAITPEIATRLE 161

Query: 201 YNYLDFGKSTLQYNNPLVASNRSEWSDTVHTVKAGISYHFGGP 243
Y Q+ N + ++ + G+SY FG
Sbjct: 162 Y----------QWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQG 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4376OMPADOMAIN375e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 37.2 bits (86), Expect = 5e-05
Identities = 50/262 (19%), Positives = 73/262 (27%), Gaps = 69/262 (26%)

Query: 1 MKSTLFVTAGVAALGLAPASAADFAARPYGKAPPPAYVTPLPSWAGFYLGANGGADWSRN 60
MK T +A A+ A A P +Y GA G +
Sbjct: 1 MKKTA---IAIAVALAGFATVAQAA----------------PKDNTWYTGAKLGWSQYHD 41

Query: 61 CWTLNRVNGVPVVPTQSEGCHN--ATSGLIGGQIGYRWQAASWVFGLEAQGNWTDLKSSN 118
+N H +G GG +Q + G E +W
Sbjct: 42 TGFINNNGP----------THENQLGAGAFGG-----YQVNPY-VGFEMGYDWL------ 79

Query: 119 ASSAAFAAGITNNTKTDAIGL-FTGQIGYAWGNVL-WYVKGGAAVAHNKYTGTANAAAPV 176
G N A G+ T ++GY + L Y + G V
Sbjct: 80 --GRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVY----- 132

Query: 177 AVGTLLDSASETRWGGTVGTGVEFGFAPNWSVAVEYDHLFMGSRDITFPATAIVNARVDT 236
+T GVE+ P + +EY I +A
Sbjct: 133 ------GKNHDTGVSPVFAGGVEYAITPEIATRLEYQWT-----------NNIGDAHTIG 175

Query: 237 IKQDIDMATVRVNYRFGGPAAA 258
+ D M ++ V+YRFG AA
Sbjct: 176 TRPDNGMLSLGVSYRFGQGEAA 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4377OMPADOMAIN434e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 43.0 bits (101), Expect = 4e-07
Identities = 49/245 (20%), Positives = 73/245 (29%), Gaps = 48/245 (19%)

Query: 1 MKKLLVITTALVGIAAAMPASAADLAARPYTKAPPVVVPILSWSGIYAGIQGGGGWGTSK 60
MKK I A+ A A AA + Y G + G
Sbjct: 1 MKKTA-IAIAVALAGFATVAQAAPKD-----------------NTWYTGAKLGWSQ---- 38

Query: 61 ETFIGRFNAPGFLGTQNYNTNGGFVGGVIGYNWQFDNLVVGLEGDYHWSDINGRSAVINA 120
++ GF+ G G +Q N VG E Y W GR +
Sbjct: 39 ------YHDTGFINNNGPTHENQLGAGAFG-GYQV-NPYVGFEMGYDWL---GRMPYKGS 87

Query: 121 GVGDTYFTKLTSFGDIKGRLGYAVGPAL-FFVSGGAAVGELQHRYDRAAGVFFGQNTTRW 179
Y + +LGY + L + G V R D + V+ + T
Sbjct: 88 VENGAYKAQGVQLT---AKLGYPITDDLDIYTRLGGMVW----RADTKSNVYGKNHDTGV 140

Query: 180 GYTVGAGAEYMFAPNWSAKLEYNYLDFGKSTLQYVGVPGRSEWKDSVHTVKAGLNYHFGG 239
G EY P + +LEY + + +G + + G++Y FG
Sbjct: 141 SPVFAGGVEYAITPEIATRLEYQWTN-------NIGDAHTIGTRPDNGMLSLGVSYRFGQ 193

Query: 240 PVIAK 244
A
Sbjct: 194 GEAAP 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4378OMPADOMAIN310.002 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.4 bits (71), Expect = 0.002
Identities = 45/221 (20%), Positives = 76/221 (34%), Gaps = 39/221 (17%)

Query: 1 MKKILLATVALAALAAPAAAADLAARPTYTKAPVLAPVQTWTGFYIGAF---GGYANEDA 57
MKK +A A A A A YT A + W+ ++ F G +E+
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKL-----GWSQYHDTGFINNNGPTHENQ 55

Query: 58 STAALKGGFA-----GGTVGYNWQQGPLVFGLEADAAWADINATVGIPGVFGLTDRIEST 112
A GG+ G +GY+W G + A+ + + +TD ++
Sbjct: 56 LGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIY 115

Query: 113 GTVRGRIGYAFDTVLLYGTGGYAWGNNKLSATVGGVAGSETKFLSGWAAGAGVEWMFAPK 172
+ GG W + S G + + GVE+ P+
Sbjct: 116 TRL----------------GGMVWRADTKSNVYGKNHDTGVSPV----FAGGVEYAITPE 155

Query: 173 WSLKGEYLYKSLESSTYFGGAVPLGT-LNLHTFQVGVNYHF 212
+ + EY + + G A +GT + +GV+Y F
Sbjct: 156 IATRLEYQW-----TNNIGDAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4383BCTERIALGSPF280.034 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 27.9 bits (62), Expect = 0.034
Identities = 22/111 (19%), Positives = 43/111 (38%), Gaps = 21/111 (18%)

Query: 3 YTLIIFDLDGTLADSFPWFRLHVNAVAARYGFRQVKDEDVERLRHASTREILDFLAVPSW 62
T ++ + + PW + + +A FR + + E+ R + R +L
Sbjct: 212 STRVLMGMSDAVRTFGPW--MLLALLAGFMAFRVMLRQ--EKRRVSFHRRLL-------- 259

Query: 63 KLPFI---------ARHMRRLKTAHAASIPLFAGVGPMLDTLAAHGHQLAL 104
LP I AR+ R L +A+++PL + D ++ + L
Sbjct: 260 HLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRL 310


109BBta_4516BBta_4525N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4516-2102.638255elongation factor Ts
BBta_4517-292.37557230S ribosomal protein S2
BBta_4518-292.474660hypothetical protein
BBta_4519-192.436259hypothetical protein
BBta_4520-1102.132667hypothetical protein
BBta_45210112.233022caspase-like domain-containing protein
BBta_4522-290.456237DNA polymerase III subunit alpha
BBta_4523012-0.045475outer-membrane protein
BBta_4524-210-0.816428hypothetical protein
BBta_4525-37-0.988137outer-membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4516BCTERIALGSPF280.049 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 27.9 bits (62), Expect = 0.049
Identities = 23/85 (27%), Positives = 35/85 (41%), Gaps = 5/85 (5%)

Query: 131 RRAAALEVSEGVVSSYVHGAVIEGAGKLGVIVALESPGKTD--ELAALGRQLAMHVAAAN 188
R+A L G+V V + ++L + +LA L RQLA VAA+
Sbjct: 26 RQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASM 85

Query: 189 P--QAIDAAGLDPEVVKREKDVLAD 211
P +A+DA E ++A
Sbjct: 86 PLEEALDAVAKQSE-KPHLSQLMAA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4520V8PROTEASE553e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 54.6 bits (131), Expect = 3e-10
Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 28/184 (15%)

Query: 63 AGRQTSTGSGFLVSADGLAITNYHVVSDAALEPKTYRLEYTGADGTQGG------VTLLA 116
A T SG +V +TN HVV +P + + + +
Sbjct: 97 APTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITK 155

Query: 117 VDLPNDLALVRVDKHD--------APFFTFDKAALEGSLPKGERLYSLGNPLDLGFTIIE 168
DLA+V+ ++ T A + + G P D +
Sbjct: 156 YSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNA---ETQVNQNITVTGYPGDKPVATM- 211

Query: 169 GTYNGLVEHSYNDHIHFTGALNPGMSGGPAVNAQGQVVGV---------NVATRRGGQLI 219
G + + + + + + G SG P N + +V+G+ N A +
Sbjct: 212 WESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENVR 271

Query: 220 SFLV 223
+FL
Sbjct: 272 NFLK 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4521cloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.7 bits (79), Expect = 0.001
Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 4/66 (6%)

Query: 535 GAGAGPGAGLGAGAGPGGHGPVPNSTQSGVAGTPPTLPGRPGPVAPAANALPMPGQGAPA 594
G+G+G G G+G G GG N G +GT L PVA AL PG G A
Sbjct: 49 GSGSGIHWGGGSGHGNGGG----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104

Query: 595 IPPTGN 600
+ +
Sbjct: 105 VSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4523OMPADOMAIN541e-10 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 53.8 bits (129), Expect = 1e-10
Identities = 47/240 (19%), Positives = 79/240 (32%), Gaps = 48/240 (20%)

Query: 1 MKKLLLT-TTALVVLASPALAADLAARPYTKAPPPVMAAIYDWSGFYIGVNGGGGWTHNT 59
MKK + AL A+ A AA + +Y G G H+T
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKD------------------NTWYTGAKLGWSQYHDT 42

Query: 60 WDVVGGGREGSHDSSGGTIGGQVGYRWQMGQFVFGVEAQGNW---ADFAGDNQSALFGTR 116
+ G + G GG Y+ G E +W + G ++ + +
Sbjct: 43 GFINNNGPTHENQLGAGAFGG---YQVNPY---VGFEMGYDWLGRMPYKGSVENGAYKAQ 96

Query: 117 NRTKTDAFGLFTGQVGYAFNNVL-VYAKGGAAITSNTYTITNAATGAFLGSNDNTRWGGV 175
T ++GY + L +Y + G + G N +T V
Sbjct: 97 GVQ-------LTAKLGYPITDDLDIYTRLGGMVWRADTK------SNVYGKNHDTGVSPV 143

Query: 176 VGAGLEYGFAPNWSLGVEYDHLFMDRQTVSFGALGSDSIKQDADLFTARLNYRFGGPVAT 235
G+EY P + +EY + T + G + + D + + ++YRFG A
Sbjct: 144 FAGGVEYAITPEIATRLEY------QWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQGEAA 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4524PRTACTNFAMLY260.034 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 26.2 bits (57), Expect = 0.034
Identities = 16/45 (35%), Positives = 20/45 (44%)

Query: 61 LACIANPYFAGVSDDPRATPYPYPNRQRIKPRPPRPNPYAPLPNP 105
LA N ++ V P P P P+PP+P P AP P P
Sbjct: 556 LAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQP 600


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4525OMPADOMAIN345e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 33.8 bits (77), Expect = 5e-04
Identities = 33/161 (20%), Positives = 52/161 (32%), Gaps = 27/161 (16%)

Query: 50 FYVGFNAGGGSSRDCWNLVQNANVVVNPALNREGCSGGGGAAVGGQVGYRYQMANWVFGV 109
+Y G G D +N G + G GY+ N G
Sbjct: 28 WYTGAKLGWSQYHD------------TGFINNNGPTHENQLGAGAFGGYQ---VNPYVGF 72

Query: 110 EAQGDWANFKGTNRNLILPNVSDRSRTNGFGLFTGQVGYAF-DNILGYIKGGAAVVGDRY 168
E DW R +V + + T ++GY D++ Y + G V R
Sbjct: 73 EMGYDW-----LGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVW--RA 125

Query: 169 DTFTTNTNVMLDRANTTRWGGTIGAGLEYGFAPNWSVGVEY 209
DT + + + + T G+EY P + +EY
Sbjct: 126 DT----KSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEY 162


110BBta_4611BBta_4618N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4611013-0.108388hypothetical protein
BBta_46120140.099453hypothetical protein
BBta_46130110.108233acetamidase/formamidase family protein
BBta_4614213-0.015625recombinase
BBta_4615-113-0.805411hypothetical protein
BBta_4616-312-0.094241hypothetical protein
BBta_4617-1140.781673hypothetical protein
BBta_4618-1150.389851short-chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4611SYCDCHAPRONE422e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.8 bits (98), Expect = 2e-06
Identities = 25/92 (27%), Positives = 38/92 (41%)

Query: 92 IDYLTSLGFTLKQMGRLDDALAVFDKAIQLKPDDAELWKHLGGVLLALDRGAEALLSYQH 151
++ L SL F Q G+ +DA VF L D+ + LG A+ + A+ SY +
Sbjct: 36 LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95

Query: 152 ALSIDPAHKEAAFQSGLLLHQQQRDAEAVEAF 183
+D F + L Q+ AEA
Sbjct: 96 GAIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4612PF03309310.003 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 31.3 bits (71), Expect = 0.003
Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 83 IVLGGPVDPDLSIMYGMPVTVAGADRLIQAAALARRYPNARIVFTGGSA 131
+++ V + ++ P V GADR++ A +Y A IV GS+
Sbjct: 88 VLIEPGVRTGIPLLVDNPKEV-GADRIVNCLAAYHKYGTAAIVVDFGSS 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4617HTHTETR270.030 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 26.9 bits (59), Expect = 0.030
Identities = 16/100 (16%), Positives = 33/100 (33%), Gaps = 9/100 (9%)

Query: 1 MPAERQTQPTWTEERL-----RALKQHFEAGLTCREIAAELGVSRNAV----IGKISRLA 51
M + + + T + + R Q + + EIA GV+R A+ K +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 52 LTRDNGGDTRRVVRAENARDGARRPVPKLRRRILRAVSND 91
+ + E P+ LR ++ + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4618DHBDHDRGNASE563e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 56.2 bits (135), Expect = 3e-11
Identities = 47/187 (25%), Positives = 80/187 (42%), Gaps = 12/187 (6%)

Query: 1 MGRELARQLVAEGCNVAMCDVSAEAMAETRRLCEAETLPQGLRVTTHVADVSIEAQLQRF 60
+G +AR L ++G ++A D + E + + + ADV A +
Sbjct: 20 IGEAVARTLASQGAHIAAVDYNPEKL----EKVVSSLKAEARHAEAFPADVRDSAAIDEI 75

Query: 61 RDELAEQQATDKIHLLFNNAGIGGGGSLFTNSREQWERTFNICWGGVYLGVRTFLPMLLK 120
+ + I +L N AG+ G + + S E+WE TF++ GV+ R+ ++
Sbjct: 76 TARIERE--MGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMD 133

Query: 121 AAEGHIVNTSSVNGFWASVGMGVSHTAYSAAKFAVKGFTEALINDLRLNAPHIKCSVVMP 180
G IV S M AY+++K A FT+ L L L +I+C++V P
Sbjct: 134 RRSGSIVTVGSNPAGVPRTSMA----AYASSKAAAVMFTKCL--GLELAEYNIRCNIVSP 187

Query: 181 GHIGTSI 187
G T +
Sbjct: 188 GSTETDM 194


111BBta_4889BBta_4896N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_4889-1112.207828ABC transporter ATP-binding protein
BBta_4890-2112.020491ABC transporter permease
BBta_4891-2102.019272*hypothetical protein
BBta_4892-2112.463776lipoate-protein ligase B
BBta_4893-1102.203739hypothetical protein
BBta_4894-2102.241637LysR family transcriptional regulator
BBta_4895-192.034136AraC family transcriptional regulator
BBta_48960102.240534hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4889PF05272290.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.016
Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 37 LVLLGPSGCGKSTILKSIAG 56
+VL G G GKST++ ++ G
Sbjct: 599 VVLEGTGGIGKSTLINTLVG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4891FLGMOTORFLIN517e-12 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 51.4 bits (123), Expect = 7e-12
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 11 RDFGGPAVSTLDK-------VTVDLMVVLGTCSMPIHQVLRLSRGAIIELDATEADDVKV 63
+ GG VS + + V L V LG M I ++LRL++G+++ LD + + +
Sbjct: 40 QQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDI 99

Query: 64 LANNLPIANGVVLVDRNRIAVEVKEML 90
L N IA G V+V ++ V + +++
Sbjct: 100 LINGYLIAQGEVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4895ISCHRISMTASE320.005 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.5 bits (71), Expect = 0.005
Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 6/73 (8%)

Query: 96 ADFIRVQIAMTGRAVS--CARGEATDVTDQQIAVAPAGVPWQMACQGGHRRLTLRLEPQA 153
ADF + M + CA TD Q+ APA V A G T
Sbjct: 179 ADFSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCE----N 234

Query: 154 LRQRLAALVGVQP 166
+R+++A L+ P
Sbjct: 235 IRKQIAELLQETP 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4896NUCEPIMERASE441e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.6 bits (103), Expect = 1e-06
Identities = 26/148 (17%), Positives = 45/148 (30%), Gaps = 29/148 (19%)

Query: 183 TVLVTGATGFVGTRLVAGLAASGHHVIAL---------VRDPAKARALPPPLTLITSLD- 232
LVTGA GF+G + L +GH V+ + A+ L P +D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 233 -------QIASDTRIDAIVNLAGEPIGNAAWTAAKREKILQSRLATTEAVVAL--IARLA 283
+ + + + A R + + I
Sbjct: 62 ADREGMTDLFASGHFERVFISPHR--------LAVRYSLENPHAYADSNLTGFLNILEGC 113

Query: 284 RKPQV--LVNGSAIGWYGLWQDQPLTES 309
R ++ L+ S+ YGL + P +
Sbjct: 114 RHNKIQHLLYASSSSVYGLNRKMPFSTD 141


112BBta_4902BBta_4914N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_49021101.245169acyl-CoA carboxylase biotin-carrying subunit
BBta_4903191.666742hypothetical protein
BBta_4904191.932325hypothetical protein
BBta_49052102.482838PAS/PAC sensor hybrid histidine kinase
BBta_49062102.853460two-component system sensor protein
BBta_4907192.224150response regulator in two-component reguatory
BBta_4908-2121.452202hypothetical protein
BBta_4909-3121.239376hypothetical protein
BBta_4910-3101.438044major facilitator transporter
BBta_4911-2120.728841arsenite permease
BBta_4912-1120.187054hypothetical protein
BBta_4913-1111.877664hypothetical protein
BBta_4914-182.002287serine protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4902RTXTOXIND310.022 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.022
Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 7/87 (8%)

Query: 555 AEIAVQTRPIPNGVRLAHQGVEAPVYVYTEAEAAAARLMPVVTAG-----DSGKKLLCPM 609
A + + P+ RL + V + ++ V TA K + P+
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMG-FLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPI 102

Query: 610 PGLVVS-IAVTEGQEVKAGETLAVIEA 635
+V I V EG+ V+ G+ L + A
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4905HTHFIS701e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 1e-14
Identities = 32/119 (26%), Positives = 58/119 (48%), Gaps = 5/119 (4%)

Query: 701 RVLVVEDNDEVGQFSTELLEDLGYVTRRVANAHDALAILETDEFAVDLVFSDVIMPGING 760
+LV +D+ + + L GY R +NA + + DLV +DV+MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG--DLVVTDVVMPDENA 62

Query: 761 LELAGIIRDRYPGLPVVLTSGYSHVLA--ENAHHG-FELIKKPYSVESLSRILRKAMAE 816
+L I+ P LPV++ S + + + + G ++ + KP+ + L I+ +A+AE
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4906PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 3e-05
Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 13/85 (15%)

Query: 394 LQRALDNLILNAIQNTPAGGEIVVAAEIHGGRLLFRVTDSGPGVDAGIRDRLFEPFVSTR 453
+Q ++N I + I P GG+I++ G + V ++G +
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK----------- 308

Query: 454 ADGTGLGLAIVRE-IARAHQGEARL 477
+ TG GL VRE + + EA++
Sbjct: 309 -ESTGTGLQNVRERLQMLYGTEAQI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4907HTHFIS473e-167 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 473 bits (1219), Expect = e-167
Identities = 178/472 (37%), Positives = 257/472 (54%), Gaps = 39/472 (8%)

Query: 2 ATILIVDDDAALRDGLAETVTDLGHRAVTAASGREAIGMLSSDT-DAVLLDLRMPGIDGI 60
ATIL+ DDDAA+R L + ++ G+ ++ +++ D V+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 ETLRRIRAGGDAPPVIVLTAYASPENTIEAMRLGAFDHLLKPIGRDALRELIER---LPP 117
+ L RI+ PV+V++A + I+A GA+D+L KP L +I R P
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 118 RRRQRRDSGRVGDSGLIGTSEAMRRVQKTIGLAADADATVLIRGETGTGKELVARALHVH 177
RR + + L+G S AM+ + + + D T++I GE+GTGKELVARALH +
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 178 SRRSTAPFVALNCAAIPQDLLESELFGHIKGSFTGAASDRSGAFRDAGAGTLFLDEIGDM 237
+R PFVA+N AAIP+DL+ESELFGH KG+FTGA + +G F A GTLFLDEIGDM
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 238 PLSMQAKILRALQERMITPVGG-KPIKVAARVVAATHRDLARQVASGAFREDLYYRLNVI 296
P+ Q ++LR LQ+ T VGG PI+ R+VAAT++DL + + G FREDLYYRLNV+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 297 PIELPPLRERTQDILPLADHFLAQ-CSDGRPQPQLSDRAIDKLVHAPWPGNVRELRNAIQ 355
P+ LPPLR+R +DI L HF+ Q +G + A++ + PWPGNVREL N ++
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 356 RACVLTRGALIDADDI------------------DIGAVPANDEAE-------------- 383
R L +I + I G++ + E
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 384 -PATDLPAAVARLEEGMIRRALAACGGNRTEAARQLNINRQLLYTKMQRYGL 434
P+ +A +E +I AL A GN+ +AA L +NR L K++ G+
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4909TONBPROTEIN280.017 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.4 bits (63), Expect = 0.017
Identities = 14/33 (42%), Positives = 15/33 (45%), Gaps = 2/33 (6%)

Query: 188 KPGPKHEHGPKPKHGPKDGPKHGPKPKHGPKHD 220
K P PKPK PK PK K + PK D
Sbjct: 83 KEAPVVIEKPKPKPKPK--PKPVKKVQEQPKRD 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4910TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 34/161 (21%), Positives = 68/161 (42%), Gaps = 3/161 (1%)

Query: 226 LLIFAICVALFHLANAAMLPLVGQKLALQDKNLGTSLMSACIAAAQLVMVPMALLVGARA 285
+LI+ ++ F + N +L + +A D N + + A L + G +
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIA-NDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 286 DRWGHKRFFLAALLILPLRAALYTLSDDKA-WLVGVQLLDGVGAGIFGAIMPVIVADLMR 344
D+ G KR L ++I + + + L+ + + G GA F A++ V+VA +
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 345 GTGRFNVAQGAVITAQSIGAALSTALAGLVVVEAGYSAAFL 385
R A G + + ++G + A+ G++ +S L
Sbjct: 134 KENR-GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLL 173



Score = 29.1 bits (65), Expect = 0.033
Identities = 33/159 (20%), Positives = 58/159 (36%), Gaps = 21/159 (13%)

Query: 43 WDEAAIGLVMSLATIAGIVAQTPAGAMVDATRAKRLIMIAAALIVTAASLLLPLMASFWP 102
W A L S+ T G + D KRL++ + S++ + SF+
Sbjct: 53 WVNTAFMLTFSIGTA-------VYGKLSDQLGIKRLLLFGIIINC-FGSVIGFVGHSFFS 104

Query: 103 VAISQSIAHAAGVIFAPAIAAVSLGIFGVSAFTARIGRNETFN------HAGNAVGAVIA 156
+ I AG A A A+ + + V+ + + R + F G VG I
Sbjct: 105 LLIMARFIQGAG---AAAFPALVMVV--VARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159

Query: 157 GAAAYALGPSAVFYLMALMSVG--SLISVLAIPARAIDH 193
G A+ + S + + + + L+ +L R H
Sbjct: 160 GMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_4914V8PROTEASE685e-15 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 68.5 bits (167), Expect = 5e-15
Identities = 32/166 (19%), Positives = 60/166 (36%), Gaps = 33/166 (19%)

Query: 90 SGTGSGFVWDDLGHVVTNYHVIEGATEALVSLT------------DGRSFRAALVGANPE 137
+ SG V ++TN HV++ +L +G + + E
Sbjct: 101 TFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 NDLAVLLIGVGTDRP------KPLPIGTSADLKVGQKVFAIGNPFGLS-STL--TTGIVS 188
DLA++ KP + +A+ +V Q + G P +T+ + G ++
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKIT 219

Query: 189 ALNRNLQVTQERTLNGLIQTDAAINPGNSGGPLLDSAGRLIGVNTA 234
L +Q D + GNSG P+ + +IG++
Sbjct: 220 YLKGEA-----------MQYDLSTTGGNSGSPVFNEKNEVIGIHWG 254


113BBta_5196BBta_5202N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5196021-1.196540autoinducer synthesis protein
BBta_5197018-0.320421hypothetical protein
BBta_51980170.009142outer efflux pump domain-containing protein
BBta_5199013-0.331908ABC transporter permease
BBta_5200-1110.710523ABC transporter ATP-binding protein
BBta_52010111.357063hypothetical protein
BBta_5202-2100.804988inner membrane transport protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5196AUTOINDCRSYN1247e-38 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 124 bits (313), Expect = 7e-38
Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 6/167 (3%)

Query: 1 MIEILTRADEPRVPAWFDQMFRGRAKVFHERLRWRVVVRDGREMDRYDEHERTIYLMAID 60
M+EI ++F R + F +RL W V DG E D+YD + T YL I
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYD-NNNTTYLFGIK 59

Query: 61 SAGQVVGSLRLLRAVGETMLDNEFRDFFSPPIVIRSAEILECTRFCVHGDQTSSKPN--- 117
V+ SLR + M+ F +F I LE +RF V +
Sbjct: 60 D-NTVICSLRFIETKYPNMITGTFFPYFKEIN-IPEGNYLESSRFFVDKSRAKDILGNEY 117

Query: 118 AVSSELMIGLCEFALANGIREVIGLYTSGMTRIYRRVGWEPREIAVA 164
+SS L + + ++ G + + + M I +R GW R +
Sbjct: 118 PISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQG 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5197RTXTOXIND826e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 81.8 bits (202), Expect = 6e-19
Identities = 47/306 (15%), Positives = 99/306 (32%), Gaps = 59/306 (19%)

Query: 79 DTVAERDVLIVMNAEDIQSRGREARSATLKAAIALDALTHWEAGPEVTRARRGVEAARTV 138
V+E +VL + + Q + + + + + E + +
Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD-------KKRAERLTVLARINRYENL 229

Query: 139 LARLERKVNDTRQLYDRGIVSRVEFEAAEQERDNQAFTVATAEQDLTATLARGGADSRRL 198
+ +++D L + +++ E + A +L ++ +
Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY-------VEAVNELRVYKSQLEQIESEI 282

Query: 199 AELELE-----------------NAKARLADVQRQADG-------TIVRAPVAGVIVKPP 234
+ E + + + +++RAPV+ + +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQ-L 341

Query: 235 TVNASGAVPQTVETGARVQRGQTLFSVA-DLSSLIVDGKVDEIDVNQIRIGQSVAISSDA 293
V+ G V V +TL + + +L V V D+ I +GQ+ I +A
Sbjct: 342 KVHTEGGV---------VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEA 392

Query: 294 FPG---LPLRGHVVGVSSEATQDSANQTAMFNVRVLID------DTKKDDIRIGMSARMT 344
FP L G V ++ +A +D FNV + I+ K + GM+
Sbjct: 393 FPYTRYGYLVGKVKNINLDAIEDQRLGLV-FNVIISIEENCLSTGNKNIPLSSGMAVTAE 451

Query: 345 IELDAR 350
I+ R
Sbjct: 452 IKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5201OMPADOMAIN260.038 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 26.0 bits (57), Expect = 0.038
Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 1 MKKTILSLAAALFCTSALAQNAP 23
MKKT +++A AL + +AQ AP
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAP 23


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5202TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 66/344 (19%), Positives = 120/344 (34%), Gaps = 12/344 (3%)

Query: 48 GLLITFGAVVLCICSPLTAWLTSRFDRRLLLAATLLVLTLGNLGSAFVPDYAGLLALRLL 107
G+L+ A++ C+P+ L+ RF RR +L +L + A P L R++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 MLAVGALYTPQAAGTAALISPVETRGSTIAYVFLGWSLAAAVGLPLITFIGSHSGWRAAY 167
GA A A I+ + R ++ + G L +G S A +
Sbjct: 106 AGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPF 163

Query: 168 LAVGAAGAISLLLLTWRLP-------SGLKGAPVALKTWTELARNRMVVLLLLIT--TLQ 218
A A ++ L + LP L+ + AR VV L+ +Q
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223

Query: 219 MSGQFVIFTFMGPLLNRLTGADANQVGLVFAIYGACGFLGVVIATRIVDGWGAYRTSLLF 278
+ GQ ++ +R DA +G+ A +G L + T V R +L+
Sbjct: 224 LVGQVPAALWVIFGEDRF-HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML 282

Query: 279 SCLLFAGIAVWTLGAGSYSLMAVAVAIWGLGFASTNSMQQVRLVAAAPPLASASVSLNTS 338
+ + A + + + G ++Q + +
Sbjct: 283 GMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342

Query: 339 VLYIGQAIGSALGGSLFARGWLDATGYAAMGFAMLALLAILASR 382
+ + +G L +++A G+A + A L LL + A R
Sbjct: 343 LTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALR 386


114BBta_5344BBta_5357N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5344-1111.385854hydrolase
BBta_53451122.172073TetR family transcriptional regulator
BBta_53461111.959244hypothetical protein
BBta_53471111.472555hypothetical protein
BBta_53480132.111657hypothetical protein
BBta_5349-2110.860296two-component sensor histidine kinase
BBta_5350-211-0.641926two-component response regulator
BBta_5351-212-0.100342TetR family transcriptional regulator
BBta_5352-2130.208678hypothetical protein
BBta_5353-1130.044846adenylosuccinate lyase
BBta_5354-2120.547048hypothetical protein
BBta_53560121.603167cytochrome B561
BBta_53570122.057029cell division protein FtsH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5344TYPE3OMBPROT280.039 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 28.5 bits (63), Expect = 0.039
Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 7/69 (10%)

Query: 207 ELSADPKKIDEATRRHYAKLYARPGNMHYAFEQFAAFNQDAKDNKVFAEKKLPMPILALG 266
+ S K+ +R ++ + GNM + N NKV KKLP+ L L
Sbjct: 465 QFSQLNSKLSSEEKRLFSTILMNSGNM-----EIQEMNTGVPGNKVM--KKLPLSSLELS 517

Query: 267 AEKSFGDQE 275
+ GD +
Sbjct: 518 YSERIGDSK 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5345HTHTETR491e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 49.2 bits (117), Expect = 1e-09
Identities = 35/172 (20%), Positives = 66/172 (38%), Gaps = 3/172 (1%)

Query: 2 PTGRPREFDTERALELATSLFWQKGYEGTSLSDLTETLGITRPSLYAAFGNKEALFRLVL 61
T + + + L++A LF Q+G TSL ++ + G+TR ++Y F +K LF +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 62 QRYEAKAGAY--RTKALKAPTALEVARQLLEGAAELHGDKANPVGCLGVHGALA-CSDEA 118
+ E+ G +A L V R++L E + + + E
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 119 AMIRRDMSAHRIAGEAAIRRRLTRAKAEGDLPPNASPSDLARYLSVVIYGMA 170
A++++ + I + L LP + A + I G+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5349PF06580453e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 3e-07
Identities = 32/173 (18%), Positives = 68/173 (39%), Gaps = 22/173 (12%)

Query: 289 MAQDARMITQSVTHLMACLRGTLSRLRR---PLAEELGLEAGLVALTQNYQHAARPAIRL 345
+ +D + +T L +R +L LA+EL + + L + Q R
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQL-ASIQFEDRLQFEN 244

Query: 346 DLHGDLADIQGPVAVTAYRMA-QECLTNALRH-----TDASEVSLRVERRPGPDSALLIS 399
++ + D+Q P M Q + N ++H ++ L+ + + + +
Sbjct: 245 QINPAIMDVQVP------PMLVQTLVENGIKHGIAQLPQGGKILLKGTKD---NGTVTLE 295

Query: 400 IEDDGGGDATQLAQSSGFGLTGIRERIAAVGG---SLSIAPAARGLSVSATIP 449
+E+ G +S+G GL +RER+ + G + ++ ++ IP
Sbjct: 296 VENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5350HTHFIS763e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 3e-18
Identities = 40/182 (21%), Positives = 69/182 (37%), Gaps = 10/182 (5%)

Query: 5 TAISILLVDDHPVVRQGYRRVLESQDGFRVIAEADSAAAAYAAFKVHAPDVVVLDISMKG 64
T +IL+ DD +R + L + G+ V +AA + D+VV D+ M
Sbjct: 2 TGATILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 65 ASGLEAIRNIRARDNRACILVFSMHGEAPLVKAAFAAGASGFVTKSSEPSALV----RAI 120
+ + + I+ +LV S A GA ++ K + + L+ RA+
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 RMVISGERALSDDVAHVLAADSLDP-MQTVL---DRLGEREIEILRQLASGLTTEQIATN 176
L DD + MQ + RL + ++ ++ SG E +A
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 177 LH 178
LH
Sbjct: 180 LH 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5351HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 1e-13
Identities = 24/198 (12%), Positives = 61/198 (30%), Gaps = 14/198 (7%)

Query: 8 TSERILEVTEDVLRRYGLAKATVVDVARALDVSHGSVYRHFPSKASLREAVAKRWLERVS 67
T + IL+V + + G++ ++ ++A+A V+ G++Y HF K+ L + + +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 68 KPLAQIAAEPGPAPAKLERWLRELFGAKHKRVCEDPEMFATYLTLAREA---------CS 118
+ + A+ P + LRE+ + + + +
Sbjct: 72 ELELEYQAKFPGDPLSV---LREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 119 VVKAHKQDLIAQVEGIIAEGVAQGAF-EVADPKTAAAAIFDATRSFHHPAHADEWGECTC 177
+ + ++E + + + AA +
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP-QSFDL 187

Query: 178 SARVDAVLTLLLRGLQAR 195
+ +LL
Sbjct: 188 KKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5357PF05272320.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.009
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 14/72 (19%)

Query: 195 VLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSEFVEMFVGVGAARVRDLFEQARAKAPAI 254
V+L G G GK+ L + G FFS + + +D +EQ
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL---DFFSDTHFDIGTG---------KDSYEQIAGI--VA 644

Query: 255 IFIDELDALGRA 266
+ E+ A RA
Sbjct: 645 YELSEMTAFRRA 656


115BBta_5491BBta_5504N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5491015-0.557711cell cycle control histidine kinase CckA
BBta_5492115-2.014358flagellar biosynthesis protein FlhB
BBta_5493-1141.347721flagellar biosynthesis protein FliR
BBta_5494-1171.444683flagellar biosynthesis protein FliQ
BBta_5495-2122.001172flagellar hook-basal body protein FliE
BBta_5496-2100.790039flagellar basal body rod protein FlgC
BBta_5497-2111.293770flagellar basal body rod protein FlgB
BBta_5498-1132.054982hypothetical protein
BBta_5499-1141.105594flagellar biosynthesis protein FliP
BBta_5500-1131.237272prolyl aminopeptidase
BBta_5502-1130.733335alcohol dehydrogenase
BBta_55032121.106901hypothetical protein
BBta_55041120.789437hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5491HTHFIS823e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 3e-18
Identities = 35/116 (30%), Positives = 59/116 (50%), Gaps = 4/116 (3%)

Query: 772 GTILLVEDEEGLRALNARGLRSRGYSVIEASNGIEAMEALEECNGALDLVVSDVVMPEMD 831
TIL+ +D+ +R + + L GY V SN + G DLVV+DVVMP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA--GDGDLVVTDVVMPDEN 61

Query: 832 GPTLLKVMREKNPDIKIIFVSGYAE-DAFEKSLPENQQFAFLPKPFTLSQLVAAVK 886
LL +++ PD+ ++ +S K+ E + +LPKPF L++L+ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5492TYPE3IMSPROT319e-110 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 319 bits (818), Expect = e-110
Identities = 102/347 (29%), Positives = 181/347 (52%), Gaps = 11/347 (3%)

Query: 7 SEDKTEDPTQKRLDQALERGDVVKSQELNTWFVIAA---ATLVMSTFSGSIGTAVLVPMR 63
S +KTE PT K++ A ++G V KS+E+ + +I A + +S + + +++
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 64 NLIANSWMIHTDGAGLLALARSLSFAVIAAIGVPIL-MVMLAAIAGNMMQHRLVWSGEPL 122
+ + L+ + P+L + L AIA +++Q+ + SGE +
Sbjct: 62 EQS------YLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAI 115

Query: 123 KPNFGKISPLAGAKRLFGKQAAANFAKGIFKLVLLGTVMVMILWPERLRLESLLHMDVSE 182
KP+ KI+P+ GAKR+F ++ F K I K+VLL ++ +I+ + L L +
Sbjct: 116 KPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIEC 175

Query: 183 LLGVTISLTKHLMGSVVALLALVAIGDYLFQYRQWYERQKMSVQEMKEEFKQSEGDPHVK 242
+ + + + LM +++I DY F+Y Q+ + KMS E+K E+K+ EG P +K
Sbjct: 176 ITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIK 235

Query: 243 GRIRQIRQQRMKKRMMAAVPKASVIITNPTHYSVALAYERG-MSAPVCVAKGVDNIAFKI 301
+ RQ Q+ + M V ++SV++ NPTH ++ + Y+RG P+ K D +
Sbjct: 236 SKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTV 295

Query: 302 REIAKAHDIPIVENVPLARALYATVEIDAEIPVEHYHAVAEIISYVM 348
R+IA+ +PI++ +PLARALY +D IP E A AE++ ++
Sbjct: 296 RKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5493TYPE3IMRPROT1232e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 123 bits (311), Expect = 2e-36
Identities = 61/239 (25%), Positives = 113/239 (47%), Gaps = 2/239 (0%)

Query: 11 LAAVFMLAFARIGAMVMLLPGLGEVNIPVRIKLATALLLTMIVLPLHRQAYQVDLQALSP 70
++ R+ A++ P L E ++P R+KL A+++T + P +
Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL 71

Query: 71 LLVMMVHEIVIGIVLGATARVTLSALQVAGSVIAQQMGLGFVTAVDPTQGQQGVLIGNFL 130
L + +I+IGI LG T + +A++ AG +I QMGL F T VDP ++ +
Sbjct: 72 WLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 131 TMLGVTLLFSTDSHHLVIAALSDSYKIFAPGEVIPSGDVASLATRAFAAAFKIGLQLSAP 190
ML + L + + H +I+ L D++ G + + T+A + F GL L+ P
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALP 189

Query: 191 FLVFGLVFNIGLGVLARLMPQMQVYFVGAPLSILIGFLIFGLVLAAMMGTFLGYFEGVI 249
+ L N+ LG+L R+ PQ+ ++ +G PL++ +G + ++ + F +
Sbjct: 190 LITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5494TYPE3IMQPROT592e-15 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 59.0 bits (143), Expect = 2e-15
Identities = 21/76 (27%), Positives = 45/76 (59%)

Query: 5 ETLDVARDAIWTIVLVSSPLMVVGLVVGVVVSLFQALTQIQEQTLVFVPKIIAIFVTLLL 64
+ + A++ ++++S +V ++G++V LFQ +TQ+QEQTL F K++ + + L L
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 ALPFMADALHSHMMRI 80
+ + L S+ ++
Sbjct: 63 LSGWYGEVLLSYGRQV 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5495FLGHOOKFLIE475e-10 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 46.6 bits (110), Expect = 5e-10
Identities = 19/78 (24%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 43 ADKGGPSFSAVLKDAIGSVMETGRKSDSQAVAMAAGKA--NVMDVVTAVAETDVAVSTLV 100
+ SF+ L A+ + +T + +QA G+ + DV+T + + V++ +
Sbjct: 26 LPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGI 85

Query: 101 SVRDRVIQAYEDVMKMTI 118
VR++++ AY++VM M +
Sbjct: 86 QVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5496FLGHOOKAP1300.003 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.9 bits (67), Expect = 0.003
Identities = 8/38 (21%), Positives = 17/38 (44%)

Query: 101 NVNPLVEMTDMRDAQRSYEANLNIIGATRRMIQRTLDI 138
VN E +++ Q+ Y AN ++ + ++I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5498IGASERPTASE553e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 55.1 bits (132), Expect = 3e-10
Identities = 47/266 (17%), Positives = 84/266 (31%), Gaps = 20/266 (7%)

Query: 77 DIVVESNIVRAMPAREQLPQRPGLSAEPQAARIAPLPEPAPWPDAEPKSEPVELPEPQMP 136
V +NI + +P P + E AP+P PAP +E E + +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 137 ELPPRPARPSFIDEVRRTAPAAAERREPAAFAPEPPPLARRGEPRAEPRPEPRSEETRVE 196
+ + + + A E + + +A+ G E + E VE
Sbjct: 1050 TVEKNEQDAT--ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 197 PRPERLPRSEAPREPLIPRPLRPAESPQQSEAARQPEAPAKLPPLRRPMRPEPPAA---- 252
E + E + +P+ Q+ QP+A P R P
Sbjct: 1108 K--EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA--------EPARENDPTVNIKE 1157

Query: 253 PPAPAPAPVAQMPAAPPPPPPAPPPSQQSAEAN----LAEMAQRLEAALRRPSADAPAAP 308
P + A P +S N + E + A +P+ ++ ++
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 309 AEGTPLRQAARAEPAPTPPAKTSFEN 334
R++ R+ P PA TS +
Sbjct: 1218 KPKNRHRRSVRSVPHNVEPATTSSND 1243



Score = 35.0 bits (80), Expect = 5e-04
Identities = 22/127 (17%), Positives = 33/127 (25%), Gaps = 6/127 (4%)

Query: 114 EPAPWPDAEPKSEPVELPEPQMPELPPRPARPSF-IDEVRRTAPAAAERREPAAFAPE-- 170
P PK E E +PQ P R P+ I E + A+ +PA
Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQAE--PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 171 PPPLARRGEPRAEPRPEPRSE-ETRVEPRPERLPRSEAPREPLIPRPLRPAESPQQSEAA 229
P+ E T +P S + R +R +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239

Query: 230 RQPEAPA 236
+
Sbjct: 1240 SSNDRST 1246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5499FLGBIOSNFLIP2732e-95 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 273 bits (699), Expect = 2e-95
Identities = 117/228 (51%), Positives = 159/228 (69%), Gaps = 2/228 (0%)

Query: 7 AGSAAAQDISINLGPGNGGVTE--RAVQLIALLTVLSIAPSILIMMTSFTRIVVVLSLLR 64
A AQ I P GG VQ + +T L+ P+IL+MMTSFTRI++V LLR
Sbjct: 16 TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLR 75

Query: 65 TAMGTATAPPNAVIIALAMFLTAFVMGPVLQKSYDDGVRPLIENQIGVEDALQRASVPLR 124
A+GT +APPN V++ LA+FLT F+M PV+ K Y D +P E +I +++AL++ + PLR
Sbjct: 76 NALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLR 135

Query: 125 GFMQKNVREKDLKLFMDLSGEAPPATPDELSLRILVPAFMISELKRAFEIGFLLFLPFLI 184
FM + RE DL LF L+ P P+ + +RIL+PA++ SELK AF+IGF +F+PFLI
Sbjct: 136 EFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLI 195

Query: 185 IDLVVASVLMSMGMMMLPPVVVSLPFKLIFFVLVDGWSLVAGSLVQSY 232
IDLV+ASVLM++GMMM+PP ++LPFKL+ FVLVDGW L+ GSL QS+
Sbjct: 196 IDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5504IGASERPTASE443e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.3 bits (104), Expect = 3e-06
Identities = 31/172 (18%), Positives = 54/172 (31%), Gaps = 15/172 (8%)

Query: 281 KVAAVSSDTLPGANAAAAEKPNGDKPNGEKPKQATLPEIVPPTSETIAKEMKAEAKPAPA 340
+VA S+T E +K K + E+ TS+ K+ ++E
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET--VQP 1141

Query: 341 MATPATETAAAPAMAEKAEPARPEAAKSEPPQAEAARAEPKLEAAK-----------PET 389
A PA E + E A +P + ++ E + + PE
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201

Query: 390 MKSETTKSETAKSEAAKPDIAKAEPAKPVEKMAEKPVETTANGAATLEALRD 441
TT+ + KP + V E ++ + + AL D
Sbjct: 1202 TTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST--VALCD 1251



Score = 43.1 bits (101), Expect = 8e-06
Identities = 36/150 (24%), Positives = 57/150 (38%), Gaps = 20/150 (13%)

Query: 288 DTLPGANAAAAEKPNGDKPNGEKPKQATLPEIVPPTSETIAKEMKA-EAKPAPAMATPAT 346
T+ N D P+ VP +E IA+ +A PAPA + T
Sbjct: 990 QTVDTTNITTPNNIQADVPS------------VPSNNEEIARVDEAPVPPPAPATPSETT 1037

Query: 347 ETAA--APAMAEKAEPARPEAAKSEPPQAEAARAEPKLEAAKPETMKSETTKSETAKSEA 404
ET A + ++ E +A ++ E A+ A +T + + SET +++
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 405 A----KPDIAKAEPAK-PVEKMAEKPVETT 429
+ K E AK EK E P T+
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTS 1127



Score = 39.7 bits (92), Expect = 8e-05
Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 13/165 (7%)

Query: 278 PDKKVAAVSSDTLPGANAAAAEKPNGDKPNGEKPKQATLPEIV----PPTSETIAKEMKA 333
P D P A A + E KQ + + +E+
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 334 EAKPAPAMATPATETAAAPAMAEKA------EPARPEAAKSEPPQAEAARAEPKLEA-AK 386
EAK T E A + + ++ E A E + + E + PK+ +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130

Query: 387 PETMKSETTKSETAKSEAAKPDIAKAEPAKPVEKMA--EKPVETT 429
P+ +SET + + + P + EP A E+P + T
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175



Score = 39.3 bits (91), Expect = 1e-04
Identities = 25/113 (22%), Positives = 41/113 (36%), Gaps = 3/113 (2%)

Query: 330 EMKAEAKPAPAMATPATETAAAPAMAEK-AEPARPEAAKSEPPQAEAARAEPKLEAAKPE 388
E + + + TP A P++ E AR + A PP A A +E A+
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPP-APATPSETTETVAENS 1044

Query: 389 TMKSETTKSETAKSEAAKPDIA-KAEPAKPVEKMAEKPVETTANGAATLEALR 440
+S+T + + A+ AK K + E +G+ T E
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097


116BBta_5507BBta_5528N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5507017-0.188371flagellar motor switch protein FliM
BBta_55080170.588755flagellar basal body-associated protein FliL
BBta_55090141.988799flagellar basal body rod protein FlgF
BBta_55100112.383340flagellar basal body rod protein FlgG
BBta_5511-2102.168294flagellar basal body P-ring biosynthesis protein
BBta_5512091.723913flagellar basal body L-ring protein
BBta_5513-191.879394hydratase/decarboxylase
BBta_5514091.853109dihydrokaempferol 4-reductase
BBta_5515-280.688380hypothetical protein
BBta_5516111-0.310889DnaK suppressor protein
BBta_5517-1110.753411hypothetical protein
BBta_5518-1110.032495transcriptional regulator
BBta_5519-115-0.694283flagellar assembly regulator FliX
BBta_5520016-2.211372flagellar basal body P-ring protein
BBta_5521315-3.123594chemotactic signal-response protein CheL
BBta_5522616-2.706397hypothetical protein
BBta_5524515-3.424801hypothetical protein
BBta_5525717-2.962747hypothetical protein
BBta_5526515-1.430938flagellar biosynthesis regulatory protein FlaF
BBta_5527313-0.588458flagellin protein, C-terminus
BBta_55282130.665530flagellin protein, C-terminus
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5507FLGMOTORFLIM2803e-94 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 280 bits (718), Expect = 3e-94
Identities = 80/322 (24%), Positives = 157/322 (48%), Gaps = 15/322 (4%)

Query: 66 VLSQEEIDNLLGF-TAGEVNLDDHSGIRAIIDSAMVSY--------ERLPMLEIVFDRLV 116
VLSQ+EID LL ++G+ +++D I + + E++ L ++ +
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 117 RLMTTSLRNFTSDNVEVSLDRITSVRFGDYMNSIPLPAVLSVFKAEEWENFGLATVDSTL 176
RL TTSL V V + + + + +++ SIP P+ L+V + + + VD ++
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 177 IYSIIDVLLGGRRGNTQLRVEGRPYTTIETNLVKRLIEVVLSDAEQAFRPLSPVRFTIDR 236
+SIID L GG +++ R T IE ++++ +I +L++ +++ + +R + +
Sbjct: 124 TFSIIDRLFGGTGQAAKVQ---RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 237 LETNPRFAAISRPANAAILVKLRIDMEDRGGNIELLLPYATIEPIRPVLLQMFMGEKFGR 296
+ETNP+FA I P+ +LV L + + G + +PY TIEPI L F R
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 297 DPV--WEGHFATEIAQAEVAVDAVLYEADVPLKQLMRLKVGDTLPL-DIRPDALVAVRCG 353
+ G +++ ++ V A + + ++ ++ L+VGD + L D + G
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 354 STMLTEGRMGRVGDRVAIRVTK 375
+ + G VG ++A ++ +
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILE 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5510FLGHOOKAP1391e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 38.8 bits (90), Expect = 1e-05
Identities = 9/48 (18%), Positives = 21/48 (43%)

Query: 213 SITQGSLEQANVDVVSEMSELIAAQRAYEMNAKVISAADQMMQSTTAL 260
++ + V++ E L Q+ Y NA+V+ A+ + + +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 32.2 bits (73), Expect = 0.002
Identities = 10/34 (29%), Positives = 20/34 (58%)

Query: 4 LHTAATGMAAQELNVQVISNNIANLRTTGFKKQT 37
++ A +G+ A + + SNNI++ G+ +QT
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5512FLGLRINGFLGH1851e-60 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 185 bits (471), Expect = 1e-60
Identities = 60/197 (30%), Positives = 96/197 (48%), Gaps = 10/197 (5%)

Query: 68 PKPEVVSYAPNSLWRN------GSRAFFKDQRARQVGDLLTVTVSISDKANIANETQRSR 121
P P A S++++ G + F+D+R R +GD LT+ + + A+ ++ SR
Sbjct: 39 PVPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASR 98

Query: 122 TNKEDSGITDFVGSKTLGATGKNILPGRILTADGTSSSDGKGTIQRSESLTTSVAAVVTQ 181
K + G D V G G + A G ++ +GKG S + + ++ V Q
Sbjct: 99 DGKTNFGF-DTVPRYLQGLFGNARAD---VEASGGNTFNGKGGANASNTFSGTLTVTVDQ 154

Query: 182 VLPNGNLVVEGKQEIRVNYEIRELIVAGIVRPEDIQSDNTIDSTKIAQARISYGGRGQIT 241
VL NGNL V G+++I +N + +G+V P I NT+ ST++A ARI Y G G I
Sbjct: 155 VLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYIN 214

Query: 242 DVQQPRYGQQVMDILLP 258
+ Q + Q+ L P
Sbjct: 215 EAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5514NUCEPIMERASE768e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 76.4 bits (188), Expect = 8e-18
Identities = 79/340 (23%), Positives = 122/340 (35%), Gaps = 50/340 (14%)

Query: 4 VLVTGGSGFIGTHVILQLLAAGHRVR-----ATLRTPARQSEVLAMLQRGGATDTGPLSF 58
LVTG +GFIG HV +LL AGH+V + + L +L + G F
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPG------FQF 56

Query: 59 YAADLTRDDGWAQ--AATGCDYVLHIA-------SPLPAHVPKDENELIVPAREGTLRVL 109
+ DL +G A+ + V S H D N G L +L
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLT------GFLNIL 110

Query: 110 RAARDAGVKRAVVTSSFAAIGYGHVRRSHPFDETDWSELDGPAVQPYPKSKTLAERAAWH 169
R ++ + SS + G R PF D +D P V Y +K E A
Sbjct: 111 EGCRHNKIQHLLYASSSSVYGLN---RKMPFSTDD--SVDHP-VSLYAATKKANELMA-- 162

Query: 170 FVAQEGGGLELATVNPVAVLGPVLGPDVSTSIAMVQALLAGRIPAV-----PRISFGLVD 224
GL + V GP PD++ +A+L G+ V + F +D
Sbjct: 163 HTYSHLYGLPATGLRFFTVYGPWGRPDMALFK-FTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 225 VRDVADLHLRAMTAPAAKGERFLAIAGPPMSLREIAELLRVRLGASGRRAPRYELPDWLA 284
D+A+ +R ++ G P + + + + + L
Sbjct: 222 --DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNI---GNSSPVELMDYIQALE 276

Query: 285 RALALAVPQLRAVLPL-VGRYCETSAD--KAKRLLGWSPR 321
AL + + +LPL G ETSAD ++G++P
Sbjct: 277 D--ALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPE 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5517NUCEPIMERASE485e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 48.2 bits (115), Expect = 5e-09
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 15/78 (19%)

Query: 1 MKIAVAGASGRAGSRITAELAGRGHQVTGI-------------ARNPDKILAHPNVTAVK 47
MK V GA+G G ++ L GHQV GI AR ++LA P K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARL--ELLAQPGFQFHK 58

Query: 48 GDANDRAELARLWAGHDV 65
D DR + L+A
Sbjct: 59 IDLADREGMTDLFASGHF 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5520FLGPRINGFLGI390e-137 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 390 bits (1002), Expect = e-137
Identities = 181/372 (48%), Positives = 256/372 (68%), Gaps = 11/372 (2%)

Query: 8 RLFQVACAAIVALA----SSAMSAHATSRIKDLANIEGVRQNQLIGYGLVVGLNGTGDTL 63
R+ ++ AA+V A S+ + TSRIKD+A+++ R NQLIGYGLVVGL GTGD+L
Sbjct: 2 RVLRIIAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSL 61

Query: 64 NNIPFTKQSLQAMLERMGVNIRGATIRTGNVAAVMVTGNLPPFATQGTRMDVTVSALGDA 123
+ PFT+QS++AML+ +G+ +G N+AAVMVT NLPPFA+ G+R+DVTVS+LGDA
Sbjct: 62 RSSPFTEQSMRAMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDA 121

Query: 124 KNLQGGTLLVTPLLGADGNVYAVAQGSLAISGFQAEGEAAKIVRGVPTVGRIANGAIIER 183
+L+GG L++T L GADG +YAVAQG+L ++GF A+G+AA + +GV T R+ NGAIIER
Sbjct: 122 TSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIER 181

Query: 184 EIEFALNRLPNVRLALRNADFTTAKRIAAAINDF----LGVKTAEPIDPSTVQLSIPPEF 239
E+ N+ L LRN DF+TA R+A +N F G AEP D + + P
Sbjct: 182 ELPSKFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVA 241

Query: 240 KGNVVAFLTEIEQLQVDPDLAAKIVIDERSGIIVMGRDVRVATVAVAQGNLTVTISESPQ 299
++ + EIE L V+ D AK+VI+ER+G IV+G DVR++ VAV+ G LTV ++ESPQ
Sbjct: 242 --DLTRLMAEIENLTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQ 299

Query: 300 VSQPNPLSQGRTVVTPRTSVGVTEDGKKFAVVKDGVSLQQLVDGLNGLGIGPRDLISILQ 359
V QP P S+G+T V P+T + ++G K A+V+ G L+ LV GLN +G+ +I+ILQ
Sbjct: 300 VIQPAPFSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQ 358

Query: 360 AIKAAGAIEADI 371
IK+AGA++A++
Sbjct: 359 GIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5521FLGFLGJ353e-05 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 35.5 bits (81), Expect = 3e-05
Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 5/97 (5%)

Query: 33 ADALTKVSPKA----QAKAKATATDFEAMFLNSMFAQMTSGVKGDGPFGDTPSTGVWRSM 88
A +L ++ KA A + A E MF+ M M + DG F + T ++ SM
Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLF-SSEHTRLYTSM 73

Query: 89 LMEQYSKNFAKAGGVGLSNDVFRTLILQQAKSSGSGA 125
+Q ++ G+GL+ + + + +Q S
Sbjct: 74 YDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5527FLAGELLIN562e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 56.2 bits (135), Expect = 2e-10
Identities = 81/507 (15%), Positives = 160/507 (31%), Gaps = 16/507 (3%)

Query: 17 LQNTASLLATTQNNLATGNKVNTALDNPTEFFTAQSLNNRASDIANLLDSIGNGVQVLQA 76
L + S L++ L++G ++N+A D+ A + + + +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 77 ANTGLTSLQKLVDSAKSIASQVLQAPTGYTTKSSITSAVIPGATANNLLGSSSNNFVTGS 136
L + + + ++ Q + SI + + + + + +
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQT----QFN 132

Query: 137 TVNNDNLSSAVAITGSTRLSGTPSSTSNDLASSITTGDTLVVNGVVFTFVAGSVSAGTNI 196
V + + + I T + + D VNG V S+ N+
Sbjct: 133 GVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNV 192

Query: 197 GVGDTVSNLLAAIDSVTGATATPSSVTGGKIALATGTAQDLTVSGTALAKLGLTAATTTR 256
DT + + A + TA + A G
Sbjct: 193 TGYDTYAVGANKYRVDVNSGAV----------VTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 257 NAPALSGQTLTIASTGGGVATNITFGTGASQISTLAQLNTALASNNLQASLSTTGQLTIL 316
N A+ T ++ G A I + + + + G+++
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 317 TTNEAASSTIGAVAGSSTASSMAFNGVTASTPVADTNSQTTRAGLIAQYNNVLAQINTTA 376
E + T+ + + A + + + N Q T + L+ + A
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL--EA 360

Query: 377 QDASFNGINLLNGDTLKLVFNETGRSTLNITGVTFNSTGLGLSALVVGTDFLDSNSANKV 436
+A + + TL + + T G+S L+ S
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420

Query: 437 LSTLNSASTAIRSEASSLGSNLSIVQIRQDFNKNLINVLQTGSSNLTLADTNEEAANSQA 496
L++++SA + + + SSLG+ + N + L + S + AD E +N
Sbjct: 421 LASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSK 480

Query: 497 LSTRQSIAVSALALANQSQASVLQLLR 523
Q S LA ANQ +VL LLR
Sbjct: 481 AQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5528FLAGELLIN553e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 55.4 bits (133), Expect = 3e-10
Identities = 80/498 (16%), Positives = 154/498 (30%), Gaps = 7/498 (1%)

Query: 17 LQSTAQLLATTQNNLATGKKVNSALDNPTNFFTAQGLDNRASDISNLLDGIGNGVQVLQA 76
L + L++ L++G ++NSA D+ A + ++ +G+ + Q
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 77 ANTGITSLQKLVDSAKSIANQVLQSSVGYSTKSNVTSAALAGATASSLIGASTTAVTGSV 136
+ + + + ++ Q + S ++ + T V
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136

Query: 137 VLNDNTSSAVAITGTTKLSGTPGTSSNDLASSITTGDTLVVNGTTFTFIAGTSSSGTNIG 196
+ DN I T + D VNG + SS N+
Sbjct: 137 LSQDNQMK---IQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 197 VGDTVTNLLSTIQSATGVTSSITAGAITLTPPAAGLTLSGTSLAKLGLSAVGNSLSGQTL 256
DT + + + +T P + + L
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAV-DLFKT 252

Query: 257 TIAATGGGTATSITFGLGTGQVNSLNDLNTKLAANNLQASFDTSSGKISITTTNDAASAT 316
T + G A +I + G+ D + + D + ++TT + T
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK---VSTTINGEKVT 309

Query: 317 IGAIGGTAAASSQSFNGLTAAAPVADATAQSQRSSLVAQYNNVLQQINTTAADASFNGVN 376
+ TA A++ L ++ V + Q + N + + A +A
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 377 LLNGDTLKLTFNETGKSSLSITGVTFNIAGLGLSNLTAGTDFLDNNSANKVLNVLNTASS 436
+ K +L+ + + G+S L S L +++A S
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 437 TLRSEASTLGSNLSVVQIRQDFNKNLINVLQTGSSNLTLADTNEEAANSQALSTRQSIAV 496
+ + S+LG+ + N + L + S + AD E +N Q
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 497 SALSLANQSQASVLQLLR 514
S L+ ANQ +VL LLR
Sbjct: 490 SVLAQANQVPQNVLSLLR 507


117BBta_5617BBta_5621N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_5617-1120.692236SPINDLY family O-linked N-acetylglucosamine
BBta_5618-2110.298058SPINDLY family O-linked N-acetylglucosamine
BBta_5619-1120.104580SPINDLY family O-linked N-acetylglucosamine
BBta_5620-213-1.292546mannose-1-phosphate guanylyltransferase (GDP)
BBta_5621-215-2.295345nucleotide sugar epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5617SYCDCHAPRONE535e-10 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 52.6 bits (126), Expect = 5e-10
Identities = 24/120 (20%), Positives = 37/120 (30%)

Query: 92 EALSNLGLALFNRKRYEEARKCQELAVALKPNLVVALTGLGNTLMRLGRPEEAIVAHDRA 151
E L +L + +YE+A K + L GLG +G+ + AI ++
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 152 IALKPDYADAYCNRGMALLPLNRNAEANQNFDRALSLNPRHMEAMFGKGLASIILRHFDD 211
+ + LL AEA A L E S +L
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156



Score = 42.6 bits (100), Expect = 1e-06
Identities = 16/114 (14%), Positives = 33/114 (28%), Gaps = 3/114 (2%)

Query: 48 QILALLPEHFDALHLLGVVALDSGQLDMAEQALTKAVEAEPRHAEALSNLGLALFNRKRY 107
+ + + + L+ L SG+ + A + + + LG +Y
Sbjct: 27 MLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQY 86

Query: 108 EEARKCQELAVALKPNLVVALTGLGNTLMRLGRPEEAIVAHDRAIAL---KPDY 158
+ A + L++ G EA A L K ++
Sbjct: 87 DLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 33.7 bits (77), Expect = 0.001
Identities = 18/97 (18%), Positives = 33/97 (34%)

Query: 24 DELLPRAVAAYRAGRPAEAQAICGQILALLPEHFDALHLLGVVALDSGQLDMAEQALTKA 83
++L A Y++G+ +A + + L LG GQ D+A + +
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 84 VEAEPRHAEALSNLGLALFNRKRYEEARKCQELAVAL 120
+ + + L + EA LA L
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 29.5 bits (66), Expect = 0.027
Identities = 15/103 (14%), Positives = 33/103 (32%), Gaps = 7/103 (6%)

Query: 153 ALKPDYADAYCNRGMALLPLNRNAEANQNFDRALSLNPRHMEAMFGKGLASIILRHFDDA 212
+ D + + + +A++ F L+ G G + +D A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 213 LAAFDAALAIRPR-------AAQVLAQRGRLHQQAGRFDPAMA 248
+ ++ + + AA+ L Q+G L + A
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5618SYCDCHAPRONE481e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 48.4 bits (115), Expect = 1e-08
Identities = 22/109 (20%), Positives = 37/109 (33%)

Query: 80 LAKAVEVDPRHAEALSNLGLALFSRKRFEEARKCQERAVTLKPNLVVAQTGLGNTLMRLG 139
+A E+ E L +L + ++E+A K + L GLG +G
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84

Query: 140 RPDEAVAAHDRAIALKPDYADAYCNRGMALLTLNRNAEANQSFDRALSL 188
+ D A+ ++ + + LL AEA A L
Sbjct: 85 QYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 46.1 bits (109), Expect = 8e-08
Identities = 16/114 (14%), Positives = 36/114 (31%), Gaps = 3/114 (2%)

Query: 48 QILALLPDHVDALHLLGVTALDGGQLDLAEQALAKAVEVDPRHAEALSNLGLALFSRKRF 107
+ + D ++ L+ L G+ + A + +D + LG + ++
Sbjct: 27 MLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQY 86

Query: 108 EEARKCQERAVTLKPNLVVAQTGLGNTLMRLGRPDEAVAAHDRAIAL---KPDY 158
+ A + L++ G EA + A L K ++
Sbjct: 87 DLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5619SYCDCHAPRONE452e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 44.9 bits (106), Expect = 2e-07
Identities = 16/97 (16%), Positives = 26/97 (26%)

Query: 96 NLGLVLSSLKRYEEARAAQERAIALKPNFATALTGLGNTLMNMRLFAQAIEAHDRAIALK 155
+L +YE+A + L + GLG M + AI ++ +
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 156 PDFADAYCNRGMAQLLLLRNEEARQSFERALALAPRH 192
+ L EA A L
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADK 137



Score = 40.3 bits (94), Expect = 8e-06
Identities = 16/101 (15%), Positives = 33/101 (32%)

Query: 153 ALKPDFADAYCNRGMAQLLLLRNEEARQSFERALALAPRHMQATFGKGLVSVNLRHFDQA 212
+ D + + Q + E+A + F+ L + G G + +D A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 213 LVAFNAALALKPGAAAVIAQRGRLYVQMGRFKEAETDFDTA 253
+ +++ + +Q G EAE+ A
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLA 130



Score = 40.3 bits (94), Expect = 8e-06
Identities = 16/114 (14%), Positives = 38/114 (33%), Gaps = 3/114 (2%)

Query: 48 QILALVPDHFDALHLLGASALDNGRLDLAEQALTRAVAVEPRNAEAQANLGLVLSSLKRY 107
+ + D + L+ L + +G+ + A + ++ ++ LG ++ +Y
Sbjct: 27 MLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQY 86

Query: 108 EEARAAQERAIALKPNFATALTGLGNTLMNMRLFAQAIEAHDRAIAL---KPDF 158
+ A + + L+ A+A A L K +F
Sbjct: 87 DLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 34.1 bits (78), Expect = 8e-04
Identities = 22/104 (21%), Positives = 37/104 (35%), Gaps = 3/104 (2%)

Query: 24 DGLLTQAVAAYRAGRHADAQAVCGQILALVPDHFDALHLLGASALDNGRLDLAEQALTRA 83
+ L + A Y++G++ DA V + L LGA G+ DLA + +
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 84 VAVEPRNAEAQANLGLVLSSLKRYEEARAAQERAIAL---KPNF 124
++ + + L EA + A L K F
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5621NUCEPIMERASE5250.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 525 bits (1355), Expect = 0.0
Identities = 203/333 (60%), Positives = 248/333 (74%), Gaps = 1/333 (0%)

Query: 6 ILVTGAAGFIGFHLAQRLLAEGRQVIGIDNINAYYDPKLKQARLDRLAAQPGFIFHKLDL 65
LVTGAAGFIGFH+++RLL G QV+GIDN+N YYD LKQARL+ LA QPGF FHK+DL
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKIDL 61

Query: 66 VDRAGVKALFAAHHFPAVVHLAAQAGVRYSLDNPHAYVDANLEGFINILEGCRHHGCAHL 125
DR G+ LFA+ HF V + VRYSL+NPHAY D+NL GF+NILEGCRH+ HL
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 126 LFASSSSVYGANTKLPFSVKDNVDHPISLYAASKKANELMAHSYSHLYRLPATGLRFFTV 185
L+ASSSSVYG N K+PFS D+VDHP+SLYAA+KKANELMAH+YSHLY LPATGLRFFTV
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 186 YGPWGRPDMAMFIFAKAILAGQPVRLFNHGQMRRDFTYIDDIVQAIHRLIGRPPQGNPDW 245
YGPWGRPDMA+F F KA+L G+ + ++N+G+M+RDFTYIDDI +AI RL P + W
Sbjct: 182 YGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQW 241

Query: 246 DGTRPDPSSSRAPWRIYNIGNNHPEQLMDVITLLEKEFGRPAIKEMLPMQPGDVEATYAD 305
P++S AP+R+YNIGN+ P +LMD I LE G A K MLP+QPGDV T AD
Sbjct: 242 TVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSAD 301

Query: 306 VSDLERDIGFRPATPIADGIARFARWYREYHQI 338
L IGF P T + DG+ F WYR+++++
Sbjct: 302 TKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


118BBta_5933BBta_5941N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_59330100.322602NifA subfamily transcriptional regulator
BBta_59352121.872786FixR protein
BBta_59362101.613672hypothetical protein
BBta_59370101.461968hypothetical protein
BBta_59380101.535973oxidoreductase NAD(P)-binding subunit
BBta_59390112.125716hypothetical protein
BBta_59400121.669895signal transduction histidine kinase,
BBta_5941-1131.224884two component LuxR family transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5933HTHFIS446e-155 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 446 bits (1150), Expect = e-155
Identities = 138/386 (35%), Positives = 199/386 (51%), Gaps = 12/386 (3%)

Query: 195 DRERLMAESHRLQKELSELKPASANRKKVLVDGIVGDSPALRALLDKINVVAKSNAPVLL 254
D L+ R E + +P+ +VG S A++ + + + +++ +++
Sbjct: 107 DLTELIGIIGRALAEP-KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165

Query: 255 RGESGTGKELVAKAIHELSNRAKRPFIKINCAALPETVLESELFGHEKGAFTGAINSRKG 314
GESGTGKELVA+A+H+ R PF+ IN AA+P ++ESELFGHEKGAFTGA G
Sbjct: 166 TGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTG 225

Query: 315 RFELADKGTLFLDEIGEISPAFQAKLLRVLQEQEFERVGGNQTIKVDVRIVTATNRNLEE 374
RFE A+ GTLFLDEIG++ Q +LLRVLQ+ E+ VGG I+ DVRIV ATN++L++
Sbjct: 226 RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQ 285

Query: 375 AVSRNEFRADLYYRISVVPIKLPPLRERRSDIPQLAHEFLRRFNCENERDLTFDVSAIEV 434
++++ FR DLYYR++VVP++LPPLR+R DIP L F+++ E FD A+E+
Sbjct: 286 SINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALEL 345

Query: 435 LMHCGFPGNVRELENCVQRTATLAAGSAIGQHDFACSRNECMSAILWKGNTVTPPPPRGQ 494
+ +PGNVRELEN V+R L I + E S I
Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIEN---ELRSEIPDSPIEKAAARSGSL 402

Query: 495 PIVPLPILPRSSAPIEAPAESPAAVDIDEGEYSESMDGGAVPERERIVQAMERSGWVQAK 554
I + A M E I+ A+ + Q K
Sbjct: 403 SISQAV--EENMRQYFASFGDALPPSGLYDRVLAEM------EYPLILAALTATRGNQIK 454

Query: 555 AARLLGLTPRQIGYALKKYNIEVKRF 580
AA LLGL + +++ + V R
Sbjct: 455 AADLLGLNRNTLRKKIRELGVSVYRS 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5935DHBDHDRGNASE1003e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (250), Expect = 3e-27
Identities = 69/263 (26%), Positives = 111/263 (42%), Gaps = 29/263 (11%)

Query: 39 EQKTLVLTGGSRGIGHATAKLFSDAGWRILTCSRQPFDGGRCPWDAGA----ANHVQVDL 94
E K +TG ++GIG A A+ + G I P + A A D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 95 SDRQAIPRAIAEIKAKLDGAPLHALINNAAI-SPKATDGGRLDSLSTSVETWMTVFHVNF 153
D AI A I+ ++ P+ L+N A + P S S E W F VN
Sbjct: 67 RDSAAIDEITARIEREM--GPIDILVNVAGVLRPGLIH-------SLSDEEWEATFSVNS 117

Query: 154 LGPILLARGLFEELK-RGQGAVVNVTSIVGTRVHPFAGTAYATSKAALACLTREMAHDFA 212
G +R + + + R G++V V S V + AYA+SKAA T+ + + A
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 213 PHGIRVNAIAPGEIKTEM-----VSPETEAR--------FSPLIPMRRIGAPEEVAKVLF 259
+ IR N ++PG +T+M + F IP++++ P ++A +
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 260 FLCSDAASYVTGEEIQINGGQHL 282
FL S A ++T + ++GG L
Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5936RTXTOXIND352e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 2e-04
Identities = 15/64 (23%), Positives = 24/64 (37%), Gaps = 4/64 (6%)

Query: 202 VTAPYDGILRGI-VRDGCDVAAGVALAEIDPRGRHAQWTGIDAQGQALAQATLAAIRLHA 260
+ + I++ I V++G V G L ++ G A +L QA L R
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD---TLKTQSSLLQARLEQTRYQI 155

Query: 261 ADRI 264
R
Sbjct: 156 LSRS 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5938DHBDHDRGNASE852e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.1 bits (210), Expect = 2e-22
Identities = 57/194 (29%), Positives = 92/194 (47%), Gaps = 12/194 (6%)

Query: 20 APIQDSAHCRAVIQRAVDELGGIDILVNNAAHQATFSEIGDISDDEWEMTFRTNIHAMFY 79
A ++DSA + R E+G IDILVN A I +SD+EWE TF N +F
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVN-VAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 80 LTKAAVPHMKP--GSAIVNTASVNSDMPNPSLLAYATTKGAIQNFTGGLAQMLADKGIRA 137
+++ +M +IV S + +P S+ AYA++K A FT L LA+ IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 138 NAVAPGPI-----WTPLIPSTMPEEKVTSFGQQ----VPMKRAGQPAELATAYVMLADPL 188
N V+PG W+ E+ + + +P+K+ +P+++A A + L
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 189 SSYTSGTTVAVTGG 202
+ + + + V GG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_5941HTHFIS576e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 6e-12
Identities = 28/141 (19%), Positives = 53/141 (37%), Gaps = 1/141 (0%)

Query: 2 RVLIVDDHPIVASGCRALLSTDPDLELLEASDAESGEASFVARRPDVSVIDINLPTVSGF 61
+L+ DD + + L + ++ S+A + A D+ V D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELARRILAHEASARLIMFSMNDDPVFAARAIDIGAKGYVSKSGDPNDLAEAIREVARGGS 121
+L RI +++ S + + A +A + GA Y+ K D +L I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 YLPPAIARSIAFAGATVAQSP 142
P + V +S
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSA 144


119BBta_6143BBta_6152N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6143070.605791hypothetical protein
BBta_6144-18-0.017896hypothetical protein
BBta_6145-111-0.399457NAD dependent epimerase/dehydratase family
BBta_6146014-0.366961hypothetical protein
BBta_6147-114-0.548724hypothetical protein
BBta_6148-115-0.812830hypothetical protein
BBta_6149011-1.056407ABC transporter ATP-binding protein
BBta_6150-111-1.246856ABC-2 type transporter permease
BBta_6151-110-0.413082hypothetical protein
BBta_6152-19-0.146704hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6143OMPADOMAIN672e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 67.3 bits (164), Expect = 2e-15
Identities = 24/112 (21%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 63 GRNEPPPQKRPPIAPELMNLP-ALNVDIQFDQDTPIVRPDSYQTVGRIADAMVHSELLPY 121
G P P APE+ L D+ F+ + ++P+ + ++ + + +
Sbjct: 194 GEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253

Query: 122 TFLIVGHVEANGRREPNVILSQRRADAIRDILANTFKISVKRLQSIGLGEEQ 173
+ +++G+ + G N LS+RRA ++ D L + I ++ + G+GE
Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLI-SKGIPADKISARGMGESN 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6145NUCEPIMERASE748e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 73.7 bits (181), Expect = 8e-17
Identities = 27/129 (20%), Positives = 53/129 (41%), Gaps = 16/129 (12%)

Query: 1 MKVLVTGGSGFIGHHLVSALAARGTEVRVLDIRCPTHMI------------DEVEYIEGS 48
MK LVTG +GFIG H+ L G +V +D + + ++ +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 VLDAGLVRN--AVAGVDQVYHLAGLPGM--WMPDREDFYRVNCVGTENVLAAARASRIRR 104
+ D + + A ++V+ + + + + N G N+L R ++I+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 105 FLHCSTESI 113
L+ S+ S+
Sbjct: 121 LLYASSSSV 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6146VACCYTOTOXIN300.008 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.0 bits (67), Expect = 0.008
Identities = 20/48 (41%), Positives = 22/48 (45%), Gaps = 9/48 (18%)

Query: 144 LILGMIATGIAVGTVSAVLGAILISLGMAWKAKMEEVFLAQELGPDYV 191
I+G IATG AVGTVS +LG W K E PD V
Sbjct: 43 AIVGGIATGAAVGTVSGLLG---------WGLKQAEEANKTPDKPDKV 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6148PF06057345e-04 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 33.7 bits (77), Expect = 5e-04
Identities = 36/130 (27%), Positives = 50/130 (38%), Gaps = 35/130 (26%)

Query: 30 ALAVALALPLGAGSVHAAPSPVAAPAASTTHVYLLRGVLNIFSLG------LD-DIAAKL 82
A A A LG + PS A+S T + L IF G LD + L
Sbjct: 20 AFADEFADNLGLTLLPVEPSTQVNAASSHT-----KPPLVIFLSGDGGWATLDKAVGGIL 74

Query: 83 DAQGIPNTVANFVSWSSL---------------ADEAAAAYRA--GRIKTIILVGHSSGA 125
QG P V WSSL Y+A G + +IL+G+S GA
Sbjct: 75 QQQGWP-----VVGWSSLKYYWKQKDPKDVTQDTLAIIDKYQAEFGT-QKVILIGYSFGA 128

Query: 126 TALPDMINKL 135
+P ++N++
Sbjct: 129 EVIPFVLNEM 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6150ABC2TRNSPORT689e-16 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 68.4 bits (167), Expect = 9e-16
Identities = 55/219 (25%), Positives = 93/219 (42%), Gaps = 1/219 (0%)

Query: 19 RTILQSIVSPVVSTSLYFVVFGAAIGSRISEVQGVSYGTFIVPGLVMLSVLTQSIANASF 78
+ L S++ + +Y GA +G + V GVSY F+ G+V S +T + +
Sbjct: 29 KAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTAATFETIY 88

Query: 79 GIYFPKFI-GTIYEILSAPISHFEIVLGYVGAAATKSIVLGLIILATAGLFVPLHIQHPV 137
+ T +L + +IVLG + AATK+ + G I A +
Sbjct: 89 AAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSLL 148

Query: 138 WMLTFLVLTAITFSLFGFIIGIWADGFEKLQMIPMLVVTPLTFLGGSFYSVNMLPPTWHT 197
+ L + LT + F+ G ++ A ++ LV+TP+ FL G+ + V+ LP + T
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 198 IALLNPVVYLISGFRWSFYEIADVRVEISVAMTLGFLVI 236
A P+ + I R V V V ++VI
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6152PRTACTNFAMLY300.009 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.0 bits (67), Expect = 0.009
Identities = 22/107 (20%), Positives = 32/107 (29%), Gaps = 12/107 (11%)

Query: 125 LVQKDGVLNTTVTLT--------GSAEVALARNPRRGTAVARREAPQQYDDNGEPVVLTP 176
LVQ T TL G+ LA N ++ +AP +P P
Sbjct: 527 LVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPP 586

Query: 177 QPQLLPGRPIAPQQRPDDGYIYPADGSDNGVRYPAPRGTSRRVYDAQ 223
QP APQ + N G + ++ A+
Sbjct: 587 QPPQPQPEAPAPQPPAGR----ELSAAANAAVNTGGVGLASTLWYAE 629


120BBta_6187BBta_6201N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_61870110.208930multidrug resistance protein B
BBta_6188-1101.023924multidrug resistance protein A
BBta_61891111.000726TetR family transcriptional regulator
BBta_61910111.325699CopG family transcriptional regulator
BBta_61920122.078856outer membrane receptor
BBta_61931133.359073hypothetical protein
BBta_61942133.016423hypothetical protein
BBta_61950123.084291hypothetical protein
BBta_61960102.413074hypothetical protein
BBta_61971112.358291hypothetical protein
BBta_6198192.060284two component sensor histidine kinase
BBta_61991101.523242OmpR family two component response
BBta_62000101.659397multidrug efflux system outer membrane subunit
BBta_62011101.240645accessory protein to ABC-type macrolide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6187TCRTETB1016e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 101 bits (252), Expect = 6e-25
Identities = 78/401 (19%), Positives = 150/401 (37%), Gaps = 17/401 (4%)

Query: 37 FMSILDIQIVSASLSEIQAGLSASSSEVSWVQTAYLIAEVIAIPLSGFLSRALGTRLLFA 96
F S+L+ +++ SL +I + + +WV TA+++ I + G LS LG + L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 97 ISAAGFTISSFLCGFA-SSIEQMILWRAIQGFLGAGMIPTVFASAYTVFPRSKFHIVGPI 155
S + S +I+ R IQG A V P+ +
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 156 IGLVATLAPTIGPTVGGYITDAMSWHWLFFINIPPGVMITIGVLALVDFDEPHFELLERF 215
IG + + +GP +GG I + HW + + IP ++ I V L+ + + F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIP--MITIITVPFLMKLLKKEVRIKGHF 199

Query: 216 DWWGLLFMAGFLGSLEYVLEEGPQHEWLQETSVAVFAVVCAVSAVCFFWRVLTVDEPIVD 275
D G++ M+ + + F +V +S + F + V +P VD
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 276 LGAFTDRNFAVGCLLQFCVGIGLYGLTYIYPRYLAEVRGYSALMIGET-MFVSGVTMFLM 334
G + F +G L + + G + P + +V S IG +F +++ +
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 335 APVVGRLMVKIDMRLIIAFGLIIFAIGSYQMTGITRDYDFWELFLPQVLRGVGMMCAMVP 394
+ G L+ + ++ G+ ++ + W + + V G+
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTA-SFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 395 TNNIALGTLAPDRVKNASGLFNLMRNLGGAVGLAVINQVLN 435
+ I +L L N L G+A++ +L+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6188RTXTOXIND1102e-28 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 110 bits (277), Expect = 2e-28
Identities = 56/361 (15%), Positives = 111/361 (30%), Gaps = 81/361 (22%)

Query: 146 VSGHIQEIVPRDNSVVRKGDVIFRIDDGDYKIAVDAARSKIATQEATIARIG-------- 197
+ ++EI+ ++ VRKGDV+ ++ + +S + R
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 198 -------------------------------------RQVAAQQSAVEQAQANLTSSEAA 220
Q ++ +++ +A + A
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 221 M--KRAGLDYERQQ-----ALSSKGFASHATFEQSEAARDQGMAAVRAARAAFDAARDNV 273
+ E+ + +L K + + E + + +R ++ + +
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 274 EVTKAQQV---------------EAQAQLAELKTSLAKAERDLEFATVRAPVDGTFSNRL 318
K + + + L LAK E + + +RAPV
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 319 VNT-GDFVAVGQRLGNVVPLDDVF-IDANFKETQLKRIRPGQPVTIKVDAYGAR---GFK 373
V+T G V + L +VP DD + A + + I GQ IKV+A+
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 374 GVVDSISPAAGSVFTLLPPDNATGNFTKIVQRLPVRIRVPKEVARQNLLRAGMSVYVTVD 433
G V +I+ A D G ++ + + L +GM+V +
Sbjct: 403 GKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAVTAEIK 453

Query: 434 T 434
T
Sbjct: 454 T 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6189HTHTETR742e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 2e-18
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)

Query: 13 QDEDTSKRRQILDGAREVFMELGFDGASMGEIARAAGVSKGTLYVYFTDKTALFEAIVEE 72
+ E R+ ILD A +F + G S+GEIA+AAGV++G +Y +F DK+ LF I E
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 73 ETRTAV-LFRFDPDRDIATNLTGFGEAYIALVCRPGGGSAIRTVMAIAER-MPDVG---- 126
L + L+ E I ++ R +M I VG
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 127 -RRYYDRVLEGTINCLTTYLRGRVEAGQLAID-DCQLAAS 164
++ + + + + L+ +EA L D + AA
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAI 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6192TONBPROTEIN386e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.4 bits (89), Expect = 6e-05
Identities = 20/105 (19%), Positives = 34/105 (32%), Gaps = 20/105 (19%)

Query: 1 MVLMAAFVQQAQA-----------HPERPPAELPTVDVDA--------PKRTAKPKPQVR 41
++ A ++ QA PE P P + PK KP +V+
Sbjct: 48 TMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107

Query: 42 HEAQR-PKPGAPRSPAPATPADTAAAPAAPSGAPAVGSGPARPQG 85
+ +R KP R +P A ++ + A + G
Sbjct: 108 EQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6197PF03544403e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 40.3 bits (94), Expect = 3e-06
Identities = 22/105 (20%), Positives = 26/105 (24%), Gaps = 1/105 (0%)

Query: 64 DDAPASSTAAAAPAAAAPATAAPSTPAPVVNPPVVNAPAPVTAAPVAT-APAPLAPPAAV 122
A S APA P A P PVV P P P P P
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 123 QMPAAPVTSNAAPPAPAAPVASPAPAATTVAVAPAAGQPAPAAAA 167
P V P + T A ++ A +
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKP 149



Score = 36.1 bits (83), Expect = 9e-05
Identities = 20/111 (18%), Positives = 28/111 (25%), Gaps = 3/111 (2%)

Query: 62 DDDDAPASSTAAAAPAAAAPATAAPSTPAPVVNPPVVNAPAPVTAAPVATAPAPLAPPAA 121
D + P + P P P P V+ P P P
Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP---KPKPKPKPVKKVEQP 114

Query: 122 VQMPAAPVTSNAAPPAPAAPVASPAPAATTVAVAPAAGQPAPAAAADPNSP 172
+ + A+P AP + AT P + A N P
Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6198PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 27/127 (21%), Positives = 45/127 (35%), Gaps = 17/127 (13%)

Query: 272 RAAMLRDIDQVSRMLDETLDYLRDDTKAEQPARIELSSLLETI-------CCDFADVGHA 324
RA +L D + ML + +R + ++ L+ L + F D
Sbjct: 183 RALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ- 241

Query: 325 VTYQGPQRLVCEGRPRALSRAVTNVVENAVKHG-----SEVVVSLSVATD-GMIAIEVAD 378
Q ++ P L V +VEN +KHG + L D G + +EV +
Sbjct: 242 FENQINPAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVEN 298

Query: 379 DGPGIPQ 385
G +
Sbjct: 299 TGSLALK 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6199HTHFIS841e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 1e-20
Identities = 40/151 (26%), Positives = 74/151 (49%), Gaps = 4/151 (2%)

Query: 18 AARILCVEDDREIAALLTELLQEHGFAPRFVASAAAMNDVLQREQVDLIMLDLMLPDEDG 77
A IL +DD I +L + L G+ R ++AA + + DL++ D+++PDE+
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 78 MSICRRLRQV-STVPIIMVTAKGEDIDRILGLELGADDYVVKPFNPRELVARI-RALLRR 135
+ R+++ +P+++++A+ + I E GA DY+ KPF+ EL+ I RAL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 136 SQLHPAVIAARQAPMTFDGWRIEPATRTLFD 166
+ + Q M G A + ++
Sbjct: 123 KRRPSKLEDDSQDGMPLVG--RSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6201RTXTOXIND613e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.0 bits (148), Expect = 3e-12
Identities = 19/152 (12%), Positives = 47/152 (30%), Gaps = 12/152 (7%)

Query: 116 DSLTQQNTLRTNEAALRNVRAQRDEKIATLALAEANMARQQVTLAQKASSRADYDSAEAT 175
++ + E + + L E+ + + +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV---TQLFK----- 297

Query: 176 VKQTQAQIAQLDAQIAEAEVAIETARVNLAYTRITAPIDGTVLSI-VTQQGQTVNAVQSA 234
+ ++ Q I + + + I AP+ V + V +G V +
Sbjct: 298 -NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-- 354

Query: 235 PTIVVLGQVETMTVRVEISEADVVKVKPGQNV 266
+V++ + +T+ V + D+ + GQN
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNA 386



Score = 54.1 bits (130), Expect = 5e-10
Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 16/182 (8%)

Query: 25 KPQRRWSRLILGAAVALAVVALLVARYSRNPNANLVTAPVTIGDVEQTVLATGTLKPV-K 83
P R RL+ + V+A +++ +G VE A G L +
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILS---------------VLGQVEIVATANGKLTHSGR 95

Query: 84 LVAVGAQASGRLVTLNVELGQKIKAGDLIGEIDSLTQQNTLRTNEAALRNVRAQRDEKIA 143
+ + + + V+ G+ ++ GD++ ++ +L + +++L R ++
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 144 TLALAEANMARQQVTLAQKASSRADYDSAEATVKQTQAQIAQLDAQIAEAEVAIETARVN 203
E N + + + + Q + Q + E+ ++ R
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215

Query: 204 LA 205

Sbjct: 216 RL 217


121BBta_6239BBta_6246N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6239011-0.041333cation efflux membrane fusion protein
BBta_6240-111-0.157015cation efflux system protein
BBta_6241219-1.828834hypothetical protein
BBta_6242215-0.454578bacterioferritin
BBta_62432120.143120bacterioferritin-associated ferredoxin
BBta_6244111-0.483526hypothetical protein
BBta_6245112-0.598476hypothetical protein
BBta_6246-112-0.311444hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6239RTXTOXIND422e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 2e-06
Identities = 22/187 (11%), Positives = 56/187 (29%), Gaps = 12/187 (6%)

Query: 57 SPTDKTLPYPAQIVIPTPQLWVVSAPVAGMVTNLAVGRGDRVTTGQPLLTLESPSFVSLQ 116
+ ++ + + + +V + V G+ V G LL L + +
Sbjct: 78 GQVEIVATANGKLT-HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADT 136

Query: 117 REYLHALAQEVLAAQQLKRNADLFDGKAVPQRVLESSQAEARQASIVVAERRQMLHLSGL 176
+ +L Q L + + + + +P+ L + V ++
Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK---- 192

Query: 177 SDDAIARLTNEAAISATLTVNAPQAASVVEIAVSPGQRMEQSAPLV--KLARLSPLWVEI 234
+ + +N + + ++ R E + + +L S L +
Sbjct: 193 -----EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247

Query: 235 AIPATNI 241
AI +
Sbjct: 248 AIAKHAV 254



Score = 34.0 bits (78), Expect = 8e-04
Identities = 37/225 (16%), Positives = 71/225 (31%), Gaps = 51/225 (22%)

Query: 124 AQEVLAAQQLKRNADLFDGKAVPQ-RVLE---------------SSQAEARQASIVVAER 167
+ +L + L +A+ + VLE SQ E ++ I+ A+
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 168 RQMLHLSGLSDDAIARLTNEAAISATLT--------------VNAPQAASVVEIAV-SPG 212
L ++ + +L LT + AP + V ++ V + G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 213 QRMEQSAPLVKLARLS-PLWVEIAIPATNIGAIKIGAKVEI--DGYP------TPGRVIL 263
+ + L+ + L V + +IG I +G I + +P G+V
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 264 VS------ETTDAATQTI--LVRAEIPNNG---ELHSGQTAAARI 297
++ + I + + L SG A I
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6240ACRIFLAVINRP6910.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 691 bits (1784), Expect = 0.0
Identities = 231/1037 (22%), Positives = 447/1037 (43%), Gaps = 40/1037 (3%)

Query: 6 LVAFALSQRLFVLLSVLLLIGAAAVFLPSLPIDAFPDVSPVQVKIIMKAPGLTPEEVEQR 65
+ F + + +F + ++L+ A A+ + LP+ +P ++P V + PG + V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 66 VTVPIELELLGLPNKRILRSTTKYA-LADITVDFEEGTDIYWARNQVSERLSNISRDFPD 124
VT IE + G+ N + ST+ A IT+ F+ GTD A+ QV +L + P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 125 GVSGGLAPITSPLGEMFM---FTIDSPELSLAERRSLLDWVIRPALRTVPGVADVNALGG 181
V + M F D+P + + + ++ L + GV DV G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 182 QVRAFEIVPRTDALAARGISYELFRKAIEANSRNDGAGRVNQGEDSALVRIEGSIRGIDD 241
Q A I D L ++ ++ + AG++ ++ SI
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 242 IKA-------IVVDTRDGIPIRVSDVARVRIGALTRYGAVTADGKGETVEGLVLGLRGAN 294
K + DG +R+ DVARV +G +GK + GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGAN 298

Query: 295 AGQLVRDVRDRLEELKPSLPASVSINVFYDRSKLVNRAVGTVVRALGEATVLVVVLLLLF 354
A + ++ +L EL+P P + + YD + V ++ VV+ L EA +LV +++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 355 LGNWRASLVIALSLPLAIAMALLVMRAVGMSANLMSLGGLAIAIGMLIDALVVVVENIVG 414
L N RA+L+ +++P+ + ++ A G S N +++ G+ +AIG+L+D +VVVEN+
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE- 417

Query: 415 NLGKHDQSRNAPLVHIVFRSVCEVLQPVASGVLIIIIVFVPLLTLQGLEGKLFIPVALAI 474
+ P +S+ ++ + +++ VF+P+ G G ++ ++ I
Sbjct: 418 ---RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 475 IFALAASLLLALTVIPVATSFALT--SASHHDPL---------LIRAAQRVYAPALGWAL 523
+ A+A S+L+AL + P + L SA HH+ + Y ++G L
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 524 GNERKVILAAMLCLVAAGYGYTRLGKTFMPTMDEGDVIVSVETLPSVNLDESLAINARLQ 583
G+ + +L L + + RL +F+P D+G + ++ + + + ++
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 584 AALMKVPDVAGIIARTGSDELGLDPMGPNQTDTFLVLKPAEQR--QSENREALLQKLRDV 641
+K A + + + N F+ LKP E+R + EA++ + +
Sbjct: 595 DYYLKNE-KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK-- 651

Query: 642 LSDFPGISLSFTQPIDM-RVQEMISGVRGDVA-VKIFGPDIAQLNDIAARLSTILS-GID 698
+ I F P +M + E+ + D + G L +L + +
Sbjct: 652 -MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPA 710

Query: 699 GAEDVYTTLNEGAQYYTVTVNRLEAGRLGLTVDSIATSLRTQIEGRTIGTALESGRRTPI 758
V E + + V++ +A LG+++ I ++ T + G + ++ GR +
Sbjct: 711 SLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKL 770

Query: 759 LVRGSEATREAPTLLAMLPLTLAAGQHVALSQVARIQRVDGPVKIDREDGNRMSVVRSNV 818
V+ R P + L + A G+ V S V G +++R +G ++
Sbjct: 771 YVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEA 830

Query: 819 RGRDMVGFVQAAQQKVAAELQLPAGYRLTWGGQFENQQRAAARLSVVVPVAIGLIFVLLF 878
G A + +A+ +LPAG W G ++ + + +V ++ ++F+ L
Sbjct: 831 APGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLA 888

Query: 879 TTFGSVRQALLVLINIPFALIGGVFALVSTGEYLSVPASVGFIALLGIAVLNGVVLVSHF 938
+ S + V++ +P ++G + A + V VG + +G++ N +++V
Sbjct: 889 ALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFA 948

Query: 939 NQL-RTQGLAEHLIVVEGAKRRLRPVLMTASITALGLVPLLFASGPGSEVQRPLAIVVIG 997
L +G + + RLRP+LMT+ LG++PL ++G GS Q + I V+G
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMG 1008

Query: 998 GLLSSTLLTLILLPILY 1014
G++S+TLL + +P+ +
Sbjct: 1009 GMVSATLLAIFFVPVFF 1025



Score = 90.7 bits (225), Expect = 2e-20
Identities = 83/513 (16%), Positives = 160/513 (31%), Gaps = 43/513 (8%)

Query: 5 RLVAFALSQRLFVLLSVLLLIGAAAVFLPSLPIDAFPDVSPVQVKIIMKAPGLTPEEVEQ 64
V L LL L++ V LP P+ +++ P +E Q
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 65 RVTVPIELELLGLPNKRILRSTTKYAL---------ADITVDFEEGTDIYWARNQVSERL 115
+V + L + T V + + N +
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 116 SNISRDFPDGVSGGLAPITSP----LGEMFMFTIDSPELSLAERRSLLDW---VIRPALR 168
+ G + P P LG F + + + +L ++ A +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 169 TVPGVADVNALGGQVRA--FEIVPRTDALAARGISYELFRKAIEANSRNDGAGRVNQGED 226
+ V G F++ + A G+S + I
Sbjct: 708 HPASLVSVR-PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 227 SALVRIEGSIR---GIDDIKAIVVDTRDGIPIRVSDVARVRIGA----LTRY-GAVTADG 278
+ ++ + +D+ + V + +G + S L RY G + +
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 279 KGETVEGLVLGLRGANAGQLVRDVRDRLEELKPSLPASVSINVFYDRSKLVNRAVGTVVR 338
+GE G G D +E L LPA + + + S +
Sbjct: 827 QGEAAPGTSSG-----------DAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPA 874

Query: 339 ALGEATVLVVVLLLLFLGNWRASLVIALSLPLAIAMALLVMRAVGMSANLMSLGGLAIAI 398
+ + V+V + L +W + + L +PL I LL ++ + GL I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 399 GMLIDALVVVVENIVGNLGKHDQSRNAPLVHIVFRSVCEVLQPVASGVLIIIIVFVPLLT 458
G+ +++VE + K + +V +V L+P+ L I+ +PL
Sbjct: 935 GLSAKNAILIVEFAKDLMEKEGKG----VVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 459 LQGLEGKLFIPVALAIIFALAASLLLALTVIPV 491
G V + ++ + ++ LLA+ +PV
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPV 1023


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6242HELNAPAPROT290.007 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 28.7 bits (64), Expect = 0.007
Identities = 16/94 (17%), Positives = 34/94 (36%), Gaps = 8/94 (8%)

Query: 53 EHADKFTDRILFLDGFPNMQV--------LDPLRIGQNVKEIIECDLAAEIAARTLYQEA 104
E D +R+L + G P V + + E+++ + + +
Sbjct: 59 ETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFV 118

Query: 105 ATYCHGVKDYVSRDLFEQLMKDEEHHIDFLETQL 138
+D + DLF L+++ E + L + L
Sbjct: 119 IGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6244cloacin290.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.9 bits (64), Expect = 0.002
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 45 GVSDSGTDGSGLSSWFFNTSTDAMGNPIDGGGDSGGGDGGGDGGG 89
GV +DGSG SS N G+ I GG SG G+GGG+G
Sbjct: 28 GVGGGASDGSGWSSEN-NPWGGGSGSGIHWGGGSGHGNGGGNGNS 71



Score = 28.1 bits (62), Expect = 0.004
Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 49 SGTDGSGLSSWFFNTSTDAMGNPIDGGGDSGGGDGGG 85
SG DG G ++ +TS + G P G G DG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6245OMPADOMAIN300.002 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 29.5 bits (66), Expect = 0.002
Identities = 22/87 (25%), Positives = 30/87 (34%), Gaps = 4/87 (4%)

Query: 1 MKKTLAALAAVATLSTAALAPAPAEARNGRVAAGV-AAGLIGGALIGGAIAAGSNPYYYG 59
MKKT A+ AVA A +A A + A + + I N G
Sbjct: 1 MKKTAIAI-AVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 60 P--GYTYSPGYSYGPGYSYYGGPAYVA 84
GY +P + GY + G Y
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6246TCRTETA260.023 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 25.9 bits (57), Expect = 0.023
Identities = 14/49 (28%), Positives = 20/49 (40%)

Query: 6 TALVAVATIAGSLSVTAPAAKAGDVGAGVAAGLIGGALIGGAIASSRPA 54
T VA A IA A G + A G++ G ++GG + P
Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPH 160


122BBta_6255BBta_6267N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6255-18-0.331579hypothetical protein
BBta_6256-19-1.7050173-mercaptopyruvate sulfurtransferase
BBta_6257010-1.179884hypothetical protein
BBta_62580100.261927hypothetical protein
BBta_6259-2100.843064hypothetical protein
BBta_6260-3100.822381ATPase
BBta_6261-3100.685587response regulator receiver
BBta_6262-3100.168382sensor histidine kinase
BBta_6263-2110.286800hypothetical protein
BBta_6264-111-0.647542non-hemolytic phospholipase C
BBta_6265-310-0.429162sensor histidine kinase
BBta_6267-310-0.117481transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6255IGASERPTASE424e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 4e-06
Identities = 43/259 (16%), Positives = 72/259 (27%), Gaps = 31/259 (11%)

Query: 76 QTPPATVTDTPATAVLPAEQPSVPP-AAETVEVTTTAESPALPTPSLPAETVDTTAATVL 134
QT T TP A+ PSVP E V P P P+ P+ET +T A
Sbjct: 990 QTVDTTNITTPNNI--QADVPSVPSNNEEIARVDEAPVPP--PAPATPSETTETVAENSK 1045

Query: 135 PKTLTTPVDRIESAAP---DRGMADDAARTVAAVEKLADKPRL-----DTTPQAAPAVPT 186
++ T + ++ +R +A +A V A + + + +T T
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 187 DA--------VAAPTSSPQVLASADPAITTPAV--PAPAPATPADDPIAAKLAALGEQPT 236
P+V + P P PA D + K T
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTT 1165

Query: 237 RQERPAAGEHRASVK--VASVQLRRS------IMKKRQAREKQMAARRAKQRRIAEHARA 288
A E ++V+ V + + Q + R
Sbjct: 1166 ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRR 1225

Query: 289 ARQAAAQQAQFPDPFQQQP 307
+ ++ +
Sbjct: 1226 SVRSVPHNVEPATTSSNDR 1244



Score = 40.8 bits (95), Expect = 7e-06
Identities = 35/156 (22%), Positives = 51/156 (32%), Gaps = 18/156 (11%)

Query: 173 RLDTTPQAAPAVPTDAVAAPTSSPQVLASADPAITTPAVPAPAPATPADDPIAAKLAALG 232
R T P + A S P P VP PAPATP++ +
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAP-VPPPAPATPSETTETVAENSKQ 1046

Query: 233 EQ------------PTRQERPAAGEHRASVKVASVQLR----RSIMKKRQAREKQMAARR 276
E T Q R A E +++VK + S K+ Q E + A
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 277 AKQRRIAEHARAARQAAAQQAQFPDPFQQQPGFGQP 312
K+ + ++ +Q P Q+Q QP
Sbjct: 1107 EKEEKAKVETEKTQEVPKVTSQVS-PKQEQSETVQP 1141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6259BACINVASINB250.037 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 25.1 bits (54), Expect = 0.037
Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 7 LIIGAIAGWLAGLIVRGAGFGLIGNIVVGIIGA--LVAGYVLPALHITLASGTLGAILDA 64
LI AI L GL V + G+IV I+ A +VA V+ A+ A+ LG L
Sbjct: 384 LIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSK 443

Query: 65 TIGAVIVLVILSLIRR 80
+G I ++ +++++
Sbjct: 444 MMGETIKKLVPNVLKQ 459


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6261HTHFIS280.012 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.9 bits (62), Expect = 0.012
Identities = 22/117 (18%), Positives = 37/117 (31%), Gaps = 13/117 (11%)

Query: 11 ILVVEDEYMLADELRCEIGDAGATVLGPVGTVADALALAKREARIDGAVLDVNLRGEMAF 70
ILV +D+ + L + AG V A D V DV + E AF
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWI-AAGDGDLVVTDVVMPDENAF 63

Query: 71 PVADLLIQR--GVPFIFTTGYDESVIPDRFANILRC------EKPIAVDGIIRALGR 119
+ + + +P + + + KP + +I +GR
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNT---FMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6262HTHFIS1021e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (256), Expect = 1e-24
Identities = 39/136 (28%), Positives = 67/136 (49%), Gaps = 5/136 (3%)

Query: 611 RARILLADDNADMRDYVRRLLEAR-YEVETVGDGEAALAAMARAKPDLVLSDVMMPRIDG 669
A IL+ADD+A +R + + L Y+V + +A DLV++DV+MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 670 MQLLARVRA-DPKVSTVPIILLSARAGEESRVEGMQAGADDYLIKPFSASELLARVEAHL 728
LL R++ P +P++++SA+ + ++ + GA DYL KPF +EL+ + L
Sbjct: 63 FDLLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 729 KMAQLRAEATESLRAS 744
+ R E
Sbjct: 120 AEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6265HTHFIS562e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 2e-10
Identities = 41/123 (33%), Positives = 52/123 (42%), Gaps = 10/123 (8%)

Query: 372 TILVVEDDALVRDYVVAQVRRLGYRTLSASNAVEGLALIDNPERIDLLFTDVIIPGGMNG 431
TILV +DDA +R + + R GY SNA I DL+ TDV++P N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVVMP-DENA 62

Query: 432 RQLALEAEKRRPGLKVLYTSGYT--ENAI--VHHGRLDADVLLLAKPYLSADLARMIRTA 487
L +K RP L VL S AI G D L KP+ +L +I A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYD----YLPKPFDLTELIGIIGRA 118

Query: 488 LEA 490
L
Sbjct: 119 LAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6267PF04605300.007 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 29.8 bits (67), Expect = 0.007
Identities = 9/60 (15%), Positives = 20/60 (33%), Gaps = 7/60 (11%)

Query: 89 EVATYQLDKRYLRQDGSV------VWARVIVNPIQDAADNVRWFCAVVEDITATRILEEQ 142
+ + L+ + + S + R ++ + W V++ T I EQ
Sbjct: 30 LIKKFMLENGFEHRQYSGYTSKEPINERRVIRIVNKLTKKFTWLGECVKEFDITEI-GEQ 88


123BBta_6380BBta_6387N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6380-314-0.119425two-component sensor histidine kinase
BBta_6381-3120.077494two-component LuxR family transcriptional
BBta_6382-380.078948hypothetical protein
BBta_6384-280.624841hypothetical protein
BBta_6385-190.936917peptidase
BBta_63870120.128557hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6380HTHFIS562e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.0 bits (135), Expect = 2e-10
Identities = 28/118 (23%), Positives = 46/118 (38%), Gaps = 6/118 (5%)

Query: 472 DQPGVLVIDDDATTRDSLGRLLTNWGYECAPFEGGAPALRFLGQAAARKRWLVLLDYRLA 531
+LV DDDA R L + L+ GY+ A R++ LV+ D +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD---LVVTDVVMP 58

Query: 532 GTETGLNIADKIIAAYADRTRLVLMTGDIGPEIAEGARQRGVL-LLRKPVQPIRLSAL 588
E ++ +I A D +++M+ A A ++G L KP L +
Sbjct: 59 D-ENAFDLLPRIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6381HTHFIS702e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 2e-16
Identities = 25/102 (24%), Positives = 43/102 (42%), Gaps = 2/102 (1%)

Query: 6 SLLFVDDHPIYRDAVRRTLELEIEDLSVATAENCSATLALLAAHEVDLCLSDYRLPDGDG 65
++L DD R + + L V N + +AA + DL ++D +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LSLLKEVRTRYPLIAVGLLCADLSPALAEGAASLGAVVCLSK 107
LL ++ P + V ++ A + A A+ GA L K
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6385SUBTILISIN1432e-39 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 143 bits (361), Expect = 2e-39
Identities = 65/254 (25%), Positives = 105/254 (41%), Gaps = 23/254 (9%)

Query: 139 QNKSWAVSRLRLQDAWKLSDELGRPSRGAGVAICQIDTGV-ISHPELAGVVRAGSYNVIG 197
V ++ W +RG GV + +DTG HP+L + G
Sbjct: 20 NEIPRGVEMIQAPAVWN-------QTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDD 72

Query: 198 DGTTPEDPTDPLNYAGNPGHGTATAS-VAISPESLDVVGAAPKARHIPIRAIENVVRISQ 256
D PE D GHGT A +A + VVG AP+A + I+ +
Sbjct: 73 DEGDPEIFKD------YNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQY 126

Query: 257 TSVAEAMDRAVALGADVISLSLGGIWSW-ALQRALERAVAADVIVVAAAGNC------VG 309
+ + + A+ D+IS+SLGG L A+++AVA+ ++V+ AAGN
Sbjct: 127 DWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTD 186

Query: 310 LVVWPARFDDCIAVAGTDFHDKPWIGSCRGPTVAISAPGQNVYRAAATTGRSGQGQGTSF 369
+ +P +++ I+V +F S V + APG+++ + G+ GTS
Sbjct: 187 ELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDIL-STVPGGKYATFSGTSM 245

Query: 370 AVALIAGVAACWLA 383
A +AG A
Sbjct: 246 ATPHVAGALALIKQ 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6387cloacin387e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 7e-06
Identities = 29/83 (34%), Positives = 32/83 (38%), Gaps = 8/83 (9%)

Query: 19 GASAPASSAPMTGLSAAAKPDAAASGTVAVRYGGRGFGGGWHGGGWHGGGWRGGGWGWGG 78
G AS G S+ P SG G GG HG G G GGG G GG
Sbjct: 28 GVGGGASDGS--GWSSENNPWGGGSG-----SGIHWGGGSGHGNGGGNGN-SGGGSGTGG 79

Query: 79 GALAAGALLGAGIAATTTPFAWG 101
A A + G A +TP A G
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGG 102


124BBta_6556BBta_6565N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_65560111.339008chemotaxis-specific methylesterase
BBta_65570121.588974chemotaxis protein CheY
BBta_6558-1132.282457CheW protein
BBta_65590142.551595chemotaxis protein CheA
BBta_65602132.950656hypothetical protein
BBta_65614143.033940CysQ protein
BBta_65625151.737040hypothetical protein
BBta_65645171.907670hypothetical protein
BBta_65656180.581004hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6556HTHFIS605e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 5e-12
Identities = 27/105 (25%), Positives = 48/105 (45%), Gaps = 4/105 (3%)

Query: 20 RVMVVDDSVVIRGMISRWIASEPDMVVAASLRTGLEAVNQIDRVNPDVAVLDIEMPELDG 79
++V DD IR ++++ ++ V S I + D+ V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITS--NAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 80 IAALPKLLAKKKNLAVIMASTLTRRNAEISFKALSLGAADYIPKP 124
LP++ + +L V++ S + KA GA DY+PKP
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQN--TFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6557HTHFIS841e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 1e-22
Identities = 27/105 (25%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 5 LVVDDSSVIRKVARRILEGLDFQIVEAEDGEQALEICKRGLPDAVLLDWNMPVMDGYEFL 64
LV DD + IR V + L + + + G D V+ D MP + ++ L
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 65 GNLRRMPGGDAPKVVFCTTENDVAHIARALHAGANEYIMKPFDKD 109
+++ D P V+ + +N +A GA +Y+ KPFD
Sbjct: 67 PRIKKA-RPDLP-VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6559HTHFIS879e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 9e-20
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 4/134 (2%)

Query: 795 QTQSVLLVDDSPFFRNMLAPVLKSAGYKVRTAASAIEGLATLRSGHTFDIVVTDIEMPEM 854
++L+ DD R +L L AGY VR ++A + +G D+VVTD+ MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE 60

Query: 855 NGFEFAEAIRADQNLHQLPVIAVSSLVSPAAIERGRQAGLYDYIAK-FDRPGLIAALKEQ 913
N F+ I+ + LPV+ +S+ + + + G YDY+ K FD LI +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 914 IEERARADAAKRAA 927
+ E R +
Sbjct: 119 LAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6564GPOSANCHOR336e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.1 bits (75), Expect = 6e-04
Identities = 12/39 (30%), Positives = 14/39 (35%), Gaps = 1/39 (2%)

Query: 7 AQQAPPPAAPPYQAAPPYQQP-APPYQQPAPPPRPAPSP 44
A+QA A A Q P A P + P AP
Sbjct: 449 AKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQA 487


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6565GPOSANCHOR462e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 46.2 bits (109), Expect = 2e-07
Identities = 50/249 (20%), Positives = 88/249 (35%), Gaps = 14/249 (5%)

Query: 81 QLADLGKKTDAINRMKIELGEKNAAIFALEAREKALKDQLRATEEEFAAKTEALRAAEQA 140
+ ADL K + K + A +A A K L E + A A +
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 141 LKDKQAELVRLTTELNDKSLLADSRQVELVSVRAQVEELRTRVAEAEKEFAATQARLALE 200
L+ ++A L EL A + + A+++ L A A + L
Sbjct: 181 LEAEKAALEARQAELEKALEGAMN---FSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 201 RDESDTATKALNDARARVENLSQRVTELDRQLIVQVKEAEMLATRVADLEGRLATQGKLL 260
+ S + + A L R EL++ L + + + ++ LE A
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 261 AEREFENNQLREANAAADRALKELRDEIAGFGSGKSAALERLKSEKAALEEQLQAARDER 320
A+ E ++ L + R L R+ L++E LEEQ + + R
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDASREAKKQ-----------LEAEHQKLEEQNKISEASR 346

Query: 321 TKLQREMNA 329
L+R+++A
Sbjct: 347 QSLRRDLDA 355



Score = 40.4 bits (94), Expect = 1e-05
Identities = 60/354 (16%), Positives = 122/354 (34%), Gaps = 11/354 (3%)

Query: 47 AEIQADKDQLRAEFAMSARRLELSVDALKNKATSQLADLGKKTDAINRMKIELGEKNAAI 106
A +AD ++ + + L+ + + A + A+ +A I
Sbjct: 154 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 213

Query: 107 FALEAREKALKDQLRATEEEFAAKTEALRAAEQALKDKQAELVRLTTELNDKSLLADSRQ 166
LEA + AL + E+ A +K +AE L + +
Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273

Query: 167 VELVSVRAQVEELRTRVAEAEKEFAATQARLALERDESDTATKALNDARARVENLSQRVT 226
+ A+++ L A E E A + + + + + L+ +R + L
Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333

Query: 227 ELDRQLIVQVKEAEMLATRVADLEGRLATQGKLLAEREFENNQLREANAAADRALKELRD 286
+L+ Q ++ EA L L + + E E+ +L E N ++ + + LR
Sbjct: 334 KLEEQN--KISEASR-----QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 386

Query: 287 EIAGFGSGKSAALERLKSEKAALEEQLQAARDERTKLQREMNAIQQQAESTWAQERMENA 346
++ A ++++ +L A +L+ +++ A+ E
Sbjct: 387 DLDA----SREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442

Query: 347 LLRERINDIAAEVAKLAMQIEGPNSTIEAMLAAEPVSAKPANANGAPAPAAAGA 400
L+E++ A E+AKL + T +A + V K P A
Sbjct: 443 ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKA 496



Score = 39.7 bits (92), Expect = 2e-05
Identities = 39/263 (14%), Positives = 88/263 (33%)

Query: 74 LKNKATSQLADLGKKTDAINRMKIELGEKNAAIFALEAREKALKDQLRATEEEFAAKTEA 133
+K + L K ++ L + N + + K + + E A+K +
Sbjct: 58 RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQE 117

Query: 134 LRAAEQALKDKQAELVRLTTELNDKSLLADSRQVELVSVRAQVEELRTRVAEAEKEFAAT 193
L A + L+ + +T + K ++ + L + +A +E+ +A
Sbjct: 118 LEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK 177

Query: 194 QARLALERDESDTATKALNDARARVENLSQRVTELDRQLIVQVKEAEMLATRVADLEGRL 253
L E+ + L A N S + + L + +
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 254 ATQGKLLAEREFENNQLREANAAADRALKELRDEIAGFGSGKSAALERLKSEKAALEEQL 313
+ + + A A L++ + F + SA ++ L++EKAALE +
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 314 QAARDERTKLQREMNAIQQQAES 336
+ L ++++ ++
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDA 320


125BBta_6668BBta_6676N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_66680101.932974hypothetical protein
BBta_6669-291.564635hypothetical protein
BBta_6670-1100.958725hypothetical protein
BBta_6672-190.654085short-chain dehydrogenase/reductase
BBta_6673-190.518343hemolysin III
BBta_6674-290.704127hypothetical protein
BBta_6675-280.418920hypothetical protein
BBta_6676-1100.381446glycosyl transferase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6668PYOCINKILLER290.031 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 29.4 bits (65), Expect = 0.031
Identities = 47/207 (22%), Positives = 71/207 (34%), Gaps = 7/207 (3%)

Query: 64 AQDAAAKAAAAAAAKAAQDAAAKSAAAAAAKAAQDAAAKQAAKA-AADAAAKAAANAAAK 122
AA + AAAA A++ AA A A + A+ AA +AA A A A AA +
Sbjct: 206 TLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGR 265

Query: 123 AATNAAASAASKAATTAATAANAAKTTVAVATTGTAGTTNTTTVTAKNGAATATATATGN 182
A AAS A + A + +A+ + ++T + A T +
Sbjct: 266 GLIQVAQGAASLAQAISDAIAVLGRV---LASAPSVMAVGFASLTYSSRTAEQWQDQTPD 322

Query: 183 GTGTGFGTSTAAAKPAAA---TTATATTGTATKGAATTGTAAATTTKTAAATTTSTTAAA 239
G A + +GT T A TT + +T +
Sbjct: 323 SVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPK 382

Query: 240 AKPATTSSTTTTTVAVAKTTAAPAAAA 266
A P ++ TT T + A A
Sbjct: 383 AVPVRMAAYNATTGLYEVTVPSTTAEA 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6669IGASERPTASE320.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.001
Identities = 25/146 (17%), Positives = 46/146 (31%), Gaps = 5/146 (3%)

Query: 6 TQAAQAAATQNQKTVEAALNQKIGALDAAVEAAEQKVNADRQALEQARQAAQQAKQDAQE 65
T Q + N++I +D A A E A+ +KQ+++
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPP----APATPSETTETVAENSKQESKT 1050

Query: 66 AAEQVERANNAANQAALAAANAKAQTTANAFNQANATLNADNQALTQARESGEQQEQALR 125
+ + A Q A AK+ AN + TQ E+ E
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANT-QTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 126 AAAQQQIQAGYTAAQQATQAAWAEAQ 151
A+ + + + +Q + + Q
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQ 1135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6672DHBDHDRGNASE756e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 74.7 bits (183), Expect = 6e-18
Identities = 53/202 (26%), Positives = 89/202 (44%), Gaps = 17/202 (8%)

Query: 3 VTGKIVVVTGGARGIGKALCEAFARAGAAKVIVADLDETAAKAVAESVGGAA-----FKC 57
+ GKI +TG A+GIG+A+ A G A + D + + V S+ A F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQG-AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 58 DVSQETDISRVIEETERQFGPIALFCSNAGIGGGFDPLAVNVGGT---TDEPWARSWAIH 114
DV I + ER+ GPI + + AG+ + G +DE W +++++
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGV--------LRPGLIHSLSDEEWEATFSVN 116

Query: 115 VMAHVYAARHLIPRYKARGGGYFLNTISAAGLLSQVGSAPYSTTKHAAVGFAENLAISHK 174
A+R + R G + S + + A Y+++K AAV F + L +
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 175 ADNIKVSILCPQGVDTDMLRSI 196
NI+ +I+ P +TDM S+
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6676PF04335310.009 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 31.0 bits (70), Expect = 0.009
Identities = 11/35 (31%), Positives = 14/35 (40%)

Query: 131 LVRLVESDDGRWWLVAGLFGGLALLSKFTVVMLLP 165
+ E W+VAG+ G LA V L P
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTP 58


126BBta_6688BBta_6695N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6688-1190.170782hypothetical protein
BBta_6689-1150.433001hypothetical protein
BBta_66900120.130445ABC transporter periplasmic substrate-binding
BBta_6691080.637629ABC transporter
BBta_6692-170.933278ABC transporter ATP-binding protein
BBta_6693-170.870532two component transcriptional regulator
BBta_6694-180.682582two-component system histidine kinase
BBta_6695-390.354113hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6688cloacin310.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.2 bits (70), Expect = 0.006
Identities = 16/56 (28%), Positives = 18/56 (32%), Gaps = 9/56 (16%)

Query: 247 AGMNTPGSPRRGGWTSGSSGG---------WSSGSSSDSGSFSGGGGSFGGGGASG 293
N G P G G+S G W GS S G G GGG +
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71



Score = 29.7 bits (66), Expect = 0.013
Identities = 18/52 (34%), Positives = 25/52 (48%)

Query: 243 ADSFAGMNTPGSPRRGGWTSGSSGGWSSGSSSDSGSFSGGGGSFGGGGASGS 294
A +G ++ +P GG SG G SG + G+ + GGGS GG S
Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6692PF05272280.038 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.038
Identities = 10/19 (52%), Positives = 12/19 (63%)

Query: 47 VLLGPSGCGKSTLLKAIGG 65
VL G G GKSTL+ + G
Sbjct: 600 VLEGTGGIGKSTLINTLVG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6693HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 2e-24
Identities = 42/141 (29%), Positives = 63/141 (44%)

Query: 2 RILLVEDTPEIGAAVTSRFERIGYAVDWEKDGRTASELIEVQTYDLIVLDVMLPNMDGFA 61
IL+ +D I + R GY V + T I DL+V DV++P+ + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLKHLRKRGLRTPVLVLTARSAVDDRIGALDLGADDYLIKPFDYRELEARARALLRRAAG 121
+L ++K PVLV++A++ I A + GA DYL KPFD EL L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 QSDNLLTLGPLVIDRAGRTAS 142
+ L + GR+A+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6694PF06580300.025 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.025
Identities = 15/100 (15%), Positives = 33/100 (33%), Gaps = 21/100 (21%)

Query: 364 LIHNALAHGARTR-----LMVRVERAAAHVAIVVWDDGPGIAAEAQQHLVSPFQKGDGSH 418
L+ N + HG ++++ + V + V + G ++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 419 GSGLGLAIAAE-VAQAHGG--SLRFAGGAGDFSVRFEIPA 455
+G GL E + +G ++ + G + IP
Sbjct: 310 STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6695SYCDCHAPRONE368e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.1 bits (83), Expect = 8e-05
Identities = 18/115 (15%), Positives = 35/115 (30%), Gaps = 3/115 (2%)

Query: 59 ALLPTDEELLLSRSSTALRLGDGTLGRSLIEKVLTLNPSNPDAMWRVGAGYEDAGDYRTA 118
+ E L S + + G + + + L+ + +GA + G Y A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 119 IRFYAMALTIAPRNQYALLFSSTSNQGLRRFDEA---LKTADALVGLGPDEINLK 170
I Y+ + + ++ EA L A L+ + L
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELS 144



Score = 31.1 bits (70), Expect = 0.004
Identities = 18/123 (14%), Positives = 38/123 (30%), Gaps = 17/123 (13%)

Query: 88 IEKVLTLNPSNPDAMWRVGAGYEDAGDYRTAIRFYAMALTIAPRNQYALLFSSTSNQGLR 147
I + ++ + ++ + +G Y A + + + + L Q +
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84

Query: 148 RFDEALKTADALVGLGPDEINLKPYFDYQGNRLDFYFIALKNRARIHWMLGHLDRAEQDL 207
++D A+ + + E P F + + A G L AE L
Sbjct: 85 QYDLAIHSYSYGAIMDIKE----PRFPF-------------HAAECLLQKGELAEAESGL 127

Query: 208 NAA 210
A
Sbjct: 128 FLA 130


127BBta_6716BBta_6722N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6716-2101.912602Fis family transcriptional regulator
BBta_6717-1111.634238methyl-accepting chemotaxis sensory transducer
BBta_67180111.018158hypothetical protein
BBta_67191140.996539catalase
BBta_6720112-0.714042hypothetical protein
BBta_6721-110-1.175935cytochrome P450
BBta_6722113-1.389193signal transduction histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6716HTHFIS582e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.9 bits (140), Expect = 2e-11
Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 6/98 (6%)

Query: 251 SEREINSLLAVDRNDLVIGATRSARRALGLSASALSRPVPAGDLIKGARAEGDDIDAAER 310
SE + + + +++ + ++ +P L + E
Sbjct: 385 SEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD------RVLAEMEY 438

Query: 311 AVLKRALARAGGNVSKAAKELDLSRATLHRKMKRLGLE 348
++ AL GN KAA L L+R TL +K++ LG+
Sbjct: 439 PLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6718OMPADOMAIN355e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 35.3 bits (81), Expect = 5e-04
Identities = 22/111 (19%), Positives = 36/111 (32%), Gaps = 18/111 (16%)

Query: 432 TATVRAGYLVTPEVLLYARGGAAWTRTSVTAVNAANGQSASAAFNRSGWTVGAGVEWMFA 491
T + GY +T ++ +Y R G R + + GVE+
Sbjct: 99 QLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSP-----VFAGGVEYAIT 153

Query: 492 RNWSAFAEYDYADFGTVTGRLPGAAAVTGGPDVVSLNTKIHTALVGVNYRF 542
+ EY + T + A + PD +GV+YRF
Sbjct: 154 PEIATRLEYQW------TNNIGDAHTIGTRPD-------NGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6720IGASERPTASE334e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 4e-04
Identities = 20/93 (21%), Positives = 33/93 (35%), Gaps = 15/93 (16%)

Query: 68 GEPAIKGRLISAEEARLRELPPPPPDPEREKMERLRKLRLEKEEAEKLAAAARGGSAAHA 127
P++ + E AR+ E P PPP P E E +A ++ S
Sbjct: 1006 DVPSVPSN--NEEIARVDEAPVPPPAPA-----------TPSETTETVAENSKQESKT-V 1051

Query: 128 ATRARGAADTTSRTGRAGAAKPARAAAAAPKTL 160
+ A +TT++ A + A +T
Sbjct: 1052 EKNEQDATETTAQNREV-AKEAKSNVKANTQTN 1083


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6722PF06580379e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 9e-05
Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 3/61 (4%)

Query: 252 LIVTELVINSLKHAFPEDAANGLIVVSYESCGAGWTLSVSDNGTGKAQVNGRVKGGLGTG 311
++V LV N +KH + G I++ TL V + G+ + K GTG
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK---NTKESTGTG 314

Query: 312 I 312
+
Sbjct: 315 L 315


128BBta_6931BBta_6940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_6931-2131.011724beta-lactamase
BBta_6932-2150.847110inositol monophosphatase
BBta_6933-3140.778023hypothetical protein
BBta_6934-2141.005257AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_6935-3132.026324AcrB/AcrD/AcrF family mulitdrug efflux protein
BBta_6936-1141.474237cyclopropane-fatty-acyl-phospholipid synthase
BBta_6937-2142.491922hypothetical protein
BBta_6938-2152.004730hypothetical protein
BBta_6939-2131.935419hypothetical protein
BBta_6940-2132.289497acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6931PF03544361e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.1 bits (83), Expect = 1e-04
Identities = 24/90 (26%), Positives = 33/90 (36%)

Query: 2 TLLRLTYMLAVIAAATTAAHAQISLTPPTLQQPPADKPKPAERPTEKAADKPKPPAVTKK 61
TLL + AV+A + Q+ P Q PA+ +A P P V +
Sbjct: 18 TLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPE 77

Query: 62 PAPAPAPTAKPAAPAPVASPQPPQPAATDP 91
P P P P AP + P+P P
Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 107



Score = 35.7 bits (82), Expect = 2e-04
Identities = 18/71 (25%), Positives = 25/71 (35%)

Query: 22 AQISLTPPTLQQPPADKPKPAERPTEKAADKPKPPAVTKKPAPAPAPTAKPAAPAPVASP 81
+ + P +P + PK A EK KPKP K P KP P +
Sbjct: 70 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPF 129

Query: 82 QPPQPAATDPN 92
+ PA +
Sbjct: 130 ENTAPARPTSS 140



Score = 35.3 bits (81), Expect = 2e-04
Identities = 22/89 (24%), Positives = 30/89 (33%), Gaps = 9/89 (10%)

Query: 12 VIAAATTAAHAQISLTPPT-LQQPPADKPKPA-----ERPTEKAADKPKPPAV---TKKP 62
VI A +++ P L+ P A +P P E E + PK V KP
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 63 APAPAPTAKPAAPAPVASPQPPQPAATDP 91
P P P P +P + P
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASP 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6933ENTEROVIROMP270.037 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 27.2 bits (60), Expect = 0.037
Identities = 8/41 (19%), Positives = 18/41 (43%)

Query: 149 GRSTSTPAYAINGGVEFKPTSNISLSLGFSYAGQSSDRIDS 189
TS ++ G++F P N++L + + S + +
Sbjct: 122 KHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6934ACRIFLAVINRP10510.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1051 bits (2719), Expect = 0.0
Identities = 520/1034 (50%), Positives = 696/1034 (67%), Gaps = 5/1034 (0%)

Query: 1 MPSFFIDRPIFAWVVALFICLIGAIAIPLLAVAQYPIIAPPSISISTSYPGASPENLYNS 60
M +FFI RPIFAWV+A+ + + GA+AI L VAQYP IAPP++S+S +YPGA + + ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTRLIEEELNGASGILNFESTSDSLGQVEITANFVPGTSTNDASVEVQNRLKRVEARLPR 120
VT++IE+ +NG ++ STSDS G V IT F GT + A V+VQN+L+ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVIQQGILVEEASAAVLQIITLQSTDGSLDEIGLGDFMIRNVLGEIRRIPGVGRATLYST 180
V QQGI VE++S++ L + S + + + D++ NV + R+ GVG L+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 ERSLRVWLDPNKLIGYGLTADDVTKAIGAQNAQVASGSIGAEPATTVQRTSALVLVKGQL 240
+ ++R+WLD + L Y LT DV + QN Q+A+G +G PA Q+ +A ++ + +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 DSPDEFGSIMLRANPDGSTVRLRDVARIEIGGLSYQFNTRLNGKPTAGLSVLLSPTGNAL 300
+P+EFG + LR N DGS VRL+DVAR+E+GG +Y R+NGKP AGL + L+ NAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 ATASAIEAKMKELSRFFPSNISYEIPYDITPVVKASIKRVLMTLVEAVVLVFVVMFLFLQ 360
TA AI+AK+ EL FFP + PYD TP V+ SI V+ TL EA++LVF+VM+LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NIRYTIIPTIVVPVALLGTCATLMLVGYSINMLTMFGMVLAVGILVDDAIVVVENVERIM 420
N+R T+IPTI VPV LLGT A L GYSIN LTMFGMVLA+G+LVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPKEATRKAMGQITGAIIGITLVLMAVFVPMAFFPGSVGIIYRQFSVTMVAAIGF 480
E+ L PKEAT K+M QI GA++GI +VL AVF+PMAFF GS G IYRQFS+T+V+A+
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SALLALSLTPALCATLLKPVQAGHGHARTGLFGWFNRFMDGSRNRYTSVVGGALTRTGRL 540
S L+AL LTPALCATLLKPV A H + G FGWFN D S N YT+ VG L TGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 MLIYAIFLVGLTYAFIQLPGGFLPVDDQGFITTDVQTPADSSYARTQAAVEAVEKYLAKR 600
+LIYA+ + G+ F++LP FLP +DQG T +Q PA ++ RTQ ++ V Y K
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 E--GIEDVTFLTGFSYAGQGVNTAQAFISLKDWSER-GKQDSAAALVADINRDLASLRDA 657
E +E V + GFS++GQ N AF+SLK W ER G ++SA A++ +L +RD
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 658 KITALQPPPIDNLGNSSGFSFRLQDRGQKGYAALTAAADQLIAAANASPV-LQKVYIEGL 716
+ P I LG ++GF F L D+ G+ ALT A +QL+ A P L V GL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 717 PQGPQVNLMIDREKAGAFGVTFEDINNTISTNLGSTYVNDFPNRGRMQRVVVQADRISRM 776
Q L +D+EKA A GV+ DIN TIST LG TYVNDF +RGR++++ VQAD RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 777 NADDILNYSVKNARGQPVPFSSFATIQWAKGPSQIAGFNYYPAIRISGEARAGYTSGDAL 836
+D+ V++A G+ VPFS+F T W G ++ +N P++ I GEA G +SGDA+
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 837 NEMERLAANLPRGFGYEWTGQSLQEKLSGSQAPLLLGLSALVVFLCLAALYESWTIPLAV 896
ME LA+ LP G GY+WTG S QE+LSG+QAP L+ +S +VVFLCLAALYESW+IP++V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 897 LLTVPLGILGAVVAANLRGLSNDVYFTVALITIIGLAAKDAILIIEFAKDL-RAHGKPLV 955
+L VPLGI+G ++AA L NDVYF V L+T IGL+AK+AILI+EFAKDL GK +V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 956 EATIEACSLRFRPIIMTGLAFVCGVLPMSMATGAGGASQQALGTNVMGGMIAVVVLALLM 1015
EAT+ A +R RPI+MT LAF+ GVLP++++ GAG +Q A+G VMGGM++ +LA+
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1016 VPVFFVSVQRVLAG 1029
VPVFFV ++R G
Sbjct: 1021 VPVFFVVIRRCFKG 1034



Score = 93.0 bits (231), Expect = 3e-21
Identities = 87/513 (16%), Positives = 182/513 (35%), Gaps = 34/513 (6%)

Query: 540 LMLIYAIFLVGLTYAFIQLPGGFLPVDDQGFITTDVQTPADSSYARTQAAVEAVEKYLAK 599
+L + + G A +QLP P ++ P + + +E+ +
Sbjct: 13 WVLAIILMMAGA-LAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNG 71

Query: 600 REGIEDVTFLTGFSYAGQGVNTAQAFISLKDWSERGKQDSAAALVADINRDLASLRDAKI 659
+ + ++ + I+L + G D A V + L
Sbjct: 72 IDNLMYMS--------STSDSAGSVTITLT--FQSGT-DPDIAQV-QVQNKLQLATPLLP 119

Query: 660 TALQPPPIDNLGNSSGFSFRLQDRGQKGYAALTAAADQLIAAANASPVLQKVYIEGLPQ- 718
+Q I +SS + +D A+N L ++ G Q
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDY--VASNVKDTLSRLNGVGDVQL 177

Query: 719 -GPQVNLMI--DREKAGAFGVTFEDINNTISTN---LGSTYVNDFPNRGRMQRVVVQADR 772
G Q + I D + + +T D+ N + + + + P Q +
Sbjct: 178 FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ 237

Query: 773 ISRMNADDILNYSVK-NARGQPVPFSSFATIQW-AKGPSQIAGFNYYPA----IRISGEA 826
N ++ +++ N+ G V A ++ + + IA N PA I+++ A
Sbjct: 238 TRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGA 297

Query: 827 RAGYTSGDALNEMERLAANLPRGFGYEW---TGQSLQEKLSGSQAPLLLGLSALVVFLCL 883
A T+ ++ L P+G + T +Q + L ++VFL +
Sbjct: 298 NALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEA--IMLVFLVM 355

Query: 884 AALYESWTIPLAVLLTVPLGILGAVVAANLRGLSNDVYFTVALITIIGLAAKDAILIIE- 942
++ L + VP+ +LG G S + ++ IGL DAI+++E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 943 FAKDLRAHGKPLVEATIEACSLRFRPIIMTGLAFVCGVLPMSMATGAGGASQQALGTNVM 1002
+ + P EAT ++ S ++ + +PM+ G+ GA + ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 1003 GGMIAVVVLALLMVPVFFVSVQRVLAGDREKPQ 1035
M V++AL++ P ++ + ++ + + +
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6935RTXTOXIND515e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 5e-09
Identities = 36/217 (16%), Positives = 68/217 (31%), Gaps = 23/217 (10%)

Query: 40 PGRIAP-TRVSDVRPRVSGIVVERLFRQGSEVRAGDPLYRIDPKPFEVEIMSGRAALAKA 98
G++ R +++P + IV E + ++G VR GD L ++ E + + +++L +A
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 99 EAVHERAVQQARRINLLFKERAAPEVEHEKAIASEREAAADVEARKADLA-----RAQLN 153
R +R I L E SE E K + + Q
Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206

Query: 154 LDYATVRAPIDGIVGAAQVSEGAIAVQNETS-----------------LATIQQLDPIYA 196
L+ RA ++ E V+ L +
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266

Query: 197 DFTQSVSELNRLRRAFESGDLDQIAPDAIKVRLVLDD 233
+ S+L ++ S + + +LD
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303



Score = 34.8 bits (80), Expect = 5e-04
Identities = 38/210 (18%), Positives = 72/210 (34%), Gaps = 34/210 (16%)

Query: 87 EIMSGRAALAKAEAVHERAVQQARRINLLFKERAAPEVEHEKAIASEREAAADVEARKAD 146
E+ ++ L + E+ A ++ + + LFK ++ R+ ++ +
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKL---------RQTTDNIGLLTLE 317

Query: 147 LARAQLNLDYATVRAPIDGIVGAAQV-SEGAIAVQNETSLATIQQLDPIYADFTQSVSEL 205
LA+ + + +RAP+ V +V +EG + ET + + + D +
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK-- 375

Query: 206 NRLRRAFESGDLDQIAP-DAIKVRLVLDDGTSY-SIPGK--LLFSDAKVDAATGQVTL-- 259
D+ I +++ T Y + GK + DA D G V
Sbjct: 376 ----------DIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 260 ------RGEFANPKRELLPGMYVRVRIEQG 283
N L GM V I+ G
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6939OMPADOMAIN821e-19 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 81.9 bits (202), Expect = 1e-19
Identities = 48/135 (35%), Positives = 72/135 (53%), Gaps = 20/135 (14%)

Query: 215 VVGDRFVFQSEVFFDTGQATLLPEGQQELNSVATALVDLDKQIPAEISWVLRVDGHTDVR 274
V F +S+V F+ +ATL PEGQ L+ + + L +LD + + + V G+TD
Sbjct: 210 VQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGS-----VVVLGYTD-- 262

Query: 275 PINSPVFKSNWELSSARAISVVQYLISLGVPAQRLVAAGFAEFQPLDPGNTEDAFRR--- 331
I S + N LS RA SVV YLIS G+PA ++ A G E P+ GNT D ++
Sbjct: 263 RIGSDAY--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPV-TGNTCDNVKQRAA 319

Query: 332 -------NRRIELKL 339
+RR+E+++
Sbjct: 320 LIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_6940SACTRNSFRASE473e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 46.9 bits (111), Expect = 3e-09
Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 8/93 (8%)

Query: 58 VAETDGALAGFVTIDAT----GYLDQLVVAPQHWGSPLATRLVEA----AKARSPGGITL 109
+ + G + I + ++ + VA + + T L+ AK G+ L
Sbjct: 69 LYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLML 128

Query: 110 LVNTDNARAIRFYKRSGFVEAGADVNPTSGRPV 142
N A FY + F+ D S P
Sbjct: 129 ETQDINISACHFYAKHHFIIGAVDTMLYSNFPT 161


129BBta_7029BBta_7038N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7029012-0.152302type IV prepilin peptidase, cpaA
BBta_7030-113-0.367996pilus assembly protein CpaB
BBta_7031-112-0.894689pilus assembly protein CpaC
BBta_7032-112-0.017136pilus assembly protein CpaD
BBta_7033-2100.076804pilus assembly protein CpaE
BBta_7034-1100.544278secretory protein kinase, cpaF-like gene
BBta_70350101.101169pilus assembly protein
BBta_70360101.447480pilus assembly protein
BBta_7037-1111.474255allantoate amidohydrolase
BBta_70381120.849294hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7029PREPILNPTASE402e-06 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 39.8 bits (93), Expect = 2e-06
Identities = 34/138 (24%), Positives = 57/138 (41%), Gaps = 5/138 (3%)

Query: 2 TLDLARLLLFPALMAFAAASDLFTMTISNRVSFALLAGFLVLAPLSGM-GMQDMLSHVGA 60
LL ++ DL M + ++++ LL G L+ L G + D + A
Sbjct: 131 GWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMA 190

Query: 61 GALLLVVAFACFAF----GWIGGGDAKVASAAALWFGFAHLMNYLLYASIFGGVLTLLLM 116
G L+L + F +G GD K+ +A W G+ L LL +S+ G + + L+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 QFRQWPLPYMLAGQPWLA 134
R + P+LA
Sbjct: 251 LLRNHHQSKPIPFGPYLA 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7031BCTERIALGSPD1173e-30 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 117 bits (295), Expect = 3e-30
Identities = 69/273 (25%), Positives = 116/273 (42%), Gaps = 28/273 (10%)

Query: 188 KVVNSITVRGRDQVMLKVTVAEVQRSVVKQLGIDLSGQ-----------LNYGTAV---- 232
+V+ + +R R QV+++ +AEVQ + LGI + + L TA+
Sbjct: 335 RVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGAN 393

Query: 233 VKFANSNPFTAYGSNLVSNNAINASFGS-SVQATLRAMENAGVIRTLAEPNLTAISGESA 291
+ ++ S L S N I A F + L A+ ++ LA P++ + A
Sbjct: 394 QYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEA 453

Query: 292 TFIAGGEFPVPAGYSCDPTTHVCTTQISFKKFGISLNFTPVVLAEGRISLRVMTEVSELS 351
TF G E PV G ++ T + K GI L P + + L + EVS
Sbjct: 454 TFNVGQEVPVLTGSQTTSGDNIFNT-VERKTVGIKLKVKPQINEGDSVLLEIEQEVS--- 509

Query: 352 NENSITLSQAVTSSTVNSLTVPSIKTRRAETTLEIPSGGAMAMAGLIQQQTKQAISGMPG 411
S+ + + TSS + + TR + + SG + + GL+ + +P
Sbjct: 510 ---SVADAASSTSSDLG----ATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPL 562

Query: 412 LMQLPVLGTLFRSRDYVNNQTELMVLVTPFVVR 444
L +PV+G LFRS ++ LM+ + P V+R
Sbjct: 563 LGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7033HTHFIS363e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.0 bits (83), Expect = 3e-04
Identities = 21/138 (15%), Positives = 42/138 (30%), Gaps = 12/138 (8%)

Query: 64 GGMAAAIEAYRSAPTPNVIILETDARSDILAGLDQLATV--CDPGTRVVVIGRVNDVTLY 121
A + ++++ TD D L + P V+V+ N
Sbjct: 34 SNAATLWRWIAAGD-GDLVV--TDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTA 90

Query: 122 RELVRRGVSDYVLSPVTPIDVVRSICNLFSAPEAKAVGRIIAVVGAKGGVGASTIAHNVA 181
+ +G DY+ P +++ I + P+ + VG S
Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSA------ 144

Query: 182 WAIARDLSMDSVVADLDL 199
A+ + + + DL
Sbjct: 145 -AMQEIYRVLARLMQTDL 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7035BCTERIALGSPF290.029 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.0 bits (65), Expect = 0.029
Identities = 26/98 (26%), Positives = 43/98 (43%), Gaps = 7/98 (7%)

Query: 164 IKAGLPLFESIKVVAADAPEP-LRSEFLAIIETQAIGMPLGEACARLYERMPLPEANFFG 222
+ A +PL E++ VA + +P L A+ G L +A + P +
Sbjct: 81 VAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADA----MKCFPGSFERLYC 136

Query: 223 IVVAIQQKSGGNLSEALGNLSKVLRDRKKMAEKI-QAM 259
+VA + S G+L L L+ R++M +I QAM
Sbjct: 137 AMVAAGETS-GHLDAVLNRLADYTEQRQQMRSRIQQAM 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7038SYCDCHAPRONE310.004 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 30.7 bits (69), Expect = 0.004
Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 2/63 (3%)

Query: 167 PDNPDWRLLSVQGTALDQMGRHDEARRYYESALKIVPGEPSVLSNLGLSYMLTRELPKAE 226
+ + L G MG++D A Y + EP + + EL +AE
Sbjct: 67 HYDSRFFL--GLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAE 124

Query: 227 EVL 229
L
Sbjct: 125 SGL 127


130BBta_7229BBta_7235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7229229-3.186408two-component response regulator
BBta_7230230-4.133275hypothetical protein
BBta_7231424-2.346375hypothetical protein
BBta_7233321-1.823935flagellar L-ring protein FlgH
BBta_7234321-2.197927hypothetical protein
BBta_7235120-1.334079outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7229HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 3e-19
Identities = 36/142 (25%), Positives = 63/142 (44%)

Query: 5 SSVRVLVVEDDPQLGIWLRDALATAFGSSDIVTTLDEGRAAIAVRTFELVVIDRGLPDGD 64
+ +LV +DD + L AL+ A I + IA +LVV D +PD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GLALLPDLRQQKPSPATVVLTALDDPADIARALDEGADDYVAKPFEPIELVARARAVLRR 124
LLP +++ +P +V++A + +A ++GA DY+ KPF+ EL+ L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 125 LYLDRGAVVSIANLSYDIVNRA 146
+ + +V R+
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7233FLGLRINGFLGH649e-15 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 64.2 bits (156), Expect = 9e-15
Identities = 37/187 (19%), Positives = 67/187 (35%), Gaps = 27/187 (14%)

Query: 47 PIQPLLPAA--AARVPAHQP---NRRLRSLSHAWSTKQ------RAPQIGDIVTMTVNIA 95
P PL+ A A VP P +S Q R IGD +T+ +
Sbjct: 26 PSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQEN 85

Query: 96 ENPGDMSTQGGTNSNRHADELPDTGGAVAGRR-----------ADASSSRDTAGRTTVGQ 144
+ S N++R D + G R +AS G+
Sbjct: 86 VSA---SKSSSANASR--DGKTNFGFDTVPRYLQGLFGNARADVEASGGNTFNGKGGANA 140

Query: 145 AELKRMSMAAVVTRVSPDGGLVVEGRRTMQIDGEIVDLEVSGVVPADSLRVDHSIDASKM 204
+ ++ V +V +G L V G + + I+ + SGVV ++ +++ ++++
Sbjct: 141 SNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQV 200

Query: 205 KQMQISY 211
+I Y
Sbjct: 201 ADARIEY 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7234FLGFLIH270.003 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 27.5 bits (60), Expect = 0.003
Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 35 YDPGYGDGYADGYVSGYKDGYSDGAWSG 62
++ GY G A+G G+K GY +G G
Sbjct: 52 HEQGYQAGIAEGRQQGHKQGYQEGLAQG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7235OMPADOMAIN552e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 55.3 bits (133), Expect = 2e-11
Identities = 49/188 (26%), Positives = 67/188 (35%), Gaps = 33/188 (17%)

Query: 14 YVGGHLG------GEFSGASGAFGRESRFLGGLQFGADYQFAPNWVLGAEGQYSWLAGNG 67
Y G LG F +G G FG YQ P +G E Y WL G
Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGA--FGG-YQVNPY--VGFEMGYDWL---G 80

Query: 68 QTSSLGRRELTRDRNGIGGLTGRLGYTWGAGL-LYVKGGYAYQDRSYGVRSRAGAPVAFA 126
+ G E + LT +LGY L +Y + G R+ + V
Sbjct: 81 RMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMV------WRADTKSNVYGK 134

Query: 127 VRDNKSGYTVGAGLEYMFAPNWSTKLEYQY-YDFGTARFVAPSPGRLVRNGTFDSTLHTV 185
D G+EY P +T+LEYQ+ + G A + P +
Sbjct: 135 NHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRP-----------DNGML 183

Query: 186 SVGVNYRF 193
S+GV+YRF
Sbjct: 184 SLGVSYRF 191


131BBta_7241BBta_7245N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BBta_7241018-2.155045flagellar hook-associated protein
BBta_7242118-2.960764flagellar hook protein FlgE
BBta_7243119-2.857497hypothetical protein
BBta_7244218-2.587810flagellar basal-body rod modification protein
BBta_7245117-2.613087short-chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7241FLGHOOKAP11296e-34 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 129 bits (325), Expect = 6e-34
Identities = 126/605 (20%), Positives = 220/605 (36%), Gaps = 72/605 (11%)

Query: 3 TNAFNTATAGLQTLQVAIGTVSQNVANSGVAGYTRRVVSTESAGPG-------NSGVAVA 55
++ N A +GL Q A+ T S N+++ VAGYTR+ A +GV V+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 56 RIDRTFDEMALKQMRLESAHAAYASAKADILAQIDKLSGKPADSSALDARLNGLAKSLLA 115
+ R +D Q+R ++ +A+ + +++ID + +S+L ++ SL
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLS--TSTSSLATQMQDFFTSLQT 118

Query: 116 LASNGASASSRSTVVDAASVLADKIRGMADALQAMRDGANKRLAAEVSAADGLLTSIADL 175
L SN ++R ++ + L ++ + L+ N + A V + IA L
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 176 N---IKATTVTDDATRVGILDRRDQQITELARYMEVKSIRQRDGGVTLMTASGLTLVERG 232
N + T V A+ +LD+RDQ ++EL + + V+ Q G + A+G +LV+
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGS 238

Query: 233 AATKLSFIDRSPATGGGAIVATLPGGVGLELDASAVSSGSIAANLEIRDAILPRTQRRLD 292
A +L+ + S + +E+ +++GS+ L R L +T+ L
Sbjct: 239 TARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLG 298

Query: 293 DLAFGLAQAFTNTTVTAGRRGAGFDLRLDDVAQMQPGNTITIAVGSGDTQRTIVLVASNL 352
LA A+AF NT A GFD D + VL +
Sbjct: 299 QLALAFAEAF-NTQHKA-----GFDANGD------------AGEDFFAIGKPAVLQNTKN 340

Query: 353 ASKSLDVAQPPGRVQTFAIPSAPATSQAYAAALSTAISAAAPGLTVTSGGSNSITVGGAG 412
DVA A+ + A+ TVT + + G
Sbjct: 341 KG---DVAIGATVTDASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLE 397

Query: 413 IQSVTASVTQPKTAGDLSGGYPKLAVFVDGAANALVNGAMDDGPQRAGLAARLAVNTALK 472
+ T D + + ++A+VN +
Sbjct: 398 LT-----FTGTPAVND--------SFTLKPVSDAIVNMDVL------------------I 426

Query: 473 ADTSALVAVGSSPSALSRPQALYQALTSEKQGFSSPNGSSNRSPAMTTVISFAQDVVAAA 532
D + + + S + L + Q S G + + +V+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALL--DLQSNSKTVGGA------KSFNDAYASLVSDI 478

Query: 533 GSEAAASATVADQQNVAKANADAWFSKGASVNIDEEMSRLIALQTAYAANARVLTAAREM 592
G++ A T + Q + VN+DEE L Q Y ANA+VL A +
Sbjct: 479 GNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538

Query: 593 IDLLL 597
D L+
Sbjct: 539 FDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7242FLGHOOKAP1320.007 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.9 bits (72), Expect = 0.007
Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 424 EASNSDVAGEFSKLIATQQAYSANVKVMTTSQQMMSDLLN 463
S ++ E+ L QQ Y AN +V+ T+ + L+N
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 29.5 bits (66), Expect = 0.035
Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 4 STALLTAFSGMKAQSYAMANISGNIANTQTPGYK 37
S+ + A SG+ A A+ S NI++ GY
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7243FLAGELLIN489e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 47.7 bits (113), Expect = 9e-08
Identities = 54/365 (14%), Positives = 94/365 (25%), Gaps = 6/365 (1%)

Query: 214 TNDGRLTFTSTLTGSDAKLTISGGGATNTVDVGFGTGALTAVAASGIDATDGSAKASVTG 273
NDG L + G + G + +G D A
Sbjct: 149 ANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVD 208

Query: 274 AAIGALGTASNFDLTAGDASITVQLGNGLTKTINLNKTADAALGVATLKAQDIAAAINRQ 333
GA+ T + + G T N D + A AI
Sbjct: 209 VNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGA 268

Query: 334 LNADTGISGKVIATYDNAAGTVSLRTTASGGDQKLTVTSAAASTKDIGFGTAGDATKART 393
+ T + + + DI G A
Sbjct: 269 IKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQ 328

Query: 394 AAGAGATAANGTNQRAALAQQFNELLNQITMAAQDARFQGANLLYRTGSDPKENTLHMTF 453
++ T+ + A + + +
Sbjct: 329 SSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTA------NAA 382

Query: 454 NEKDTSYLDIKGVKFDASGLGITQSSGNFATNDEVKTALSQLMNASSTLRSQASTFGSNL 513
+K T + ASG+ + A L+ + +A S + + S+ G+
Sbjct: 383 GDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQ 442

Query: 514 TVIQNRQAFTKNMINILDVGANNLTIADLNEETANQNALSLRNSLGISALSLANQAQQGI 573
+ N + L+ + + AD E +N + + G S L+ ANQ Q +
Sbjct: 443 NRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNV 502

Query: 574 LQLLR 578
L LLR
Sbjct: 503 LSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BBta_7245DHBDHDRGNASE1178e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (294), Expect = 8e-34
Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 20/267 (7%)

Query: 8 AVKDKKALVTGGASGIGLAIAEGLAENGAVVAIVDRNKEALEREIARLISRDLRVSGVHG 67
++ K A +TG A GIG A+A LA GA +A VD N E LE+ ++ L +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 68 DVA-ADGFDATIEAAIAALGGVDIVFANAGISGGYGPGVGGNDAGLLQNIDLKSWNHTIG 126
DV + D +G +DI+ AG+ GL+ ++ + W T
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVL----------RPGLIHSLSDEEWEATFS 114

Query: 127 VNLTGIVSTLKATIPTLKEQRSGKIVVTASIAGLRANPSIGYSYTASKAALVLLIKELAL 186
VN TG+ + ++ + ++RSG IV S S+ +Y +SKAA V+ K L L
Sbjct: 115 VNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA-AYASSKAAAVMFTKCLGL 173

Query: 187 ELAPFGVQVNGLAPGPFKTNINGGRFFDPDNAAREAA--------TVPLGRLAQPHEIKG 238
ELA + ++ N ++PG +T++ + D + A + +PL +LA+P +I
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 239 LALLLSSDASSYITGAVIPIDGGKTAG 265
L L S + +IT + +DGG T G
Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGGATLG 260



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.