PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeCM003.gbffThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP040010 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1FBF30_01015FBF30_01040Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_010152161.555928lysine--tRNA ligase
FBF30_010205192.913468hypothetical protein
FBF30_010258193.606348undecaprenyl-diphosphate phosphatase
FBF30_010306174.313596ABC transporter ATP-binding protein
FBF30_010357183.857572ABC transporter permease
FBF30_010407184.909219collagen-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01015SECBCHAPRONE300.015 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.015
Identities = 8/79 (10%), Positives = 24/79 (30%), Gaps = 15/79 (18%)

Query: 368 KNIRKDVVGPVWLVNTPKFISPLAKSSIDDPNTVQRFQPIMLGSELGNGFSELNDPID-- 425
+ + + P + P A+ + F + L P++
Sbjct: 98 SGLEEMQMAHCLTSQCPNMLFPYARELVSSLVNRGTFPALNL------------SPVNFD 145

Query: 426 -QYGRFIEQQAMRDSGDDE 443
+ ++++Q + +E
Sbjct: 146 ALFMDYLQRQEQAEQTTEE 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01035MICOLLPTASE290.040 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.3 bits (65), Expect = 0.040
Identities = 16/65 (24%), Positives = 27/65 (41%)

Query: 71 TDMQKGIGASVLPEYKPDTVTQYGREYATLSPDDLAKLRSRSDIDNVQPLYNLTPKYATF 130
TD + +G S+ P + E ++ DL +L +NV L+N TF
Sbjct: 72 TDNNRPLGPSIAPSRARNNKIYTFDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTF 131

Query: 131 GAAKD 135
+ +D
Sbjct: 132 FSNRD 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01040cloacin383e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 3e-05
Identities = 19/37 (51%), Positives = 20/37 (54%)

Query: 219 IGRGGGGGGGGGGGGGGGGGGGGGGGGGVARLAQLAL 255
I GGG G G GGG G GGG G GG A A +A
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90



Score = 34.3 bits (78), Expect = 4e-04
Identities = 18/37 (48%), Positives = 20/37 (54%)

Query: 220 GRGGGGGGGGGGGGGGGGGGGGGGGGGVARLAQLALP 256
G G GGG G G GGG G GGG G L+ +A P
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.002
Identities = 16/29 (55%), Positives = 16/29 (55%)

Query: 218 PIGRGGGGGGGGGGGGGGGGGGGGGGGGG 246
P G G G G GGG G G GGG G GG
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGG 73



Score = 32.0 bits (72), Expect = 0.002
Identities = 16/32 (50%), Positives = 16/32 (50%)

Query: 215 PIGPIGRGGGGGGGGGGGGGGGGGGGGGGGGG 246
P G G GGG G G GGG G GGG G
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76


2FBF30_01200FBF30_01235Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_012003244.575501hypothetical protein
FBF30_012054286.786063NUDIX hydrolase
FBF30_012104262.631469AbrB/MazE/SpoVT family DNA-binding
FBF30_012153251.610523type II toxin-antitoxin system death-on-curing
FBF30_012202240.145094histidine phosphatase family protein
FBF30_01225228-0.197883SDR family oxidoreductase
FBF30_01230026-4.668429hypothetical protein
FBF30_01235125-4.471924hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01220FLGMOTORFLIG300.005 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 29.8 bits (67), Expect = 0.005
Identities = 18/71 (25%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 29 RQQAQQLARRIRDRELVFDVVYASPLDRALET--AAIVATELGLAEPIVHDDLIERNFGI 86
++ +LA I+ + VF+ + DR+++ I EL A V + E+ F
Sbjct: 230 EEEDPELAEEIKKKMFVFEDIVLLD-DRSIQRVLREIDGQELAKALKSVDIPVQEKIFKN 288

Query: 87 MTGRPAADVEE 97
M+ R A+ ++E
Sbjct: 289 MSKRAASMLKE 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01225NUCEPIMERASE320.003 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.003
Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 6 TVLVAGATGYLGRFVVAELHRRGYKVRAI 34
LV GA G++G V L G++V I
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


3FBF30_01320FBF30_01500Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_013209222.259680penicillin-binding protein 2
FBF30_0132510252.388295phospho-N-acetylmuramoyl-pentapeptide-
FBF30_0133010272.007032FtsW/RodA/SpoVE family cell cycle protein
FBF30_0133511302.416833UDP-N-acetylglucosamine--N-acetylmuramyl-
FBF30_0134010321.983681hypothetical protein
FBF30_013457310.575480BspA family leucine-rich repeat surface protein
FBF30_01350224-5.877783NUDIX hydrolase
FBF30_01355225-3.704921type II-A CRISPR-associated protein Csn2
FBF30_01360226-2.500990CRISPR-associated endonuclease Cas2
FBF30_01365226-1.585982type II CRISPR-associated endonuclease Cas1
FBF30_01370226-1.115680type II CRISPR RNA-guided endonuclease Cas9
FBF30_013752260.343948type II CRISPR RNA-guided endonuclease Cas9
FBF30_013801281.641874ATP-binding protein
FBF30_013851282.587572hypothetical protein
FBF30_01390-2233.023403type I DNA topoisomerase
FBF30_01395-2243.480995hypothetical protein
FBF30_014000213.929372hypothetical protein
FBF30_014051214.701530CDP-alcohol phosphatidyltransferase family
FBF30_01410-1264.423568DNA-protecting protein DprA
FBF30_01415-2174.624651ZIP family metal transporter
FBF30_01420-1205.406101aminoacyl-tRNA hydrolase
FBF30_014250206.030364hypothetical protein
FBF30_014301195.615014hypothetical protein
FBF30_014351205.268907ribosome biogenesis GTPase Der
FBF30_014401205.452803hypothetical protein
FBF30_014452246.130434hypothetical protein
FBF30_014501244.947756adenylosuccinate synthetase
FBF30_014550214.296903hypothetical protein
FBF30_014600183.980803hypothetical protein
FBF30_01465-1163.127416rRNA pseudouridine synthase
FBF30_01470-2131.946552response regulator
FBF30_01475-3140.844024PAS domain-containing protein
FBF30_01480016-1.014583protease HtpX
FBF30_01490017-1.840133*hypothetical protein
FBF30_01495122-3.118068hypothetical protein
FBF30_01500025-3.075603hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01380IGASERPTASE442e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.3 bits (104), Expect = 2e-06
Identities = 42/195 (21%), Positives = 65/195 (33%), Gaps = 25/195 (12%)

Query: 753 APAFSAKTLNL----PPAQDDNTGRIIQHTRENYARNRQEIEEDISKRILPPENLVVKRP 808
APA ++T + + Q E A+NR+ +E S + V +
Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088

Query: 809 G-----PAYTPPKSPEQREAERRAKEAAILAQGKTWPISNVTPDEVNIALKEARDQKKQP 863
G T K E E +AK Q S V+P K+ + + QP
Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP-------KQEQSETVQP 1141

Query: 864 STEPKT-------SDQPDAPINNSNNVTEKPKKKRTRTRKRKPSGSSDTQQEPNRPRIIR 916
EP +P + N+ TE+P K T + +P S T N
Sbjct: 1142 QAEPARENDPTVNIKEPQSQ-TNTTADTEQP-AKETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 917 ESSVNDGKPSSNNPE 931
E++ + N E
Sbjct: 1200 ENTTPATTQPTVNSE 1214



Score = 31.6 bits (71), Expect = 0.021
Identities = 26/150 (17%), Positives = 51/150 (34%), Gaps = 15/150 (10%)

Query: 794 SKRILPPENLVVKRPGPAYTPPKSPEQREAERRAKEAAILAQGKTWPISNVTPDEVNIAL 853
++ I + V P PA TP ++ E + + + + T +A
Sbjct: 1014 NEEIARVDEAPVPPPAPA-TPSETTETVAENSKQESKT--VEKNEQDATETTAQNREVAK 1070

Query: 854 KEARDQKKQPSTEPKTSDQPDAPINNSNNVTEKPKKKRTRTRKRKPSGSSDTQQEPNRPR 913
+ + K T + T + K+ T ++ K ++ QE P+
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSET----KETQTTETKETATVEKEEKAKVETEKTQE--VPK 1124

Query: 914 IIRESSVNDGKPSSNNP------ENDPTIL 937
+ + S + + P ENDPT+
Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVN 1154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01470HTHFIS832e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-19
Identities = 29/138 (21%), Positives = 60/138 (43%), Gaps = 3/138 (2%)

Query: 2 TKILLVEDDKSLREIYGVRLLAEGYDIVSAGDGEEALAMAIKDRPDLILSDVMMPKISGF 61
IL+ +DD ++R + L GYD+ + DL+++DV+MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DMLDILRSTTETKNIKVIMMTALSSEEQRQRGVALGADRYLVKSQVGIEDVVRTVHEVLS 121
D+L ++ ++ V++M+A ++ + GA YL K + +++ + L+
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGRALA 120

Query: 122 DAPVSGQKPLVSPRPAAP 139
+ K + P
Sbjct: 121 EPKRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01475PF06580433e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 3e-06
Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 27/109 (24%)

Query: 454 VVQNLVENAIKY-----TPEGEVSVDVTGDHSHIVISIADTGIGIPHEDQSHLFQKFYRV 508
+VQ LVEN IK+ G++ + T D+ + + + +TG +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------- 309

Query: 509 DNSDTREIGGTGLGLY-LCRRLTETIGG--RIWVESEYKHGSTFFVEIP 554
TG GL + RL G +I + + + V IP
Sbjct: 310 ---------STGTGLQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLIP 348


4FBF30_01780FBF30_01880Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_017802250.976158hypothetical protein
FBF30_017851210.087567hypothetical protein
FBF30_01790022-0.358798recombination protein RecR
FBF30_01795022-0.593474YbaB/EbfC family nucleoid-associated protein
FBF30_01800022-0.131058glycosyltransferase family 39 protein
FBF30_018052241.059013replicative DNA helicase
FBF30_018101241.695359DNA polymerase III subunit gamma/tau
FBF30_018150273.213812thymidine kinase
FBF30_01820-1273.343811hypothetical protein
FBF30_01825-1233.41732150S ribosomal protein L7/L12
FBF30_01830-2222.71686550S ribosomal protein L10
FBF30_01835-1192.102054hypothetical protein
FBF30_018401170.712044hypothetical protein
FBF30_018450170.311381deoxycytidine triphosphate deaminase
FBF30_018500151.119423dihydrofolate reductase
FBF30_018551180.959974thymidylate synthase
FBF30_018600211.536645hypothetical protein
FBF30_018652242.977626hypothetical protein
FBF30_018702203.62958650S ribosomal protein L1
FBF30_018751183.872689LysM peptidoglycan-binding domain-containing
FBF30_01880-1223.683485hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01810HTHFIS290.046 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.046
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 8/52 (15%)

Query: 13 SLDEIVGQS----HITDMLKRAIASDNIAHAYLLTGPRGVGKTSIARILAHE 60
+VG+S I +L R + +D ++TG G GK +AR L H+
Sbjct: 135 DGMPLVGRSAAMQEIYRVLARLMQTDL---TLMITGESGTGKELVARAL-HD 182


5FBF30_02295FBF30_02340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_022952160.046012hypothetical protein
FBF30_02300217-0.421991hypothetical protein
FBF30_02305013-0.405011hypothetical protein
FBF30_023152212.046932hypothetical protein
FBF30_023202252.15084330S ribosomal protein S7
FBF30_023252252.38530330S ribosomal protein S12
FBF30_023303232.226377hypothetical protein
FBF30_023353232.509873hypothetical protein
FBF30_023403242.486020DNA-directed RNA polymerase subunit beta'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02330ACRIFLAVINRP290.032 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.032
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 7/40 (17%)

Query: 51 RPVFVLILAIFMTALGAFVGIVVYKAYFERPATEAPVVAP 90
RP+F +LAI + GA A + P + P +AP
Sbjct: 8 RPIFAWVLAIILMMAGAL-------AILQLPVAQYPTIAP 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02335PilS_PF08805290.027 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 28.7 bits (64), Expect = 0.027
Identities = 11/69 (15%), Positives = 27/69 (39%), Gaps = 4/69 (5%)

Query: 45 KHAYHPKRMMIVLSSIVAVSLIAVAIMLIYALSPTQQNKKTNGSDQSNKSTPQKDALTAK 104
+ + ++ ++ V +I V Y L Q+ + ++Q+N T +
Sbjct: 19 RRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLT----VIANM 74

Query: 105 QTVKNVAIY 113
+++K Y
Sbjct: 75 KSLKFQGRY 83


6FBF30_02875FBF30_02915Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_02875-116-3.631586basic amino acid ABC transporter
FBF30_02880-117-4.534959hypothetical protein
FBF30_02885-119-6.280906LD-carboxypeptidase
FBF30_02905020-6.923262***HpaII family restriction endonuclease
FBF30_02910022-7.098191DNA cytosine methyltransferase
FBF30_02915022-7.346654ATP-dependent endonuclease
7FBF30_03265FBF30_03400Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_032652203.710650phosphoglycerate kinase
FBF30_032703254.471637pyruvate kinase
FBF30_032754294.537020VIT family protein
FBF30_032803314.150012ABC transporter ATP-binding protein
FBF30_032852294.917498ABC transporter permease
FBF30_03290-1242.349075ABC transporter permease
FBF30_032950230.616110ASCH domain-containing protein
FBF30_03300121-1.239623ASCH domain-containing protein
FBF30_03305121-1.007666hypothetical protein
FBF30_033102290.733011hypothetical protein
FBF30_03315122-2.766627hypothetical protein
FBF30_03320024-1.503294hypothetical protein
FBF30_03325018-3.755047hypothetical protein
FBF30_03330020-4.559693hypothetical protein
FBF30_03335120-1.476506hypothetical protein
FBF30_03340221-1.246359non-canonical purine NTP pyrophosphatase
FBF30_03345220-0.642793hypothetical protein
FBF30_03350017-1.241917GrpB family protein
FBF30_03355018-0.565527HAMP domain-containing histidine kinase
FBF30_03360016-1.041185response regulator transcription factor
FBF30_03365321-4.008053CPBP family intramembrane metalloprotease
FBF30_03370523-6.276529IS1595 family transposase
FBF30_03375626-6.998758NUDIX hydrolase
FBF30_03380729-7.820139hypothetical protein
FBF30_03385629-7.787766hypothetical protein
FBF30_03390626-6.214512CopG family transcriptional regulator
FBF30_03400624-1.922177hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03280PF05272310.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.004
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 38 GSSGAGKSTLLGLLAGLDTPTDGQILF-DDQDIAEQ 72
G+ G GKSTL+ L GLD +D +D EQ
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQ 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03360PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 43/220 (19%), Positives = 75/220 (34%), Gaps = 53/220 (24%)

Query: 235 LEHENKRITQLEKEKIAFLRAASHELKTPLAALRIMLENMQLN----------IGEYKNR 284
L ++ +I + AS + L AL+ Q+N I
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALK-----AQINPHFMFNALNNIRALILE 188

Query: 285 DQYLAESVAQVDRLAAMVNDVLRSGSVAEQALRQEKRLRIDKLLAEVVDDYMLLAKTR-- 342
D A + + L+ ++ LR + + +L E VVD Y+ LA +
Sbjct: 189 DPTKAREM--LTSLSELMRYSLRYSNARQVSLADEL---------TVVDSYLQLASIQFE 237

Query: 343 -GMTFEVNAEPTTIRANRDMMRHVISNLVSNAVRHG----DTGSVIAIT--CDQRELAIE 395
+ FE P + + ++ LV N ++HG G I + D + +E
Sbjct: 238 DRLQFENQINPAIMDVQ--VPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 396 NACKPLAKQQLQHVFDPFYRSSGSTKQRADSSGIGLYTVK 435
V + S + K +S+G GL V+
Sbjct: 296 -------------VENT---GSLALKNTKESTGTGLQNVR 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03365HTHFIS844e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 4e-21
Identities = 31/121 (25%), Positives = 65/121 (53%), Gaps = 4/121 (3%)

Query: 2 IVEDEPALRSGTEQFLRQRGFTVVTATSGEEALKKFTEA--DVIILDIMLPGVSGIETLH 59
+ +D+ A+R+ Q L + G+ V ++ + D+++ D+++P + + L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 60 QIRQA-SDVPVLMLTALHDEPTQIASFDELADDYMSKPFSL-VILEKRIRALLRRQQSVK 117
+I++A D+PVL+++A + T I + ++ A DY+ KPF L ++ RAL ++
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127

Query: 118 K 118
K
Sbjct: 128 K 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03370PF06580300.008 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.008
Identities = 19/118 (16%), Positives = 38/118 (32%), Gaps = 15/118 (12%)

Query: 20 WFVAMAIAVAVTVPLQIAIGLPFELYALVTLAPFIAYLATIPLRHWRPSR-WQTVSAARW 78
W V + + T R + + W ++ +
Sbjct: 20 WGVYTLTGFGFASLYGS---PKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQI 76

Query: 79 AMSIISACLTIGVV--------GLLFVAIGYKP-HWQLPTAGASIGVFLLLQIFGAFT 127
+ ++ AC+ IG+V L I KP + LP A + I F ++ + ++
Sbjct: 77 ILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSII--FNVVVVTFMWS 132


8FBF30_03785FBF30_03860Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_03785224-1.469463hypothetical protein
FBF30_03790423-0.671178glycosyltransferase family 4 protein
FBF30_03795421-0.070728glycosyltransferase family 4 protein
FBF30_038004200.122454NUDIX hydrolase
FBF30_038056251.185593nucleotide exchange factor GrpE
FBF30_038107240.690210transcriptional regulator
FBF30_03815523-2.872828hypothetical protein
FBF30_03820324-4.732239hypothetical protein
FBF30_03825424-2.972703hypothetical protein
FBF30_03830325-2.109862hypothetical protein
FBF30_03835324-3.588101alpha/beta hydrolase
FBF30_03840222-4.913162hypothetical protein
FBF30_03845018-2.397541hypothetical protein
FBF30_038501200.152816GTP-binding protein LepA
FBF30_03855017-0.622845hypothetical protein
FBF30_03860217-1.560001hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03845PF04183841e-19 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 83.8 bits (207), Expect = 1e-19
Identities = 42/201 (20%), Positives = 69/201 (34%), Gaps = 23/201 (11%)

Query: 112 PTSSTRTLLTYDQPYTFMVKTDLEKRHYKFIRRLKGTSVEHSIAMSSELGSIC-KDEALS 170
S RTL + +K L + R + G + S L + D L
Sbjct: 254 AQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLV 313

Query: 171 E--FAYLPESIGIIFGDE------------KTGAGVLFREIVPRPLVDDTRTLVPYFSLY 216
+ L E E + GV++RE R L D + V +L
Sbjct: 314 QSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPD-ESPVLMATLM 372

Query: 217 ANDIRNPEDRALLSQLIDLHSTKGQELAYFTEVILGKIIRNWTTLARDYGILPELHGQNT 276
D ++ L ID + + + ++ L YG+ HGQN
Sbjct: 373 ECD---ENNQPLAGAYIDRSGLDAET---WLTQLFRVVVVPLYHLLCRYGVALIAHGQNI 426

Query: 277 LLELNDNLEPERIVYRDFQGT 297
L + + + P+R++ +DFQG
Sbjct: 427 TLAMKEGV-PQRVLLKDFQGD 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03850TCRTETOQM452e-07 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 45.2 bits (107), Expect = 2e-07
Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 1/80 (1%)

Query: 123 EIREPWIDGEIVVPQDYIGAVIQLIVAKRGRQKNLSYIDERALISFTAPLANLLTDFYDQ 182
E+ EP++ +I PQ+Y+ + + ++S P A + ++
Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIP-ARCIQEYRSD 592

Query: 183 LKSVTSGYGSFNYELAGYQP 202
L T+G EL GY
Sbjct: 593 LTFFTNGRSVCLTELKGYHV 612



Score = 38.7 bits (90), Expect = 3e-05
Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 7/74 (9%)

Query: 26 PVSNEDYNDLKEAIEKLSLSDSALQFE--PENSPVLGYGVRIGFLGLLHMDIIRERLERE 83
P + L +A+ ++S SD L++ ++ + FLG + M++ L+ +
Sbjct: 352 PSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQEK 406

Query: 84 YNLDLIVTNPSTDY 97
Y++++ + P+ Y
Sbjct: 407 YHVEIEIKEPTVIY 420


9FBF30_04055FBF30_04140Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_04055019-4.433112hypothetical protein
FBF30_04060021-6.613666hypothetical protein
FBF30_04065021-6.735116DUF167 domain-containing protein
FBF30_04070021-6.404199helix-turn-helix transcriptional regulator
FBF30_04075121-5.785508type I restriction endonuclease subunit R
FBF30_04080021-6.083211restriction endonuclease subunit S
FBF30_04085122-4.689485hypothetical protein
FBF30_04090325-1.836636virulence RhuM family protein
FBF30_04095224-1.593007type I restriction-modification system subunit
FBF30_04100125-1.474547DNA repair protein RadC
FBF30_04105-119-2.638445HAMP domain-containing histidine kinase
FBF30_04110020-2.280885response regulator transcription factor
FBF30_04115019-4.173867LD-carboxypeptidase
FBF30_04120019-4.503352hypothetical protein
FBF30_04125019-3.744201TetR/AcrR family transcriptional regulator
FBF30_04130-121-3.239709ferredoxin reductase
FBF30_04135-121-3.721880winged helix-turn-helix transcriptional
FBF30_04140018-3.212740hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_04105PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 20/108 (18%), Positives = 35/108 (32%), Gaps = 25/108 (23%)

Query: 227 ILAILVDNAVKY---VPSKVGKINLCVRSRKNTLEFIVKDNGPGIASADQKHIFERFYQA 283
++ LV+N +K+ + GKI L T+ V++ G ++
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------- 309

Query: 284 DTARTRTDVSGHGLGLA-IAKSLADRCG--YTIHVKSRLSAGAEFVLI 328
G GL + + L G I + + VLI
Sbjct: 310 ----------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_04110HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 2e-23
Identities = 32/127 (25%), Positives = 64/127 (50%)

Query: 2 RILLVEDDVAIAQSLKEGLEDEAYAVDVAHDGDEGYRTATADDYDVIILDVMLPEMNGYE 61
IL+ +DD AI L + L Y V + + +R A D D+++ DV++P+ N ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRALRQDGNQTPILMLTARDAERDIVEGLDMGADDYLAKPFSFEVLLARLRALLRRPNE 121
+ +++ P+L+++A++ ++ + GA DYL KPF L+ + L P
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 KLEEVLR 128
+ ++
Sbjct: 125 RPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_04125HTHTETR601e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 1e-13
Identities = 21/85 (24%), Positives = 43/85 (50%), Gaps = 1/85 (1%)

Query: 1 MDKRQALKTAAYDVFSKKGYKETGISEIAKRAGVAVGSFYNYYDGKETIFLDVYIEENNR 60
+ RQ + A +FS++G T + EIAK AGV G+ Y ++ K +F +++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 VRQAMMDNIDWQ-QDLVELVRQIFE 84
+ + ++ D + ++R+I
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILI 94


10FBF30_00500FBF30_00520N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_00500021-0.546699type II secretion system F family protein
FBF30_00505221-0.710438prepilin-type N-terminal cleavage/methylation
FBF30_00510218-1.140482prepilin peptidase
FBF30_00515117-1.095182type II secretion system protein
FBF30_00520017-2.006647type II secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_00500BCTERIALGSPF307e-104 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 307 bits (789), Expect = e-104
Identities = 125/403 (31%), Positives = 216/403 (53%), Gaps = 7/403 (1%)

Query: 1 MKKFTYEARDKSSNETVKSMVQADSESSAAKVLIEQGLMPLDIREINEDA------SFFN 54
M ++ Y+A D + + + +ADS A ++L E+GL+PL + E D
Sbjct: 1 MAQYHYQALD-AQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSL 59

Query: 55 RLTNRITTKDKVVFLRQMATLIGAGLPLAQSLHTVLEQTANKKMQQVVEEIIAEVEGGHT 114
R R++T D + RQ+ATL+ A +PL ++L V +Q+ + Q++ + ++V GH+
Sbjct: 60 RRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHS 119

Query: 115 LSDSFGKHPDVFDKVVLALVAAGETSGTLDEALKRVAAQKEKDAAMMSKIRGAMVYPMIV 174
L+D+ P F+++ A+VAAGETSG LD L R+A E+ M S+I+ AM+YP ++
Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 175 LLVIVGVMIFMLIAVVPQVDKLYKDMHKTLPMLTQVMISVAGFLISYWWAVIIGLGIGGY 234
+V + V+ +L VVP+V + + M + LP+ T+V++ ++ + ++ +++ L G
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 235 FLRQYLKTEPGIKLKDTVKLNIPLFNGMFRKLYMARFTRTGQTLLSTGVAMLDMMRISSE 294
R L+ E L++PL + R L AR+ RT L ++ V +L MRIS +
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 295 SVNNTIISKSIDRAAEKVKGGKALSVALKPEDYILPMVPQMIKIGEQSGKIDEMMGKTAQ 354
++N + A + V+ G +L AL+ PM+ MI GE+SG++D M+ + A
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 355 VYEDELDEEIKAISTAIEPVLMVVLAVFAGGMVGAILFPIYSL 397
+ E ++ EP+L+V +A +V AIL PI L
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQL 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_00505BCTERIALGSPH353e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 35.3 bits (81), Expect = 3e-05
Identities = 14/46 (30%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 6 LKQKGFTIIEVVLVLAIAALIFLMVFIALPALQRNQRDAARKQELQ 51
++Q+GFT++E++L+L + + MV +A PA R+ A +
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPA-SRDDSAAQTLARFE 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_00510PREPILNPTASE1093e-30 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 109 bits (274), Expect = 3e-30
Identities = 73/282 (25%), Positives = 126/282 (44%), Gaps = 16/282 (5%)

Query: 7 IVLVVLGSLFGSFACAQVWRLRARQLEVDRRDGEVVDESEYQRLKGLLRPVSRDRSECLY 66
++ + + GSF + RL + + + + + + RS C +
Sbjct: 17 SLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSCCPH 76

Query: 67 CHHQLTWYDLLPILGWILVGGKCRYCRKPIGVAELLAEVGLAAAFVLSFFYWPYKFLTVA 126
C+H +T + +P+L W+ + G+CR C+ PI L E+ A V + T+A
Sbjct: 77 CNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWGTLA 136

Query: 127 DVGLFSIWLIALVFMTILLIYDAKWSLLPFSLNISLIVVGAVFFCI---TSLQHGINIMS 183
+ L L+AL F D LLP L + L+ G +F + SL ++
Sbjct: 137 ALLLTW-VLVALTF------IDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDA--VIG 187

Query: 184 AGGGLLLLSGLYLLFSLF---GWVGVGDGILGFGLALFLGKWELAFLTLFLANVLGCCMM 240
A G L+L LY F L +G GD L L +LG W+ + L L++++G M
Sbjct: 188 AMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLG-WQALPIVLLLSSLVGAFMG 246

Query: 241 IPLMAAKRIGRHARVPFGPFLIVATFIVMMWGNGVINWFFHT 282
I L+ + + +PFGP+L +A +I ++WG+ + W+
Sbjct: 247 IGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYLTN 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_00515BCTERIALGSPH403e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 39.5 bits (92), Expect = 3e-06
Identities = 13/60 (21%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 6 RGFTIIETMLVLAITGLVVAVVLVNIGTALRNEQYHTAVDQVHDYFQGQYSLTSAILNDR 65
RGFT++E ML+L + G+ +VL+ + + A Q ++ + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDD---SAAQTLARFEAQLRFVQQRGLQTGQ 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_00520BCTERIALGSPH280.021 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 27.6 bits (61), Expect = 0.021
Identities = 11/40 (27%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 5 QRGDTIIEMMLAFAIFTLAAVGAMSILSSGVAITQRNLES 44
QRG T++EMML + ++A M +L+ + ++
Sbjct: 3 QRGFTLLEMMLILLLMGVSAG--MVLLAFPASRDDSAAQT 40


11FBF30_01100FBF30_01135N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_01100-2151.067818TPM domain-containing protein
FBF30_011050150.256254cell division ATP-binding protein FtsE
FBF30_01110-1140.830015ABC transporter permease
FBF30_01115-2111.282180CHAP domain-containing protein
FBF30_01120-1120.850496S41 family peptidase
FBF30_01125-2120.083300response regulator
FBF30_01130-2130.055818hypothetical protein
FBF30_01135-212-0.14723650S ribosomal protein L27
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01100cloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 2e-04
Identities = 17/41 (41%), Positives = 21/41 (51%)

Query: 271 PSWWAGGTHFGGGSSSGGSSGGFGGGSFGGGGFSGGGASGS 311
W + +GGGS SG GG G GGG + GG SG+
Sbjct: 37 SGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 31.2 bits (70), Expect = 0.005
Identities = 13/38 (34%), Positives = 15/38 (39%)

Query: 274 WAGGTHFGGGSSSGGSSGGFGGGSFGGGGFSGGGASGS 311
W GG+ G G G GG GGG GG +
Sbjct: 46 WGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01115GPOSANCHOR508e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 50.4 bits (120), Expect = 8e-09
Identities = 32/203 (15%), Positives = 72/203 (35%), Gaps = 6/203 (2%)

Query: 37 SQAFADRWDDQMRALNAQMQQYQSQASALNAQANTLQAQLDQITAQKNAILAQIDLSQKQ 96
+ F+ +++ L A+ ++ + L +A+ + A+ + +
Sbjct: 132 AMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEAR 191

Query: 97 YDALQKQIEETKQKIDDNKEALGRIIADMYVDGSITPLEMLASSKNIGDYVDQQEYRNSI 156
L+K +E + + + A+ + A + ++
Sbjct: 192 QAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 157 QNTLSDTIDQINSLKKKLESKQNEVKKVLDQQKDQKAQLAAKEAEQAELVAKTRNDEAAY 216
+ + + L+K LE N + K +A+ AA EAE+A+L +++ A
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311

Query: 217 QQMAND------AKAQVENAAAQ 233
Q + D AK Q+E +
Sbjct: 312 QSLRRDLDASREAKKQLEAEHQK 334



Score = 46.2 bits (109), Expect = 2e-07
Identities = 31/192 (16%), Positives = 72/192 (37%), Gaps = 24/192 (12%)

Query: 45 DDQMRALNAQMQQYQSQASALNAQANTLQAQLDQITAQKNAILAQIDLSQKQYDALQKQI 104
+ + L ++ + ++A +A+ TL+A+ + A+K + ++ + A +I
Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248

Query: 105 EETKQKIDDNKEALGRIIADMYVDGSITPLEMLASSKN--IGDYVDQQEYRNSIQNTLSD 162
+ + + + LE + + + + L
Sbjct: 249 KTLEAEKAALEARQAE-------------LEKALEGAMNFSTADSAKIKTLEAEKAALEA 295

Query: 163 TIDQINSLKKKLESKQNEVKKVLDQQKDQKAQLAAKEAEQAELVAKTRNDEAAYQQMAND 222
+ + L + + +++ LD ++ K QL EAE +L + + EA+ Q + D
Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDASREAKKQL---EAEHQKLEEQNKISEASRQSLRRD 352

Query: 223 ------AKAQVE 228
AK Q+E
Sbjct: 353 LDASREAKKQLE 364



Score = 32.0 bits (72), Expect = 0.005
Identities = 31/167 (18%), Positives = 67/167 (40%), Gaps = 13/167 (7%)

Query: 45 DDQMRALNAQMQQYQSQASALNAQANTLQAQLDQITAQKNAILAQIDLSQKQYDA---LQ 101
+ + ++ L A+ L+A+ + Q + A ++ DA +
Sbjct: 266 EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK 325

Query: 102 KQIEETKQKIDDNKEALGRIIADMYVDGSITPLEMLASSKNIGDYVDQQEYRNSIQNTLS 161
KQ+E QK+++ + + D L +S+ ++ + + QN +S
Sbjct: 326 KQLEAEHQKLEEQNKISEASRQSLRRD--------LDASREAKKQLEAEHQKLEEQNKIS 377

Query: 162 DTIDQINSLKKKLESKQNEVKKVLDQQKDQKAQLAAKEAEQAELVAK 208
+ Q SL++ L++ + K+V ++ ++LAA E EL
Sbjct: 378 EASRQ--SLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEES 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01125HTHFIS786e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 6e-20
Identities = 27/124 (21%), Positives = 57/124 (45%), Gaps = 3/124 (2%)

Query: 3 ITKKKILLVEDDIALAAVYRSRLELEGFEIHEVNNGEDALSAAVSFKPDLILLDAMMPKI 62
+T IL+ +DD A+ V L G+++ +N + DL++ D +MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 63 SGFDVLDILRNTPETTNIRVIMLTALSQPKDKERAEQLGVDDYLVKSQVVIGDVVARVKH 122
+ FD+L ++ ++ V++++A + +A + G DYL K + +++ +
Sbjct: 61 NAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGR 117

Query: 123 HLGL 126
L
Sbjct: 118 ALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_01135RTXTOXIND280.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.3 bits (63), Expect = 0.004
Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 3 KVKAGGSSKNIHNNAGARLG---VKRFGGQKVSAGEVLVR 39
K+ G SK I + + VK G+ V G+VL++
Sbjct: 89 KLTHSGRSKEIKPIENSIVKEIIVKE--GESVRKGDVLLK 126


12FBF30_02255FBF30_02290N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_022551161.314477elongation factor Tu
FBF30_022601160.958763NAD-dependent epimerase/dehydratase family
FBF30_022650150.213015hypothetical protein
FBF30_02270-1120.129676glycosyltransferase
FBF30_02280113-0.395926NUDIX domain-containing protein
FBF30_02285115-0.439031type II secretion system protein
FBF30_02290115-0.361022elongation factor G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02265TCRTETOQM893e-21 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 88.8 bits (220), Expect = 3e-21
Identities = 53/189 (28%), Positives = 92/189 (48%), Gaps = 13/189 (6%)

Query: 11 INVGTMGHVDHGKTTLTAAI--SHVLSKKLPSDVNVPRDYDTIDNAPEEKARGITIASSH 68
IN+G + HVD GKTTLT ++ + +L S V + DN E+ RGITI +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGS---VDKGTTRTDNTLLERQRGITIQTGI 60

Query: 69 IEYESANRHYAHVDMPGHADYVKNMITGAAQIDGAVLVVAANDGPLPQTREHVLLAKQVG 128
++ N +D PGH D++ + + +DGA+L+++A DG QTR +++G
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 129 VPKIVVFLNKMDLADPELVELVEMDVRELLTKNG------YDGDNAPIIKGSATKALEGD 182
+P I F+NK+D +L V D++E L+ N + + ++ +
Sbjct: 121 IPTI-FFINKIDQNGIDL-STVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTV 178

Query: 183 AAAEDAIMD 191
D +++
Sbjct: 179 IEGNDDLLE 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02270NUCEPIMERASE1242e-35 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 124 bits (313), Expect = 2e-35
Identities = 72/330 (21%), Positives = 133/330 (40%), Gaps = 36/330 (10%)

Query: 1 MKVLIFGGAGYVGYELVKCFLGNGDTVAIYDAFCNG-DAS-AHDKVSALGS--VTVFEGT 56
MK L+ G AG++G+ + K L G V D + D S ++ L +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 IADKKAVRRAIEEFCPDVVYNLAALHYIPYCIQHPDEVYETNYQGLQNIIQVLRDYPQTK 116
+AD++ + + V+ + Y +++P ++N G NI++ R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 117 FIFASSASVYGSPDQ-QCTLDTPVD-PNDIYGASKLAGEGLIKYQLSN-----FVIMRLF 169
++ASS+SVYG + + D VD P +Y A+K A E L+ + S+ +R F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE-LMAHTYSHLYGLPATGLRFF 179

Query: 170 NVYGSLDPHPHLIPKVARAAVRGEDLDL-GAMEAKRDFVHVTDVAQAFF----------- 217
VYG + K +A + G+ +D+ + KRDF ++ D+A+A
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 218 --------VARNGCPGDTYIVATGETHSVKEVVDKIYQLSGSLGKVTYGTVGNMRAKDAS 269
A + P Y + + + + + G K ++ D
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM---LPLQPGDVL 296

Query: 270 CLSGDYSALR-ALGWAPMVQFDEGLRSAID 298
S D AL +G+ P +G+++ ++
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVN 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02290BCTERIALGSPH414e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.5 bits (97), Expect = 4e-07
Identities = 14/58 (24%), Positives = 30/58 (51%)

Query: 9 RQDGFTIVEIFITMAVIVILASVAIVGYNGLRDRTADADRATTAGQLKKLIQDAVVNN 66
RQ GFT++E+ + + ++ + A + ++ + RD +A A QL+ + Q +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_02295TCRTETOQM5540.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 554 bits (1429), Expect = 0.0
Identities = 177/689 (25%), Positives = 303/689 (43%), Gaps = 72/689 (10%)

Query: 12 RNVGIIAHIDAGKTTTTEGILYRTGINHKIGEVKGDGDGATTDWMAQEKERGITITSAAV 71
N+G++AH+DAGKTT TE +LY +G ++G V D TD E++RGITI +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSV--DKGTTRTDNTLLERQRGITIQTGIT 61

Query: 72 TCFWKGHKINIIDTPGHIDFTAEVERSLRVLDGAVTVFDGKMGVEAQSETVWRQANKYGV 131
+ W+ K+NIIDTPGH+DF AEV RSL VLDGA+ + K GV+AQ+ ++ K G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 132 PRICFVNKINQTGGDFYKSLESIRTRLSKQAFPIHLPIGFEKDICGVVDLIDMKAYTYDY 191
P I F+NKI+Q G D + I+ +LS + + ++C V + + + +
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ-KVELYPNMC-VTNFTESEQW---- 175

Query: 192 FTDHELKVGEIPADMLEKAKNARSLLVENAVEADEDLMMKFFDEGEESITVDELKSALRK 251
+ +E ++DL+ K+ +S+ EL+
Sbjct: 176 ---------------------------DTVIEGNDDLLEKYMS--GKSLEALELEQEESI 206

Query: 252 RVLAGDFYLVTGGDGRGVI-VEKVLDLITDYLPSPLDIDEIWGKNPKTGDEVSRKPDEKE 310
R + V G + I ++ ++++IT+ S +
Sbjct: 207 RFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH-------------------RGQS 247

Query: 311 PMAALAFKIAADPFVGKLIFIRVYSGVLTAGSYVLNTTTGEKERIGRIVRMHADKREDID 370
+ FKI +L +IR+YSGVL V + EK +I + + ID
Sbjct: 248 ELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSINGELCKID 306

Query: 371 KVGAGDIAAVVG--LK-NTFTGNTLAELAHPIALESIEFPDPPVSIAVEPKTKADQEKMG 427
K +G+I + LK N+ G+T E IE P P + VEP +E +
Sbjct: 307 KAYSGEIVILQNEFLKLNSVLGDTKLL----PQRERIENPLPLLQTTVEPSKPQQREMLL 362

Query: 428 IALQRLAEEDPTFRIHTDEETGQTIMSGMGELHLEILIDRMKREFNVEANVGEPQVAFRE 487
AL +++ DP R + D T + I+S +G++ +E+ ++ +++VE + EP V + E
Sbjct: 363 DALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME 422

Query: 488 TIKGMAEAQGKHAKQSGGRGQYGDVWVRFEPNETGKGFEFIDEIKGGVVPQEYRPAVQKG 547
AE + + + + P G G ++ + G + Q ++ AV +G
Sbjct: 423 RPLKKAEY--TIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG 480

Query: 548 IKEVLDGGVIAGYPVVDVKATLYDGSYHDVDSSELAFSLAGGLAAREGIKKATPVLLEPV 607
I+ + G + G+ V D K G Y+ S+ F + + + +KKA LLEP
Sbjct: 481 IRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPY 539

Query: 608 MHVEVTTPEEFMGDIIGDLNSRRGRIEAMEDLMGGAKLVKAIVPLANMFGYTSDIRSMSQ 667
+ ++ P+E++ D I + L ++ +P + Y SD+ +
Sbjct: 540 LSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTFFTN 598

Query: 668 GRAASTMELAHYEEVPPNVAQEIIEKRSK 696
GR+ EL Y + + + R
Sbjct: 599 GRSVCLTELKGYH---VTTGEPVCQPRRP 624


13FBF30_03025FBF30_03075N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_03025-215-0.717197hypothetical protein
FBF30_03030-313-0.939987GNAT family N-acetyltransferase
FBF30_03040-213-1.127685*alpha/beta hydrolase
FBF30_03055-117-1.031346**peptide ABC transporter substrate-binding
FBF30_03060016-1.707300preprotein translocase subunit SecG
FBF30_03065016-1.401612phage holin family protein
FBF30_03070015-1.114957M24 family metallopeptidase
FBF30_03075017-0.602420prepilin-type N-terminal cleavage/methylation
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03025PF06580290.036 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.036
Identities = 18/129 (13%), Positives = 43/129 (33%), Gaps = 7/129 (5%)

Query: 219 GLAQNGFGSMLFWLFAIVVAVLVFYWATSTLIALVVVTLPGMYPLRALKASSDLVIGRRL 278
G+ + + ++F A S + ++ R+ +
Sbjct: 21 GVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTH------AYRSFIKRQGWLKLNMG 74

Query: 279 RILYRWLWAALIIILAWMIVMIPVILLDTVIKSALPAIQNVPIVPYVGAFMSSATVVWFA 338
+I+ R L A ++I + W + + L I + P +P+ + + T +W
Sbjct: 75 QIILRVLPACVVIGMVWFVANTSIWRLLAFINTK-PVAFTLPLALSIIFNVVVVTFMWSL 133

Query: 339 SYVYLLYRR 347
Y + +
Sbjct: 134 LYFGWHFFK 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03030SACTRNSFRASE382e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 2e-05
Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 1/67 (1%)

Query: 96 IGPLIVSEEYREKLGIGSALLEYAEEFARGLGASRVYCTVAKPNQRALIFFLRKGFCVAG 155
I + V+++YR+K G+G+ALL A E+A+ + N A F+ + F +
Sbjct: 92 IEDIAVAKDYRKK-GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150

Query: 156 TAREQYK 162
Y
Sbjct: 151 VDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03060SECGEXPORT332e-05 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 33.4 bits (76), Expect = 2e-05
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 6 ILQIVSIVSAVLMIVAILLQQ-RGASLGAGFG-GSSELYTTRRGLDKNLFEVTIFLAVTF 63
L +V ++ A+ ++ I+LQQ +GA +GA FG G+S G + +T LA F
Sbjct: 4 ALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLF 63

Query: 64 VLSILV 69
+ LV
Sbjct: 64 FIISLV 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03075BCTERIALGSPG479e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 9e-10
Identities = 23/88 (26%), Positives = 44/88 (50%), Gaps = 9/88 (10%)

Query: 1 MKRRQGFTIVEVIVVIVIIASLALLTVFAFGAWRKRTAKTEMRQEIMTVVSSLKSYQAFQ 60
+++GFT++E++VVIVII LA L V +++ K + +I+ + ++L Y+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 NKFPGT-PGGSAVSPRTISGLTYKPSAN 87
+ +P T G + L P+
Sbjct: 64 HHYPTTNQGLES--------LVEAPTLP 83


14FBF30_03445FBF30_03490N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
FBF30_03445-111-0.661852response regulator
FBF30_034501180.580029bifunctional 5,10-methylenetetrahydrofolate
FBF30_034552230.903820hypothetical protein
FBF30_034702231.655392**UDP-N-acetylmuramate dehydrogenase
FBF30_034751241.745866prepilin-type N-terminal cleavage/methylation
FBF30_03480-1211.372695prepilin-type N-terminal cleavage/methylation
FBF30_03485-1200.692876hypothetical protein
FBF30_03490-113-1.553670type II secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03445HTHFIS703e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 3e-17
Identities = 22/104 (21%), Positives = 44/104 (42%), Gaps = 4/104 (3%)

Query: 2 TKIVIIEDDQVINQMYRMKFEAAGFDVATASDGQAGIKMAEKFKPEIILLDLQMPNMGGA 61
I++ +DD I + AG+DV S+ + ++++ D+ MP+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EALEIIRKSTWGAKIPVIILTNLGE-EEAPKLLRSLGIHSYIVK 104
+ L I+K +PV++++ A K G + Y+ K
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASE-KGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03475BCTERIALGSPG491e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.7 bits (116), Expect = 1e-09
Identities = 17/54 (31%), Positives = 32/54 (59%)

Query: 1 MNARRGFTIVEIVIVMVIMAILIGLAVLNISSTQANARDNKRKTDVENIARGLE 54
+ +RGFT++EI++V+VI+ +L L V N+ + A K +D+ + L+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03485BCTERIALGSPG330.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 0.001
Identities = 15/51 (29%), Positives = 32/51 (62%), Gaps = 5/51 (9%)

Query: 11 IKRNRRHAGFTLVEMLAVAPIVLIVIGVLISAMVS-MIGDALVANARTVVA 60
++ + GFTL+E++ +V+++IGVL S +V ++G+ A+ + V+
Sbjct: 1 MRATDKQRGFTLLEIM----VVIVIIGVLASLVVPNLMGNKEKADKQKAVS 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
FBF30_03490BCTERIALGSPG531e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.6 bits (126), Expect = 1e-11
Identities = 21/78 (26%), Positives = 37/78 (47%)

Query: 1 MTKQTKSSGFTIVELLIVIVVIAILAAITIVAYNGIQNRANDTAAKETASQFRTKIEAYN 60
M K GFT++E+++VIV+I +LA++ + G + +A+ A ++ Y
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 TIKSKYPATATTASALVT 78
YP T +LV
Sbjct: 61 LDNHHYPTTNQGLESLVE 78



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.