PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeCP027306.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP027306 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1C5746_00010C5746_00140Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00010214-0.738694hypothetical protein
C5746_00015212-0.425623helicase
C5746_00020311-0.287740hypothetical protein
C5746_00025390.109097cellulose 1,4-beta-cellobiosidase
C5746_000304130.053601integrase
C5746_000352130.360102XRE family transcriptional regulator
C5746_00045311-0.417924IS6 family transposase
C5746_00050312-0.012888alpha/beta hydrolase
C5746_00055114-0.187867erythromycin esterase
C5746_00065118-3.268471IS6 family transposase
C5746_00070222-3.196818hypothetical protein
C5746_00075024-1.881208hypothetical protein
C5746_00080-123-0.805968hypothetical protein
C5746_000900250.309398IS5/IS1182 family transposase
C5746_000950220.514857hypothetical protein
C5746_001000251.041766hypothetical protein
C5746_00110221-0.210132hypothetical protein
C5746_00115312-0.591226ABC transporter
C5746_00120311-1.201923transposase
C5746_00130213-0.962995hypothetical protein
C5746_00140213-0.659477hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00025PF07675340.002 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 34.3 bits (78), Expect = 0.002
Identities = 19/56 (33%), Positives = 28/56 (50%), Gaps = 9/56 (16%)

Query: 681 DQGATVSYNVYRSTSSSDVFTPEHRIATGVTALTYKDPGLAPRYYYYVVTAVAAEG 736
+ +Y +YR+ + +IA+GVT TY+DP LA +Y Y V V G
Sbjct: 1254 GNAPSYTYTIYRNNT---------QIASGVTETTYRDPDLATGFYTYGVKVVYPNG 1300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00060BACYPHPHTASE300.018 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 30.2 bits (67), Expect = 0.018
Identities = 18/44 (40%), Positives = 22/44 (50%)

Query: 185 GGFAGAELYDKVNAYAAAARPELAPRLTELYRGLRPATDAETYV 228
G A A V+ Y AR EL+ RLT L L PAT+ Y+
Sbjct: 175 AGEARATAPSTVSPYGPEARAELSSRLTTLRNTLAPATNDPRYL 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00140TONBPROTEIN462e-07 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 45.7 bits (108), Expect = 2e-07
Identities = 16/60 (26%), Positives = 18/60 (30%)

Query: 15 PQRQPPPPAIPPHQPLPDGTAPLPPAPPTVPPPPPLPPAPPTVPPPPPPHPPAAPPAGPA 74
P PP A+ P P P P P P+ P P P P P P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111



Score = 34.2 bits (78), Expect = 0.001
Identities = 18/76 (23%), Positives = 20/76 (26%), Gaps = 4/76 (5%)

Query: 5 QPGSSADEPEPQRQP-PPPAIPPHQPLPDGTAPLPPAPPTVPPPPPLPPAPPTVPPPPPP 63
EPEP+ P PP P P P P P P P P
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124

Query: 64 ---HPPAAPPAGPAGK 76
PA + A
Sbjct: 125 FENTAPARLTSSTATA 140


2C5746_00225C5746_00450Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00225215-1.365822DUF4262 domain-containing protein
C5746_00230317-1.754034hypothetical protein
C5746_00235319-2.156854hypothetical protein
C5746_00245416-1.695596hypothetical protein
C5746_00250412-2.586320hypothetical protein
C5746_00260118-2.420594hypothetical protein
C5746_00265-114-2.884301hypothetical protein
C5746_00270-116-2.618268LuxR family transcriptional regulator
C5746_00275011-3.451108hypothetical protein
C5746_00280011-3.369121ABC transporter substrate-binding protein
C5746_0028519-3.850900lactose ABC transporter permease
C5746_0029019-4.413628carbohydrate ABC transporter permease
C5746_00295113-3.793860alpha-galactosidase
C5746_00300012-4.034546LacI family transcriptional regulator
C5746_00305015-3.265894hypothetical protein
C5746_00310-118-3.231988hypothetical protein
C5746_00320-124-2.232092hypothetical protein
C5746_00325120-2.324841hypothetical protein
C5746_00335016-1.988299phage tail protein
C5746_003401120.225947hypothetical protein
C5746_003451120.890452hypothetical protein
C5746_003500120.732417hypothetical protein
C5746_003603132.580229hypothetical protein
C5746_003702131.724759hypothetical protein
C5746_003802132.252181hypothetical protein
C5746_003852113.380342baseplate assembly protein
C5746_003902113.457422hypothetical protein
C5746_003952113.238212hypothetical protein
C5746_004002112.955693putative baseplate assembly protein
C5746_004052113.045805putative baseplate assembly protein
C5746_004101102.452210hypothetical protein
C5746_00420012-0.377449hypothetical protein
C5746_00425111-0.717499hypothetical protein
C5746_00430112-0.339703hypothetical protein
C5746_00435212-0.470486hypothetical protein
C5746_00440111-0.235666hypothetical protein
C5746_004452120.486151antibiotic ABC transporter ATP-binding protein
C5746_00450214-0.154147ABC transporter
3C5746_00535C5746_00620Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00535-215-3.007317methyltransferase type 11
C5746_00540-315-2.587565hypothetical protein
C5746_00545-213-1.581049hypothetical protein
C5746_005553180.160647cation transporter
C5746_005654300.176288transcriptional regulator
C5746_00570329-0.413879hypothetical protein
C5746_00580-1300.370461hypothetical protein
C5746_00585027-0.326750hypothetical protein
C5746_00595026-0.946610restriction endonuclease
C5746_00600319-0.066122phytanoyl-CoA dioxygenase family protein
C5746_00610313-0.818096hypothetical protein
C5746_00615312-0.617130hypothetical protein
C5746_00620311-0.232867hypothetical protein
4C5746_00705C5746_00790Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00705214-0.739494hypothetical protein
C5746_00710216-1.110755hypothetical protein
C5746_00715312-1.673067PmrA
C5746_00725510-0.1778543-oxoacyl-ACP reductase
C5746_007304130.623890TetR family transcriptional regulator
C5746_007354141.458753transposase
C5746_007403141.863395IS200/IS605 family transposase
C5746_007455140.831341GNAT family N-acetyltransferase
C5746_007505140.423380hypothetical protein
C5746_00755115-1.934682protein phosphatase
C5746_00760117-2.444860alpha/beta hydrolase
C5746_00770118-4.125763alpha/beta hydrolase
C5746_00780321-2.268811TetR family transcriptional regulator
C5746_007902130.062513hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00760DHBDHDRGNASE1132e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (283), Expect = 2e-32
Identities = 74/262 (28%), Positives = 119/262 (45%), Gaps = 28/262 (10%)

Query: 10 AVVTGASRGIGLATVQALTAEGVRVVAAARTITPELKETGAL--------AIPVDLLTPD 61
A +TGA++GIG A + L ++G + A K +L A P D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 62 APAQLIDRATTELGDLDLLVNNVGGGDGGEGQTGGFLSFTDQQWQQSLDLNFLAAVRTSR 121
A ++ R E+G +D+LVN G + G S +D++W+ + +N SR
Sbjct: 71 AIDEITARIEREMGPIDILVNV-----AGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 AALPSLLRRR-GALVNISSNGARMPHAGPVTYTTAKAALTAFGKALAEEFGPQGVRINTI 180
+ ++ RR G++V + SN A +P Y ++KAA F K L E +R N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 SPGPVRTAM----WESPDGYGAELARSMGVTQEQLLAQIPAAMGMTTGRLLEPSEVATAV 236
SPG T M W +G + S+ E IP +L +PS++A AV
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSL----ETFKTGIP------LKKLAKPSDIADAV 235

Query: 237 AYLASPLAASMSGTDLLIDGGS 258
+L S A ++ +L +DGG+
Sbjct: 236 LFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00765HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 36/199 (18%), Positives = 68/199 (34%), Gaps = 16/199 (8%)

Query: 6 RALRADTERTVRTILEAAERVLAADP--AATMEQIAAAAGVARTTVHRRFATREALVEAL 63
R + + + T + IL+ A R+ + + ++ +IA AAGV R ++ F + L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 64 ATWATERFHQAV-EAARPLTSPPLVALHQVTANVLQVKI------GWSFAMSRTAPSDPE 116
+ + E PL L ++ +VL+ + + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 117 AARVHA-------DVLAQCDQLFRRAQEAGLVSADTDLEWARRVYYALIHEASEEGREVV 169
A V + + +Q + EA ++ AD A + I E
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 170 DIGAVDQLAARVVDTLLHG 188
+ + A V LL
Sbjct: 183 QSFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00785SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 4e-05
Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 7/92 (7%)

Query: 60 EQLRACDESFLGVRDELRLVGAVAWTRLPNGALDICRLVVHPVAHRRGVATALLDALDSI 119
+ ++ E +G + NG I + V ++GV TALL +I
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHK--AI 115

Query: 120 E-----PAELTIVSTGTANLPAVALYRRRGFI 146
E ++ T N+ A Y + FI
Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00800BACYPHPHTASE280.019 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 27.8 bits (61), Expect = 0.019
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 91 CGGGVGRTGTALSAICVFEGMDPKEAVKWVRQNY 124
C GVGRT + A+C+ + + + +V+ +
Sbjct: 403 CRAGVGRTAQLIGAMCMNDSRNSQLSVEDMVSQM 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00815CHLAMIDIAOM6300.015 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 30.0 bits (67), Expect = 0.015
Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 255 IVGNALGFDAMPSTAEREEAFAAFDLRPLLDQDANGPMLVVNGTEDVHVPLDDT 308
I GN + FD++P +E + L+ + DA G ++ + T + VP+ DT
Sbjct: 490 ITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDT--LTVPVSDT 541


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00820HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-15
Identities = 29/205 (14%), Positives = 65/205 (31%), Gaps = 19/205 (9%)

Query: 11 RRPGGRAARVRQAVLAATMEVLAEEGIARLSIAEVAARAGVNETTVYRRWGSREKLVLDA 70
R+ A RQ +L + + +++G++ S+ E+A AGV +Y + + L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 71 M------LAGSDEGIPVPDTGTVRTDLAAFARALTEYLATPTGRAVARAASLSSDD---- 120
+ + G + L + E T R + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 121 -PDLAEAWQTFWQSRLDQAGAIVSRAVERGELPTDTDAALALELLCSPLQTRSLLGHRPI 179
+ +A + D+ + +E LP D A ++ + L+ +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS--GLMENWLF 180

Query: 180 EPDLPERLT------DLVLDGLRGR 198
P + ++L+
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLC 205


5C5746_01115C5746_01185Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_01115126-3.140996IS4 family transposase
C5746_01120222-2.666489MFS transporter
C5746_01125222-2.610642hypothetical protein
C5746_01130223-1.909144transposase
C5746_01135324-1.213765hypothetical protein
C5746_01140315-0.150625hypothetical protein
C5746_011451131.152313hypothetical protein
C5746_011501181.494943hypothetical protein
C5746_011551180.6588792-polyprenyl-6-methoxyphenol hydroxylase
C5746_01160022-1.206350DUF3103 domain-containing protein
C5746_01165022-1.255204hypothetical protein
C5746_01170126-3.005915hypothetical protein
C5746_01175128-4.172011hypothetical protein
C5746_01180021-4.603513hypothetical protein
C5746_01185-120-3.872818LacI family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01165TCRTETB1111e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (278), Expect = 1e-28
Identities = 87/405 (21%), Positives = 169/405 (41%), Gaps = 29/405 (7%)

Query: 27 FVVIMDTSIIGVALPKMQADLGFSQENLSWVFNAYVVAFGGLLLLGGRLSDLLGAKRLFS 86
F +++ ++ V+LP + D + +WV A+++ F + G+LSD LG KRL
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 87 AGWVVLAVGSLTAGLA-SQVWVELAGRALQGVGAALIAPSALTLLMTLFGARPQELGKAF 145
G ++ GS+ + S + + R +QG GAA AL +++ + GKAF
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP--ALVMVVVARYIPKENRGKAF 141

Query: 146 ALYGAAAPAGGTAGVFLGGVITQYASWPWVFYINIPVAVIVLAATP-AVMPSGAGRRGSI 204
L G+ G G +GG+I Y W + + IP+ I+ ++ +G
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHF 199

Query: 205 DLAGAVTVTAGLAAAVYAIVRAPETGWGSAETWLVLLGGVALIAAFVAIQSKRREPLMRL 264
D+ G + ++ G+ + + L+ V FV K +P +
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 265 GIWR-APNLAGANIAQLLMA--AAWIPMWFFLNLYLQQVLGLDAFASGSA-LLPMTVAIM 320
G+ + P + G ++ A ++ M + ++ V L GS + P T++++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSM---VPYMMKDVHQLSTAEIGSVIIFPGTMSVI 307

Query: 321 IMMIVLAPRLISRFGPKPLIVLGLLALGAGMFWLSLARPDGNFWVDVLPASLLSAVGMSL 380
I + L+ R GP ++ +G+ L S + W + ++ V L
Sbjct: 308 IFGYI-GGILVDRRGPLYVLNIGVTFLSVSFLTASFL-LETTSWFMTI---IIVFVLGGL 362

Query: 381 AFIPSLGT--ALSSARPEEGGLASGIVNTSYQVGSALGLAAMTAL 423
+F ++ + SS + +E G ++N + + G+A + L
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01250HTHTETR280.033 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.033
Identities = 11/66 (16%), Positives = 23/66 (34%), Gaps = 8/66 (12%)

Query: 28 RPFVAVTIREVASAAGVSVSTVSRAFTAPDQV--------QPTTRQRILDAATKLGYSPN 79
+ + ++ E+A AAGV+ + F + + + L+ K P
Sbjct: 27 QGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPL 86

Query: 80 PAARSL 85
R +
Sbjct: 87 SVLREI 92


6C5746_01335C5746_01405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_01335290.174652hypothetical protein
C5746_0134029-0.138407baseplate protein
C5746_0134509-1.385077putative baseplate assembly protein
C5746_01350210-0.294474phage tail protein I
C5746_01355013-0.588459zinc ribbon domain-containing protein
C5746_01360015-0.407666hypothetical protein
C5746_01365-1130.905341AraC family transcriptional regulator
C5746_013750120.946881type I glyceraldehyde-3-phosphate dehydrogenase
C5746_013851131.953000TetR family transcriptional regulator
C5746_013950153.145931lipase
C5746_014000163.025408hypothetical protein
C5746_014052173.235545hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01415PF03544444e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 43.8 bits (103), Expect = 4e-07
Identities = 23/101 (22%), Positives = 31/101 (30%)

Query: 74 LPDPPVEPTRSHTPEPAGPPPEEPAPAPEPPPRPAQPAAVQPEPPAQPAPVRPEPPAPDA 133
LP P + + PP+ P PEP P PEPP + V +P
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 102

Query: 134 PVAAVQPAKPVARRPVVRPVTADEEPDGPACPACGTPNLPG 174
P +R V + P PA T +
Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTAT 143



Score = 41.9 bits (98), Expect = 2e-06
Identities = 22/117 (18%), Positives = 36/117 (30%), Gaps = 2/117 (1%)

Query: 36 PVAPEPLGPSEQAGPAAVPAVAAAPETEGPASGAPPSPLPDPPVEPTRSHTPEPAGPPPE 95
P++ + P++ P AV P P P + PV + P P
Sbjct: 49 PISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPV 108

Query: 96 EPAPAPEPPPRPAQPAAVQPEPPAQPAPVRPEPPAPDAPVAAVQPAKPVARRPVVRP 152
+ P+ +P + P PA RP A + + R + R
Sbjct: 109 KKVEQPKRDVKPVESRPASPFENTAPA--RPTSSTATAATSKPVTSVASGPRALSRN 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01455HTHTETR419e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 40.8 bits (95), Expect = 9e-07
Identities = 23/182 (12%), Positives = 52/182 (28%), Gaps = 17/182 (9%)

Query: 10 TTERLVRAGAELADEIGFAQTTPAELARRFGVKTASLYSHVKNAHDLKTKIALLALEELA 69
T + ++ L + G + T+ E+A+ GV ++Y H K+ DL ++I L+ +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 70 DQVSTAVAGRAGKDALTAFANVYR--DYALEHPGRFAAAQFRLDPQTAAASAGVRHAQMT 127
+ A G + + + R + V
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 128 RAILRGYNLAEPHQTHAVRLLGSVFSGYVGLETAGGFSHSTPDSQESWTEILNALDALLR 187
L Y+ + + ++ + + + L+
Sbjct: 132 NLCLESYDR---------------IEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 188 TW 189
W
Sbjct: 177 NW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01470V8PROTEASE330.001 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 33.1 bits (75), Expect = 0.001
Identities = 32/203 (15%), Positives = 62/203 (30%), Gaps = 34/203 (16%)

Query: 46 HPDRTP-ASVTATSFAGTVALS--------NCSGSLVRMPNSQATDPGLVMTNGHCLETG 96
+ DR T +A + SG +V ++TN H ++
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVV--------GKDTLLTNKHVVD-- 122

Query: 97 FPSAGQVITNQSSTRSFTLLNASAGTAGTLRANKVVYATMTDTDVTLYQLTKTYAQIQQS 156
+ + ++F + + + D+ + + +
Sbjct: 123 -----ATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIG 177

Query: 157 TGIAPLTLSAAHPVQGHAIDVVSGYWKRVYSCFVDGFAYRLKE--GQWTWKDAVRYTPEC 214
+ P T+S Q + V+GY D + E G+ T+ +
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGY-------PGDKPVATMWESKGKITYLKGEAMQYDL 230

Query: 215 QVIGGTSGSPVIDTTTGQVTAIN 237
GG SGSPV + +V I+
Sbjct: 231 STTGGNSGSPVFN-EKNEVIGIH 252


7C5746_01855C5746_01895Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_01855216-1.980282hypothetical protein
C5746_01860217-2.313825cytochrome P450
C5746_01865216-1.870543cell division protein FtsW
C5746_01870117-3.682834hypothetical protein
C5746_01875017-4.165618hypothetical protein
C5746_01880-117-4.078401phosphotyrosine protein phosphatase
C5746_01885-216-4.083752transcriptional regulator
C5746_01890-115-2.828311arsenical-resistance protein
C5746_01895-215-3.054358nicotinamidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01895YERSSTKINASE366e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 36.3 bits (83), Expect = 6e-04
Identities = 41/159 (25%), Positives = 68/159 (42%), Gaps = 24/159 (15%)

Query: 637 RHPNISRT--IAIVEAA--PEWALIMELLDGPALSQVLE------ENGQLPPASVVQMGR 686
+HPN++ +A+V E AL+M+ +DG S L + G++ + +
Sbjct: 189 KHPNLANVHGMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIK 248

Query: 687 CLGEALL----HLHSLNLCRIDLKPSNIIAHPIRG-PVVVDLGIARWTGVDENDRDTAVG 741
+ LL HL + D+KP N++ G PVV+DLG+ +G
Sbjct: 249 FIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF----- 303

Query: 742 SMVGTPAYMAPEQLRSATEADIRSDLYALGVVLYQCITG 780
T ++ APE A +SD++ + L CI G
Sbjct: 304 ----TESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


8C5746_01955C5746_02015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_019552130.721009hypothetical protein
C5746_019603131.595990sporulation domain-containing protein
C5746_019652122.180564RNA polymerase subunit sigma-70
C5746_019703122.088092oxidoreductase
C5746_019753132.635871LysR family transcriptional regulator
C5746_019802122.640135hypothetical protein
C5746_01985-1102.407145hypothetical protein
C5746_01990-2102.435163DNA topoisomerase (ATP-hydrolyzing) subunit B
C5746_01995-2101.833529hypothetical protein
C5746_02000-1111.158197PAS sensor protein
C5746_020051101.008196hypothetical protein
C5746_02015217-0.500409MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02075TCRTETA424e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 4e-06
Identities = 62/308 (20%), Positives = 106/308 (34%), Gaps = 30/308 (9%)

Query: 45 VLP-LFAVLTLNAD-AGRLGVLRAVGQAPILLLSLFVGAWVDRWRARTVMVLTDVGRTLA 102
VLP L L + D G+L A+ + +GA DR+ R V++++ G +
Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86

Query: 103 LGATAVAGLLGWLGLPALL---VVAFAVGALSVFFDVAYQTSLVRLMKRDQLVRGNSALE 159
A A L L + ++ A A + D+ R +
Sbjct: 87 YAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF---------MS 137

Query: 160 GSRSAAQIGGPALGGALVSL-LSAP--IAAASSALFFALSFLSIRRIRRIESIPECSERP 216
+ GP LGG + AP AAA + L F + + E P E
Sbjct: 138 ACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL 197

Query: 217 PGVWRRIHEGLCFVVSDTSLRTVCLASAAFQFSFAAMMTVYLLFLPRELHLSGTAVGLAL 276
+ + T + + Q ++++F H T +G++L
Sbjct: 198 NPL-----ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISL 252

Query: 277 AATGP-GALLGSMLAASLPSRFGH-GAVLVSAAALGDGVFLCVPALHGSSAVTVPALLLV 334
AA G +L +M+ + +R G A+++ A G G L A G +
Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG------WMAFPI 306

Query: 335 SFVFGTGG 342
+ +GG
Sbjct: 307 MVLLASGG 314


9C5746_02070C5746_02140Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_02070227-2.864195adenylate kinase
C5746_02075029-2.325293MerR family transcriptional regulator
C5746_02080-2190.181744molybdate ABC transporter substrate-binding
C5746_02085-1190.516913molybdate ABC transporter permease subunit
C5746_02090-1180.438272S-adenosyl methyltransferase
C5746_020951171.427120hypothetical protein
C5746_021001141.421755MFS transporter
C5746_02110218-2.536697alkaline phosphatase
C5746_02115216-2.237951esterase
C5746_02120315-3.140754DNA-binding response regulator
C5746_02125417-3.918570IS5/IS1182 family transposase
C5746_02130217-3.668420hypothetical protein
C5746_02135216-3.387316hypothetical protein
C5746_02140013-3.324777hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02220HTHFIS515e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.4 bits (123), Expect = 5e-10
Identities = 23/112 (20%), Positives = 44/112 (39%), Gaps = 3/112 (2%)

Query: 2 IADDDEVTRSGLRTLLAAQPGISVVGEAADGVEAVEQARRLRADVVLMDVRMPRRNGIEA 61
+ADDD R+ L L+ + G V ++ D+V+ DV MP N +
Sbjct: 8 VADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 62 TRQLLAESAEPPKVVVITTFENDGYVTAALSAGASGFVLKRLPVRQIAEAVR 113
++ + P V+V++ A GA ++ K + ++ +
Sbjct: 66 LPRIKKARPDLP-VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


10C5746_02350C5746_02540Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_02350-223-3.911405amino acid permease
C5746_02355-223-4.493477protein phosphatase
C5746_02360-221-3.324777hypothetical protein
C5746_02365022-2.912619hypothetical protein
C5746_02370020-2.977621hypothetical protein
C5746_02375221-3.030029peptidoglycan-binding protein
C5746_02385224-3.245695hypothetical protein
C5746_02390328-4.187284hypothetical protein
C5746_02395329-5.518233hypothetical protein
C5746_02405-210-3.030751hypothetical protein
C5746_02410-310-1.258290hypothetical protein
C5746_02415-19-2.879924IS30 family transposase
C5746_02425-18-3.288053hypothetical protein
C5746_02430110-2.757528hypothetical protein
C5746_0243509-3.079792hypothetical protein
C5746_0244009-3.105169hypothetical protein
C5746_02450-18-1.328893hypothetical protein
C5746_02455090.102014terminase
C5746_02460-111-0.346959hypothetical protein
C5746_02465020-2.000790GTP cyclohydrolase I FolE
C5746_02470122-1.9615847-carboxy-7-deazaguanine synthase QueE
C5746_02480221-2.364430IS21 family transposase
C5746_02485134-2.934072ATP-binding protein
C5746_02490136-3.179081TetR/AcrR family transcriptional regulator
C5746_02495131-2.379613SAM-dependent methyltransferase
C5746_02505029-2.221700hypothetical protein
C5746_02515018-2.184603hypothetical protein
C5746_02520017-1.914016hypothetical protein
C5746_02525118-0.619059hypothetical protein
C5746_02530120-0.466001hypothetical protein
C5746_025402170.058511hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02485PHPHTRNFRASE280.005 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.005
Identities = 10/52 (19%), Positives = 20/52 (38%)

Query: 31 SEITQQEYEEAKAALDAEQAEHVAAIDAEAQERARQDYEALIAAGIPPETAA 82
+E + YEE +AA + ++ E + + + E G P +
Sbjct: 233 TEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDG 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02495cloacin343e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 3e-04
Identities = 33/116 (28%), Positives = 40/116 (34%), Gaps = 19/116 (16%)

Query: 107 GAGGGGNNAGSAGEASSFGGTVTAIGGNGGG-DGMTTGTTNTTSSGTSAPLAGKGQITMG 165
G G G+N G+ + + G T +G GG DG + N G S G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS-----------G 51

Query: 166 GGPGEGAIRMGMRGLSGRGGDSHLGFGGYGRSTSGPGGASRGRGGGGGGAFSSDGA 221
G G G SG G G G G T G A G A S+ GA
Sbjct: 52 SGIHWG-------GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGA 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02520HTHTETR280.043 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.043
Identities = 16/78 (20%), Positives = 28/78 (35%), Gaps = 1/78 (1%)

Query: 94 QGLGVREIARRLGRDPSTISRELRRNAATRGGQLDYRAS-IAQWKAEPAARRPKTPKLVA 152
+ EIA+ G I + + + S I + + E A+ P P V
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVL 89

Query: 153 DERLREYVQDRLAGDVRR 170
E L ++ + + RR
Sbjct: 90 REILIHVLESTVTEERRR 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02580HTHTETR786e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.1 bits (192), Expect = 6e-20
Identities = 36/178 (20%), Positives = 62/178 (34%), Gaps = 14/178 (7%)

Query: 15 PDKRAERSRRTREKIVAAARELFVAQGYGATSLQEVADRAGVAVQTVYFVFRNKRALFKD 74
K + ++ TR+ I+ A LF QG +TSL E+A AGV +Y+ F++K LF +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 75 VVDTS-------IAGDVEPVTTMDRDWFRAACAEPTAAGQLRAHVRGVRDILSRVAPIMP 127
+ + S R + R + +I+ +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 128 LIAAAAATDPEIAAQWPAGPDPRYSVQHAAAKALTSKPDARPDVDTARAADLLYGLLS 185
+A + + Y K D+ T RAA ++ G +S
Sbjct: 122 EMAVVQQAQRNLCLES-------YDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02600PF05272300.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.012
Identities = 19/60 (31%), Positives = 24/60 (40%)

Query: 238 QGGPAVSGQTVSAPTPAPVPAVPVSAPAPVPPVSPAAASQEPAVPDPLAKPDHVGPDPVG 297
+G +V+G + AP AP P P P P P V VP+ P P P G
Sbjct: 97 EGLESVAGIVMGAPAGAPAPKPPRPEPPPRPVVEKECWETIQPVPEHAVPPSFWHPAPKG 156


11C5746_02605C5746_02690Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_02605033-3.498993hypothetical protein
C5746_02610019-2.161987hypothetical protein
C5746_02615022-2.909495IS630 family transposase
C5746_02625-124-2.170432hypothetical protein
C5746_02630-121-2.001154hypothetical protein
C5746_02640023-1.699669HIT family protein
C5746_02645-120-1.639555hypothetical protein
C5746_02655213-3.072634hypothetical protein
C5746_02665313-3.499297penicillin-binding protein
C5746_02670213-3.521600hypothetical protein
C5746_02675313-4.230449ATPase
C5746_02680014-4.223604VWA containing CoxE family protein
C5746_02685121-4.152094hypothetical protein
C5746_02690115-4.693513hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02675INTIMIN358e-04 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 35.0 bits (80), Expect = 8e-04
Identities = 47/311 (15%), Positives = 94/311 (30%), Gaps = 21/311 (6%)

Query: 89 GANPNSIAVTPDNAHVYVSNRGSNTVSVIDTTTNSVSTTITGFTGGLQGIAIAPDGLHAY 148
G+N + + + SN T++V+ +T FT + DG A
Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTA--DKTSAKADGTEA- 577

Query: 149 VITFSGLVYVIDTTTNTIVGSPITLAGGADPRFLAITPDSKFVY---VTELGLGQVQVID 205
IT++ V + S ++G A + + + GQV V
Sbjct: 578 -ITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSA 636

Query: 206 TTSN---SITTTITGFTQPTGIAITPDGLDAYVADFGTNTVSVIDTTTNTIVGSPINVGL 262
T+ ++ F T +IT D A ++ T P++
Sbjct: 637 KTAEMTSALNANAVIFVDQTKASITEIKADKTTAV-ANGQDAITYTVKVMKGDKPVSNQE 695

Query: 263 APYGLAITPDGTQVYATNVNDGTVTAIDTSTNTPTATITIGAGSSNYKLAVTPDSTRVYV 322
+ + +G TST + ++ + P+
Sbjct: 696 VTFTTTLGKLS-NSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVK-APEVEFFTT 753

Query: 323 TDFTTSSVTVIDTTTNTVSTTITGITNPFGDAIAAN--------ASASLSIGKSHSGHFT 374
++ ++ T T+ + A+ +++ + SG T
Sbjct: 754 LTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVT 813

Query: 375 QGQRGTYTITV 385
++GT TI+V
Sbjct: 814 LKEKGTTTISV 824


12C5746_02740C5746_02835Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_02740614-1.118316GPP34 family phosphoprotein
C5746_02745514-1.438812DUF1348 domain-containing protein
C5746_02750516-2.163487serine/arginine repetitive matrix protein 1
C5746_02755518-3.608063PadR family transcriptional regulator
C5746_02760321-4.104744hypothetical protein
C5746_02765123-3.822821Mini-circle protein
C5746_02770124-3.509757hypothetical protein
C5746_02775226-3.62036650S ribosomal protein L31
C5746_02785021-3.28549250S ribosomal protein L33
C5746_02790017-3.505302phosphatase
C5746_02795116-3.032174hypothetical protein
C5746_02800-118-4.136040phosphotransferase
C5746_02810023-2.987857glycosyl hydrolase
C5746_02815-123-2.725008sugar phosphate isomerase
C5746_02825120-2.122410ABC transporter substrate-binding protein
C5746_02830118-1.159709ABC transporter permease
C5746_02835218-0.154435ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_02865MICOLLPTASE652e-12 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 64.7 bits (157), Expect = 2e-12
Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 5/101 (4%)

Query: 721 PVAKATASATDGPVPLKVDFSSKGSNDPDGDALSYAWDFDGDGTYDSTEADPSHTYTAKG 780
P A + ++ V +++F S D DG+ +Y WDF GDG S EA +H Y G
Sbjct: 775 PKAVIKSDSS-VIVEEEINFDGTESKDEDGEIKAYEWDF-GDGE-KSNEAKATHKYNKTG 831

Query: 781 DVVAQLKVTDSTG--KSGYANIPITAGNTTPKVTIDFPVSG 819
+ +L VTD+ G + I + + P +
Sbjct: 832 EYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNND 872



Score = 42.0 bits (98), Expect = 1e-05
Identities = 46/308 (14%), Positives = 82/308 (26%), Gaps = 102/308 (33%)

Query: 638 AYFEGASFFYEWSRNYVKEVRFDQDQKLLKINDFLSS---------------QKFNKPMD 682
AYF + + + NYV +V F + ++ ++ N
Sbjct: 739 AYF--VNHKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGT 796

Query: 683 MTFGPDGSLYVLEWGSSFGGGN-----------NDSGLYRIDYA----QGKRIPVAKATA 727
+ DG + EW FG G N +G Y + G +K
Sbjct: 797 ESKDEDGEIKAYEW--DFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIK 854

Query: 728 SATDGPVPL-----------------KVDFSSKGSNDPDGDALSYAWDF----------- 759
D PV + K + KG+ + + Y +D
Sbjct: 855 VVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITLN 914

Query: 760 -------------DGDGTYDSTEADPSHTYTAKGDVVAQ-----LKVTDSTGKSGYANIP 801
+GD A + KG+ + L V +SG +
Sbjct: 915 NLNSVGITWTLYKEGDLNNYVLYATGNDGTVLKGEKTLEPGRYYLSVYTYDNQSGTYTVN 974

Query: 802 ITAG----------------------NTTPKVTIDFPVSGKLISFGDKIPYKVTVTDPED 839
+ + KV + + G L + K Y + + +P D
Sbjct: 975 VKGNLKNEVKETAKDAIKEVENNNDFDKAMKVDSNSKIVGTLSNDDLKDIYSIDIQNPSD 1034

Query: 840 GPVDCSKV 847
+ +
Sbjct: 1035 LNIVVENL 1042


13C5746_02910C5746_02945Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_02910217-0.840865GntR family transcriptional regulator
C5746_02915121-1.873646hypothetical protein
C5746_02920320-1.550053hypothetical protein
C5746_02925321-1.561925DUF397 domain-containing protein
C5746_02930424-2.093152acetyl-CoA acetyltransferase
C5746_02935324-1.721042lipid-transfer protein
C5746_02940323-2.051835DNA-binding protein
C5746_02945425-1.580538enoyl-CoA hydratase
14C5746_03460C5746_03495Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_034602122.972065MBL fold metallo-hydrolase
C5746_034651132.743432hypothetical protein
C5746_034701171.713848PucR family transcriptional regulator
C5746_034754171.118970carboxylesterase
C5746_034805150.416375acetylxylan esterase
C5746_03485313-0.327502dihydroxyacetone kinase subunit DhaK
C5746_03490312-0.724080dihydroxyacetone kinase subunit L
C5746_034952120.191706PTS fructose transporter subunit IIA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03545PYOCINKILLER320.001 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 32.1 bits (72), Expect = 0.001
Identities = 28/106 (26%), Positives = 42/106 (39%), Gaps = 6/106 (5%)

Query: 11 MTAAADSVDREANHLTELDSAIGDADHGSNLHRGFAAVRAALDKELPQTPGAVLMLAGRQ 70
+TAA S++ A + +A R AA+RAA +P V AGR
Sbjct: 207 LTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRG 266

Query: 71 LIATVGGASGPLYGTLLRRTGKALGDAPRVARQQLAEALGVGVAAV 116
LI GA+ +L + A+ RV + VG A++
Sbjct: 267 LIQVAQGAA-----SLAQAISDAIAVLGRVLASA-PSVMAVGFASL 306


15C5746_04075C5746_04240Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_04075219-1.804217hypothetical protein
C5746_04080215-1.277676hypothetical protein
C5746_04085113-0.905553peptide synthetase
C5746_04095112-0.876566non-ribosomal peptide synthetase
C5746_04100015-1.184797hypothetical protein
C5746_04105019-1.1536122,3-diaminopropionate biosynthesis protein SbnB
C5746_04110-122-1.3758902,3-diaminopropionate biosynthesis protein SbnA
C5746_04115-216-2.450304hypothetical protein
C5746_04120-216-2.529451CBS domain-containing protein
C5746_04130016-3.081184oxidoreductase
C5746_04135-214-1.734236Xaa-Pro aminopeptidase
C5746_04140-215-1.796144transcriptional regulator
C5746_04145-121-1.678687winged helix DNA-binding domain-containing
C5746_04150-224-0.446598helicase
C5746_04155-216-3.073442hypothetical protein
C5746_04160-39-2.406237hypothetical protein
C5746_04165-29-2.725903ricin-type beta-trefoil lectin domain protein
C5746_04170-29-2.578002beta-galactosidase
C5746_04175-29-2.551554sugar ABC transporter substrate-binding protein
C5746_04180-210-2.947337ABC transporter permease
C5746_04185-19-2.130333sugar ABC transporter permease
C5746_04190-112-3.029419LacI family transcriptional regulator
C5746_04195-110-2.217551hypothetical protein
C5746_04200-211-2.200517sporulation protein
C5746_04210-170.128419Sec-independent protein translocase TatA
C5746_04215-170.736064hypothetical protein
C5746_04220081.112741PadR family transcriptional regulator
C5746_04230-270.653373hypothetical protein
C5746_04235010-0.437274hypothetical protein
C5746_04240212-1.186464ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04260MALTOSEBP388e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 37.8 bits (87), Expect = 8e-05
Identities = 59/247 (23%), Positives = 100/247 (40%), Gaps = 20/247 (8%)

Query: 97 ISAKKGVPDVAQIEYYALSQYALTKSVADLKPYGAEKLADSYSPGPWNSVQAGDGIYGLP 156
++A PD+ + YA + +A++ P A D P W++V+ + P
Sbjct: 76 VAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKA--FQDKLYPFTWDAVRYNGKLIAYP 133

Query: 157 MDSGPMALFYNKRVFDKYEIAVPTTWDEYVEAARKLHKADPKAYITSDLGDAGLTTSLLW 216
+ ++L YNK + P TW+E + A K KA K+ + +L + T L+
Sbjct: 134 IAVEALSLIYNKDLLPN----PPKTWEE-IPALDKELKAKGKSALMFNLQEPYFTWPLIA 188

Query: 217 QAGSHPYKV-----DGAEVGIDFTDAGATKYTETWQKLIDEKLVSPIAGWSDDWYKGLGD 271
G + +K D +VG+D +AGA LI K ++ +S
Sbjct: 189 ADGGYAFKYENGKYDIKDVGVD--NAGAKAGLTFLVDLIKNKHMNADTDYSIA-EAAFNK 245

Query: 272 GTIATLPSGAWMPANFASGVKGASGDWRAAPLPQWTKGATGSSENGGSSLALPELGKNRE 331
G A +G W +N + + ++ LP + KG G S + N+E
Sbjct: 246 GETAMTINGPWAWSN----IDTSKVNYGVTVLPTF-KGQPSKPFVGVLSAGINAASPNKE 300

Query: 332 LAYAFVE 338
LA F+E
Sbjct: 301 LAKEFLE 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04290cloacin347e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 7e-04
Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 16/78 (20%)

Query: 261 PAGSSYGHGDPYAGNHSHGQGHGHYEERHHDGGHRSGPGMGTAVAA------------GA 308
P G G G + G HG G G+ + GG G +AVAA GA
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGA 100

Query: 309 AGLAVGVVGGMVAAEVVD 326
GLAV + G ++A + D
Sbjct: 101 GGLAVSISAGALSAAIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04295TATBPROTEIN373e-06 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 36.5 bits (84), Expect = 3e-06
Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 1 MFGI--SEIAIILIVVILVLGAKRLPDLARSAGKSARILKAEAKAMKE 46
MF I SE+ ++ I+ ++VLG +RLP ++ R L++ A ++
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQN 48


16C5746_04285C5746_04455Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_04285119-3.273711hypothetical protein
C5746_04290218-4.175684hypothetical protein
C5746_04295319-4.641557hypothetical protein
C5746_04300117-4.357641hypothetical protein
C5746_04305117-3.495718hydrolase
C5746_04310214-2.181418hypothetical protein
C5746_04315114-1.035332hypothetical protein
C5746_04320013-0.196723hypothetical protein
C5746_04325-1120.162787hypothetical protein
C5746_04330-2120.748334hypothetical protein
C5746_04335-1120.926923LacI family transcriptional regulator
C5746_04340-1170.660394ribokinase
C5746_04350017-1.482231nucleoside hydrolase
C5746_04355222-2.639419adenosine deaminase
C5746_04365221-4.095986hypothetical protein
C5746_04375-112-3.324777IS6 family transposase
C5746_04380-214-3.244262N-acetyltransferase
C5746_04385-113-2.676742IS1380 family transposase
C5746_04390014-2.334340alpha/beta hydrolase
C5746_044003110.210886transposase
C5746_04410011-0.369695IS6 family transposase
C5746_04415-111-0.357126hypothetical protein
C5746_04425-110-0.469385peptidoglycan-binding protein
C5746_04435216-0.301812hypothetical protein
C5746_04440220-0.491210hypothetical protein
C5746_04450222-0.678274histidinol phosphatase
C5746_04455218-0.965922acyl-CoA thioesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04410YERSSTKINASE250.040 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 24.7 bits (53), Expect = 0.040
Identities = 10/34 (29%), Positives = 20/34 (58%)

Query: 7 VQNRLSPTPDATPQAHDARLAHYLADAVDDDRDA 40
+ + L + D+ P +++ARL +L+D D+ A
Sbjct: 385 ITDILGVSADSRPDSNEARLHEFLSDGTIDEESA 418


17C5746_04500C5746_04545Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_04500213-1.955971DDE endonuclease
C5746_04505313-1.868159transposase
C5746_04510313-1.977745hypothetical protein
C5746_04515213-2.149295hypothetical protein
C5746_04520218-1.385702hypothetical protein
C5746_04525119-1.265813iron-containing alcohol dehydrogenase
C5746_04530025-1.481256phosphoglycerate kinase
C5746_04535130-2.538856N-acetyl-gamma-glutamyl-phosphate reductase
C5746_04540129-2.5889803-phosphoglycerate dehydrogenase
C5746_04545331-2.501197phosphoenolpyruvate phosphomutase
18C5746_04595C5746_04705Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_04595324-3.3931431-phosphofructokinase
C5746_04600326-3.269681fructose-bisphosphate aldolase
C5746_04605323-2.804331beta-galactosidase
C5746_04610323-2.613228hypothetical protein
C5746_04615524-2.519254hypothetical protein
C5746_04620422-0.855642LuxR family transcriptional regulator
C5746_04625121-1.456450hypothetical protein
C5746_04630118-1.875502hypothetical protein
C5746_04635019-1.724336hypothetical protein
C5746_04640022-3.465523hypothetical protein
C5746_04645026-4.257722ROK family transcriptional regulator
C5746_04650026-5.745537sugar ABC transporter substrate-binding protein
C5746_04655127-6.144886ABC transporter permease
C5746_04660127-6.617346ABC transporter permease
C5746_04665334-8.765876hypothetical protein
C5746_04670334-7.925093beta-galactosidase
C5746_04675428-6.702309peptidase M4
C5746_04680219-4.415844transposase
C5746_04685215-3.879580alpha/beta hydrolase
C5746_04695112-1.935362AraC family transcriptional regulator
C5746_04705-213-3.065662hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04790SUBTILISIN2197e-66 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 219 bits (560), Expect = 7e-66
Identities = 106/326 (32%), Positives = 152/326 (46%), Gaps = 38/326 (11%)

Query: 191 SASKVSRLWYDGKVQASLDKSVPQIGAPEAWAKGVDGKGVKVAVLDTGVDLNNADIKGRL 250
+V + + + V I AP W + G+GVKVAVLDTG D ++ D+K R+
Sbjct: 8 IPYQVIKQEQQ---VNEIPRGVEMIQAPAVWNQTR-GRGVKVAVLDTGCDADHPDLKARI 63

Query: 251 TATRSFVDGAA----TVQDGHGHGTHVASTIVGSGANSGGKYKGVAPGADLLVGKVLTDG 306
R+F D +D +GHGTHVA TI + +G GVAP ADLL+ KVL
Sbjct: 64 IGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVV--GVAPEADLLIIKVLNKQ 121

Query: 307 GGGLSSWIIDGMEWAAAQGADVVNMSLGGSASSPQDAKTEVVDRLSTTTGTLFVISAGND 366
G G WII G+ +A Q D+++MSLGG E V + + L + +AGN+
Sbjct: 122 GSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDV--PELHEAVKKA-VASQILVMCAAGNE 178

Query: 367 GPGA---TTVLSPGTADSALTVGAVDKSDVLARFSSRGPRVGDSAIKPEITAPGVGIVAA 423
G G + PG + ++VGA++ + FS+ V ++ APG I++
Sbjct: 179 GDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEV-------DLVAPGEDILS- 230

Query: 424 RAAGTSMGTPVDANYTAADGTSMAAPHVAGAAALLAQR-----HPDWTGRRIKAALVTHA 478
T Y GTSMA PHVAGA AL+ Q D T + A L+
Sbjct: 231 --------TVPGGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRT 282

Query: 479 KPSNAYSVYQQGNGRVDVPAALDPAL 504
P S +GNG + + A + +
Sbjct: 283 IPLG-NSPKMEGNGLLYLTAVEELSR 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_04840THERMOLYSIN2612e-80 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 261 bits (668), Expect = 2e-80
Identities = 165/504 (32%), Positives = 225/504 (44%), Gaps = 61/504 (12%)

Query: 74 LGLGGQEQLLVKSVLKDADGTVHARYERTFAGLPVLGGDLVVHTAANGSAKGV--TKATN 131
LG +E+L + D G R+E+ A +G LV H +G + T N
Sbjct: 68 LGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVN-DGELSSLSGTLIPN 126

Query: 132 ATITVASTTAARSAGSAKTFAVGRAKAKGVAKARAGT-----ARKVVWAASGTPTLAWET 186
T AA S A+ A A V K R R V++ TP LA+E
Sbjct: 127 LDKRTLKTEAAISIQQAEMIAKQD-VADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEV 185

Query: 187 VVGGTQADGTPSELHVITDAKSGKKLFEYQGIE---------------TGIGNSKYSGQV 231
V P + DA GK L ++ ++ G+G Q
Sbjct: 186 NV--RFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQK 243

Query: 232 TIGTTPAN--GGYSMTDGTRGGH-KTYDLNHGNSGTGTLFTDPDDTWGDGTTNDPQTTAV 288
I TT ++ G Y + D TRG TYD + G+L+ D D+ + AV
Sbjct: 244 YINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQF----FASYDAAAV 299

Query: 289 DAAYGAQLTWDYYKDVHGRNGIKNDGVAAYTRVHYGNNYVNAFWDDNCFCMTYGDGAG-N 347
DA Y A + +DYYK+VHGR A + VHYG Y NAFW+ + M YGDG G
Sbjct: 300 DAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQ--MVYGDGDGQT 357

Query: 348 AKPLT-SIDIAAHEMTHGVTSNTAGLIYRGESGGLNEATSDIMAAAVEFWANNASDPGDY 406
P + ID+ HE+TH VT TAGL+Y+ ESG +NEA SDI VEF+AN D+
Sbjct: 358 FLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNP---DW 414

Query: 407 LNGDQI---DIHGDGTPLRYMDKPSKDGKSADAWYPGIESID---VHYSSGPANHWFYLA 460
G+ I + GD LR M P+K G + D VH +SG N YL
Sbjct: 415 EIGEDIYTPGVAGDA--LRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLL 472

Query: 461 SEGSGAKTVGGVDYDSPTSDGLPVSAIGRDAAAKIWYKALTIYMTSSTDYAGARTATLQA 520
S+G G+ V+ IGRD KI+Y+AL Y+T +++++ R A +QA
Sbjct: 473 SQGG-------------VHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQA 519

Query: 521 AADLYGLGSDTYINAANAWAAINV 544
AADLYG S + A+ A+ V
Sbjct: 520 AADLYGSTSQEVNSVKQAFNAVGV 543


19C5746_04900C5746_05320Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_0490019-3.266756histone protein
C5746_04905011-2.699386cyclase
C5746_04910-114-4.002743hypothetical protein
C5746_04915-111-3.213481VOC family virulence protein
C5746_04920116-2.125737gas vesicle protein
C5746_04925214-1.968845gas vesicle protein
C5746_04930215-1.693143gas vesicle protein
C5746_04940013-1.850210gas vesicle protein
C5746_04945013-1.390446gas vesicle protein
C5746_049500120.087022gas vesicle protein
C5746_04955-1110.622042gas vesicle protein
C5746_04960-191.394398gas vesicle protein K
C5746_049651142.187712CsbD family protein
C5746_04970-1131.516600histidine phosphatase family protein
C5746_049751143.469751alpha-hydroxy-acid oxidizing enzyme
C5746_049800133.051909non-ribosomal peptide synthetase
C5746_049850142.756807cytochrome P450
C5746_049951132.489914LLM class F420-dependent oxidoreductase
C5746_050002142.977562hypothetical protein
C5746_05005120-0.999196hypothetical protein
C5746_05010133-4.232689cytochrome P450
C5746_05015024-3.351285nitric oxide synthase oxygenase
C5746_05020-122-3.538741cytochrome P450
C5746_05030224-3.456130ABC transporter
C5746_05040123-2.340729daunorubicin/doxorubicin resistance ABC
C5746_05045224-1.792210aminotransferase
C5746_05055027-2.309223MbtH family protein
C5746_05065030-2.887660thioesterase
C5746_05070731-3.400388hypothetical protein
C5746_050751230-5.762137cytochrome P450
C5746_050801232-6.830174alpha/beta hydrolase
C5746_050901125-5.523491streptomycin biosynthesis protein
C5746_050951223-5.365594TetR family transcriptional regulator
C5746_05100621-5.582718transcriptional regulator
C5746_05105220-3.567378pyruvate, phosphate dikinase
C5746_05110419-3.526553transcriptional regulator
C5746_05115319-4.075716GNAT family N-acetyltransferase
C5746_05120423-4.656446hypothetical protein
C5746_05125325-3.910143hypothetical protein
C5746_05135028-4.613527hypothetical protein
C5746_05145128-4.709445sporulation protein
C5746_05150129-4.136912hypothetical protein
C5746_05155329-5.211862DeoR family transcriptional regulator
C5746_05160226-3.993048Immediate-early protein 2
C5746_05165125-1.458373DUF2867 domain-containing protein
C5746_05170-1190.790449MxaD family protein
C5746_051753280.302867cysteine synthase
C5746_051803270.177022hypothetical protein
C5746_051853270.245998hypothetical protein
C5746_051903270.177334ABC transporter substrate-binding protein
C5746_051953260.266392peptide ABC transporter ATP-binding protein
C5746_052004250.123143hypothetical protein
C5746_052051140.373566TetR family transcriptional regulator
C5746_052100151.204027hypothetical protein
C5746_05220-2161.272924MFS transporter
C5746_05225-2160.928739hypothetical protein
C5746_05235-2111.088195hypothetical protein
C5746_052454242.069734DNA-binding response regulator
C5746_052504251.621600two-component sensor histidine kinase
C5746_052604251.522436TetR family transcriptional regulator
C5746_052704231.566443deazaflavin-dependent nitroreductase
C5746_05280-271.979383hyaluronate lyase
C5746_05285-191.544727dipeptidase
C5746_05295-2131.644590ATP/GTP-binding protein
C5746_05300-2142.150924hypothetical protein
C5746_053051121.583435hypothetical protein
C5746_05310182.189662hypothetical protein
C5746_05315172.276768group II intron reverse transcriptase/maturase
C5746_05320282.071255hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05095IGASERPTASE542e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 54.3 bits (130), Expect = 2e-10
Identities = 38/231 (16%), Positives = 74/231 (32%), Gaps = 19/231 (8%)

Query: 77 RKAVAAAADRGMSSLADALSDRTARLGE---KQDEGDEEEYEPEEGGAEYEEEEEPEEEE 133
+ +A + + A A T KQ+ E+ E + + E +E +
Sbjct: 1014 NEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAK 1073

Query: 134 PEAEYEEGQAE----EEEEEEEQPEEEYEEEEEQEERQPK----------RRRSSLQPAR 179
+ E E +E Q E E ++E + K + S + P +
Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQ 1133

Query: 180 SRGGAARKKAAPARGEKPRTAKKTAARKTAPPKKT--AAKKTAPAKKTAARKTAPPKKTA 237
+ + +A PAR P K +T T AK+T+ + ++
Sbjct: 1134 EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 238 AKKTAPAKKTAARKTAPPKKTAAKKTAPAKKSAARKTTSSKRAASKRADRR 288
+ P T A ++ K + + R + A+ ++ R
Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244



Score = 52.4 bits (125), Expect = 1e-09
Identities = 31/194 (15%), Positives = 72/194 (37%), Gaps = 7/194 (3%)

Query: 96 SDRTARLGEKQDEGDEEEYEPEEGGAEYEEEEEPEEEEPEAEYEEGQAEEEEEEEEQPEE 155
++ TA+ E E + + E +E + E E+EE+ + + E+
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 156 EYEEEEEQEERQPKRRRSSLQPARSRGGAARKKAAPARGEKPRTAKKTAARKTAPPKKTA 215
E + + PK+ +S + + + AR+ ++P++ T A P K+T+
Sbjct: 1119 TQEVPKVTSQVSPKQEQS--ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 216 AKKTAPAKKTAARKTAP-----PKKTAAKKTAPAKKTAARKTAPPKKTAAKKTAPAKKSA 270
+ P ++ T P+ T T P + + + + ++ P
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236

Query: 271 ARKTTSSKRAASKR 284
A +++ + +
Sbjct: 1237 ATTSSNDRSTVALC 1250



Score = 43.5 bits (102), Expect = 8e-07
Identities = 34/231 (14%), Positives = 77/231 (33%), Gaps = 10/231 (4%)

Query: 62 AELQEQLKGEVFDAGRKAVAAAADRGMSSLADALS----DRTARLGEKQDEGDEEEYEPE 117
AE +Q V + A A + + +T + + E E +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 118 EGGAEYEEEEEPEEEEPEAEYEEGQAEEEEEEEEQPEEEYEEEEEQEERQPKRRRSSLQP 177
+ A E+EE+ + E + + + ++EQ E + E E P +++
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT---VNIKE 1157

Query: 178 ARSRGGAARKKAAPARGEKPRTAKKTAARKTAPPKKTAAKKTAPAKKTAARKTAPPKKTA 237
+S+ PA+ + + + ++ + P T A T P +
Sbjct: 1158 PQSQTNTTADTEQPAK--ETSSNVEQPVTESTTVNTGNSVVENPENTTPA-TTQPTVNSE 1214

Query: 238 AKKTAPAKKTAARKTAPPKKTAAKKTAPAKKSAARKTTSSKRAASKRADRR 288
+ + + ++ P A ++ + + A +S + +D R
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDAR 1265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05125SHAPEPROTEIN290.005 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.0 bits (65), Expect = 0.005
Identities = 13/44 (29%), Positives = 26/44 (59%)

Query: 46 RVSAPRAMRYAAEQLQELLGRAPESVSAVKPTEGGWQADVEVLE 89
R +P+++ +++LGR P +++A++P + G AD V E
Sbjct: 45 RAGSPKSVAAVGHDAKQMLGRTPGNIAAIRPMKDGVIADFFVTE 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05260SECA250.035 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 25.2 bits (55), Expect = 0.035
Identities = 5/17 (29%), Positives = 9/17 (52%)

Query: 41 EAERGACLEYIDQHWTD 57
E+G L+ +D W +
Sbjct: 761 HFEKGVMLQTLDSLWKE 777


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05290HTHTETR661e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 1e-14
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 1 MVRLTRAQQQERTRAAVLAAAKAEFTERGYAAAKVDEIAERAELTRGAVYSNFPSKRALY 60
M R T+ Q+ + TR +L A F+++G ++ + EIA+ A +TRGA+Y +F K L+
Sbjct: 1 MARKTK-QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 LAVLVDMVERAAAAEHSDSAKRSGTTEQ 88
+ E AK G
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05300PHPHTRNFRASE836e-19 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 83.3 bits (206), Expect = 6e-19
Identities = 56/247 (22%), Positives = 94/247 (38%), Gaps = 37/247 (14%)

Query: 257 VPFGEALFGRQGEDVVSGSSLTEPLSELADREPEVWTRLLSALTRLEENYRD-ACYVEFT 315
V +A + + +S+T+ +E+ +L +AL + +E R E +
Sbjct: 14 VAIAKAFIHLEPNVDIEKTSITDVSTEIE--------KLTAALEKSKEELRAIKDQTEAS 65

Query: 316 FEAGELWILQVRRGRFVGRAAVRVAVDLADAGTIGRDEALLRVSPQH---LTHVRTPRIT 372
A + I V + + + AL VS + +
Sbjct: 66 MGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMK 125

Query: 373 --AIEPRDLFTRGLGASPGVAVGRVATTADSAARLAAGGPVVLLRPETSPLDMHGL--AA 428
A + RD+ R LG GV G +AT A+ V++ + +P D L
Sbjct: 126 ERAADIRDVSKRVLGHLIGVETGSLATIAE---------ETVIIAEDLTPSDTAQLNKQF 176

Query: 429 AAGVVTARGGPTSHAAVVARSMGKPAVVGAANLTVDAADGCVRAGGRTLPEGTLIALDGT 488
G T GG TSH+A+++RS+ PAVVG +T + G ++ +DG
Sbjct: 177 VKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEK------------IQHGDMVIVDGI 224

Query: 489 GGEVVVG 495
G V+V
Sbjct: 225 EGIVIVN 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05305CHLAMIDIAOM6290.021 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 28.5 bits (63), Expect = 0.021
Identities = 14/44 (31%), Positives = 25/44 (56%)

Query: 59 EERHGRHRYVRVADRRVVELIESLAALAPQGSARPRSLSASGRQ 102
+ GR V+V D R VE+ +++ A GS P ++A+G++
Sbjct: 86 DSCFGRMYTVKVNDDRNVEITQAVPEYATVGSPYPIEITATGKR 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05310SACTRNSFRASE290.007 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.007
Identities = 11/49 (22%), Positives = 17/49 (34%), Gaps = 2/49 (4%)

Query: 81 PQLRGSGLGREILRQAEDEAHARGCRTAVLYTITFQAPG--FYHKQGWK 127
R G+G +L +A + A +L T FY K +
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05320PF05844270.027 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 27.3 bits (60), Expect = 0.027
Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 2/52 (3%)

Query: 52 LPALPAALPALPAAPSQPNPSYGYQQPAQQGYAPMQPAQLQHAPAPYIPQQP 103
L A AA+P+ P AP S G Q A + P PA P+Q
Sbjct: 8 LAATQAAIPSEPIAPGAAGRSVGTPQAAAEL--PQVPAARADRVELNAPRQV 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05330PRTACTNFAMLY373e-04 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 37.0 bits (85), Expect = 3e-04
Identities = 31/138 (22%), Positives = 39/138 (28%), Gaps = 12/138 (8%)

Query: 419 SPAEAGFYVSAEGRGTFHGCRVTGSEGYGFHVMDGCRTTLTRCRTERCARGGYEFAEGGT 478
S A A V T G +TG G M G L R R GG
Sbjct: 214 SGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGA 273

Query: 479 AHGDG-------TGAGPVVE-----DCTSDESALRSPAAPAPTVLTATQSASGLLGAVPG 526
G G GPV++ D + L AP + A + G V G
Sbjct: 274 VPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSG 333

Query: 527 PRAAEPTPATVPAAEPVR 544
+ P + R
Sbjct: 334 GSLSAPHGNVIETGGARR 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05350NUCEPIMERASE455e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.8 bits (106), Expect = 5e-07
Identities = 23/123 (18%), Positives = 44/123 (35%), Gaps = 16/123 (13%)

Query: 13 LRCLVTGASGYIGGRLVPALLDAGHRVRAL--------ARTPRKLRDHPWADRVEVVEGD 64
++ LVTGA+G+IG + LL+AGH+V + + + + + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 65 VSDAASVRAAMRD--MDVAYYLVHALGSGPGFEETDREAAR-VFGAQ-----ARAAGVGR 116
++D + + + H L E A + G R +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 117 IVY 119
++Y
Sbjct: 121 LLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05390HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 1e-15
Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 13/169 (7%)

Query: 4 RAMSTPDRLIEATQELLWERGYVGTSPKAIQQQAGAGQGSMYHHFAGKPDLALTAIRRTA 63
A T +++ L ++G TS I + AG +G++Y HF K DL +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 64 SQMRETARQLF-DGPGTAYERISTYLLR---------ERDVLRGCPVGRLTMDPEVIASD 113
S + E + PG + L+ R +L + E+
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 114 ELRAPVDETIGWLRGRLAEIIQEGLDQGEFTRLLVPEDVAATIVATVQG 162
+ + + R+ + ++ ++ L+ A + + G
Sbjct: 128 QAQRNLCLE---SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05400TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 62/283 (21%), Positives = 94/283 (33%), Gaps = 12/283 (4%)

Query: 54 HQTGASASTLGLALLGVSAGAVVTMMLTGRLCRRFGSHPVTVVCGVLLPLGIALPAQTHS 113
+ + G+ L + + G L RFG PV +V + A+ A
Sbjct: 36 VHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF 95

Query: 114 ALALGLVLLVFGAAYGGMNVAMNSAAVDLVAALRRPVMPSF-HAAFSLGGMVGAGLGGLV 172
L + +V G VA A D+ R F A F G + G LGGL
Sbjct: 96 LWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGFGMVAGPVLGGL- 153

Query: 173 AGGLSPATHLFVLTGIGLLVTAATGPVLLRQPAPKPASTADDAEKPRQPAGRARRMV--- 229
GG SP F + L TG LL + + R R +
Sbjct: 154 MGGFSPHAPFFAAAALNGL-NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 230 -LLFGVIALCTAYGEGALADWGALHLEQDLHAHPGIAAAGYSLFAL--AMTAGRLSGTAL 286
L V + G+ A W + E H + F + ++ ++G +
Sbjct: 213 AALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP-V 270

Query: 287 LERLGQTRTLVAGGATAAAGMLLGSLAPTTWLALLGFAVTGLG 329
RLG+ R L+ G G +L + A W+A + G
Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313



Score = 32.1 bits (73), Expect = 0.004
Identities = 42/161 (26%), Positives = 55/161 (34%), Gaps = 33/161 (20%)

Query: 252 LHLEQDLHAHPGIAAAGYSL--FALAMTAGRLSGTALLERLGQTRTLVAGGATAAAGMLL 309
L D+ AH GI A Y+L FA A G LS +R G+ L+ A AA +
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALS-----DRFGRRPVLLVSLAGAAVDYAI 89

Query: 310 GSLAPTTWLALLGFAVTGLGLANIFPVAVGRAGELAGPGGVAAAST--------LGY--- 358
+ AP W+ +G V G+ A G A T G+
Sbjct: 90 MATAPFLWVLYIGRIVAGI-----------TGATGAVAGAYIADITDGDERARHFGFMSA 138

Query: 359 ---GGMLLGPPAIGFLADWFSLPLALTTVALLAAAAAALGY 396
GM+ GP G + FS A L G
Sbjct: 139 CFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05425HTHFIS526e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.8 bits (124), Expect = 6e-10
Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 6/118 (5%)

Query: 2 IRVLLADDQLLVRAGF-RALLDAQPDIEVAGEAADGEEAVRLVRELRPDTVLMDIRMPHL 60
+L+ADD +R +AL A D+ + + R + D V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLAATRAITGDFELSGVKVVMLTTFELDEYVFEAIRSGASGFLVKDTEPEELLRAVR 118
+ I + V++++ +A GA +L K + EL+ +
Sbjct: 61 NAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05430PF06580290.049 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.049
Identities = 57/362 (15%), Positives = 106/362 (29%), Gaps = 80/362 (22%)

Query: 85 FGSSAAAMIYLAAGYPYGPVFLAVAVGCFSAVVSG-----------HRR----------- 122
G + YG L + + + G R+
Sbjct: 18 IGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQII 77

Query: 123 -AAWTAV---GMVWLGHVLVAHWLYRWLPPSDDH-PAAWGQELG---VAAWVVAIIAAAE 174
A GMVW VA+ L + P A+ L + VV +
Sbjct: 78 LRVLPACVVIGMVWF----VANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSL 133

Query: 175 FVRVRREQWADQRAEREAAEQRR-ADEERLRMARE------LHDVLAHSISVINVQSSVG 227
++AE + + A E +L + + + L ++I
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL-NNIR--------- 183

Query: 228 LALLDSDPEQARTALTTIKAASKEALGEVRQVLDTLRTPGDAPRTPAPGLDRLPELVEQA 287
AL+ DP +AR LT++ + +L +L +D +L
Sbjct: 184 -ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTV-------VDSYLQLASIQ 235

Query: 288 AGAGLTVTVETD-GVRGAVPPGADLAAFRIVQEALTNVVRHSGSRTAQ-----VRIGYGP 341
L + + + P +VQ + N ++H ++ Q ++
Sbjct: 236 FEDRLQFENQINPAIMDVQVP------PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDN 289

Query: 342 ARIRLRIDDEGPATGDDAG-GSGNGLAGMRERAAALRG-----TIEAGPRADGGFRVRAE 395
+ L +++ G + +G GL +RER L G + G
Sbjct: 290 GTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSE---KQGKVNAMVL 346

Query: 396 LP 397
+P
Sbjct: 347 IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05435HTHTETR472e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.5 bits (110), Expect = 2e-08
Identities = 19/112 (16%), Positives = 42/112 (37%), Gaps = 4/112 (3%)

Query: 5 RGARERARIEVTAAIKGEARKQLAAEGAAKLSLRAVARELGMASSALYRYFPSRDELLTA 64
R ++ A+ E I A + + +G + SL +A+ G+ A+Y +F + +L +
Sbjct: 3 RKTKQEAQ-ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 65 LIVDAYDSVGESAEAAHRAARADAAPHITRWTVVAHAVRDWALAHPHEYALI 116
+ + ++GE D + V + + L+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLS---VLREILIHVLESTVTEERRRLLM 110


20C5746_05460C5746_05535Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_05460320-1.116297threonine aldolase
C5746_05465522-2.550205short-chain dehydrogenase
C5746_05470419-2.513047hypothetical protein
C5746_05480416-2.766685glycerophosphodiester phosphodiesterase
C5746_05485517-2.304981GNAT family N-acetyltransferase
C5746_05490217-3.255477glyoxalase
C5746_05495218-2.695253hypothetical protein
C5746_05500318-2.353599DUF5134 domain-containing protein
C5746_05505322-2.693745hypothetical protein
C5746_05510222-3.701425hypothetical protein
C5746_05515127-5.140420hydrolase
C5746_05520116-4.178483hypothetical protein
C5746_05525-211-4.122813DUF305 domain-containing protein
C5746_0553007-5.315876NADP oxidoreductase
C5746_05535-18-4.015794transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05625DHBDHDRGNASE578e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.0 bits (137), Expect = 8e-12
Identities = 48/170 (28%), Positives = 74/170 (43%), Gaps = 3/170 (1%)

Query: 12 ALEGAVVAVAGAAGPAGRATLLRLAEAGATVVASDADATRLAEAVDAARYAHGGATVTGD 71
+EG + + GAA G A LA GA + A D + +L + V + +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAF 62

Query: 72 TVDLLDLAAAREWADKTEKEFGRIDGLVHLVGGWRGSATFAETDLADWNLLEKLLIRTVQ 131
D+ D AA E + E+E G ID LV++ G R + +D +W + V
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSD-EEWEATFSVNSTGVF 121

Query: 132 HTSLAFQEGLQRSDRGRYVLISAAGASQPTAGNAAYAASKAAAEAWTLAL 181
+ S + + + G V + + A P AAYA+SKAAA +T L
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05660BCTERIALGSPG310.004 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.6 bits (69), Expect = 0.004
Identities = 8/33 (24%), Positives = 21/33 (63%)

Query: 3 VSLALLLLGALAAVVTPRLMARADWPEREPVVA 35
+ + ++++G LA++V P LM + +++ V+
Sbjct: 15 IMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05680PF07201320.002 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 31.7 bits (72), Expect = 0.002
Identities = 20/152 (13%), Positives = 42/152 (27%), Gaps = 20/152 (13%)

Query: 41 DNRTKAQAGG---GPSVVAPGKPGEPARTLSAEEAAKAAGDDAPNSADFRYVQ----MMI 93
++ Q G G SV + AEE + S D R + +
Sbjct: 23 SSQIVNQTLGQFRGESVQIVSGTLQSIAD-MAEEVTFVFSERKELSLDKRKLSDSQARVS 81

Query: 94 QHHAQALELTGLVPARSESTAIKRLAERITAGQKPEIGAMEGWLKHNGGEK--------- 144
Q + VP + + L ++ + ++ +L+ E
Sbjct: 82 DVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCG 141

Query: 145 -RKSGHDHSAMPGMATPAQ--LDQLRTADGAA 173
R + + ++ + L + G
Sbjct: 142 LRDALKGRPELAHLSHLVEQALVSMAEEQGET 173


21C5746_05790C5746_05845Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_057902142.133738biphenyl 2,3-dioxygenase
C5746_057952152.199355TetR/AcrR family transcriptional regulator
C5746_058001131.479211FAA hydrolase family protein
C5746_058052140.991590transketolase
C5746_058102141.231408dienelactone hydrolase
C5746_058152170.963173hypothetical protein
C5746_05820211-0.424856hypothetical protein
C5746_058302110.498830hypothetical protein
C5746_05840210-0.047103hypothetical protein
C5746_05845210-0.215167hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05950HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 2e-13
Identities = 42/203 (20%), Positives = 71/203 (34%), Gaps = 23/203 (11%)

Query: 1 MPRNRQQIPKAEREAVLVDSAWELFAVKGYKGTSLTEVGKAAGVAANAVRWYFPTKDDLF 60
M R +Q + R+ +L D A LF+ +G TSL E+ KAAGV A+ W+F K DLF
Sbjct: 1 MARKTKQEAQETRQHIL-DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 AATLGRLFLRERERVEADPV-IGGDPRRQLVTFLADL--ESYRGLHREAYDRMDESPALA 117
+ E GDP L L + + R +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 118 ------------DVYSRMREWLEGRLLAAVAS-RLSEGADVEPVADMAHVLFEGLLVSA- 163
++ + +E L + + L A + GL+ +
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 164 -----RELDRPTEEFIDLLMEAL 181
+L + +++ +L+E
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMY 202


22C5746_06105C5746_06180Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_06105-119-3.077253protease inhibitor
C5746_06110-122-3.187706alkaline phosphatase
C5746_06115-125-3.226630alpha-L-rhamnosidase
C5746_06120025-3.556343sugar ABC transporter substrate-binding protein
C5746_06130333-3.427263sugar ABC transporter permease
C5746_06135230-1.320826sugar ABC transporter permease
C5746_061401210.194097hypothetical protein
C5746_061501200.951952amino acid permease
C5746_061552201.127277glycoside hydrolase
C5746_061603131.174200beta-galactosidase
C5746_061651120.991921carbohydrate ABC transporter permease
C5746_061702120.918576lactose ABC transporter permease
C5746_061753120.329884ABC transporter substrate-binding protein
C5746_061802120.249037aminomethyl-transferring glycine dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_06260SSBTLNINHBTR645e-16 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 63.7 bits (154), Expect = 5e-16
Identities = 37/86 (43%), Positives = 45/86 (52%), Gaps = 1/86 (1%)

Query: 47 IRGVALFCPPAPDAHHPHAAAACAAIDWAEGDLDALPGSPR-LCIEGYDPVTATATGNRD 105
+R V L C P HP AAAACA + A GD AL +C Y PV T G
Sbjct: 59 LRAVTLTCAPTASGTHPAAAAACAELRAAHGDPSALAAEDSVMCTREYAPVVVTVDGVWQ 118

Query: 106 GRPVSWQRTFPNACVMDAVTGPVFRF 131
GR +S++RTF N CV +A + VF F
Sbjct: 119 GRRLSYERTFANECVKNAGSASVFTF 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_06285MALTOSEBP523e-09 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 51.7 bits (123), Expect = 3e-09
Identities = 54/196 (27%), Positives = 85/196 (43%), Gaps = 21/196 (10%)

Query: 20 LRIGGGVAALAALPPAL-TACSDSSSGSGKLRIVGVADQQKPIEELVALYRKSHPDDEFS 78
++ G + AL+AL + +A + + GKL I D K L + +K D
Sbjct: 3 IKTGARILALSALTTMMFSASALAKIEEGKLVIWINGD--KGYNGLAEVGKKFEKDTGIK 60

Query: 79 TSFAPTDQVQTVVRTQLAGGNAPDV----HVLYPGSGSAMSMVELARAGLLADLS-DQAW 133
+ D+++ A G+ PD+ H + G A++GLLA+++ D+A+
Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGG---------YAQSGLLAEITPDKAF 111

Query: 134 TKEVPANFHPAYRYKGKTYLYSAGSSAIGAIYNKKAFAEAGVEPPKTWSELLDFCAKLKA 193
++ A RY GK Y A+ IYNK PPKTW E+ +LKA
Sbjct: 112 QDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPN----PPKTWEEIPALDKELKA 167

Query: 194 KGVIPIALGAQTPWVT 209
KG + Q P+ T
Sbjct: 168 KGKSALMFNLQEPYFT 183


23C5746_06450C5746_06585Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_06450217-4.571519hypothetical protein
C5746_06455222-4.407920IS3 family transposase
C5746_06460320-3.724128ATP-dependent Clp protease proteolytic subunit
C5746_06465117-2.216796hypothetical protein
C5746_06470018-2.311815glycoside hydrolase
C5746_06475217-1.730454lysine transporter LysE
C5746_06480015-1.243126LysR family transcriptional regulator
C5746_06485015-0.992465ATP-binding protein
C5746_06490-117-0.167196transcriptional regulator
C5746_06495020-1.580822DUF397 domain-containing protein
C5746_06500118-1.0247438-amino-7-oxononanoate synthase
C5746_06505222-1.611329biotin synthase BioB
C5746_06510122-1.822430adenosylmethionine--8-amino-7-oxononanoate
C5746_06515-122-1.809626dethiobiotin synthase
C5746_06520-123-2.581495SAM-dependent methyltransferase
C5746_06525124-3.034922fic family toxin-antitoxin system, toxin
C5746_06535323-2.929215antitoxin
C5746_06540120-2.519420histidine kinase
C5746_06545120-2.619683GNAT family N-acetyltransferase
C5746_06550119-2.646311hypothetical protein
C5746_06555-216-1.453432hypothetical protein
C5746_06560-215-1.494249lysine transporter LysE
C5746_06565218-2.244345GDSL family lipase
C5746_06570227-3.443947adenylosuccinate lyase
C5746_06575125-2.839488G/U mismatch-specific DNA glycosylase
C5746_06580019-3.622290hypothetical protein
C5746_06585119-3.582775two-component sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_06730SACTRNSFRASE280.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.006
Identities = 21/103 (20%), Positives = 36/103 (34%), Gaps = 8/103 (7%)

Query: 33 DHDGVARLISRDPGALLLAERDGLLAGTVIAGFDGW--RCSVYRLAVHPDCRRRGIATAL 90
D D + + A L + G I W + +AV D R++G+ TAL
Sbjct: 52 DDDMDVSYVEEEGKAAFLYYLENNCIGR-IKIRSNWNGYALIEDIAVAKDYRKKGVGTAL 110

Query: 91 MAAAEQ--RFVSLGGRRV---DAMVLEANERAHHTWTAGGYHR 128
+ A + + G + D + + A H + G
Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDT 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_06735SECYTRNLCASE320.003 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 32.4 bits (74), Expect = 0.003
Identities = 37/170 (21%), Positives = 63/170 (37%), Gaps = 14/170 (8%)

Query: 7 LLVAVLLSLACGAFVAAEF--SLTTVERGQLERAVEQGERGAASAMKAVRSLTFQLSGAQ 64
LL + +L ++ A L TV +LE ++G+ G A + R LT L+ Q
Sbjct: 69 LLQITIFALGIMPYITASIILQLLTVVIPRLEALKKEGQAGTAKITQYTRYLTVALAILQ 128

Query: 65 -LGITVTNLVVGMLSEPSIAKLIRGPVEAVGLSPSVASSLALVLGTALSTVFLMVVGELV 123
G+ T + S+ + S+ +++ +V+ T +M +GEL+
Sbjct: 129 GTGLVATARSAPLFGRCSVG-------GQIVPDQSIFTTITMVICMTAGTCVVMWLGELI 181

Query: 124 PKNWAI---SSPLAVAKAVATPQRAFTAVFRPFISHLNNTANRIVRRIGL 170
I S L AT A A+ + V +GL
Sbjct: 182 TD-RGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGL 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_06780PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.003
Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 9/72 (12%)

Query: 306 ILQESLTNVLRH------SGAVPVRVRISVTNGRLEMEVTN--PLTDSMGTPGGGSGLRG 357
++Q + N ++H G + ++ + NG + +EV N L G+GL+
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGK-ILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQN 317

Query: 358 IRERAALLGGKA 369
+RER +L G
Sbjct: 318 VRERLQMLYGTE 329


24C5746_07295C5746_07375Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_07295-122-3.5000845-oxoprolinase
C5746_07305-120-4.872396hypothetical protein
C5746_07310022-5.696560hypothetical protein
C5746_07315022-5.521362hypothetical protein
C5746_07320-121-4.104897hypothetical protein
C5746_07325-118-3.697759GntR family transcriptional regulator
C5746_07330017-2.247897hypothetical protein
C5746_07350018-0.683235glycerophosphodiester phosphodiesterase
C5746_07355-1150.105668MFS transporter
C5746_07360013-0.042132hypothetical protein
C5746_07365112-0.254035hypothetical protein
C5746_073701140.023917dolichol-phosphate mannosyltransferase
C5746_073753250.903035amidohydrolase
25C5746_07740C5746_07860Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_077402110.275155DNA-directed RNA polymerase subunit omega
C5746_07745112-0.574817guanylate kinase
C5746_07750213-0.90458430S ribosomal protein S13
C5746_07755211-1.428700orotidine-5'-phosphate decarboxylase
C5746_07760111-0.333734dihydroorotate dehydrogenase (quinone)
C5746_07765111-0.392226carbamoyl phosphate synthase large subunit
C5746_07770-190.258697carbamoyl phosphate synthase small subunit
C5746_07775-181.211712hypothetical protein
C5746_07780-291.804784dihydroorotase
C5746_07785-1114.077725aspartate carbamoyltransferase
C5746_07790-2103.118041bifunctional pyr operon transcriptional
C5746_07795-293.178962transcriptional regulator
C5746_07800-1112.855087transcription antitermination factor NusB
C5746_07805-192.447012elongation factor P
C5746_078101101.945656peptidase M24 family protein
C5746_07815117-0.201172hypothetical protein
C5746_078201150.452695type II 3-dehydroquinate dehydratase
C5746_07825222-0.6525503-dehydroquinate synthase
C5746_07830123-0.383601chorismate synthase
C5746_07835124-0.519632shikimate dehydrogenase
C5746_07840125-0.241716endolytic transglycosylase MltG
C5746_07845127-0.466957Holliday junction resolvase RuvX
C5746_07850227-0.454266alanine--tRNA ligase
C5746_078551240.086987hypothetical protein
C5746_07860222-0.180553DUF948 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07865UREASE290.031 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 29.3 bits (66), Expect = 0.031
Identities = 24/84 (28%), Positives = 35/84 (41%), Gaps = 14/84 (16%)

Query: 4 ILIRGAKVLG----GEPQDVLVDGETIAAVGTGIEAGDATVVEAEGQILLPGLVDLHTHL 59
I ++ ++ G P + G TI VG G E V+ EG+I+ G +D H H
Sbjct: 88 IGLKDGRIAAIGKAGNPD--MQPGVTII-VGPGTE-----VIAGEGKIVTAGGMDSHIHF 139

Query: 60 REPGREDSETVLTGTKAAAVGGFT 83
P + E L +GG T
Sbjct: 140 ICP--QQIEEALMSGLTCMLGGGT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07900PERTACTIN280.045 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.1 bits (62), Expect = 0.045
Identities = 22/81 (27%), Positives = 24/81 (29%)

Query: 10 PPPQGPGSGPVGWTHQAQHPGPPGPPPTTPPTPRGWSGPAPQHAPAPVSRETTGHIQLPP 69
G S A P P P P P+ P P P P R+ PP
Sbjct: 554 ANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613

Query: 70 GGPVPLPAPPAEPGTGSATLA 90
G A A TG LA
Sbjct: 614 AGRELSAAANAAVNTGGVGLA 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07940VACJLIPOPROT260.044 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 26.0 bits (57), Expect = 0.044
Identities = 9/43 (20%), Positives = 18/43 (41%)

Query: 1 MFRRTFWFTAGAAAGVWATAKVNRKLKQLTPESLAAQAADKAI 43
++ W T + G W + + + L + L Q++D I
Sbjct: 169 LYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYI 211


26C5746_07980C5746_08040Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_07980217-0.091828pyridoxal 5'-phosphate synthase lyase subunit
C5746_07985215-0.252191hypothetical protein
C5746_07990116-0.830544alpha-(1-2)-phosphatidylinositol
C5746_07995215-0.981419phosphatidylinositol mannoside acyltransferase
C5746_08000216-0.988573CDP-diacylglycerol--glycerol-3-phosphate
C5746_08005115-1.203080elongation factor G-like protein EF-G2
C5746_08010215-1.323696hypothetical protein
C5746_08015116-0.955364HIT family hydrolase
C5746_08020115-1.064241ion transporter
C5746_08025217-0.924581threonine--tRNA ligase
C5746_08030118-0.633403hypothetical protein
C5746_08035115-0.461944DUF4365 domain-containing protein
C5746_08040210-0.389357DNA polymerase III subunit epsilon
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_08080TCRTETOQM329e-105 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 329 bits (844), Expect = e-105
Identities = 140/693 (20%), Positives = 270/693 (38%), Gaps = 96/693 (13%)

Query: 24 VRNVVLVGHSGSGKTTLVEALALTAGAVNRAGRVEDGATVSDYDEIEHRQQRSVQLSLVP 83
+ N+ ++ H +GKTTL E+L +GA+ G V+ G T +D +E ++ ++Q +
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 84 VEWGGYKINLLDTPGYTDFVGELRAGLRAADAALFVVSAAQEADAVAGSTRAVWDECAAV 143
+W K+N++DTPG+ DF+ E+ L D A+ ++SA D V TR ++ +
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAK---DGVQAQTRILFHALRKM 119

Query: 144 GMPRAIVVTHLDTARTSFDEMTRICGEIFGGDDPDAVLPLYLPVHGPEGPDGHAPLTGLT 203
G+P + +D + +I + V+
Sbjct: 120 GIPTIFFINKIDQNGIDLS---TVYQDIKEKLSAEIVI---------------------- 154

Query: 204 GLLTQRIFDYSSGDRQENPPADAQRTTLQEARNRLIEGIIAESEDETLMDRYLGGGEIDI 263
Q++ Y P T E + +IEG ++ L+++Y+ G ++
Sbjct: 155 ---KQKVELY--------PNMCVTNFTESEQWDTVIEG------NDDLLEKYMSGKSLEA 197

Query: 264 ETLIDDLERAVARGAFHPVLSAAPAAEGARQGIGTVELLDLITGGFPTPLERPLPEVTPL 323
L + + PV + A+ IG L+++IT F + R
Sbjct: 198 LELEQEESIRFHNCSLFPVYHGS-----AKNNIGIDNLIEVITNKFYSSTHRG------- 245

Query: 324 HGAARPVLTCDPHGPLVAEVVKTASDPYVGRVSLVRVFSGTLRPDETVHVFGHGLNDPCH 383
L +V K R++ +R++SG L ++V +
Sbjct: 246 ------------QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS--------- 284

Query: 384 ETRPFHEAEVRVGALSSPFGKHQRPLTACIAGDLAC----VAKLGSAETGDTLSAQGQPL 439
+ ++++ + + + +G++ KL S GDT +
Sbjct: 285 -----EKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQRER 338

Query: 440 LMDPWTTPDPLLPLAIEAHSKADEDKLSQGLSRLVAEDPTMRLEQNQDTHQVVLWCLGEA 499
+ +P PLL +E + L L + DP +R + TH+++L LG+
Sbjct: 339 IENP----LPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKV 394

Query: 500 HQDVALDRLRSRYGVQVDAVPHKVSLRETFAGPSTGRGRHVKQSGGHGQYAICEIDVEPL 559
+V L+ +Y V+++ V E + + +A + V PL
Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYMERPLK--KAEYTIHIEVPPNPFWASIGLSVSPL 452

Query: 560 PPGSGIEFVDKVVGGAVPRQFIPSVEKGVRAQAARGVAAGHPLVDIRITLRDGKSHSVDS 619
P GSG+++ V G + + F +V +G+R +G+ G + D +I + G +S S
Sbjct: 453 PLGSGMQYESSVSLGYLNQSFQNAVMEGIRYGCEQGLY-GWNVTDCKICFKYGLYYSPVS 511

Query: 620 SDAAFQTAGALALREAAADTRIQLLEPVAEIQVLVPDDYVGPVMSDLSGRRGRVIGTEQS 679
+ A F+ + L + +LLEP ++ P +Y+ +D ++ T+
Sbjct: 512 TPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLK 571

Query: 680 GAGRTLVRAEVPELEIGRYAIDLRSLSHGAGRF 712
++ E+P I Y DL ++G
Sbjct: 572 N-NEVILSGEIPARCIQEYRSDLTFFTNGRSVC 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_08085TYPE3IMSPROT340.002 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 34.0 bits (78), Expect = 0.002
Identities = 14/102 (13%), Positives = 30/102 (29%), Gaps = 7/102 (6%)

Query: 108 SATRLRKARD-------ARLTAVTALFGFLFLPGMLLWVIAFRLRDSLGNTKDKALAALG 160
+ ++R AR + + + + L + +++
Sbjct: 10 TPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFS 69

Query: 161 NALLPVIGIVALLFLIKLPLTGFLALYVRAMIVAPVIGWLIA 202
AL V+ V L F +A + G+LI+
Sbjct: 70 QALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLIS 111


27C5746_08105C5746_08140Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_081054252.505196chloride transporter
C5746_081106311.308056*****zf-TFIIB domain containing protein
C5746_081155301.423452aminoglycoside phosphotransferase
C5746_081255251.822948hypothetical protein
C5746_081304271.917450serine/threonine protein kinase
C5746_081403120.283470rRNA methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_08225YERSSTKINASE300.025 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.7 bits (66), Expect = 0.025
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 9/78 (11%)

Query: 132 EAGVVHRDLKPSNILL--SPKGPRIIDFGIAWATGASTLTHVGTAVGSPGFLAPE-QVRG 188
+AGVVH D+KP N++ + P +ID G+ +G + F APE V
Sbjct: 263 KAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF------TESFKAPELGVGN 316

Query: 189 ATVTPATDVFSLGATLAY 206
+ +DVF + +TL +
Sbjct: 317 LGASEKSDVFLVVSTLLH 334


28C5746_08710C5746_08735Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_08710212-4.248277peptide-binding protein
C5746_08715414-4.847868hydrolase
C5746_08720214-5.370250methionine synthase
C5746_08725114-4.261107IclR family transcriptional regulator
C5746_08730116-3.519196aquaporin family protein
C5746_08735223-2.286397glycerol kinase
29C5746_08950C5746_09000Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_08950210-0.489574hypothetical protein
C5746_0895518-0.309161hypothetical protein
C5746_089601130.299327hypothetical protein
C5746_08965012-0.038392hypothetical protein
C5746_08970-210-1.129925TetR family transcriptional regulator
C5746_08975010-1.347242hypothetical protein
C5746_08980410-1.383944SARP family transcriptional regulator
C5746_0898529-2.082656bifunctional metallophosphatase/5'-nucleotidase
C5746_08990211-2.462870GntR family transcriptional regulator
C5746_08995311-2.028589FAD-binding dehydrogenase
C5746_09000212-1.195877hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09010HTHTETR752e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 2e-18
Identities = 34/168 (20%), Positives = 64/168 (38%), Gaps = 12/168 (7%)

Query: 40 MAK--QQRAIRTRRLILDSAATMIERNGYTATSIDGISTTAGTTKGAVYFHFPSKGDIAA 97
MA+ +Q A TR+ ILD A + + G ++TS+ I+ AG T+GA+Y+HF K D+ +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 98 ALFDEAHDSLRAIAT-----RARGPEASPLQSLIDTTHLLVRQFNGDPVTRACIQLCLER 152
+++ + ++ + P + + LI V + I E
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIFHKCEF 119

Query: 153 HSPAPQ----ARAYRLTWASTVRLLLESAAARREFREGVSAHAVAALV 196
R L + L+ + + A ++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09015HTHTETR844e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.9 bits (207), Expect = 4e-22
Identities = 42/205 (20%), Positives = 76/205 (37%), Gaps = 15/205 (7%)

Query: 11 QERALRTRRAILEAASIVFEKRGFVAAPLSEIIAASGVTKGALYFHFSSKEELARGVIEA 70
++ A TR+ IL+ A +F ++G + L EI A+GVT+GA+Y+HF K +L + E
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 71 QFSAMGQL------RLQGSPLQQAIDSTHAVAHALQHVPLVRAGVRLVIEQGSFTEPDPE 124
S +G+L + G PL + V + R + +I +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEER-RRLLMEIIFHKCEFVGEMA 124

Query: 125 PYLRWVEPAQ--------AFLEQARTEGELRPEVDVLAVAQLIVALFTGTQLMAQVLTQW 176
+ L+ L ++ A ++ +G Q
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 177 EDLDERITAFWRVVLPGLAMPEALR 201
DL + + ++L + LR
Sbjct: 185 FDLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_0902556KDTSANTIGN280.035 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 28.4 bits (63), Expect = 0.035
Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 1/35 (2%)

Query: 129 SAAALEEALGLWRGP-ALEGVGGGVLCGSAAARLD 162
SA LE+ +GL GP A GV GG++ G+ + RLD
Sbjct: 21 SAIELEDEVGLECGPYAKVGVVGGMITGAESTRLD 55


30C5746_09695C5746_09800Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_09695-1193.400576iron ABC transporter ATP-binding protein
C5746_09700-1192.970358hypothetical protein
C5746_097050162.713402chaplin
C5746_097150121.832990DNA-binding response regulator
C5746_097200201.193794diguanylate cyclase
C5746_097250140.893531IS110 family transposase
C5746_097305231.021578lytic transglycosylase
C5746_09735224-0.050021ABC transporter ATP-binding protein
C5746_09740221-0.107890phosphoserine phosphatase SerB
C5746_09745221-0.366706phosphohistidine phosphatase
C5746_09750222-1.176876hypothetical protein
C5746_09755330-1.489420MFS transporter
C5746_09775-114-3.110505GntR family transcriptional regulator
C5746_09780-113-2.339437hypothetical protein
C5746_09785112-2.362265enoyl-[acyl-carrier-protein] reductase FabI
C5746_09790215-1.8360063-oxoacyl-[acyl-carrier-protein] reductase
C5746_0979518-0.764322PIN domain-containing protein
C5746_09800215-0.698515hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09815PF07520330.002 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 32.6 bits (74), Expect = 0.002
Identities = 21/97 (21%), Positives = 35/97 (36%), Gaps = 8/97 (8%)

Query: 41 AGKTTLLNIASSYLFPSTGTAKILGEQLGGVGTDVFELRPRIG--IAGIAMAEKLPRRQT 98
AG + + S+ + P + Q GG +R G I G RRQ
Sbjct: 635 AGDDLVHRVISAIVLPRLQDSI---AQAGGQFVAER-MRELFGGDIGGQEQQTVQRRRQF 690

Query: 99 VLQTVLTAAYGMTATWHENYEAVDEERARAFLDRLGM 135
++ ++ A + + E+ E D D LG+
Sbjct: 691 SIRVLVPLAEAILSAC-EDAEEADRIDIP-VADVLGL 725


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09830HTHFIS727e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 7e-17
Identities = 27/115 (23%), Positives = 46/115 (40%), Gaps = 2/115 (1%)

Query: 6 IRVLLVDDHQVVRRGLRTFLEIQDDIEVVGEASDGAEGVARTEELRPDVVLMDIKMPGTD 65
+L+ DD +R L L V S+ A D+V+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GIEALRKLRELENPAKVLIVTSFTEQRTVVPALRAGASGYVYKDVDPDALAGAIR 120
+ L ++++ VL++++ T + A GA Y+ K D L G I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09845IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 2e-04
Identities = 22/121 (18%), Positives = 39/121 (32%), Gaps = 4/121 (3%)

Query: 37 VPANAEGKPEAQTPVSAAPVVLASVAGTPQV----KAVQASIIEQHSTAEQLVKAADLAR 92
VP+N E P T V K ++ + A +
Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 93 AEKAAAVKAKAEAAAKAKAAAAAEVKARAEAKARAAAQVKAKAAAERTETQAASRSEART 152
E + VKA + A++ + + E K A + + KA E +TQ + ++
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129

Query: 153 P 153

Sbjct: 1130 S 1130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09850PERTACTIN330.007 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.8 bits (74), Expect = 0.007
Identities = 46/162 (28%), Positives = 63/162 (38%), Gaps = 16/162 (9%)

Query: 34 DLTIDDARVSWRHATISWGGHSWFIEDHGSTNGTYVQGRRIHQLEIGPGSAVHLGNATDG 93
DL + D V R A+ G H ++ + GS + G + ++ GSA A
Sbjct: 487 DLGLSDKLVVMRDAS---GQHRLWVRNSGSEPAS---GNTMLLVQTPRGSAATFTLANKD 540

Query: 94 PRLSIG------AAAGADVYSGQGAGAQQAPVQQPQQGGAGRQAPVPPQQQQPQQGWQQA 147
++ IG AA G +S GA A AP PQ G P P Q PQ
Sbjct: 541 GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPG----PQPGPQPPQPPQPPQPPQ 596

Query: 148 PQVQPQQQPAQPQQPQVPHQQGMARTPGAGGPGGVAGAPPVY 189
P PQ+QP P ++ A A GGV A ++
Sbjct: 597 PPQPPQRQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLW 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09870TCRTETB300.022 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.022
Identities = 23/122 (18%), Positives = 44/122 (36%), Gaps = 4/122 (3%)

Query: 72 EVQDGLHMSAGAAGVLTSVPPLCFAI-FGVMAPRLARRFGPGAVVCAGMIAITAG-LVIR 129
++D +S G + P I FG + L R GP V+ G+ ++ L
Sbjct: 282 MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS 341

Query: 130 PLVGGTAGFLAASALALMG--IAVSNVLMPVIVKRWFPDRVGSMTGLYSMALALGTSLAA 187
L+ T+ F+ + ++G V+ ++ G+ L + L
Sbjct: 342 FLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGI 401

Query: 188 AV 189
A+
Sbjct: 402 AI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09885DHBDHDRGNASE497e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 48.5 bits (115), Expect = 7e-09
Identities = 56/267 (20%), Positives = 99/267 (37%), Gaps = 38/267 (14%)

Query: 5 LDGKRILITGVLMESSIAFHTAKVAQEQGAEVILTAF----PRPTLTERIAKKLPKPAKV 60
++GK ITG I A+ QGA + A + K + A+
Sbjct: 6 IEGKIAFITGA--AQGIGEAVARTLASQGAHI--AAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 61 IELDVTNAEHLDRLAGLVRDELGSLDGVVHSIGF---APQDALGGNFLNTPFESVSTAMH 117
DV ++ +D + + E+G +D +V+ G +L E
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSL-------SDEEWEATFS 114

Query: 118 VSAFSLKSLAMACKPLM--SEGGSIVGLTFDAQYAWPQYDW--MGPAKAALEATSRYLAR 173
V++ + + + + M GSIV + + P+ +KAA ++ L
Sbjct: 115 VNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCLGL 173

Query: 174 DLGKDDIRCNLISAGPIGS-----------MAAKSIPGFSELADVWNSRSPLAWNMSDPE 222
+L + +IRCN++S G + A + I G E + + PL ++ P
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLE---TFKTGIPLK-KLAKPS 229

Query: 223 PAGRGVVALLSDFFPKTTGEIIHVDGG 249
V+ L+S T + VDGG
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09890DHBDHDRGNASE1384e-42 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 138 bits (348), Expect = 4e-42
Identities = 78/252 (30%), Positives = 125/252 (49%), Gaps = 16/252 (6%)

Query: 3 RSVLVTGGNRGIGLAIARAFAENGDKVAITYRSGEPPQILTEAGVLAVR------CDITD 56
+ +TG +GIG A+AR A G +A + E + + + R D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 AEQVEQAYKEIEDKHGPVEVLVANAGITKDQLLMRMSEEDFTSVLDTNLTGTFRVVKRAN 116
+ +++ IE + GP+++LV AG+ + L+ +S+E++ + N TG F + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 117 RGMLRAKKGRVVLISSVVGLLGSAGQANYAASKAGLVGFARSLARELGSRNITFNVVAPG 176
+ M+ + G +V + S + A YA+SKA V F + L EL NI N+V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 177 FVDTDMTQVL-----TEEQR-KGIVSQ----VPLGRYAQPEEIAAAVRFLASDDASYITG 226
+TDM L EQ KG + +PL + A+P +IA AV FL S A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 227 AVIPVDGGLGMG 238
+ VDGG +G
Sbjct: 249 HNLCVDGGATLG 260


31C5746_09855C5746_09930Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_098552110.554929DUF485 domain-containing protein
C5746_098605110.347320peptidase S8
C5746_098653120.264210hypothetical protein
C5746_09875213-1.565518IMP dehydrogenase
C5746_09885215-2.595970lysoplasmalogenase
C5746_09890217-2.457677C-5 sterol desaturase
C5746_09895119-0.744209ATP-binding protein
C5746_099000210.787094hypothetical protein
C5746_09905-1210.594330amidohydrolase
C5746_09910-219-0.052535amidohydrolase
C5746_09915-1200.191030alcohol dehydrogenase
C5746_09920-2200.244561acetoacetate decarboxylase
C5746_099250120.245874TetR family transcriptional regulator
C5746_09930214-0.968919DNA polymerase III subunit epsilon
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09965SUBTILISIN2005e-62 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 200 bits (509), Expect = 5e-62
Identities = 107/347 (30%), Positives = 140/347 (40%), Gaps = 62/347 (17%)

Query: 152 MEPLQWDLPAIKADQAHQKTLGSSKVTVAVIDTGVDDTHPDLAPNFNRGASASCVSGAPD 211
+ + + I+A +T G V VAV+DTG D HPDL G + + D
Sbjct: 19 VNEIPRGVEMIQAPAVWNQTRGR-GVKVAVLDTGCDADHPDLKARIIGGRNFT------D 71

Query: 212 TTDGAWRPKTGESDHGTHVAGTIAAAKNGVGVTGVAPGVKVAGIKVANPDGFFYTESIVC 271
+G + HGTHVAGTIAA +N GV GVAP + IKV N G + I+
Sbjct: 72 DDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQ 131

Query: 272 GFVWAAEHGVDVTNNSYYTDPWMFNCKNDLDQGALVEAVARATRYAEHKGTVNVASAGNS 331
G +A E VD+ + S D L EAV +A + + +AGN
Sbjct: 132 GIYYAIEQKVDIISMSL---------GGPEDVPELHEAVKKAVA----SQILVMCAAGN- 177

Query: 332 KFDLASGAIDDTTSPNDTTAATRTIDPRECLDIPAMLPGVVTVSATGAKGLKSSYSNYGN 391
G DD T L P V++V A S +SN N
Sbjct: 178 -----EGDGDDRTD---------------ELGYPGCYNEVISVGAINFDRHASEFSNSNN 217

Query: 392 GVIDVAAPGGDSTVYQTPAPPATSGLILSTLPGGKYGYKAGTSMASPHVAGVAALIKS-- 449
V D+ APG D ILST+PGGKY +GTSMA+PHVAG ALIK
Sbjct: 218 EV-DLVAPGED---------------ILSTVPGGKYATFSGTSMATPHVAGALALIKQLA 261

Query: 450 ---THPHASAAAVKALLTAEADATACGAPYDFNGDGKIDAVCEGGKN 493
+ + A L + NG + AV E +
Sbjct: 262 NASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRI 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10010DHBDHDRGNASE643e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 3e-14
Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 4/169 (2%)

Query: 5 EGQVAVVTGAASGIGLAMARRFAAEGLKVVLADVEEGALDKAAGELRRDGAQVLARAVDV 64
EG++A +TGAA GIG A+AR A++G + D L+K L+ + A DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 65 GERESVQALAEAAYDTFGAVHVLCNNAGVGSGAEGRMWEHEPNDWKWAFAVNVWGVFHGI 124
+ ++ + G + +L N AGV G + +W+ F+VN GVF+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLR--PGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 125 QAFVPRMIAGGGPGHVVNTSSGDGGIAPLPTASVYAVTKSAVVTMTESL 173
++ M+ G +V S G+ P + + YA +K+A V T+ L
Sbjct: 125 RSVSKYMMDRRS-GSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10020TETREPRESSOR627e-14 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 61.9 bits (150), Expect = 7e-14
Identities = 51/207 (24%), Positives = 83/207 (40%), Gaps = 15/207 (7%)

Query: 1 MARSSLTREQVLDTAGALVKRHGPQALTMRALAAELGTAVTSIYWHVGNRESLLDALVER 60
MAR L RE V+D A L+ G LT R LA +LG ++YWHV N+ +LLDAL
Sbjct: 1 MAR--LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVE 58

Query: 61 TVQEMGT--LRPAGRTPAARIVSVARGLHRELRARPHLIAMVHERGLTERMFLPAQQALV 118
+ L AG + + + + A R L + E+ + + L
Sbjct: 59 ILARHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQL- 117

Query: 119 HEVHAAGLRGARAAAAVRAVQFQTVGFLLVERNRERSPVQSPGEGDLWTASTADDDPALA 178
+ G A+ AV T+G +L ++ + P + ++ P L
Sbjct: 118 RFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPA-------APDENLPPLL 170

Query: 179 RA---LARPADPDRLFADSVRALVEGL 202
R + D ++ F + +L+ G
Sbjct: 171 REALQIMDSDDGEQAFLHGLESLIRGF 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10025PF07675300.016 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.1 bits (67), Expect = 0.016
Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 4/68 (5%)

Query: 9 TAAPWPTAYPQGYAVVDVETTGLARDDRIVSAAVYRLDAQGNVE----DHWYTLVNPERD 64
T + + E A D +V+ + QG V + Y + NPE
Sbjct: 422 TGPLFTGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPA 481

Query: 65 PGPVWIHG 72
G +WI G
Sbjct: 482 SGKMWIAG 489


32C5746_10050C5746_10105Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_100502223.008556cobyric acid synthase
C5746_100553172.199567cobaltochelatase subunit CobN
C5746_100602181.095900magnesium chelatase
C5746_100653130.065473cob(I)yrinic acid a,c-diamide
C5746_100702120.177332cobyrinic acid a,c-diamide synthase
C5746_100751130.495112precorrin-2 C(20)-methyltransferase
C5746_100802141.207801permease
C5746_100851151.576661precorrin-4 C(11)-methyltransferase
C5746_100900141.739357bifunctional cobalt-precorrin-7
C5746_100950102.248415precorrin-3B C(17)-methyltransferase
C5746_101000132.463646precorrin-8X methylmutase
C5746_101050143.395365sirohydrochlorin chelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10190HTHFIS290.033 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.033
Identities = 19/115 (16%), Positives = 38/115 (33%), Gaps = 13/115 (11%)

Query: 28 DFVAELGERHPELPVA--GGFIELSPPPLGDAVTELVERGVKRFAAVPLMLVSAGHAKGD 85
D + + + P+LPV A+ E+G + P L G
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFM-----TAIKAS-EKGAYDYLPKPFDLTELIGIIGR 117

Query: 86 IPAALAREKERHPGISYTYGRPLGPHPALLKVLERRVDAVLGDTDRSEVTVLLVG 140
A R + S +G A+ ++ L ++++T+++ G
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV-----LARLMQTDLTLMITG 167


33C5746_10410C5746_10485Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_104103110.523337cytochrome B
C5746_10415320-0.214073hypothetical protein
C5746_104201150.112792protoheme IX farnesyltransferase
C5746_10425213-0.632953transketolase
C5746_10430112-1.681223transaldolase
C5746_10435013-2.186061glucose-6-phosphate dehydrogenase
C5746_10440114-2.799843glucose-6-phosphate dehydrogenase assembly
C5746_10445114-2.8847336-phosphogluconolactonase
C5746_10450318-2.955433hypothetical protein
C5746_10455320-3.676629hypothetical protein
C5746_10460219-2.833032glucose-6-phosphate isomerase
C5746_10465320-1.963853RNA polymerase-binding protein RbpA
C5746_10470217-1.349745preprotein translocase subunit SecG
C5746_10475116-0.768287triose-phosphate isomerase
C5746_10480015-1.033827phosphoglycerate kinase
C5746_10485212-0.378069type I glyceraldehyde-3-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10565SECGEXPORT378e-07 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 37.2 bits (86), Expect = 8e-07
Identities = 19/71 (26%), Positives = 39/71 (54%)

Query: 1 MILAFEIALIVFSLLLMLLVLMHKGKGGGLSDMFGGGMQSSVGGSSVAERNLDRITVFIG 60
M A + ++ ++ L+ L+++ +GKG + FG G +++ GSS + + R+T +
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 LAWFACIVVLG 71
+F +VLG
Sbjct: 61 TLFFIISLVLG 71


34C5746_10895C5746_10970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_10895248-2.942971aminopeptidase N
C5746_10900341-3.099552MarR family transcriptional regulator
C5746_10905337-3.203565hypothetical protein
C5746_10910337-2.703659alpha/beta hydrolase
C5746_10915335-2.843360TetR family transcriptional regulator
C5746_10920213-3.914168chorismate mutase
C5746_10925218-3.471620AraC family transcriptional regulator
C5746_10935130-2.383757glycosyl hydrolase
C5746_10940236-1.297386trypsin
C5746_10945231-1.128025hypothetical protein
C5746_10950024-0.840231short-chain dehydrogenase
C5746_10955-1400.695661enoyl-CoA hydratase family protein
C5746_109600301.071997acyl CoA--acetate/3-ketoacid CoA transferase
C5746_10965-2180.393032CoA-transferase
C5746_109703141.4407882-nitropropane dioxygenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10985HTHTETR475e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 5e-09
Identities = 21/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 2 ARAALTTDAVVDVALLITDEKGPAALTLSAVAGRAGVATPSLYKHVRNLAELRSLVSARI 61
A T ++DVAL + ++G ++ +L +A AGV ++Y H ++ ++L S +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 MNEIADQVGEAVLGRSAD--EAIRAVMMAWRHYALRHPHRYSALI 104
+ I + E D +R +++ + R +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11005V8PROTEASE511e-09 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 51.2 bits (122), Expect = 1e-09
Identities = 32/235 (13%), Positives = 62/235 (26%), Gaps = 46/235 (19%)

Query: 33 PTPVVGGTRAAQGEFPFMVRLSM-------GCGGALYAKNIVLTAAHCVDGSGNNTSITA 85
T G + + + + G + K+ +LT H VD + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 86 TAGVVDLQSSSAI-KVKSTKVLQAPGYNGKGKDWALIKLAKPID-------LPTLKIATN 137
Q + + ++ + G D A++K + + ++ N
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEG----DLAIVKFSPNEQNKHIGEVVKPATMSNN 188

Query: 138 TTYNNGTFA-IAGWGAATEGGSQQRYLLKATVPFVSDADCQGAYGSDLVPGDEICAGLLD 196
+ G+ + + S G + D
Sbjct: 189 AETQVNQNITVTGYPGDKPVATM----------WESKGKITYLKGEAM---------QYD 229

Query: 197 TGGVDTCQGDSGGPMFRKDNAGAWIQVGIVSWGQGCARPGYPGVYSEVSTFAANI 251
T G+SG P+F + N +GI G G + V F
Sbjct: 230 L---STTGGNSGSPVFNEKNE----VIGIHWGGVPNEFNGAVFINENVRNFLKQN 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11015DHBDHDRGNASE737e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 7e-17
Identities = 49/214 (22%), Positives = 87/214 (40%), Gaps = 16/214 (7%)

Query: 26 DGRVVIVTGAGRGLGRAHALAFAAEGAKVVVNDLGVGPGGDGGSAGPARQVVDEIVAAGG 85
+G++ +TGA +G+G A A A++GA + D + +VV + A
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDY---------NPEKLEKVVSSLKAEAR 57

Query: 86 EAVAHGGDIATMEGAASLVAAALETFGRLDTLVNNAGFLRDRMLVNLDEDDWDAVMRVHL 145
A A D+ + A G +D LVN AG LR ++ +L +++W+A V+
Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 146 KGHFLPLKHAAAHWRAEAKAGRAPVARVVNTSSGAGLLGSVGQGNYSAAKAGIVGLTLVA 205
G F + + + +V S + Y+++KA V T
Sbjct: 118 TGVFNASRSVSKYMMDRRSGS------IVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171

Query: 206 AAEMGRYGVQVNAIAPAA-RTRMTEQTFAQTMAA 238
E+ Y ++ N ++P + T M +A A
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLWADENGA 205


35C5746_11110C5746_11165Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_11110283.329422tryptophan synthase subunit(beta)
C5746_11115293.054182indole-3-glycerol phosphate synthase TrpC
C5746_111202102.742672hypothetical protein
C5746_111253101.929434hypothetical protein
C5746_11130-191.397445TIGR02234 family membrane protein
C5746_111351121.028524anthranilate synthase component I
C5746_111400130.671864phosphoribosyl-AMP cyclohydrolase
C5746_111450150.905444TIGR03085 family protein
C5746_111502190.526078MFS transporter
C5746_111553190.626521transcriptional regulator
C5746_111603211.031132imidazole glycerol phosphate synthase subunit
C5746_111652211.907781hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11180PF06580290.009 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.009
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 98 IFAVLWVVWMARAWRGRPMSIAAKPVVWWGTGAVLLIFSIV 138
+ ++W V WR I KPV + A+ +IF++V
Sbjct: 86 VIGMVWFVANTSIWRLLAF-INTKPVAFTLPLALSIIFNVV 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11210TCRTETA515e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.6 bits (121), Expect = 5e-09
Identities = 81/345 (23%), Positives = 129/345 (37%), Gaps = 14/345 (4%)

Query: 52 ATQTGLVLAVGSIPRALLMLGGGVLADRIGPRRVVIGSDAARCLVVLGLAGALLLTSPAV 111
G++LA+ ++ + G L+DR G R V++ S L + A++ T+P +
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVS-----LAGAAVDYAIMATAPFL 96

Query: 112 WMLIAVALVFGVVDALFLPAVGALPPRITAAGQLARVQGLRGLASRTANVVGAPLGGVAV 171
W+L +V G+ A GA IT + AR G V G LGG+
Sbjct: 97 WVLYIGRIVAGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 172 AFGGPRLAFAAAGVLFAVSLPLLLSLRISPSTAQETAEPSGTAWHDLTDGLRHIRRHPVL 231
F P F AA L L L + P + + P + R R V+
Sbjct: 156 GFS-PHAPFFAAAALNG--LNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 232 GPLMLVVALSELGFAGPLNLGLILLARERGWGASGMGWIVAAFGI-GAGASALLLAVRGR 290
LM V + +L P L +I W A+ +G +AAFGI + A A++
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272

Query: 291 VPRAGLVMCLTVLIGAVAIAALAHVPSVGLAVTVAVCIGLFAGLGGSLCGALIQTAADPA 350
+ L ++ LA +A + V + G+G A++ D
Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEE 331

Query: 351 YLGRVTSVSTLFTHA---IAPLSYPVTGAAVAAWGTGPVFVASAA 392
G++ T + PL + AA G ++A AA
Sbjct: 332 RQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11225CHLAMIDIAOM6280.014 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 27.7 bits (61), Expect = 0.014
Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 38 SVENGQISAGGPYEQTVTSFKVAFDALKQLG 68
S E +S GP + T+T V FD+L +LG
Sbjct: 474 SKELQPVSFSGPTKGTITGNTVVFDSLPRLG 504


36C5746_11685C5746_11750Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_116852112.763047AsnC family transcriptional regulator
C5746_116902102.367077cysteine desulfurase
C5746_116952101.554140anthranilate phosphoribosyltransferase
C5746_117002110.714338ubiquinol-cytochrome c reductase cytochrome b
C5746_117053120.141256ubiquinol-cytochrome C reductase
C5746_117100110.608359cystathionine beta-lyase
C5746_11715-215-1.884448cytochrome B
C5746_11720-115-1.944011hypothetical protein
C5746_11725-216-2.037155hypothetical protein
C5746_11730-215-2.653227cytochrome C oxidase subunit IV
C5746_11735-115-3.324777cytochrome c oxidase subunit I
C5746_11740-214-3.861574cytochrome c oxidase subunit II
C5746_11745-112-3.500679cysteine desulfurase
C5746_11750-212-4.815603carbohydrate kinase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11745TYPE3IMSPROT290.041 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.6 bits (64), Expect = 0.041
Identities = 10/90 (11%), Positives = 33/90 (36%), Gaps = 1/90 (1%)

Query: 55 AVAFMFTLSMLATIGFIASYVIFPVDKIVYIWPFGHVSALNFSLGLTLGVALFAIGAGAV 114
++ + + + + + ++ I + + + +++
Sbjct: 149 LLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYY 208

Query: 115 HWARTL-MSDVEVADDRHAIEATPEVKAKV 143
+ + L MS E+ + +E +PE+K+K
Sbjct: 209 QYIKELKMSKDEIKREYKEMEGSPEIKSKR 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11760HTHFIS310.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.001
Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 6/84 (7%)

Query: 5 ATVLVYSDDANTREQVRLAAGRRPAADVPPVEFLECATLPAVLDALDNGGIDVCVLDGET 64
AT+LV DDA R + A R + + + + G D+ V D
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGY------DVRITSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 65 APAGGMGVCRQIKDEIFHCPPVLL 88
+ +IK P +++
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVM 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11785PF01206585e-13 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 57.8 bits (140), Expect = 5e-13
Identities = 14/68 (20%), Positives = 33/68 (48%)

Query: 391 VDALGKRCPIPVIELAKVIGEVPLGGTVTVLSDDEAARLDIPAWCAMREQEYVGEEPADR 450
+DA G CP+P+++ K + + G + V++ D + D ++ E + ++ D
Sbjct: 8 LDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDG 67

Query: 451 GSAYVVRR 458
+ ++R
Sbjct: 68 TYHFRLKR 75


37C5746_12010C5746_12050Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12010-211-3.301451XRE family transcriptional regulator
C5746_12015-213-3.843680hypothetical protein
C5746_12020-219-2.878206hypothetical protein
C5746_12030228-4.786983arsenate reductase
C5746_12040428-3.974820hypothetical protein
C5746_12045425-3.682296oxidoreductase
C5746_12050325-4.061365hypothetical protein
38C5746_12230C5746_12260Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12230211-2.262941hypothetical protein
C5746_12235313-2.635888DUF305 domain-containing protein
C5746_12240316-3.232505CBS domain-containing protein
C5746_12245217-3.276902copper oxidase
C5746_12250217-3.124727NAD+ synthase
C5746_12260114-3.364476hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12300PF07132290.040 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 29.3 bits (65), Expect = 0.040
Identities = 19/72 (26%), Positives = 26/72 (36%), Gaps = 2/72 (2%)

Query: 225 GVGGSTPDGVLDELSGGRGGMGHGMGHDGMGGKSDSSHAGHGGSPKSAKRRSGPSRIMKK 284
G GS+ G+ L GG G G G G + G GG+ + PS +M
Sbjct: 74 GGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALG--GGLGGALGAGMNAMNPSAMMGS 131

Query: 285 SFSELLDSHGGN 296
L+ G
Sbjct: 132 LLFSALEDLLGG 143


39C5746_12315C5746_12415Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_1231529-0.902774type I methionyl aminopeptidase
C5746_123201101.733751biliverdin-producing heme oxygenase
C5746_12325091.171601transposase
C5746_123300101.224861phenazine biosynthesis protein PhzC/PhzF
C5746_12335291.921124hypothetical protein
C5746_123401102.241745Htaa domain protein
C5746_123451103.003974ABC transporter substrate-binding protein
C5746_123500190.359200heme ABC transporter permease
C5746_123550180.836174heme ABC transporter ATP-binding protein
C5746_12360-320-0.273860peptidase M75
C5746_12365-120-1.916830deferrochelatase/peroxidase EfeB
C5746_12370-117-2.666666iron transporter
C5746_12375111-1.264713hypothetical protein
C5746_12380211-0.189046DNA primase
C5746_123852100.251535TetR family transcriptional regulator
C5746_123901111.405133amidinotransferase
C5746_123951102.575193rRNA methyltransferase
C5746_124003121.699784hypothetical protein
C5746_124052111.699251Bcr/CflA family drug resistance efflux
C5746_124101120.826342oxidoreductase
C5746_124152130.479744alkaline phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12400cloacin330.004 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.004
Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 448 GGTGSTGSTGPAGGSTATGGGSVGGGSVGGSGSLAATGSDV----PAGALIAASGAVVAT 503
GG+GS G G GG GG G G+L+A + V PA + A G V+
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107

Query: 504 GAGAVIAA 511
AGA+ AA
Sbjct: 108 SAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12420PF05272290.023 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.023
Identities = 9/20 (45%), Positives = 11/20 (55%)

Query: 54 VLALVGPNGAGKSTLLAALA 73
+ L G G GKSTL+ L
Sbjct: 598 SVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12450HTHTETR793e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 3e-20
Identities = 41/208 (19%), Positives = 68/208 (32%), Gaps = 18/208 (8%)

Query: 6 PPDPSRRSERSRRAIYDAALGLVGEVGYPRTTVEAIAARAGVGKQTIYRWWPSKAAVLLE 65
+ ++ +R+ I D AL L + G T++ IA AGV + IY + K+ + E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 66 AFVALGDRVAEENGAGDARGLPDTGDLAADLKVVLRATVDELTDPAFEAPTRALAAEGIV 125
+ + E L D VLR + + + R L E I
Sbjct: 62 IWELSESNIGE-------LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF 114

Query: 126 DPQLGAEFVQKLLD-------PSLRLYVARLRAAQEAGQVRADIDPRIALELLVGPLT-- 176
+ + S L+ EA + AD+ R A ++ G ++
Sbjct: 115 HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 177 -HRWLLR-TLPLTHEYADAIVDYTLHGL 202
WL + A V L
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12455ARGDEIMINASE320.003 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 31.7 bits (72), Expect = 0.003
Identities = 28/186 (15%), Positives = 56/186 (30%), Gaps = 34/186 (18%)

Query: 17 MDPSKPVDLPLAQTQWEDLRDRYRTLGHTVELLTPR--PELPDMVFAANGATVIDGRV-- 72
+ + ++ E+L++ +L V +P+++F + I V
Sbjct: 116 LTIDNMISKMISGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTI 175

Query: 73 ------------LGARFAYQERYEEAGAHREWFRDNGFTAIHEPAHVNEGEGDFAVTASY 120
+ A + ++ W ++ EG GD V
Sbjct: 176 NKMFTKVRQRETIFAEYIFKYHPVYKENVPIWLNRWEEASL-------EG-GDELVLNKG 227

Query: 121 ILA-GRGFRSSPLSHNE-AQEFFG-----RPVVGLDLVDPR-YYHLDTALSVLDDAGDEI 172
+L G R+ S + A F ++ + R Y HLDT + +D
Sbjct: 228 LLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDY--SVF 285

Query: 173 MYYPGA 178
+
Sbjct: 286 TSFTSD 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12470TCRTETA643e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 64.1 bits (156), Expect = 3e-13
Identities = 84/361 (23%), Positives = 142/361 (39%), Gaps = 28/361 (7%)

Query: 57 TALPPLSMDMYLPALPAVTESLHASAATVQLTLTACLTGMALGQVVVGPM----SDRWGR 112
AL + + + +P LP + L S V L AL Q P+ SDR+GR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGALSDRFGR 72

Query: 113 RRPLLLGMIIYVVATAICVFAPTTELLIGFRLLQGLAGAAGIVIARAVVRDMYDGVEMAR 172
R LL+ + V AI AP +L R++ G+ GA G V A + D+ DG E AR
Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR 131

Query: 173 FFSTLMLISGVAPIVAPLIGGQVLRFTDWRGIFAVLTVVGVVLTLVVQKWLHETLPPQDR 232
F + G + P++GG + F+ F + + L L E+ + R
Sbjct: 132 HFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 233 HTGGIGDALRTMRGLLADRVFTGYMIAGSLAFAALFSYVSASPFVVQEIYGASPQTFS-L 291
+AL + R T +A +A + V P + I+G +
Sbjct: 191 --PLRREALNPLASFRWARGMTV--VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 292 LFGINSVGLIVVGQINGKVLVGRISL----DKALAFGLSVIVLAAAALLLMTSGVFGHVG 347
GI+ ++ + ++ G ++ +AL G+ L T G
Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG------ 300

Query: 348 LVPVAAGLFVLMSAMGLAMPNTNAQALMRTKHAAGSASALLGTSSFL--IGAVASPLVGI 405
+A + VL+++ G+ MP QA++ + L G+ + L + ++ PL+
Sbjct: 301 --WMAFPIMVLLASGGIGMPAL--QAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 406 A 406
A
Sbjct: 357 A 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12475RTXTOXINC300.010 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.010
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 48 RFGIPRAYGSWADLAADDEVDVVYVATPHS 77
R P AY SWA+L+ ++E+ Y+ S
Sbjct: 49 RDDYPVAYCSWANLSLENEIK--YLNDVTS 76


40C5746_12820C5746_12885Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12820220-0.028680cobalt ABC transporter ATP-binding protein
C5746_12825022-0.851478ECF transporter S component
C5746_12830815-3.666104hypothetical protein
C5746_12835716-4.790475hypothetical protein
C5746_12840715-4.790303steroid 3-ketoacyl-CoA thiolase
C5746_12845613-4.197606steroid C27-monooxygenase
C5746_12850411-3.737399AEC family transporter
C5746_12855410-3.063661hypothetical protein
C5746_12860011-1.416723DUF2330 domain-containing protein
C5746_128651142.743599GNAT family N-acetyltransferase
C5746_128701133.167312EmrB/QacA family drug resistance transporter
C5746_128751153.655791hypothetical protein
C5746_128800133.790575hypothetical protein
C5746_128850143.459047hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12910VACCYTOTOXIN310.003 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.8 bits (69), Expect = 0.003
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 4/51 (7%)

Query: 24 SIADMMGLNEREVRLARRTVGRGDARSLAQELLSRTQQATREAPAEPAPEI 74
S+++ M LN R V L+RR D S A+ L + Q R A E A E+
Sbjct: 965 SLSNAMILNSRLVNLSRRHTNHID--SFAKRLQALKDQ--RFASLESAAEV 1011


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12940SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 15/60 (25%), Positives = 26/60 (43%)

Query: 80 DVAELTRVFVRPEHRGTGGGGLLLAAVESAARAFGISTVRLDTRNDLVEARGLYAKHGYR 139
A + + V ++R G G LL A+ + L+T++ + A YAKH +
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12945TCRTETB1612e-45 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 161 bits (410), Expect = 2e-45
Identities = 95/406 (23%), Positives = 176/406 (43%), Gaps = 17/406 (4%)

Query: 33 LLAALDQTIVSTALPTIVSDLGGLEH-LSWVVTAYLLASTAATPLWGKLGDQYGRKKLFQ 91
+ L++ +++ +LP I +D +WV TA++L + T ++GKL DQ G K+L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 92 TAIVIFLIGSALCGVAQNM-PQLIGFRALQGLGGGGLIVLSMAIVGDIVAPRERGKYQGL 150
I+I GS + V + LI R +QG G L M +V + RGK GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 151 FGAVFGATSVLGPLLGGFFTQHLSWRWVFYINLPIGVVALLVIAAVLYIPVRRTQHTIDY 210
G++ +GP +GG ++ W + + +P+ + + L R + D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 211 LGTFLIASVATCLVLVASLGGTTWAWGSAQVIALAALSVLLLIAFVHVERRAAEPVIPLK 270
G L++ + L T+++ +SVL + FV R+ +P +
Sbjct: 202 KGIILMSVGIVFFM----LFTTSYSIS------FLIVSVLSFLIFVKHIRKVTDPFVDPG 251

Query: 271 LFRIRTFSLVAVISFVVGFAMFGAMTYLPTFLQVVHSITPTMSG-VHMLPMVLGLLLTST 329
L + F + + ++ + G ++ +P ++ VH ++ G V + P + +++
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 330 LSGQIVSRTGRWKVFPIAGTGITAIGLQLLHRLTETSSTLEMSIYFFVFGAGLGLVMQVL 389
+ G +V R G V I G ++ L ET+S I FV G GL V+
Sbjct: 312 IGGILVDRRGPLYVLNI-GVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVI 369

Query: 390 VLVVQNAVTYEDLGVATSGATFFRSIGASFGVAVFGTIFTNRLTDK 435
+V +++ ++ G S F + G+A+ G + + L D+
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12950FLGHOOKFLIK300.018 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.8 bits (66), Expect = 0.018
Identities = 20/80 (25%), Positives = 31/80 (38%)

Query: 196 LPSTVASAPAEETASASASASASAVPSGPASTSPSASAPTSTSPSASTSPSPSASAASQP 255
LP+ + + T+ +A P PA A + ++PSP +AAS
Sbjct: 155 LPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214

Query: 256 ATASPSASTPHASRPASSAP 275
T + P + P SAP
Sbjct: 215 ITPHQTQPLPTVAAPVLSAP 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12960cdtoxina359e-05 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 34.7 bits (79), Expect = 9e-05
Identities = 9/60 (15%), Positives = 20/60 (33%)

Query: 34 KVRNWQTGYVLGVAGGSTVGGAQIQYEVDTDNLAQKWAIDPVTSSTFLLRNLNSDMCVTA 93
+ RN G + G G + + + L++L++ +C+ A
Sbjct: 130 QFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLCIRA 189


41C5746_13020C5746_13080Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_130200153.242014hypothetical protein
C5746_130251103.346193DUF4383 domain-containing protein
C5746_130301113.187433FmdB family transcriptional regulator
C5746_130351102.774972SAM-dependent methyltransferase
C5746_130402111.879350HAD family hydrolase
C5746_130453141.447055phosphoribosyltransferase
C5746_13050215-0.773514ATP/GTP-binding protein
C5746_13055020-1.902976TetR family transcriptional regulator
C5746_13060024-2.656775Tellurium resistance
C5746_13065127-3.789229hypothetical protein
C5746_13070027-4.236521chemical-damaging agent resistance protein C
C5746_13075026-5.359206chemical-damaging agent resistance protein C
C5746_13080022-4.302772peroxiredoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13115IGASERPTASE320.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.007
Identities = 62/355 (17%), Positives = 105/355 (29%), Gaps = 42/355 (11%)

Query: 119 EASHTYITAVDAHDLDRDDLEPAVAARARAELTTAKDELVRVKGELDRF---AQGLGPLL 175
E +H +T DA RD L ++ +L K +L V G D + + +
Sbjct: 934 EPNHNELTLFDASKAQRDHLNVSLVGN-TVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTV 992

Query: 176 --GNAETQLARLA--PAVERARQALLGASNALDAVRASGLRADDLAARLAALAPELTKLN 231
N T A P+V + + A A ++ A +
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE--TTETVAENSKQESKT 1050

Query: 232 QGAGRHGVPETLQRADRVLRDA------EAVRAEAAQLPERAAEIDHRL------VSLRT 279
ET + V ++A E AQ E V
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 280 RAQALTTRTGSVEPVLSELRRRFSAACWQDLQPVPEQAAANVRQAEEKLKEAATARDEQR 339
+A+ T +T V V S++ + + Q P + +E + T D ++
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 340 WADATSRLSTVRALLNATDEAVSAAGDRLHRLNEVAKDPQQEIQRTRFAIRDAQRLAMAG 399
A TS + T N V ++P+ T +++
Sbjct: 1171 PAKETSSNVEQPVTESTTVNT----------GNSVVENPENTTPATTQPTVNSES----- 1215

Query: 400 RNTPDPRHARPLDDAVARLERAVAGLDGRHPDYWHFLTETDAVRRTAARVVSEIR 454
N P RH R + +E A + R + D V+S+ R
Sbjct: 1216 SNKPKNRHRRSVRSVPHNVEPATTSSNDRST-----VALCDLTSTNTNAVLSDAR 1265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13150PF03544344e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.2 bits (78), Expect = 4e-04
Identities = 21/97 (21%), Positives = 30/97 (30%)

Query: 181 AEAAAEPDPAPSPVTVPSPVAAPAFPPPAFPPAPTPCPDSQATVQHQQPAYGYPQPAAAQ 240
E EP+P P P+ P A P P P P P + + +PA+
Sbjct: 70 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPF 129

Query: 241 PAPQPAPQPAYGYPQPATAPPAYSYPQPAAAAAPAPD 277
PA + + P P A + P
Sbjct: 130 ENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166



Score = 30.7 bits (69), Expect = 0.005
Identities = 21/100 (21%), Positives = 29/100 (29%), Gaps = 10/100 (10%)

Query: 184 AAEPDPAPSPVTVPSPVAAPAFPPPAFPPAPTPCPDSQATVQHQQPAYGYPQPAAAQPAP 243
A +P P P P P P P A P P + P+P P
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK----------PKPVKKVEQP 114

Query: 244 QPAPQPAYGYPQPATAPPAYSYPQPAAAAAPAPDPNFVLP 283
+ +P P A + P + A A P +
Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154



Score = 30.3 bits (68), Expect = 0.006
Identities = 21/114 (18%), Positives = 32/114 (28%), Gaps = 1/114 (0%)

Query: 176 ISVDEAEAAAEPDPA-PSPVTVPSPVAAPAFPPPAFPPAPTPCPDSQATVQHQQPAYGYP 234
+ + P PA P VT+ +P PP P P+ + + P
Sbjct: 33 LYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPV 92

Query: 235 QPAAAQPAPQPAPQPAYGYPQPATAPPAYSYPQPAAAAAPAPDPNFVLPPQGPQ 288
+P P+P P+P QP + AP
Sbjct: 93 VIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146


42C5746_13675C5746_13720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_136753120.357641MBL fold metallo-hydrolase
C5746_13680314-0.462301DUF3097 domain-containing protein
C5746_136852120.135356coproporphyrinogen III oxidase
C5746_136902120.970026protein phosphatase
C5746_137001131.275256long-chain fatty acid--CoA ligase
C5746_137050141.610288elongation factor 4
C5746_137101142.91340830S ribosomal protein S20
C5746_137152122.553508DNA polymerase III subunit delta
C5746_137202132.160263aldo/keto reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13785BCTERIALGSPH300.027 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.5 bits (66), Expect = 0.027
Identities = 24/93 (25%), Positives = 36/93 (38%), Gaps = 6/93 (6%)

Query: 187 DVVKERVAAITADQLATLIYTSGTTGRPKGVRLPHDNWSYM---AKATVATGLITKDDVQ 243
D + +A A QL + TG+ GV + D W ++ A+
Sbjct: 35 DSAAQTLARFEA-QLRFVQQRGLQTGQFFGVSVHPDRWQFLVLEARDGADPAPADDGWSG 93

Query: 244 YLWLPLAHVFGKVLTSGQIEVGHVTAVDGRVDK 276
Y WLPL G+V TSG I G + + +
Sbjct: 94 YRWLPLRA--GRVATSGSIAGGKLNLAFAQGEA 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13790TCRTETOQM1484e-40 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 148 bits (376), Expect = 4e-40
Identities = 107/504 (21%), Positives = 189/504 (37%), Gaps = 111/504 (22%)

Query: 19 IRNFCIIAHIDHGKSTLADRMLQLTGV------VDQRQMRAQYLDRMDIERERGITIKSQ 72
I N ++AH+D GK+TL + +L +G VD+ R D +ER+RGITI++
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT---DNTLLERQRGITIQTG 59

Query: 73 AVRLPWAPNTGEDQGRTHVLNMIDTPGHVDFTYEVSRS----------LAACEG----TV 118
W T V N+IDTPGH+DF EV RS ++A +G T
Sbjct: 60 ITSFQW--------ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR 110

Query: 119 LLVDAAQGIEAQTLA-------------NLYLAMENDLT--IVP---------------- 147
+L A + + T+ +Y ++ L+ IV
Sbjct: 111 ILFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFT 170

Query: 148 --------------VLNK-IDLPAAQPEKFSEEL-ANLIGCQPEDVLRVSAKTGVGVDAL 191
+L K + + + + +E C V SAK +G+D L
Sbjct: 171 ESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNL 230

Query: 192 LDRVVKDVPAPIGRADAPARAMIFDSVYDSYRGVVTYVRVVDGQLNKRERIRMMSTGATH 251
++ + + R + +F Y R + Y+R+ G L+ R+ +R+
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIK 290

Query: 252 ELLEIGVSSPEMTPADGLGVGEVGYI---ITGVKDVRQSKVGDTITSLNKGATEALGGYK 308
+ E+ D GE+ + + V +GDT + E
Sbjct: 291 ITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLLPQRERIE------ 340

Query: 309 DPKPMVFSGLYPLDGSDYPDLREALDKLQLNDAAL-VYEPETSAALGFGFRVGFLGLLHL 367
+P P++ + + P L +AL ++ +D L Y + + + FLG + +
Sbjct: 341 NPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEI----ILSFLGKVQM 396

Query: 368 DVIRERLEREFGLELIATAPNVVYR---VEMEDGTEHIVTNPSEFPEGKIDKVHEPVVRA 424
+V L+ ++ +E+ P V+Y ++ + T HI P+ F I P+
Sbjct: 397 EVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPNPF-WASIGLSVSPLPLG 455

Query: 425 TVLA----------PSEFIGAIME 438
+ + F A+ME
Sbjct: 456 SGMQYESSVSLGYLNQSFQNAVME 479



Score = 35.6 bits (82), Expect = 6e-04
Identities = 14/81 (17%), Positives = 27/81 (33%), Gaps = 2/81 (2%)

Query: 417 VHEPVVRATVLAPSEFIGAIMELCQNRRGTLLGMDYLSEDRVEIRYTLPLAEIVFDFFDQ 476
+ EP + + AP E++ ++ L + V + +P I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 477 LKSKTRGYASLDYEPTGEQSA 497
L T G + E G
Sbjct: 593 LTFFTNGRSVCLTELKGYHVT 613


43C5746_14040C5746_14095Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_14040316-0.661291cation/H(+) antiporter
C5746_14045220-1.222675hypothetical protein
C5746_14050020-1.181201GNAT family N-acetyltransferase
C5746_14055020-1.272271DNA glycosylase
C5746_14060017-1.953166ribose-5-phosphate isomerase
C5746_14065117-2.228925amino acid transporter
C5746_14070017-3.042025FAD-linked oxidoreductase
C5746_14075013-3.760357biotin transporter BioY
C5746_14080314-4.049415amino acid transporter
C5746_14085314-4.202534histidine phosphatase family protein
C5746_14090212-3.601173carbohydrate kinase
C5746_14095012-3.260052erythritol/threitol ABC transporter
44C5746_14335C5746_14375Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_14335321-0.368224sugar ABC transporter permease
C5746_143401180.4789716-phospho-beta-glucosidase
C5746_143451190.431091ATPase
C5746_143500142.042800hypothetical protein
C5746_143550163.488453sugar-binding protein
C5746_143600103.858035hypothetical protein
C5746_14365093.972483serine/threonine protein kinase
C5746_14370-193.398592protein phosphatase
C5746_14375-183.687127hypothetical protein
45C5746_14450C5746_14590Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_14450-1153.002708ATP-binding protein
C5746_14455-1143.274207*DNA-binding response regulator
C5746_144600143.700401sensor histidine kinase
C5746_144650113.363852hypothetical protein
C5746_144700103.429866DUF1772 domain-containing protein
C5746_14475-1103.836421VWA domain-containing protein
C5746_14485-3130.915670ABC transporter permease
C5746_14490-1140.915246ABC transporter ATP-binding protein
C5746_144951110.399523hypothetical protein
C5746_145002120.979661hypothetical protein
C5746_145051100.939698DUF4291 domain-containing protein
C5746_145101101.055947hypothetical protein
C5746_145150132.302331hypothetical protein
C5746_14520-2152.266820hypothetical protein
C5746_14525-1102.451544hypothetical protein
C5746_14530-181.681264ABC transporter ATP-binding protein
C5746_145402101.287929FkbM family methyltransferase
C5746_145450101.381105SARP-family transcriptional regulator
C5746_145500111.495075acyl-CoA desaturase
C5746_145552111.5825583-oxoacyl-ACP synthase
C5746_145600121.052768acyl carrier protein
C5746_14570-2110.7446133-oxoacyl-ACP synthase
C5746_14575011-0.225869VlmB-like protein
C5746_14580117-1.620517flavin reductase
C5746_14590218-1.564943hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14545HTHFIS532e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.5 bits (126), Expect = 2e-10
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)

Query: 1 MDRSLRVVLAEDSVLLREGLIGLLTRFGHEVVAAVGDAEALTAAVAEHGPDIVVTDVRMP 60
M + +++A+D +R L L+R G++V +A L +A D+VVTDV MP
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 PGFQDEGLHAAVRLREKQPALPVLVLSQY 89
+ R+++ +P LPVLV+S
Sbjct: 59 ---DENAFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14550PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-04
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 6/70 (8%)

Query: 358 RVAVSGGHAGGRMFLEIHDDGRGGA--STSGGGSGLTGLADRVSVL---DGRLSLSSPAG 412
++ + G G + LE+ + G + G+GL + +R+ +L + ++ LS G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 413 GPTRLRVEIP 422
V IP
Sbjct: 340 KVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14555SURFACELAYER290.048 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 28.9 bits (64), Expect = 0.048
Identities = 27/122 (22%), Positives = 41/122 (33%), Gaps = 15/122 (12%)

Query: 12 LAIAVAAAVTTFPAVAQASTPAPVTASATPAAAVPAPTPRRDDFNGDGYPDVAFTAP--- 68
L I AAA A+T PV A+ T A ++ D P ++ A
Sbjct: 5 LRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIAAVAK 64

Query: 69 -------GATVGGTAKAGYVGVVYGSKSGLKTSTKQVFTQDSPGI---PDTAEAGDAFGS 118
++ G+ A Y G Y + L + DS P EA A+
Sbjct: 65 SDTMPAIPGSLTGSISASYNGKSY--TANLPKDSGNATITDSNNNTVKPAELEADKAYTV 122

Query: 119 SM 120
++
Sbjct: 123 TV 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14640ISCHRISMTASE260.019 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 25.7 bits (56), Expect = 0.019
Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 2 PSVLDRITEVLVSRFGVEPEEVTEDTTMRDLDLDS---LALVE 41
+ I + + PE++T+ + D LDS + LVE
Sbjct: 229 VFTCENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVE 271


46C5746_14885C5746_14960Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_148852141.130992transcriptional regulator
C5746_148902151.160071DUF397 domain-containing protein
C5746_148952150.688673restriction endonuclease subunit S
C5746_149000141.588559SAM-dependent DNA methyltransferase
C5746_149052121.644167hypothetical protein
C5746_14910090.156458hypothetical protein
C5746_14915-17-0.282819peptidase inhibitor
C5746_14920-18-0.482792acyl-CoA dehydrogenase
C5746_14925-211-0.868927acyl-CoA thioesterase II
C5746_14930-215-2.887949phosphatase
C5746_14940324-4.702947hypothetical protein
C5746_14945223-4.727375sugar transferase
C5746_14950320-4.306044TetR family transcriptional regulator
C5746_14955317-3.844872DUF418 domain-containing protein
C5746_14960311-3.489590methylcrotonoyl-CoA carboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15010HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.8 bits (173), Expect = 2e-17
Identities = 38/196 (19%), Positives = 69/196 (35%), Gaps = 17/196 (8%)

Query: 3 TRTDAPTRREQILKEAARLFAERGFHGVGVDEIGAAVGISGPGLYRHFPGKDAMLAELLV 62
T+ +A R+ IL A RLF+++G + EI A G++ +Y HF K + +E+
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 63 GISERLLAGGQLRVSEDAACRDGSPHALLDALIEGHIDFAL--DDRPL---ITLHDRELD 117
+ E A G P ++L ++ ++ + + R L I H E
Sbjct: 65 LSESNIGE----LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 118 RLRDADRKRVRQLQRQYVEVWVEVVR------ELYPDLPEHEARAAVHAVFGLLNSTPHL 171
++ R L + + + ++ L DL A + L +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME--NW 178

Query: 172 GRPDAQPDRADTAALL 187
D A
Sbjct: 179 LFAPQSFDLKKEARDY 194


47C5746_15325C5746_15515Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_1532529-1.4766932-aminoethylphosphonate ABC transport system
C5746_15330011-1.701332phosphonatase-like hydrolase
C5746_15335014-2.679283TIGR03364 family FAD-dependent oxidoreductase
C5746_15340114-4.101065GntR family transcriptional regulator
C5746_15345012-4.505704hypothetical protein
C5746_15350-110-3.817482hypothetical protein
C5746_15355-212-1.741393oxidoreductase
C5746_15360-216-0.246557heme-degrading domain-containing protein
C5746_15365-2180.2617212-hydroxyhepta-2,4-diene-1,7-dioate isomerase
C5746_153700131.233310hypothetical protein
C5746_153751151.489265hypothetical protein
C5746_153801152.760279sodium:solute symporter
C5746_153852142.800721hypothetical protein
C5746_153952132.248699class E sortase
C5746_154050122.412928preprotein translocase subunit SecA
C5746_154102121.396585GNAT family N-acetyltransferase
C5746_154151140.015613hypothetical protein
C5746_154200140.034001E family RNA polymerase sigma-70 factor
C5746_154301151.377952peptidase S8
C5746_154351130.976298hypothetical protein
C5746_154401141.585105endoribonuclease L-PSP
C5746_154453142.902329DUF3427 domain-containing protein
C5746_154504113.113656hypothetical protein
C5746_154552122.693533DUF262 domain-containing protein
C5746_154602112.233519divalent-cation tolerance protein CutA
C5746_15465111-1.328220hypothetical protein
C5746_15470013-2.225696ATP-binding protein
C5746_15475015-2.810361IS5/IS1182 family transposase
C5746_15480431-9.134596hypothetical protein
C5746_15485432-9.301310molybdenum metabolism regulator
C5746_15490638-10.153290aldo/keto reductase
C5746_15495751-11.282092hypothetical protein
C5746_15500751-10.020416hypothetical protein
C5746_15505751-10.035368hypothetical protein
C5746_15510845-7.308962hypothetical protein
C5746_15515436-5.722125hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_1542560KDINNERMP1164e-32 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 116 bits (293), Expect = 4e-32
Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 36/213 (16%)

Query: 31 AIVLFTALVRLAVHPLSRAAARGQKARTRLQPQIAELRKKHGKNPERMQKALMELHKAEN 90
+I++ T +VR ++PL++A LQP+I +R++ G + +R+ + +M L+KAE
Sbjct: 357 SIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEK 416

Query: 91 VSPLSGCLPSLLQMPAFFLLYHLFSSQTIDGHPNELLGHQLFGAPLGEHWHDALAHGGPF 150
V+PL GC P L+QMP F LY++ +L AP W L+ P+
Sbjct: 417 VNPLGGCFPLLIQMPIFLALYYMLMGSV-----------ELRQAPFA-LWIHDLSAQDPY 464

Query: 151 GAQGMVYLGLYAIVAVVATFNYRRTKAQMAANPVTPTTGPDGQPVPGMGMMTKLMPLMSF 210
+ I+ V F ++ ++PTT D P + MP++
Sbjct: 465 --------YILPILMGVTMFFIQK---------MSPTTVTD----PMQQKIMTFMPVI-- 501

Query: 211 ATLFTVSVVPLAAALYVVTSTTWTAVERAFLYR 243
T+F + P LY + S T +++ +YR
Sbjct: 502 FTVFFLW-FPSGLVLYYIVSNLVTIIQQQLIYR 533


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15455SECA384e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 38.3 bits (89), Expect = 4e-05
Identities = 18/55 (32%), Positives = 23/55 (41%), Gaps = 6/55 (10%)

Query: 278 EAFAASEATSPADPDLLPQYATTLAARGRAVPWP-PARSAPCWCGSGRTYRECHG 331
E A + S D D AA R+ PC CGSG+ Y++CHG
Sbjct: 849 ERLAQMQQLSHQDDDS-----AAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15475SUBTILISIN2086e-62 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 208 bits (530), Expect = 6e-62
Identities = 100/300 (33%), Positives = 141/300 (47%), Gaps = 33/300 (11%)

Query: 215 QVGAPQAWKSGWTGKGVKVAVLDTGIDSTHPDFSGSIGESADFTQGNGAISGAASDGDGH 274
+ AP W G+GVKVAVLDTG D+ HPD I +FT + D +GH
Sbjct: 28 MIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGH 86

Query: 275 GTHVASTVAGDGTASDGRYRGMAPDAELLVGKVLDDNGGGQESWVLQGMEWAAA-RAPIV 333
GTHVA T+A + G+AP+A+LL+ KVL+ G GQ W++QG+ +A + I+
Sbjct: 87 GTHVAGTIAATENENGV--VGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDII 144

Query: 334 NISLSGAVTDGTDPLSQAVDNLSASHGTLFVAAAGNLGRPG----TVSTPGTADAALTVG 389
++SL G + L +AV + L + AAGN G + PG + ++VG
Sbjct: 145 SMSLGG--PEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVG 201

Query: 390 AVERDDALAGLSSQGPRLGDHAVKPDLTAPGIGIVAARAAGTDGDNTVNDRYTALSGTSM 449
A+ D + S+ + DL APG I++ +Y SGTSM
Sbjct: 202 AINFDRHASEFSNSNN-------EVDLVAPGEDILST---------VPGGKYATFSGTSM 245

Query: 450 ATPHVAGAAALLAQA-----HPDWKGPQLKAALTSSAKPVAGQSAYEQGAGRLDVARATA 504
ATPHVAGA AL+ Q D P+L A L P+ S +G G L +
Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGN-SPKMEGNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15565PF05616330.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.8 bits (74), Expect = 0.002
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 2/64 (3%)

Query: 111 PASPPPPSTQ-FPTVGAPTARPAGGYEPVEDHSDKTTPHPPSEADPAADPDPGPTAGPDP 169
P S P+ Q P V +P PA P E+ + P P + +P A+PD G P
Sbjct: 317 PGSAEAPNAQPLPEV-SPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRP 375

Query: 170 EPTA 173
+ A
Sbjct: 376 DSPA 379



Score = 31.6 bits (71), Expect = 0.004
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 115 PPPSTQFPTVGAPTARPAGGYEPVEDHSDKTTPHPPSEADPAADPDP--GPTAGPDPEPT 172
P P + AP A+P P E+ ++ P+ P +PDP P A PD +
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQ 370

Query: 173 AGS 175
G+
Sbjct: 371 PGT 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15580GPOSANCHOR376e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 6e-05
Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 15/97 (15%)

Query: 156 EPGKPGKPGKPSTPATPSAPGAPSGPAQPTAAPTSSAPEGTKPTSSTSSPSSPSSSPSTQ 215
+ GK TP A G Q GTKP + + +P +
Sbjct: 456 AKLRAGKASDSQTPDAKPGNKAVPGKGQA-------PQAGTKP--------NQNKAPMKE 500

Query: 216 SGGDLAETGSGAPVGLLSAAAAALVAAGGFLVIRRRK 252
+ L TG A +AA + AG V++R++
Sbjct: 501 TKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKE 537



Score = 36.2 bits (83), Expect = 9e-05
Identities = 28/104 (26%), Positives = 41/104 (39%), Gaps = 17/104 (16%)

Query: 153 KPDEPGKPGKPGKPSTPATPSAPGAPSGPAQPTAAPTSSAPE-GTKPTSSTSSPSSPSSS 211
K E + GK S TP A P AP+ GTKP + + +
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAK-----PGNKAVPGKGQAPQAGTKP--------NQNKA 496

Query: 212 PSTQSGGDLAETGSGA-PVGLLSAAAAALVAAGGFLVIRRRKAQ 254
P ++ L TG A P +AAA ++A G + +RK +
Sbjct: 497 PMKETKRQLPSTGETANP--FFTAAALTVMATAGVAAVVKRKEE 538


48C5746_15770C5746_16065Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_15770216-0.086682RDD family protein
C5746_15775218-1.734994hypothetical protein
C5746_15780227-2.096842SsgA family sporulation/cell division regulator
C5746_15785225-2.988694FAD-binding oxidoreductase
C5746_15790115-1.895871hypothetical protein
C5746_15795013-1.5097024-hydroxyphenylpyruvate dioxygenase
C5746_15800-110-1.050062AsnC family transcriptional regulator
C5746_15805113-1.154828ABC transporter permease
C5746_15810215-1.180972polyamine ABC transporter ATP-binding protein
C5746_15815214-1.156612permease
C5746_158203100.146048glycine/betaine ABC transporter
C5746_158253120.177700transcriptional regulator
C5746_15835-2132.026977hypothetical protein
C5746_15840-2121.267564MFS transporter
C5746_15845-2111.533285hypothetical protein
C5746_15850-1130.018791IclR family transcriptional regulator
C5746_15855-114-0.454618hypothetical protein
C5746_15860-116-0.447550hypothetical protein
C5746_15865114-2.408718hypothetical protein
C5746_15870314-1.995598hypothetical protein
C5746_15875214-2.024932death-on-curing protein
C5746_15880014-0.632324hypothetical protein
C5746_15885-113-0.120834MFS transporter
C5746_15890-2120.340371*hypothetical protein
C5746_158950121.781454hypothetical protein
C5746_159001101.837197hypothetical protein
C5746_159052101.668709hypothetical protein
C5746_159101120.657160hypothetical protein
C5746_159150130.525847MerR family transcriptional regulator
C5746_15920-1130.076900hypothetical protein
C5746_15925-3140.261614hypothetical protein
C5746_15930-116-0.424451hypothetical protein
C5746_15935017-1.125015*hypothetical protein
C5746_15940118-3.909884hypothetical protein
C5746_15945119-4.341251hypothetical protein
C5746_15950118-4.069564hypothetical protein
C5746_15960317-6.152832hypothetical protein
C5746_15965215-6.320606phosphotransferase
C5746_15970116-5.506960DUF397 domain-containing protein
C5746_15975-19-3.242676transcriptional regulator
C5746_15980010-2.807055hypothetical protein
C5746_15985-112-2.640356serine/threonine protein kinase
C5746_15990014-2.229777hypothetical protein
C5746_15995014-2.067840hypothetical protein
C5746_16000014-2.978105type VII secretion-associated serine protease
C5746_16005116-5.386092type VII secretion-associated serine protease
C5746_16010018-5.619351hypothetical protein
C5746_16015015-5.821355oxidoreductase
C5746_16025-115-5.325687(2Fe-2S)-binding protein
C5746_16030015-5.275013dehydrogenase
C5746_16035-115-5.577925beta-N-acetylhexosaminidase
C5746_16040025-2.986177DUF3039 domain-containing protein
C5746_16045020-0.886535hypothetical protein
C5746_16050019-1.639273UDP-N-acetylglucosamine
C5746_16055019-1.969764integration host factor
C5746_16060022-2.224667NAD-dependent malic enzyme
C5746_16065226-2.608855helicase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15840TONBPROTEIN343e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 34.2 bits (78), Expect = 3e-04
Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 2/49 (4%)

Query: 11 PPEEDPFLKKPQEPQGPREPQGPQGPQEPPSGSPYDSAPPPPPPPPPPP 59
P + + P EP EP+ P+ P P P P P P P
Sbjct: 56 EPPQAV--QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 102



Score = 31.9 bits (72), Expect = 0.002
Identities = 14/59 (23%), Positives = 19/59 (32%)

Query: 6 PTSGQPPEEDPFLKKPQEPQGPREPQGPQGPQEPPSGSPYDSAPPPPPPPPPPPYDSGP 64
P +PP+ +P P P+ P+E P P P P P P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110



Score = 29.2 bits (65), Expect = 0.011
Identities = 16/61 (26%), Positives = 17/61 (27%)

Query: 10 QPPEEDPFLKKPQEPQGPREPQGPQGPQEPPSGSPYDSAPPPPPPPPPPPYDSGPYGGGG 69
P P P PQ Q P EP + P P PP P P
Sbjct: 38 LPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 97

Query: 70 P 70
P
Sbjct: 98 P 98



Score = 28.0 bits (62), Expect = 0.028
Identities = 14/88 (15%), Positives = 26/88 (29%), Gaps = 9/88 (10%)

Query: 4 DQPTSGQPPEEDPFLKKPQEPQGPREPQGPQGPQEPPSGSPYDSAPPPPPP--------- 54
+ +PP+E P + + +P+ +P+ + QE P P P
Sbjct: 74 EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARL 133

Query: 55 PPPPPYDSGPYGGGGPYGGVDPLAGMPP 82
+ G L+ P
Sbjct: 134 TSSTATAATSKPVTSVASGPRALSRNQP 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15845IGASERPTASE365e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 5e-04
Identities = 31/218 (14%), Positives = 58/218 (26%), Gaps = 16/218 (7%)

Query: 132 QSEQSRPPSGPESGRDPRTPADRAPADRVPADRAPAVREPVDRTPAVREPVDRAPADPRR 191
Q++ PS E PA P++ V E + E ++ +
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTA 1063

Query: 192 PASPDPTGGALPGMRDGRSQAPENTAAIRAVGRGGRPPAPRTTEQVPADG-----TMAIR 246
+ A ++ A + + + + T V + T +
Sbjct: 1064 QNR-EVAKEAKSNVKANTQTN--EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 247 AIAPGAGAQPPAQAQAQA----PAPAQAQAPAPAQNRAPAPAQTPAS--APAELNSPRTP 300
+ P Q Q++ PA+ P + T A PA+ S
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 301 GP--GGGAASWAQQVHQLAQPDQAARPQPQQHQHQHQH 336
P + V + + A QP +
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218



Score = 30.4 bits (68), Expect = 0.023
Identities = 25/160 (15%), Positives = 44/160 (27%), Gaps = 30/160 (18%)

Query: 45 QQGEPMPATASGAAAEPQPQPSAPAPAPVDETGPVFLDEEPYADNRPEPATAWQADASRQ 104
+ E T+ + + Q + P P E P +EP + T
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT--------- 1168

Query: 105 TGFGGERDRRVSWGGAGQPGGQGEPGRQSEQSRPPSGPESGRDPRTPADRAPADRVPADR 164
QP + + + + P + PA P
Sbjct: 1169 ----------------EQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 165 APAVREPVDRTPAVREPVDRAPADPRRP--ASPDPTGGAL 202
+ + +P +R R V P + +S D + AL
Sbjct: 1213 SESSNKPKNRH---RRSVRSVPHNVEPATTSSNDRSTVAL 1249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15905TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 32/147 (21%), Positives = 62/147 (42%), Gaps = 14/147 (9%)

Query: 49 VVAFCATLPIVVSALIGGPVIDRVGRRRVSVASDLVCGAAVSAIPVLHYA---DALAFWM 105
V+ F T+ +++ IGG ++DR G V L G ++ L + + +++M
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYV-----LNIGVTFLSVSFLTASFLLETTSWFM 351

Query: 106 LCALMALSGLLHTPGNTARYVLVPDLAEHAGTTLARAASLFDAVSRGARMVGAALAGVLI 165
++ + G L ++ L + SL + S + G A+ G L+
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQ---QEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408

Query: 166 ALVGAETVLLL---DAATFLTSALLIA 189
++ + LL D +T+L S LL+
Sbjct: 409 SIPLLDQRLLPMEVDQSTYLYSNLLLL 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15920INTIMIN330.006 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 32.7 bits (74), Expect = 0.006
Identities = 41/204 (20%), Positives = 59/204 (28%), Gaps = 16/204 (7%)

Query: 320 LDASGATLDDTAVYVKSTADGT--FSADFTVSDPAIAAIQVDEG---NDPATVLTTPFAV 374
+D G T D TA + ADGT + TV +A V VL+ A
Sbjct: 555 VDQVGVT-DFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSAN 613

Query: 375 TDASATLSAGAAKVKPGGTITLSGGDWPTGTTPAAALCAADGSACDSARISGSTLRINAD 434
T+ S + KPG + + T A A+ D + I A+
Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVAN 673

Query: 435 GTLAGTVTVAGSTPRAVYALQVTAGGAQALTPLTVAPTFVVLTPSAGPKGTAVTVLGQGF 494
G A T TV + + + + T + T G A L
Sbjct: 674 GQDAITYTV-----KVMKGDKPVSNQEVTFTTTLGKLS--NSTEKTDTNGYAKVTLTSTT 726

Query: 495 AKVATVSIVGLRADGSQTSDPVRT 518
+ VS R
Sbjct: 727 PGKSLVSA---RVSDVAVDVKAPE 747


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15950TCRTETB721e-15 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 71.8 bits (176), Expect = 1e-15
Identities = 76/396 (19%), Positives = 144/396 (36%), Gaps = 35/396 (8%)

Query: 54 VVFLIAFEATAVGTAMPVAARELHGIP-LYAFAFSAYFTTSLFAMVLSGQWADRRGPLGP 112
+ F + ++P A + + P + +A+ T + G+ +D+ G
Sbjct: 22 LSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRL 81

Query: 113 LATGISAFGVGLLLSGTAGSMW-MFIAGRAVQGLGGGLVIVALYVVISRAYPEHLRPSIM 171
L GI G ++ S + + I R +QG G + VV++R P+ R
Sbjct: 82 LLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAF 141

Query: 172 AAFAASWVIPSVVGPLAAGSVTEHLGWRWVFIGIPVLVV-FPLALALPAIRRRASGPADP 230
+ + VGP G + ++ W ++ + + ++ P + L R G D
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD- 200

Query: 231 TAPVEPYDRRRILLALGISLGAGLLQYAGQERNWFALLPAVVGFGLLVPAVRG----LLP 286
I + +S+G + L+ +V+ F + V +R +
Sbjct: 201 -----------IKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 287 PGTGRAARGLPSVVLLRGVAAGSFIAAESFVPLMLVTQRGLSPTMAGL------SLAAGG 340
PG G+ VL G+ G+ S VP M+ LS G +++
Sbjct: 250 PGLGK-NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 341 ATWALGSYVQSRPRLEPYRERLMVGGMVLVAAAIAAAPSVLIDWVPVW-TVAVAWAFGCF 399
+ G V R L ++ G+ ++ + A S L++ + T+ + + G
Sbjct: 309 FGYIGGILVDRRGPL-----YVLNIGVTFLSVSFLTA-SFLLETTSWFMTIIIVFVLGGL 362

Query: 400 GMGMVIASTSVLLLKLSAPEEAGANSAALQISDGLS 435
+ ST V +EAGA + L + LS
Sbjct: 363 SFTKTVISTIV--SSSLKQQEAGAGMSLLNFTSFLS 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15985INVEPROTEIN280.033 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 28.2 bits (62), Expect = 0.033
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 4/51 (7%)

Query: 62 LREIRAVLDSQVDQVAVLREHHRRLLREHDRLETLVRTVERTIAELEEGKD 112
LR+ R++ D V VLRE LLR D E + + +E + +EE D
Sbjct: 112 LRQARSLFPDPSDLVLVLRE----LLRRKDLEEIVRKKLESLLKHVEEQTD 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16000BACINVASINB320.007 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 32.4 bits (73), Expect = 0.007
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 12 TAVTQWTEMIGKLTSLQTDASAMKTKADKSTWKGENATVTKEFVTKTAKEFSDAVTEAES 71
++ Q T ++GKL +L D S + ++ + W+ + KE + +KEF A+ EA+
Sbjct: 80 SSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIES-QKEMGIQVSKEFQTALGEAQE 138

Query: 72 VRDLLKDAHALLKDAHALLKSAHDDLKDA 100
DL + + A ++ +A L A
Sbjct: 139 ATDLYEASIKKTDTAKSVYDAATKKLTQA 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16075YERSSTKINASE300.027 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.5 bits (68), Expect = 0.027
Identities = 27/98 (27%), Positives = 41/98 (41%), Gaps = 9/98 (9%)

Query: 115 VRAVGAALCGALGQLHSSEVVHRDLKPSNVMLS-AYG-PKVIDFGIARALGDDRLTRTGT 172
++ + L L + VVH D+KP NV+ A G P VID G+ G+
Sbjct: 247 IKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ------P 300

Query: 173 AAGTPAYMSPEQASGQ-EQTPAGDVFALAGILVFAATG 209
T ++ +PE G + DVF + L+ G
Sbjct: 301 KGFTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16080YERSSTKINASE371e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 36.6 bits (84), Expect = 1e-04
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 13/100 (13%)

Query: 114 VRALAADLARALGDIHAAGLVHRDVKPANIMM--TSDGPRVIDFGIARPEHGLTLTTTGE 171
++ +A L + AG+VH D+KP N++ S P VID G+ + +GE
Sbjct: 247 IKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLH--------SRSGE 298

Query: 172 IP--VTPGYGAPEQVLGQR-VGPAADVFSLGAVLVYAATG 208
P T + APE +G +DVF + + L++ G
Sbjct: 299 QPKGFTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16085SUBTILISIN1311e-37 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 131 bits (332), Expect = 1e-37
Identities = 79/284 (27%), Positives = 119/284 (41%), Gaps = 32/284 (11%)

Query: 1 MQAETMWRTSTGEGVTVAVIDTGV-REVPELAGQLLEGKDFTDGVTGEHDEA------GT 53
+QA +W + G GV VAV+DTG + P+L +++ G++FTD G+ + GT
Sbjct: 29 IQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 54 TAAAVIAGTGKGAGGKESAYGLAPGAKILPLKVSDGSEAPQSGERSIAINSDLAPAIRYA 113
A IA T G G+AP A +L +KV + + Q + I YA
Sbjct: 89 HVAGTIAATENENGV----VGVAPEADLLIIKVLNKQGSGQY--------DWIIQGIYYA 136

Query: 114 ADSEAKVISISVTASLSIGGAVDEAVKCALSKGKLVFAAVG---DSEFPERPVQNPAAIP 170
+ + +IS+S+ + EAVK A++ LV A G D + + P
Sbjct: 137 IEQKVDIISMSLGGPED-VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYN 195

Query: 171 GVTGVAALGKDLAALKSSAVGPEVTLSAAGEDVLSACAEPEGLCTSSGSAVATAVAAASA 230
V V A+ D A + S EV L A GED+LS T SG+++AT A +
Sbjct: 196 EVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPG-GKYATFSGTSMATPHVAGAL 254

Query: 231 ALIWAKYP-----DWTNYQVLRVMVNTVGGPTSGAVRNNYIGYG 269
ALI D T ++ ++ G G
Sbjct: 255 ALIKQLANASFERDLTEPELYAQLIKRT---IPLGNSPKMEGNG 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16090SUBTILISIN1833e-56 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 183 bits (467), Expect = 3e-56
Identities = 87/293 (29%), Positives = 133/293 (45%), Gaps = 26/293 (8%)

Query: 38 QWYLDAMQAEQMWKSSTGENVTVAVIDSGVDASIPDLRGRVLKGKDLAAASPGDE--HTD 95
++ +QA +W + G V VAV+D+G DA PDL+ R++ G++ GD D
Sbjct: 23 PRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKD 82

Query: 96 YDNHGTGMASIIAGTGNVRGGGGSFGLAPGVKILPIRLRDTTGKVNGATGNKYLNEDLSV 155
Y+ HGT +A IA T N G G+AP +L I++ + G G + +
Sbjct: 83 YNGHGTHVAGTIAATEN---ENGVVGVAPEADLLIIKVLNKQGS--GQY------DWIIQ 131

Query: 156 AIRFAVDHGAKIINASVGDSIGGSQQLTDSVKYALDKGALIFAAVGN---SADEGNLIEY 212
I +A++ II+ S+G +L ++VK A+ L+ A GN D + + Y
Sbjct: 132 GIYYAIEQKVDIISMSLGGP-EDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGY 190

Query: 213 PAGTPGVVGVGAIGKDLHKADFSQWGPQVDLSAPGVDMVHGCSGGTKLCRTSGTSDAAAI 272
P V+ VGAI D H ++FS +VDL APG D++ GG K SGTS A
Sbjct: 191 PGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG-KYATFSGTSMATPH 249

Query: 273 VSASAALIWSK-----HLDWTNNQVLRVLLNTVGAPTSGALRNDYVGYGVVRP 320
V+ + ALI D T ++ L+ G G++
Sbjct: 250 VAGALALIKQLANASFERDLTEPELYAQLIKRT---IPLGNSPKMEGNGLLYL 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16105PF05616396e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 39.0 bits (90), Expect = 6e-05
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 19/92 (20%)

Query: 356 VDLAQVPAPDTVQAPDTVQAPDTVPAADTVPAPDTVPAADTPAEHPVEATPEHHPA---- 411
VD+ +P PD P + +AP+ P + PA + PA +P P +P
Sbjct: 305 VDVQVIPRPDL--TPGSAEAPNAQPLPEVSPA-------ENPANNP---APNENPGTRPN 352

Query: 412 -EPPAGTAPDSEPDT--APDTAPDAPPLDTPP 440
EP PD+ PDT P T PD+P + P
Sbjct: 353 PEPDPDLNPDANPDTDGQPGTRPDSPAVPDRP 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16135DNABINDINGHU922e-28 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 92.1 bits (229), Expect = 2e-28
Identities = 39/90 (43%), Positives = 54/90 (60%), Gaps = 1/90 (1%)

Query: 2 NRSELVAALADRAEVTRKDADAVLAALAETVGEIVAKGDEKVTIPGFLTFERTHRAARTA 61
N+ +L+A +A+ E+T+KD+ A + A+ V +AKG+ KV + GF FE RAAR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGE-KVQLIGFGNFEVRERAARKG 61

Query: 62 RNPQTGDPINIPAGYSVKVSAGSKLKEAAK 91
RNPQTG+ I I A AG LK+A K
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAVK 91


49C5746_16560C5746_16625Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_16560210-0.207675DNA-binding protein
C5746_1656528-1.299915hypothetical protein
C5746_1657018-1.421865acyl-CoA dehydrogenase
C5746_1657519-1.893443UDP-glucose 6-dehydrogenase
C5746_16580010-2.172612hypothetical protein
C5746_16585010-2.029776glyoxalase
C5746_16590-110-0.694372membrane dipeptidase
C5746_16595-211-0.0674485-(carboxyamino)imidazole ribonucleotide mutase
C5746_16600214-0.6756005-(carboxyamino)imidazole ribonucleotide
C5746_16605313-0.726608hypothetical protein
C5746_16610414-0.931615two-component sensor histidine kinase
C5746_16615414-0.694671DNA-binding response regulator
C5746_16620415-1.505877MFS transporter
C5746_16625415-1.211218hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16650TONBPROTEIN310.006 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.7 bits (69), Expect = 0.006
Identities = 13/51 (25%), Positives = 16/51 (31%)

Query: 140 PAVLRPTPQPPPREPAAEPHPASSEAPQPPASQGTPEPRPALVFVPAPTAP 190
P P P EP EP P + P P+P+P P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16680ADHESNFAMILY300.013 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.2 bits (68), Expect = 0.013
Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 242 AMATFVPKFVLPAAVAWTLAADENMSAHGLHHLDTTAQAMKIHAAF--EAANPRPMATVA 299
A F + +P+A W + +E + + L + K+ + F + + RPM TV+
Sbjct: 207 AFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVS 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16700PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 5e-06
Identities = 20/81 (24%), Positives = 31/81 (38%), Gaps = 24/81 (29%)

Query: 321 LIENSLMHG------GGTVALRTRVTGNQAVIEVTDEGPGVPPDLGARIFERTISGRNST 374
L+EN + HG GG + L+ +EV + G + + ST
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK-----------NTKEST 311

Query: 375 GIGLAVARDLAEADGGRLELL 395
G GL R+ RL++L
Sbjct: 312 GTGLQNVRE-------RLQML 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16705HTHFIS985e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.4 bits (245), Expect = 5e-26
Identities = 38/128 (29%), Positives = 64/128 (50%)

Query: 2 TRVLLAEDDASISEPLARALRREGYEVEVREDGPTALDAGLQGGIDLVVLDLGLPGMDGL 61
+L+A+DDA+I L +AL R GY+V + + T G DLVV D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVARRLRAEGHTAPILVLTARADEVDTVVGLDAGADDYVTKPFRLAELLARVRALLRRGA 121
++ R++ P+LV++A+ + + + GA DY+ KPF L EL+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 TEPAPQPA 129
P+
Sbjct: 124 RRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16715cloacin300.015 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.015
Identities = 16/64 (25%), Positives = 25/64 (39%)

Query: 169 GGGSTTAGGTTTGSTTGSTTAGGTTTAGGTTTGSTTAGGTTTAGGTTAGGTTAAGGTTSG 228
G +T+G G T G + +G ++ + GG+ + G GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 229 SGGG 232
SGGG
Sbjct: 71 SGGG 74


50C5746_17690C5746_17750Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_17690121-4.228204hypothetical protein
C5746_17695120-3.653922glycosyl transferase
C5746_17700018-4.418217hypothetical protein
C5746_17705-120-4.402792hypothetical protein
C5746_17710019-4.240042hypothetical protein
C5746_17715-118-3.806996hypothetical protein
C5746_17720-324-2.516697D-alanyl-D-alanine carboxypeptidase
C5746_17725-122-2.643578hypothetical protein
C5746_17730020-2.372396transcriptional regulator
C5746_17735017-2.614183DUF397 domain-containing protein
C5746_17740118-2.564321hypothetical protein
C5746_17745221-2.367840short-chain dehydrogenase
C5746_17750322-2.780058Na+:solute symporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17810PF06776260.028 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 25.7 bits (56), Expect = 0.028
Identities = 10/31 (32%), Positives = 13/31 (41%), Gaps = 1/31 (3%)

Query: 1 MSLTTDGEPPGPVRFYLACDRSGCRARAVFD 31
+ L D G F + C +GC A V D
Sbjct: 143 LGLKLDNVDVGRAGF-VRCLPNGCVAEVVMD 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17820TATBPROTEIN310.015 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 30.8 bits (69), Expect = 0.015
Identities = 14/89 (15%), Positives = 27/89 (30%)

Query: 200 DAKAATSEDESAPDADADADTDAVAGGDASEPATPADGASDKATASNDADAADEPSDDAS 259
D +E D + + P + A+ + A +
Sbjct: 83 DELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNEAAHEGVTPAAAQTQASSPEQKP 142

Query: 260 EDAEDSEDAAASEADAANTADADSKADKP 288
E + A++A+ A + S +DKP
Sbjct: 143 ETTPEPVVKPAADAEPKTAAPSPSSSDKP 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17845DHBDHDRGNASE892e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.3 bits (221), Expect = 2e-23
Identities = 70/259 (27%), Positives = 106/259 (40%), Gaps = 13/259 (5%)

Query: 3 LAGKTVIVSGVGAGLGHQVAATVVRDGGSAVLGARTAANLAKSASEIDPEGRHTAHLATD 62
+ GK ++G G+G VA T+ G L K S + E RH D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 ITDEAQCEALAALALERFGRIDAVVHVAAWDSYFGGIEDADFATWRSVIDVNLLGTLRMT 122
+ D A + + A G ID +V+VA G I W + VN G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 123 RACLPGLK-ERGGSVVVIGTQSSVAAPSQVQQAAYAASKGALTSAMYSMAREFGPHRIRV 181
R+ + R GS+V +G S+ A + AAYA+SK A + E + IR
Sbjct: 125 RSVSKYMMDRRSGSIVTVG--SNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 182 NTVLPGWMWGPPVQAYVRFTAHTEGVPEAEVLGRLTER----MALPDLATDGDVAEAAAF 237
N V PG + ++++ + +V+ E + L LA D+A+A F
Sbjct: 183 NIVSPG-----STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 238 LASDRARAITGQSLLVNAG 256
L S +A IT +L V+ G
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


51C5746_17890C5746_17995Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_178903160.608970hypothetical protein
C5746_178952160.1911123'-phosphoesterase
C5746_17900419-1.058209endonuclease
C5746_17905419-0.989159penicillin amidase
C5746_17910019-1.958657NADPH-dependent ferric siderophore reductase
C5746_17915-215-1.147531amino acid transporter
C5746_17920-113-0.269898aldehyde dehydrogenase
C5746_17925-116-0.199777aminoacylase
C5746_179303161.173868LLM class flavin-dependent oxidoreductase
C5746_179351130.787027short-chain dehydrogenase
C5746_179400130.539644LLM class F420-dependent oxidoreductase
C5746_17945313-0.359755hypothetical protein
C5746_17950411-0.552984carboxyvinyl-carboxyphosphonate
C5746_17955212-0.141930hypothetical protein
C5746_1796039-0.006303hypothetical protein
C5746_17965490.506640peptidase M23
C5746_17970590.933418hypothetical protein
C5746_179755101.642543glycosyl transferase family 2
C5746_179804112.541763long-chain fatty acid--CoA ligase
C5746_179854112.5968103-oxoacyl-ACP reductase
C5746_179902113.186597hypothetical protein
C5746_179950103.145178TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18035UREASE533e-09 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 52.8 bits (127), Expect = 3e-09
Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 23/134 (17%)

Query: 2 LDHLIRGATVVDGTGGPSYPADIGIRDGRIAVIAE---PGTL---------AEEAVTTED 49
+D +I A ++D G ADIG++DGRIA I + P E + E
Sbjct: 68 VDTVITNALILDHWG--IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE- 124

Query: 50 ATGLVLAPGFVDPHTHY-DAQLFWDPYATPSMNHGVTTVAGGNCGFTLAPLHPERPEDAD 108
G ++ G +D H H+ Q + ++ G+T + GG G L
Sbjct: 125 --GKIVTAGGMDSHIHFICPQQIEE-----ALMSGLTCMLGGGTGPAHGTLATTCTPGPW 177

Query: 109 YTRRMMSRVEGMAL 122
+ RM+ + +
Sbjct: 178 HIARMIEAADAFPM 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18045DHBDHDRGNASE1322e-39 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 132 bits (332), Expect = 2e-39
Identities = 80/258 (31%), Positives = 121/258 (46%), Gaps = 5/258 (1%)

Query: 4 LDGRVVLISGAARGQGEQEARLFAAEGARVVIADVLDEQGEALAKELGEGVARFVH---L 60
++G++ I+GAA+G GE AR A++GA + D E+ E + L + AR
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-KAEARHAEAFPA 64

Query: 61 DVSREGEWQAAVAAAKDAFGRIDGLVNNAGILRFNELVTTPLEEFQQVVQVNQVGAFLGI 120
DV A + G ID LVN AG+LR + + EE++ VN G F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 121 KSVAPEIEAAGGGTIVNTASYTGLTGMAFVGAYAATKHAVLGLTKVAAVELAAKGIRVNA 180
+SV+ + G+IV S + AYA++K A + TK +ELA IR N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 181 VCPGAVDTAMTNPAALDPTADPEESGAAVAELYRKLVPLGRIGQPEEVAALALFLTSDDS 240
V PG+ +T M D E+ E ++ +PL ++ +P ++A LFL S +
Sbjct: 185 VSPGSTETDMQWSLWADENGA-EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 241 SYITGQPFVIDGGWLAGV 258
+IT +DGG GV
Sbjct: 244 GHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18080cloacin300.013 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.013
Identities = 22/84 (26%), Positives = 30/84 (35%), Gaps = 4/84 (4%)

Query: 76 TGATGASGTNGADGATGPV--GPTGATGASGTNGADGATGPVGPTGATGASGTNGADGAT 133
+G G GA +G + GPTG G + G + P G G SG+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGGG 59

Query: 134 GPVGPTGATGAAGTNGADGATGAT 157
G G G +G G +
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 28.5 bits (63), Expect = 0.027
Identities = 24/92 (26%), Positives = 34/92 (36%), Gaps = 4/92 (4%)

Query: 97 TGATGASGTNGADGATGPV--GPTGATGASGTNGADGATGPVGPTGATGAAGTNGADGAT 154
+G G GA +G + GPTG G + G + P G G +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGGG 59

Query: 155 GATGPTGATGPAGPVGPSQQANSNVTSVPAGG 186
G G G +G + S V + A G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18100DHBDHDRGNASE864e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.9 bits (212), Expect = 4e-21
Identities = 54/193 (27%), Positives = 83/193 (43%), Gaps = 7/193 (3%)

Query: 199 AAPLTGRTALVTGAARGIGASVASVLARDGAQVICLDIPRSADELKRTAERLGA---TAL 255
A + G+ A +TGAA+GIG +VA LA GA + +D E ++ + A A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 256 PLDITADDAADRIA---AASPDGLDILVHNAGITRDRRLANMPPDRWASVIEVNLGSVLR 312
P D+ A D I +DILV+ AG+ R + ++ + W + VN V
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 313 TTDALLMSGTVNRGGRIVATASIAGIAGNNGQTNYAAGKAGIIGLVRSLAPRAAADHGVT 372
+ ++ R G IV S YA+ KA + + L A++ +
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG-LELAEYNIR 181

Query: 373 VNAVAPGFIETKM 385
N V+PG ET M
Sbjct: 182 CNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18110HTHTETR743e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.5 bits (180), Expect = 3e-18
Identities = 31/204 (15%), Positives = 62/204 (30%), Gaps = 14/204 (6%)

Query: 1 MPRAVREQ------QMMDAAVRTFGQRGYRAASMDEIAELAGVSKPLVYLYLNSKEELFT 54
M R +++ ++D A+R F Q+G + S+ EIA+ AGV++ +Y + K +LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 55 ACIQREAKALVAAVRSGVEPELPADRQLWAGLRAFFTHTAKNPDGW----AVLHRQARTH 110
+ + + + + + ++ +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 111 GEPFATEVMVMRDEIVAFVTGLIGAAAREAHRDPAL-PDRDVAGLAQALVGAAESL-AGW 168
GE V + + I + L D A + G L W
Sbjct: 121 GE--MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 169 ANDTPGVSAKEAAATLMNFAWAGL 192
K+ A +
Sbjct: 179 LFAPQSFDLKKEARDYVAILLEMY 202


52C5746_18040C5746_18080Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_18040214-0.8731354-hydroxybenzoate polyprenyltransferase
C5746_18045015-0.595119menaquinone biosynthesis decarboxylase
C5746_18050-19-1.140690urease accessory protein
C5746_18055320-2.246029urease accessory protein UreG
C5746_18060934-2.355080urease accessory protein UreF
C5746_18065830-2.649558urease subunit alpha
C5746_18070521-1.751669urease subunit beta
C5746_18075318-0.314306hypothetical protein
C5746_180802190.221322tRNA-specific adenosine deaminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18185UREASE8570.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 857 bits (2215), Expect = 0.0
Identities = 325/625 (52%), Positives = 419/625 (67%), Gaps = 60/625 (9%)

Query: 20 LTRAAYADRFGPTVRDRIRLADTELRIEIEDDWAGGPGRSGNEMVFGGGKVIRESMGQSI 79
++RAAYA+ FGPTV D++RLADTEL IE+E D+ G E+ FGGGKVIR+ MGQS
Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFT----THGEEVKFGGGKVIRDGMGQSQ 60

Query: 80 APRDPAGADGPGGPGESDGGEVVMPPDTVITGAVVLDHWGIVKADVAIRDGRITALGKAY 139
R+ DTVIT A++LDHWGIVKAD+ ++DGRI A+GKA
Sbjct: 61 VTREGG------------------AVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAG 102

Query: 140 NDETMDPLHEGAGNHNDVGPIPTDFVIGPDTEVIAGNGKILTAGGVDTHVHFICPEQVGE 199
N + + ++GP TEVIAG GKI+TAGG+D+H+HFICP+Q+ E
Sbjct: 103 NPDMQPGV---------------TIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQIEE 147

Query: 200 ALSAGVTTLIGGGTGPAEGSTATTVTPGSWHLARTFEALDSFPVNVGLLGKGATMSEKSM 259
AL +G+T ++GGGTGPA G+ ATT TPG WH+AR EA D+FP+N+ GKG ++
Sbjct: 148 ALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPMNLAFAGKGNASLPGAL 207

Query: 260 YDQVDAGVIGFKIHEDWGATPAVIKKCLEVCRQTGVQLALHADSLNEAGFVDDTIRAIDG 319
+ V G K+HEDWG TPA I CL V + VQ+ +H D+LNE+GFV+DTI AI G
Sbjct: 208 VEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQVMIHTDTLNESGFVEDTIAAIKG 267

Query: 320 HSIHVFHVEGAGGGHAPDMIKMVHEGNVLPASTNPTRPLTINTVKEHFDMVMVCHHLNPK 379
+IH +H EGAGGGHAPD+I++ + NV+P+STNPTRP T+NT+ EH DM+MVCHHL+P
Sbjct: 268 RTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTRPYTVNTLAEHLDMLMVCHHLSPT 327

Query: 380 IKEDLAFADSRIRPSTMAAEDILHDLGAISIMSSDAQAMGRIGEMIMRTWQTAHVMKVRR 439
I ED+AFA+SRIR T+AAEDILHD+GA SI+SSD+QAMGR+GE+ +RTWQTA MK +R
Sbjct: 328 IPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQR 387

Query: 440 GFLEGDKLPEGEQPTGDKGSDNRRARRYVAKYTINPAIAQGVDAVVGSVEPGKLADLVLW 499
G L+ + +DN R +RY+AKYTINPAIA G+ +GS+E GK ADLVLW
Sbjct: 388 GRLKEET----------GDNDNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADLVLW 437

Query: 500 EPKFFGVKPHQVIKGGQIAYAQVGDANASIPTPQPVLPRAMWGATGLAPRATSFNFVTQR 559
P FFGVKP V+ GG IA A +GD NASIPTPQPV R M+GA G + +S FV+Q
Sbjct: 438 NPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGRSRTNSSVTFVSQA 497

Query: 560 AIDNGLPERLALGKQFKAISSTR-DVRKADMKENDATPDVRIDPDTFEVIAGGAKVEDVT 618
++D GL RL + K+ A+ +TR + KA M N TP + +DP+T+EV A G ++
Sbjct: 498 SLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETYEVRADG----ELL 553

Query: 619 TCIEGQIVERNYPRELPMAQRYFLF 643
TC LPMAQRYFLF
Sbjct: 554 TC--------EPATVLPMAQRYFLF 570


53C5746_18695C5746_18800Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_186952161.928735two-component sensor histidine kinase
C5746_187000140.486691hypothetical protein
C5746_18710-115-1.145428NADH-quinone oxidoreductase subunit D
C5746_18715214-1.980799hypothetical protein
C5746_18720213-2.352299amino acid ABC transporter substrate-binding
C5746_18725211-2.802194ABC transporter permease
C5746_18730412-2.849161ABC transporter permease
C5746_18735311-3.331513ABC transporter ATP-binding protein
C5746_18740312-3.835982esterase
C5746_18745123-1.521693hypothetical protein
C5746_18750-113-1.120428dihydropteroate synthase
C5746_18755-115-1.097314DUF4440 domain-containing protein
C5746_187600150.983092dihydroneopterin aldolase
C5746_187650182.7674772-amino-4-hydroxy-6-
C5746_187701193.842293DUF3180 domain-containing protein
C5746_18775-1143.155491GTP cyclohydrolase I FolE
C5746_18780-2123.852029cell division protein FtsH
C5746_18785-2103.790095hypoxanthine phosphoribosyltransferase
C5746_18790-1133.242615tRNA lysidine(34) synthetase TilS
C5746_187952162.547845coenzyme F420 biosynthesis-associated protein
C5746_188003181.785286D-alanyl-D-alanine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18905HTHFIS330.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.004
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 209 VLLYGPPGTGKTLLARAV---AGEAGVPFYS-----ISGSDFVEMFVGV------GASRV 254
+++ G GTGK L+ARA+ PF + I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 255 RD-LFEQAKANAPAIVFVDEID 275
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEG---GTLFLDEIG 241


54C5746_19070C5746_19150Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_190702131.137086Na+/H+ antiporter NhaA
C5746_190753131.018020phage holin family protein
C5746_190853131.319873alpha/beta hydrolase
C5746_190905131.237518hypothetical protein
C5746_190953100.484210serine protease
C5746_19100211-0.574870coenzyme A pyrophosphatase
C5746_19105-29-1.025927endonuclease III
C5746_19115-29-0.410552Crp/Fnr family transcriptional regulator
C5746_19120-1100.024060MBL fold metallo-hydrolase
C5746_191250140.446598NUDIX hydrolase
C5746_191301151.328408LysR family transcriptional regulator
C5746_191352202.663722DUF4177 domain-containing protein
C5746_191405433.520931ATPase
C5746_191455393.154849anion-transporting ATPase
C5746_191503341.776849WhiB family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19225V8PROTEASE504e-09 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 50.4 bits (120), Expect = 4e-09
Identities = 29/205 (14%), Positives = 65/205 (31%), Gaps = 36/205 (17%)

Query: 203 SIVKVVGTAPSCGKVLEGTGFVFSDRRVMTNAHVVGGVDEPTVQI-----------GGQG 251
+ + AP+ + +G V ++TN HVV + G
Sbjct: 89 PVTYIQVEAPTGTFI--ASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNG 146

Query: 252 RLYDAKVVLYDWQRDIAVLDVPDLD--------AKPLKFTDTDHDAGTGNSAIVAGFPEN 303
++ Y + D+A++ + KP ++ + + V G+P +
Sbjct: 147 GFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNA-ETQVNQNITVTGYPGD 205

Query: 304 GSYDVRAARVRGRI-DADGPDIYHRGTVRRDVYSLFATVRQGNSGGPLLTPDGKVYGVVF 362
+G+I G + + GNSG P+ +V G+ +
Sbjct: 206 KPVATMWES-KGKITYLKGEAM-----------QYDLSTTGGNSGSPVFNEKNEVIGIHW 253

Query: 363 AKSLDDPDTGYALTADEIRQDIAHG 387
++ + + + +R +
Sbjct: 254 GGVPNEFNGAVFIN-ENVRNFLKQN 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19270BACINVASINC300.016 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 30.2 bits (67), Expect = 0.016
Identities = 31/139 (22%), Positives = 60/139 (43%), Gaps = 17/139 (12%)

Query: 274 ERDALREAAYFVERLAAEEMPLAGLVLNRVHGSDAARLSAELALAAAENLDGVGIVDQTA 333
ER AL+ A +++L E + N ++G ++ +L AE GV +
Sbjct: 202 ERGALKHNAAKIDKLTTESHSIK----NVLNGQNSVKLGAE----------GVDSLKSLN 247

Query: 334 G-KAGLRDPADWSVVSPEAVADPDPSEHEDTED--REPDPEPAPEVATRTETVTEDASVE 390
K G + + + ++ A +E ++ ++ PE ++ R E+V D +E
Sbjct: 248 MKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESVESDIRLE 307

Query: 391 QLTAGLLRLHAERMQVVAR 409
Q T + R+ A +MQ+
Sbjct: 308 QNTMDMTRIDARKMQMTGD 326


55C5746_19705C5746_19800Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_19705-113-3.039267alpha/beta hydrolase
C5746_19710016-2.402055hypothetical protein
C5746_19715117-3.223048LuxR family transcriptional regulator
C5746_19720021-3.229573hypothetical protein
C5746_19725230-4.471310ATP-binding protein
C5746_19730231-2.906502dynein regulation protein LC7
C5746_19740230-1.444806DUF742 domain-containing protein
C5746_19745230-2.080370ATP-binding protein
C5746_19750336-2.271463cytochrome
C5746_19755329-4.168659cytochrome P450
C5746_19760328-4.936916enoyl-CoA hydratase
C5746_19765328-5.518988RNA polymerase subunit sigma
C5746_19770333-5.685888hypothetical protein
C5746_19775111-1.164284CAP domain-containing protein
C5746_19780111-1.135837CAP domain-containing protein
C5746_19785211-0.360470hydrolase
C5746_1979028-0.110728DNA-binding response regulator
C5746_19795280.168330two-component sensor histidine kinase
C5746_19800290.048144multidrug ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19855TONBPROTEIN290.031 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.8 bits (64), Expect = 0.031
Identities = 35/123 (28%), Positives = 47/123 (38%), Gaps = 3/123 (2%)

Query: 280 SVSPYGGVRAVVLLPDELLTGDAPAPPTPLAAPGGHDADVSALPRRQAPTPPPTPLFPPA 339
SV +G V A +L + PAP P++ AD L QA PPP P+ P
Sbjct: 16 SVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPAD---LEPPQAVQPPPEPVVEPE 72

Query: 340 AHPAPAPQPTGSFTTAGGLPKRRRKSPVSVVPSTEPGPVRSNEETASRLGAFQRGTRTGR 399
P P P+P PK + K V + P R + SR + T R
Sbjct: 73 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPAR 132

Query: 400 DTT 402
T+
Sbjct: 133 LTS 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19915HTHFIS623e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.8 bits (150), Expect = 3e-13
Identities = 29/120 (24%), Positives = 48/120 (40%), Gaps = 5/120 (4%)

Query: 4 TATRIVVVDDHEVVRAGFAGLLDTQPDFTVVGTASDGAEAVRVCAQQRPDVVLMDVRMPG 63
T I+V DD +R L ++ + V S+ A R A D+V+ DV MP
Sbjct: 2 TGATILVADDDAAIRTVLNQAL-SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 64 TDGIEATAQIRAATPDGGGPRILILTTFDLDEHVYDALAAGAGGFLLKDVTAEHLFDAVR 123
+ + +I+ A PD +L+++ + A GA +L K L +
Sbjct: 60 ENAFDLLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19920PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 9/91 (9%)

Query: 306 IVQEALTNARRHA-----PGAAVDVELRYAPQTLDLRIRDNGPGPDRSGARASGHGLLGM 360
+VQ + N +H G + ++ T+ L + + G ++ ++G GL +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNV 318

Query: 361 RERVAAVGGE---LSAGPAPGGGFLIEVRLP 388
RER+ + G + G + V +P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


56C5746_20200C5746_20295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_202001143.234296hypothetical protein
C5746_202051133.384488acyl-CoA dehydrogenase
C5746_202102143.304955hypothetical protein
C5746_202153151.731217M18 family aminopeptidase
C5746_202202132.285092hypothetical protein
C5746_202252132.332461AfsR family transcriptional regulator
C5746_20230-1131.385519hypothetical protein
C5746_202350110.614480alkyl hydroperoxide reductase
C5746_202400120.305104hypothetical protein
C5746_20245-2130.661934carbon-nitrogen hydrolase
C5746_20250019-1.599517hypothetical protein
C5746_20255226-3.550766MFS transporter
C5746_20260228-3.719892GntR family transcriptional regulator
C5746_20270013-2.760174D-alanyl-D-alanine carboxypeptidase
C5746_20280311-2.082947hypothetical protein
C5746_20285312-1.495628DUF4440 domain-containing protein
C5746_20290114-1.491631DUF397 domain-containing protein
C5746_20295221-1.912079DUF397 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20340cloacin320.018 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.018
Identities = 24/46 (52%), Positives = 26/46 (56%), Gaps = 9/46 (19%)

Query: 279 NGSGSGTGIERGHGGGSGHGNGTGIANSNGA-------LSVARPVA 317
G GSG+GI GGGSGHGNG G NS G +VA PVA
Sbjct: 46 WGGGSGSGI--HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20345PF01540310.005 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 30.9 bits (69), Expect = 0.005
Identities = 16/47 (34%), Positives = 23/47 (48%)

Query: 9 TILAAATTAVLALALTACGGDDSGTKSAGPASDAAAAAASTDATDAK 55
T+ A TAVL +A +C D K+ +DAA A+ A + K
Sbjct: 10 TLCGIAATAVLPIATISCNDDKLAEKNGKEKADAALKQANALAEELK 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20355PF05272280.023 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.023
Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 23/142 (16%)

Query: 9 LFAVGDDYWIEDADGRKVFLVDGKAMRVRDTFELKDAQGRILVEIRQKLLSLRDTMLIER 68
L+ G+ Y+ D F + + V + + + R+ +
Sbjct: 736 LYLAGERYFPSPEDEEIYFRPEQELRLVETGVQGR----LWALLTREGAPAAEGA----- 786

Query: 69 DGEQLARIKRKRLSLLRNHYRVTLVDGTELDVS-------GKILDREFAIDYDGELLAQI 121
Q + + LV D G++ D ++
Sbjct: 787 --AQKGYSVNTTFVTIAD-----LVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSG 839

Query: 122 SRRWLTVRDTYGIDVVREDADA 143
RR +R V+ ED +A
Sbjct: 840 QRRRGYMRPQVWPPVIAEDKEA 861


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20370TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.1 bits (99), Expect = 3e-06
Identities = 63/344 (18%), Positives = 104/344 (30%), Gaps = 33/344 (9%)

Query: 53 FHVNASALSTFSILQLLVYAGMQIPV----GLMVDRLGTKKVLTLGAVLFTLGQLGFALS 108
+ + + IL L +YA MQ G + DR G + VL + + A +
Sbjct: 35 LVHSNDVTAHYGIL-LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 109 PSYGMALASRALLGCGDAMTFISVLRLGARWFPARRGPLIGQVAALFGMAGNLVSTLFIA 168
P + R + G A T A + A FG +A
Sbjct: 94 PFLWVLYIGRIVAGITGA-TGAVAGAYIADITD------GDERARHFGFMSACFGFGMVA 146

Query: 169 RALHG-----FGWTTTFVGTSAAGVLVLVPLLLFLKDHPEGHEPPPVEHAGAAYVRKQIA 223
+ G F F +A L + L + +G P A + A
Sbjct: 147 GPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWA 206

Query: 224 ASWREPGTRLGMWVHFTTQFPAMVFLLLWGLPFLVEAQGLSRSTAGELLTLVVLSNMAFG 283
M V F Q V LW + F + +T G L + +
Sbjct: 207 RGMT--VVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 284 LVYGQIIARHHEARAPLALG-TVTVTALLWASTIFYPGDRAPMWLLIVLCVVLGVCGPAS 342
+ +A R L LG T + + R M I++ + G G +
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLASGGIGMPA 319

Query: 343 MIGFDFARPANPPERQGTASG----IVNMGGFIASM--TTLFAV 380
+ + ERQG G + ++ + + T ++A
Sbjct: 320 LQAMLSRQVDE--ERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


57C5746_20585C5746_20680Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_20585222-0.302215transcriptional regulator
C5746_205902190.058908*DNA-binding protein
C5746_20595015-0.262928serine/threonine protein kinase
C5746_20600011-0.750970hypothetical protein
C5746_20605-213-1.084075hypothetical protein
C5746_20610014-2.069786hypothetical protein
C5746_20615015-2.885355*hypothetical protein
C5746_20625-214-3.156884DNA gyrase subunit A
C5746_20630-113-3.720191DNA topoisomerase (ATP-hydrolyzing) subunit B
C5746_20635-111-3.218394DUF721 domain-containing protein
C5746_20640-210-3.337927DNA replication/repair protein RecF
C5746_20645012-3.0742686-phosphogluconate dehydrogenase
C5746_20650112-2.382555DNA polymerase III subunit beta
C5746_20655213-2.327045chromosomal replication initiator protein DnaA
C5746_20660113-2.00162250S ribosomal protein L34
C5746_20665111-1.929834ribonuclease P protein component
C5746_20670010-2.664117membrane protein insertion efficiency factor
C5746_20675011-1.620672membrane protein insertase YidC
C5746_20680212-1.285056single-stranded DNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20715PF03544392e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 2e-05
Identities = 25/127 (19%), Positives = 35/127 (27%)

Query: 289 PAGPIAAPRTASQDAQPVQTDQSAQQPYTQPPVPMSETGSFHLPPPPRQPTPLPHPAQPA 348
PA PI+ A D +P Q Q +P +P P P P P
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 349 TPPTPVPAHAASPFPAPDPSQAPTAAVQHEAALTRAYTAGQPQVPAPGSGVSPHLSAAVP 408
P V P +P + TA + + LS P
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQP 165

Query: 409 VHAPRTR 415
+ R +
Sbjct: 166 QYPARAQ 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20750GPOSANCHOR300.005 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.005
Identities = 15/61 (24%), Positives = 21/61 (34%), Gaps = 7/61 (11%)

Query: 12 EGYATGPLPGEREPAPGQASGPYHPPQAYPSPAGGTQGGAQGAAAAQAARR-PRTGARTT 70
G A+ + +P G PQA GT+ A + R+ P TG
Sbjct: 460 AGKASDSQTPDAKPGNKAVPGKGQAPQA------GTKPNQNKAPMKETKRQLPSTGETAN 513

Query: 71 P 71
P
Sbjct: 514 P 514


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20790HTHFIS300.028 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.028
Identities = 13/48 (27%), Positives = 17/48 (35%), Gaps = 3/48 (6%)

Query: 268 VIGASNRFAHAAAVAVAEAPAKAYNPLFIYGESGLGKTHLLHAIGHYA 315
++G S V L I GESG GK + A+ H
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDL--TLMITGESGTGKELVARAL-HDY 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_2081060KDINNERMP2361e-74 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 236 bits (604), Expect = 1e-74
Identities = 75/250 (30%), Positives = 113/250 (45%), Gaps = 48/250 (19%)

Query: 6 SLFSFITWPVSWVIVQFHTLYGAIFGDDTGWAWGLSIVSLVVLIRICLIPLFVKQIKSTR 65
FI+ P+ ++ H+ G WG SI+ + ++R + PL Q S
Sbjct: 331 GWLWFISQPLFKLLKWIHSFVGN---------WGFSIIIITFIVRGIMYPLTKAQYTSMA 381

Query: 66 NMQVLQPKMKAIQERYKSDKQRQSEEMMKLYKETGTNPLSSCLPILAQSPFFFALYHVLS 125
M++LQPK++A++ER DKQR S+EMM LYK NPL C P+L Q P F ALY++L
Sbjct: 382 KMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLM 441

Query: 126 AIASGKTIGVIDQPLLDSARQAHIFGAPLAAKFMDSEAKVQALGASLLDVRVVTAIMIVM 185
+ AP A D A+ I+ ++
Sbjct: 442 -------------------GSVELRQAPFALWIHDLSAQDPYY------------ILPIL 470

Query: 186 MSASQFFTQRQLMTKNVDLTVKTPYMQQQKMLMYIFPVIFAVMGINFPVGVLVYWLTTNV 245
M + FF Q K TV P Q+ +M PVIF V + FP G+++Y++ +N+
Sbjct: 471 MGVTMFFIQ-----KMSPTTVTDP---MQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNL 522

Query: 246 WTMGQQMYVI 255
T+ QQ +
Sbjct: 523 VTIIQQQLIY 532


58C5746_20730C5746_20985Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_20730019-3.801649serine/threonine protein kinase
C5746_20735-121-3.324777murein biosynthesis integral membrane protein
C5746_20740-126-3.419899hypothetical protein
C5746_20750-126-3.223267CCA tRNA nucleotidyltransferase
C5746_20755-125-3.324777MFS transporter
C5746_20760-125-3.292818inositol-3-phosphate synthase
C5746_20765023-2.436112PadR family transcriptional regulator
C5746_20770-119-2.656666penicillin-binding protein
C5746_20775014-2.445932hypothetical protein
C5746_20780118-2.878768alanine racemase
C5746_20785316-4.553446peptidoglycan bridge formation protein FemAB
C5746_20790215-3.707660hypothetical protein
C5746_20795325-4.51419230S ribosomal protein S6
C5746_20800125-4.975172single-stranded DNA-binding protein
C5746_20805326-4.75590430S ribosomal protein S18
C5746_20810427-4.62937250S ribosomal protein L9
C5746_20815330-3.715044MATE family efflux transporter
C5746_20820324-3.657487replicative DNA helicase
C5746_20825222-3.740949transposase
C5746_20830420-2.896878transposase
C5746_20835222-2.663027AAA family ATPase
C5746_20840-217-1.977976LysR family transcriptional regulator
C5746_20845-214-1.779674hypothetical protein
C5746_20850-213-2.204329hypothetical protein
C5746_20855-112-2.126486hypothetical protein
C5746_20860-210-2.452597hypothetical protein
C5746_20865-210-2.588745transposase
C5746_20875-110-3.979047deaminase
C5746_20885-211-3.582414hypothetical protein
C5746_20890-19-3.473768TetR family transcriptional regulator
C5746_20895-210-3.784151deaminase
C5746_20900-112-2.233569serine hydrolase
C5746_20905-116-2.630333hypothetical protein
C5746_20910020-3.722787GNAT family N-acetyltransferase
C5746_20915120-2.273448MarR family transcriptional regulator
C5746_20920218-2.460748GNAT family N-acetyltransferase
C5746_20925114-2.203282DUF2269 domain-containing protein
C5746_20930012-2.194302flavohemoprotein
C5746_20935011-2.266358NUDIX hydrolase
C5746_20940011-1.881000LysR family transcriptional regulator
C5746_20945012-2.264332cystathionine gamma-lyase
C5746_20950015-2.387914hypothetical protein
C5746_20960420-3.233703cupin domain-containing protein
C5746_20965321-3.447326hypothetical protein
C5746_20970524-5.033206transcriptional regulator
C5746_20975525-3.420922SsgA family sporulation/cell division regulator
C5746_20980320-2.653941hypothetical protein
C5746_20985221-2.447065phosphomethylpyrimidine synthase ThiC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20895cloacin412e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.2 bits (96), Expect = 2e-05
Identities = 28/62 (45%), Positives = 33/62 (53%), Gaps = 9/62 (14%)

Query: 844 NNNGGTNNGAGNGGAAGGPGTSP--------SPTSSLPGGGNGNGNGNGNGNGNGNGNGT 895
N NGG GGA+ G G S S + GGG+G+GNG GNGN G G+GT
Sbjct: 19 NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS-GGGSGT 77

Query: 896 GG 897
GG
Sbjct: 78 GG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20905ALARACEMASE461e-07 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 45.9 bits (109), Expect = 1e-07
Identities = 53/263 (20%), Positives = 88/263 (33%), Gaps = 28/263 (10%)

Query: 26 LVPVCKGNGYGFGHERLADEAIRFGSDTLAVGTTYEAARI-KDWFSGDLLVLTPFRRGEE 84
+ V K N YG G ER+ +D A+ EA + + + G +L+L F ++
Sbjct: 30 VWSVVKANAYGHGIERIWSAIG--ATDGFALLNLEEAITLRERGWKGPILMLEGFFHAQD 87

Query: 85 PVPLPD-RVIRSVSSVDGVHALVGAR------VVIECMSSMKRHGVKEEELG---QLHAA 134
R+ V S + AL AR + ++ S M R G + + + Q A
Sbjct: 88 LEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRA 147

Query: 135 IEDVRLEGFALHLPLDRTDGSDAVEEVIGWMDRLRAARLPLHTMFVSHLRAEELGRLQQQ 194
+ +V H + D + M R+ A L A L +
Sbjct: 148 MANVGEMTLMSHFA--EAEHPDGISGA---MARIEQAAEGLECRRSLSNSAATLWHPEAH 202

Query: 195 FPQTRF---------RARIGTRLWLGDHEATEYRGAVLDVTRVVKGDRFGY-RQQKAASD 244
F R + G ++ V + G+R GY + A +
Sbjct: 203 FDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDE 262

Query: 245 GWLVVVAGGTSHGVGLEAPKALH 267
+ +VA G + G AP
Sbjct: 263 QRIGIVAAGYADGYPRHAPTGTP 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20925cloacin471e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 47.4 bits (112), Expect = 1e-08
Identities = 28/73 (38%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 122 GRGGQGGQGGYGGGQQGGGNWGGGPGGGGQQGGGGAPADDPWATSAPAGGQQGGGQQGGG 181
GRG G G GG G G GGG G G + ++PW + +G GGG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPT-GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 182 GGWGGSSGGSGGG 194
GG G+SGG G
Sbjct: 65 GGGNGNSGGGSGT 77



Score = 43.2 bits (101), Expect = 3e-07
Identities = 30/75 (40%), Positives = 35/75 (46%), Gaps = 5/75 (6%)

Query: 125 GQGGQGGYGGGQQGGGNWGGGPGGGGQQGGGGAPADDPWATS-APAGGQQGGGQQGGGGG 183
G G+G G GN GGP G G GGGA W++ P GG G G GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG--VGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 184 WGGSSG--GSGGGYS 196
G+ G G+ GG S
Sbjct: 61 GHGNGGGNGNSGGGS 75



Score = 37.8 bits (87), Expect = 2e-05
Identities = 31/95 (32%), Positives = 32/95 (33%), Gaps = 23/95 (24%)

Query: 112 NATAKVTKTTGRGGQGGQGGYGGGQQGGG------NWGGGPGGGGQQGGGGAPADDPWAT 165
N A T GG G G GG G G WGGG G G GGG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS--------- 60

Query: 166 SAPAGGQQGGGQQGGGGGWGGSSGGSGGGYSDEPP 200
G G GG G GG SG G + P
Sbjct: 61 --------GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.5 bits (68), Expect = 0.005
Identities = 19/55 (34%), Positives = 20/55 (36%)

Query: 122 GRGGQGGQGGYGGGQQGGGNWGGGPGGGGQQGGGGAPADDPWATSAPAGGQQGGG 176
G G G GG G G G GGG GG + P A PA G G
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20965BCTERIALGSPF300.046 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.8 bits (67), Expect = 0.046
Identities = 30/140 (21%), Positives = 57/140 (40%), Gaps = 20/140 (14%)

Query: 372 HLRHRLPTQIPRLPARTNTDITLRSRKIPSMFWNSWTVRLAPPEGT-FARILASALAASL 430
R L + +P + + + + + +RL+ + R LA+ +AAS+
Sbjct: 27 QARQLLRER-GLVPLSVDENRGDQQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASM 85

Query: 431 LIVDS-----RIDFDTATRRL-----GNVIEGHDLSRVLQRLDNHPGWAD-LVTALVRLA 479
+ ++ + +L V+EGH L+ ++ PG + L A+V
Sbjct: 86 PLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKC---FPGSFERLYCAMVAAG 142

Query: 480 D---HLDAV-DVPIDYASRR 495
+ HLDAV + DY +R
Sbjct: 143 ETSGHLDAVLNRLADYTEQR 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20990PF05272270.043 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.3 bits (60), Expect = 0.043
Identities = 13/71 (18%), Positives = 21/71 (29%), Gaps = 4/71 (5%)

Query: 40 KGKVRRHIP----DYLLLTGQVPVVVDVKPLHRLSKAEVEFTFDWTRQTVESRGWKYDVW 95
KG D + G P ++ E +++ R+T R Y
Sbjct: 789 KGYSVNTTFVTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRRRGYMRP 848

Query: 96 SEPPAVELENI 106
P V E+
Sbjct: 849 QVWPPVIAEDK 859


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21020HTHTETR535e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.1 bits (127), Expect = 5e-11
Identities = 21/121 (17%), Positives = 47/121 (38%), Gaps = 5/121 (4%)

Query: 6 ERRTALVDAAIEVLADEGARGLTFRAVDARAGVPVGTSSNYFANRDDLFMQAAARITVRM 65
E R ++D A+ + + +G + + AGV G +F ++ DLF + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 66 TPDPAKVEEAMRPTPS---RELVADLMRWLVQRMEDDRTGYLAMLELRLEATRRPALRNR 122
+ + P RE++ ++ V E+ R + ++ + E A+ +
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVT--EERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 123 L 123

Sbjct: 129 A 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21040SACTRNSFRASE391e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.2 bits (91), Expect = 1e-06
Identities = 14/59 (23%), Positives = 27/59 (45%)

Query: 82 RSVIEGVRIHADERGSGLGTQLIEWAVDESRRQDCQLVQLTSDASRTDAHRFYERLGFT 140
++IE + + D R G+GT L+ A++ ++ + L + A FY + F
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21115PF05272300.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.041
Identities = 13/68 (19%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 131 PGRPRLPRRSRDGQPVTQLAYARRGEITPEMEYVAIRENVEPEVVREEIAAGRAVLPANV 190
PG+ + + + + E + + +R V P V+ E+ A +A P +
Sbjct: 811 PGKSSPMLEGQVRDWLNENGWEYLRETSGQRRRGYMRPQVWPPVIAEDKEADQAHAPGDQ 870

Query: 191 NHPE-IEP 197
+ + +EP
Sbjct: 871 DQQQPVEP 878


59C5746_21550C5746_21625Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_21550313-1.070471alpha/beta hydrolase
C5746_21555414-1.166003hypothetical protein
C5746_21560415-2.461592LysR family transcriptional regulator
C5746_21565417-2.765189oxidoreductase
C5746_21570415-2.9793541-aminocyclopropane-1-carboxylate deaminase
C5746_21575517-2.000748*DNA polymerase III subunit gamma and tau
C5746_21580013-0.473015phosphoribosylamine--glycine ligase
C5746_215850170.042226hypothetical protein
C5746_215900171.449642phosphoribosylamine--glycine ligase
C5746_215951182.014918phosphoribosylaminoimidazolesuccinocarboxamide
C5746_216050171.281039*DNA-binding response regulator
C5746_216101181.720715sensor histidine kinase
C5746_216204141.044279hypothetical protein
C5746_216254141.114286multidrug ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21720PF03544395e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.8 bits (90), Expect = 5e-05
Identities = 21/116 (18%), Positives = 27/116 (23%), Gaps = 3/116 (2%)

Query: 411 VPVAPAVQPGAGPAAARAAVRGEAPAPAAPPAPVAAPPAPAQPVAPVAPQPPAEAPAPAG 470
VAPA P A P P P+ PP A V P P P
Sbjct: 53 TMVAPADLE---PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 471 GQRPGAWPTAAAPESGRRPGGWPTASAPGQSPVPQTAPATAPAQAAPAAPVPAVEP 526
P + P S + A+ + +P
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQP 165



Score = 34.6 bits (79), Expect = 0.001
Identities = 23/101 (22%), Positives = 30/101 (29%), Gaps = 8/101 (7%)

Query: 431 RGEAPAPAAP-------PAPVAAPPAPAQPVAPVAPQPPAEAPAP-AGGQRPGAWPTAAA 482
E PAPA P PA + P A P PV P P P + P
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 483 PESGRRPGGWPTASAPGQSPVPQTAPATAPAQAAPAAPVPA 523
+ ++ PA+ APA P +
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSS 140



Score = 31.5 bits (71), Expect = 0.009
Identities = 19/96 (19%), Positives = 32/96 (33%), Gaps = 3/96 (3%)

Query: 621 APPQTGGGRLSAVAPAPQRPAPAPQQSYEPRPAPAPHQQPGAAPAAEPASQSYSAPEPPR 680
PP+ P P+ P AP +P+P P P +P + P R
Sbjct: 68 PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP---VKKVEQPKRDVKPVESR 124

Query: 681 SVAPEDDTPEADDPDLVDSALSGHDLIVRELGATVV 716
+P ++T A +A + + G +
Sbjct: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160



Score = 29.6 bits (66), Expect = 0.036
Identities = 17/107 (15%), Positives = 24/107 (22%), Gaps = 1/107 (0%)

Query: 400 VPGPEALAHAPVPVAPAVQPGAGPAAARAAVRGEAPAPAAPPAPVAAPPAP-AQPVAPVA 458
+ P+A+ P PV P P P P QP V
Sbjct: 60 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVK 119

Query: 459 PQPPAEAPAPAGGQRPGAWPTAAAPESGRRPGGWPTASAPGQSPVPQ 505
P A + A + + + PQ
Sbjct: 120 PVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21750HTHFIS642e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 2e-14
Identities = 27/119 (22%), Positives = 48/119 (40%), Gaps = 2/119 (1%)

Query: 1 MTTVRVLLADDEHLIRGALAALLALEDDLVVVAEAATGREALAMARAHRPDVAVLDLEMP 60
MT +L+ADD+ IR L L+ V + A A D+ V D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAA--TLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GADGVNVATSLRTELPGCRTMIVTSHGRPGHLKRALAAGVRAFVPKTVSARQLAGIIRT 119
+ ++ ++ P +++++ +A G ++PK +L GII
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21755PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 8/61 (13%)

Query: 329 NVLRHG---DPRHCTIRFRASADAAVLL--VENDGAAAATGPGGNSGPGGSGLAGLRERL 383
N ++HG P+ I + + D + VEN G+ A ++G +GL +RERL
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTG---TGLQNVRERL 322

Query: 384 R 384
+
Sbjct: 323 Q 323


60C5746_21845C5746_21900Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_21845211-0.405627tRNA (guanosine(46)-N7)-methyltransferase TrmB
C5746_21850211-0.524538L-2-hydroxyglutarate oxidase
C5746_218602120.085135sporulation protein
C5746_218653172.052819asparagine synthase
C5746_218704150.457729AfsR family transcriptional regulator
C5746_21875316-0.057845RNA polymerase subunit sigma-24
C5746_21895316-0.092019TetR family transcriptional regulator
C5746_21900214-0.565477FAD-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22010TYPE3OMGPROT300.040 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.9 bits (67), Expect = 0.040
Identities = 19/88 (21%), Positives = 29/88 (32%), Gaps = 11/88 (12%)

Query: 577 LRRVLAGAGIHELPPGWGTPSLATSNATA----RTGLRAALPDLIALFDAPLLADAGLVE 632
+RVL G + W LR L D A +DA +V
Sbjct: 9 FKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDAT------VVV 62

Query: 633 ARVVRKALRAASEGEPLPLDGLAHLAAT 660
+ + + E + P D L H+A+
Sbjct: 63 SDKINDKVSGQFEHDN-PQDFLQHIASL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22020TONBPROTEIN538e-10 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 53.1 bits (127), Expect = 8e-10
Identities = 25/89 (28%), Positives = 29/89 (32%)

Query: 433 PAAVAPAVPTPAPPKPTPTPTPTPKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTPTPT 492
P AV P P+P P P P P +AP P P P P PK P P
Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117

Query: 493 PKPTPPPKPSTPPTPKPPAPAPTPPPSPP 521
P +T P + A P
Sbjct: 118 ESRPASPFENTAPARLTSSTATAATSKPV 146



Score = 46.9 bits (111), Expect = 9e-08
Identities = 32/96 (33%), Positives = 37/96 (38%), Gaps = 10/96 (10%)

Query: 434 AAVAPAVPTPAPPKPTP----TPTPTPKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTP 489
+V + PAP +P TP P A PP P P P P+ P P P P
Sbjct: 30 TSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEP-PKEAPVV 88

Query: 490 TPTPKPTPPPKPSTPP----TPKP-PAPAPTPPPSP 520
PKP P PKP PK P + P SP
Sbjct: 89 IEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124



Score = 40.4 bits (94), Expect = 1e-05
Identities = 20/75 (26%), Positives = 25/75 (33%), Gaps = 5/75 (6%)

Query: 456 PKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTPTPTPKPTPPPKPSTPPTPKPPAPAPT 515
P P P P+ PP++ P P+P P P P P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVE-----PEPEPEPIPEPPKEAPVVIEKPK 93

Query: 516 PPPSPPAPTVYRVSE 530
P P P V +V E
Sbjct: 94 PKPKPKPKPVKKVQE 108



Score = 35.3 bits (81), Expect = 5e-04
Identities = 18/96 (18%), Positives = 21/96 (21%), Gaps = 3/96 (3%)

Query: 422 QPKPEAGHAAKPAAVAPAVPTPAP---PKPTPTPTPTPKPDAPAPPAPPAPSPTPPPKST 478
P P PKP P P P P P P S
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124

Query: 479 PMPTPTPTPTPTPTPKPTPPPKPSTPPTPKPPAPAP 514
T T + T P S P+ +
Sbjct: 125 FENTAPARLTSSTATAATSKPVTSVASGPRALSRNQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22025HTHTETR583e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 3e-12
Identities = 38/253 (15%), Positives = 69/253 (27%), Gaps = 61/253 (24%)

Query: 60 PLRVDAQRNLEHVLRAAREVFGELGY-GAPMEDVARRAKVGVGTVYRRFPSKDVLVRRIA 118
+ +AQ +H+L A +F + G + ++A+ A V G +Y F K L I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 119 EEETSRLTDQARTALGQEEEPWSALSRFLRTSVASGAGRLLPPQVLRVGVDTDADADDSN 178
E S + + + P VLR +
Sbjct: 64 ELSESNIGELELEYQAKFPG--------------------DPLSVLREILIH-------- 95

Query: 179 AAGESGSIAASVSDSGPGSARDETRVPQQRQGAGQADLRPADGRSTAETGIEDDGTGAGE 238
+ T ++R+ + + + E
Sbjct: 96 -------------------VLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 239 LLEIVGRLVDRARAAGELRRDVTVADV----------LLVIATAAPSLPDAAQQAAASSR 288
+ + + + A L D+ L+ AP D ++A
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARD--- 193

Query: 289 LLDILLEGLRSRP 301
+ ILLE P
Sbjct: 194 YVAILLEMYLLCP 206


61C5746_22120C5746_22165Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_22120212-0.854091hypothetical protein
C5746_221302110.422992peptidase P60
C5746_221352130.313868hypothetical protein
C5746_221402201.192029hypothetical protein
C5746_22150428-0.247854hypothetical protein
C5746_22155625-1.625431hypothetical protein
C5746_22160421-1.714682hypothetical protein
C5746_22165216-1.827008metal-sensitive transcriptional regulator
62C5746_22210C5746_22320Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_22210219-2.753435RNA degradosome polyphosphate kinase
C5746_22215221-2.985994ABC transporter permease
C5746_22220219-3.000331ABC transporter ATP-binding protein
C5746_2222518-1.249550GntR family transcriptional regulator
C5746_222351110.960110mycothiol synthase
C5746_222403121.622812HIT family protein
C5746_222452111.926864phospholipase
C5746_222500120.823149hypothetical protein
C5746_22255318-1.571955two-component sensor histidine kinase
C5746_22260217-2.450966DNA-binding response regulator
C5746_22265415-3.872450hypothetical protein
C5746_22270315-3.698376serine protease
C5746_22275216-3.609678LacI family transcriptional regulator
C5746_22285115-0.753560transcriptional regulator
C5746_22290017-0.434436alpha/beta hydrolase
C5746_222951170.264514molybdopterin synthase sulfur carrier subunit
C5746_223000161.108720hypothetical protein
C5746_223051152.032664DUF2993 domain-containing protein
C5746_223101152.135658sulfurtransferase
C5746_223150161.253046DUF1416 domain-containing protein
C5746_223202142.179406DUF3099 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22330BACSURFANTGN270.023 Yersinia/Haemophilus virulence surface antigen sign...
		>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen

signature.
Length = 322

Score = 27.0 bits (59), Expect = 0.023
Identities = 8/34 (23%), Positives = 12/34 (35%), Gaps = 1/34 (2%)

Query: 95 APLRAELADWARRARAAG-LEKDDVSALFTAVLD 127
+ E R G E + + L A+LD
Sbjct: 205 SERMIERHCLLRPVDVTGTTESEGLDQLLNAILD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22335SACTRNSFRASE365e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 5e-05
Identities = 22/100 (22%), Positives = 31/100 (31%), Gaps = 12/100 (12%)

Query: 212 KGFFLAERDGELIGF-----HWTKVHADEHLGEVYVVGIRPDAQGGGLGKALTAIGLRHL 266
K FL + IG +W E + + D + G+G AL +
Sbjct: 65 KAAFLYYLENNCIGRIKIRSNWNGYALIED------IAVAKDYRKKGVGTALLHKAIEWA 118

Query: 267 AAEGLPTAMLYVDADNKAALAVYERMGFATHEVDLM-YRT 305
ML N +A Y + F VD M Y
Sbjct: 119 KENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSN 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22365HTHFIS1003e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 3e-26
Identities = 35/117 (29%), Positives = 58/117 (49%)

Query: 10 RILIVDDEPAVREALQRSLAFEGYGTEVAVDGLDALARAESYAPDLIVLDIQMPRMDGLT 69
IL+ DD+ A+R L ++L+ GY + + + DL+V D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 70 AARRLRSAGTTTPILMLTARDTVGDRVTGLDAGADDYLVKPFELDELFARIRALLRR 126
R++ A P+L+++A++T + + GA DYL KPF+L EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22375V8PROTEASE514e-09 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 50.8 bits (121), Expect = 4e-09
Identities = 37/207 (17%), Positives = 61/207 (29%), Gaps = 61/207 (29%)

Query: 146 IVEVNATSTAGTSTGSGVVITSDGEVVTNNHVISGASSIKVTLST------------GKT 193
+ + + GT SGVV+ ++TN HV+ L G
Sbjct: 90 VTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGF 148

Query: 194 YNAKVVGTDADKDLALIKLQG-------ASGLKTATLGDSSAVSVGDQVVAIGSPEGLTG 246
++ + DLA++K +K AT+ +++ V + G P
Sbjct: 149 TAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP- 207

Query: 247 TVTSGIVSALDRDVTVAKDDSSGQGQGGWGGGGEQWPFEFGGRQFNGDTGNSTTTYKAIQ 306
V+ + +A+Q
Sbjct: 208 ------VATMWESKGKITYLKG----------------------------------EAMQ 227

Query: 307 TDASLNPGNSGGALINMNGEVIGINSA 333
D S GNSG + N EVIGI+
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22400IGASERPTASE300.034 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.034
Identities = 15/68 (22%), Positives = 20/68 (29%)

Query: 161 QPAIRPAQPAQPVQQPAPATAETGYLPPHPQPQAASGSYPLPPEAPVAPVAAEPVRAGAP 220
QP PA+ P ++T QP + S P V P
Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 221 ADAAPAAT 228
+ PA T
Sbjct: 1200 ENTTPATT 1207


63C5746_23050C5746_23145Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_230503101.899603haloacid dehalogenase
C5746_230550150.803190hypothetical protein
C5746_23060311-1.902458transcriptional regulator
C5746_23065412-2.033229DUF397 domain-containing protein
C5746_23075113-0.822186alpha/beta hydrolase
C5746_23080010-0.610457hypothetical protein
C5746_23085-112-0.197823hypothetical protein
C5746_230900131.089675hypothetical protein
C5746_230951102.381024hypothetical protein
C5746_231001132.987799hypothetical protein
C5746_231051122.749520cold-shock protein
C5746_231102113.1937411,4-dihydroxy-6-naphthoate synthase
C5746_231152113.018800futalosine hydrolase
C5746_231202103.679950DUF2771 domain-containing protein
C5746_231253142.376721MFS transporter
C5746_231304111.497445DUF3027 domain-containing protein
C5746_231353100.872602molecular chaperone Hsp90
C5746_231404120.349950hypothetical protein
C5746_23145219-0.344583calcium-binding protein
64C5746_23320C5746_23400Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_233202131.253412XRE family transcriptional regulator
C5746_233252121.064536hypothetical protein
C5746_23335212-1.032969hypothetical protein
C5746_23340216-1.003304hypothetical protein
C5746_23345316-1.353974hypothetical protein
C5746_23350116-2.666489phosphoserine transaminase
C5746_23355224-3.726384cytochrome P450
C5746_23360014-2.278653cytochrome
C5746_23365112-1.265575ATP-binding protein
C5746_23370214-1.441392DUF742 domain-containing protein
C5746_23375214-1.794805dynein regulation protein LC7
C5746_23380012-1.365060ATP-binding protein
C5746_23385111-1.686428FAD-binding oxidoreductase
C5746_23390114-2.771855hypothetical protein
C5746_23395218-3.134301transcriptional regulator
C5746_23400218-2.857123transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_23505IGASERPTASE365e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 5e-04
Identities = 31/173 (17%), Positives = 53/173 (30%), Gaps = 5/173 (2%)

Query: 299 VQLRRAERAVSATNQDLTGLSGTRLGLAVV--GRLARKHGLNVSFRPSARGGTGALMMLP 356
A+ A S + + G K V A+ T +P
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 357 QELLTRTPVPVRASAPQPASESRPEPEPTHETAATTAAEAVPAFSDSPASETTGNLPRFG 416
+ +P ++ QP +E E +PT + A ++ PA ET+ N+ +
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 417 ESGLPKRPRGRTLAAAEARTNTNGATETPRPRTADPKEQAARFSSFRQAVRAN 469
+ E NT AT P + + R ++V N
Sbjct: 1184 TESTTVN---TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233


65C5746_24990C5746_25030Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_249902122.589783sugar ABC transporter permease
C5746_249952112.664445sugar ABC transporter permease
C5746_250002123.815330ABC transporter substrate-binding protein
C5746_250052114.044905DNA-binding transcriptional regulator
C5746_250100112.784705hypothetical protein
C5746_25015093.176486short-chain dehydrogenase
C5746_250201102.427911MOSC domain-containing protein
C5746_25025291.892369LysR family transcriptional regulator
C5746_250302102.011009WhiB family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25145MALTOSEBP423e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 42.4 bits (99), Expect = 3e-06
Identities = 59/261 (22%), Positives = 97/261 (37%), Gaps = 18/261 (6%)

Query: 84 KISNAVKAGNAPDLVSIEYPQLPEYVSQGALQDI--GQYFTDDIKKKLLPQAVELTTLGG 141
K G+ PD++ + + Y G L +I + F D KL P + G
Sbjct: 72 KFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQD----KLYPFTWDAVRYNG 127

Query: 142 KNWAVPFDASPQSFYYRKDLFEKYGVEVPKTWDEFRKAAEKIKKADKKARIGTFFPDDPT 201
K A P S Y KDL PKTW+E A +K KA K+ + +
Sbjct: 128 KLIAYPIAVEALSLIYNKDLLPN----PPKTWEEI-PALDKELKAKGKSALMFNLQEPYF 182

Query: 202 TFQAMAWQAGAQWYKPEG--DTWKVSTADAATNKVADYWQGLLDDDLIRANASFSPEWTN 259
T+ +A G + G D V +A + L+ + + A+ +S
Sbjct: 183 TWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIA-EA 241

Query: 260 SLKNGGTVGYLGAAWGAGVLKGTLPEQSGKWAVAPMPSWDGKPASGMLGGSTFAVTKTSK 319
+ G T + W + + V +P++ G+P+ +G + + S
Sbjct: 242 AFNKGETAMTINGPWAWS----NIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASP 297

Query: 320 KAEAAVEFATWMSTTEEGIKA 340
E A EF T+EG++A
Sbjct: 298 NKELAKEFLENYLLTDEGLEA 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25160DHBDHDRGNASE856e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 6e-22
Identities = 59/186 (31%), Positives = 85/186 (45%), Gaps = 8/186 (4%)

Query: 4 ALITGATAGIGAAFARRLAADGHNLVLVARNTERLR--EQATELHDRHGIEAEVLTADLS 61
A ITGA GIG A AR LA+ G ++ V N E+L + + RH AE AD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARH---AEAFPADVR 67

Query: 62 EDKGIAAVEARLTDRHQSVDLLVNNAGFGNKGRYLDVPMADELNMLKVHCEAVLRLTSAA 121
+ I + AR+ +D+LVN AG G + + V+ V + +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 122 AAGMRERGRGGVVNVASVAAFVPR---GTYGASKAWVVQFTQGAAKDLAGSGVRLMALCP 178
+ M +R G +V V S A VPR Y +SKA V FT+ +LA +R + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 179 GFVRTE 184
G T+
Sbjct: 188 GSTETD 193


66C5746_26020C5746_26100Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_260202171.387787polyprenyl diphosphate synthase
C5746_260253161.032465hypothetical protein
C5746_260302150.837095hypothetical protein
C5746_260351161.894130hypothetical protein
C5746_260402130.978951hypothetical protein
C5746_26045213-0.716082hypothetical protein
C5746_26055213-1.812847peptide ABC transporter ATP-binding protein
C5746_26060113-1.940844hypothetical protein
C5746_26065015-2.111697hypothetical protein
C5746_260700151.337176NADP-specific glutamate dehydrogenase
C5746_260751152.099498hypothetical protein
C5746_260802172.346279peptide-methionine (S)-S-oxide reductase
C5746_260852172.364824hypothetical protein
C5746_260902172.437570peptidase
C5746_261002162.486588hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26170ABC2TRNSPORT290.043 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.7 bits (64), Expect = 0.043
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 13/91 (14%)

Query: 320 LASGVGKWLSIV--VLTAAFLVAGLLTSSAVSR-----------RVREFGTLKALGWKSG 366
L +G+G + V V AFL AG++ +SA++ R+ T +A+ +
Sbjct: 49 LGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQL 108

Query: 367 RVTRQVVGEALVNGLLGGVLGISVGLAGAYV 397
R+ V+GE + G +G+ A +
Sbjct: 109 RLGDIVLGEMAWAATKAALAGAGIGVVAAAL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26175PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.002
Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 34 LVIQGPTGGGKSTLLQMLGGLDRPTSGSVEL 64
+V++G G GKSTL+ L GLD + ++
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


67C5746_26455C5746_26590Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_26455218-2.510350hypothetical protein
C5746_26460317-2.163076LLM class F420-dependent oxidoreductase
C5746_26465220-2.184377protein phosphatase
C5746_26470021-1.8740523-methyl-2-oxobutanoate
C5746_26475020-2.208514mini-circle protein
C5746_26480-120-2.324189hypothetical protein
C5746_26485-219-3.094169hypothetical protein
C5746_26490017-3.501261hypothetical protein
C5746_26495118-4.645200DUF3592 domain-containing protein
C5746_26500122-6.562873hypothetical protein
C5746_26505122-5.608674hypothetical protein
C5746_26510121-5.759789hypothetical protein
C5746_26515123-5.263767dehydrogenase
C5746_26520227-4.965010helix-turn-helix domain-containing protein
C5746_26525125-4.220759hypothetical protein
C5746_26530223-3.791892hypothetical protein
C5746_26535421-3.742268prepilin peptidase
C5746_26540521-2.995675SAM-dependent methyltransferase
C5746_26545317-1.472925hypothetical protein
C5746_265501150.239164hypothetical protein
C5746_26555214-0.343024hypothetical protein
C5746_265601150.046651L-glyceraldehyde 3-phosphate reductase
C5746_265650140.077285isoprenyl transferase
C5746_26570-1150.501672PhoH family protein
C5746_265750120.284378transglycosylase domain-containing protein
C5746_26580212-0.377058AI-2E family transporter
C5746_265851120.421568alkyl hydroperoxide reductase
C5746_265902110.441587peroxiredoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26625cloacin404e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.7 bits (92), Expect = 4e-05
Identities = 36/123 (29%), Positives = 48/123 (39%), Gaps = 7/123 (5%)

Query: 220 GGDVVAGDGGRSGGTESGGTESGGSGSGSGGPDSGGAVASGGTDSGGADTAGGSDAAGGS 279
GGD G G +G + G +GG G GGA G S GGS +
Sbjct: 3 GGD---GRGHNTGAHSTSGNINGGPTGLGVG---GGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 280 HAAGGSDAAGGSNAAGGSNAAGGSDAAGGSNAAGGSNAAGGSDAAGGSGSGSSGGGDTAT 339
G GG+ +GG + GG+ +A + A G A + AGG S G +A
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGALSAA 115

Query: 340 QVD 342
D
Sbjct: 116 IAD 118



Score = 37.0 bits (85), Expect = 3e-04
Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 4/101 (3%)

Query: 241 SGGSGSGSGGPDSGGAVASGGTDSGGADTAGGSDAAGGSHA----AGGSDAAGGSNAAGG 296
SGG G G + G +G G SD +G S GGS + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 297 SNAAGGSDAAGGSNAAGGSNAAGGSDAAGGSGSGSSGGGDT 337
GG+ +GG + GG+ +A + A G + S+ G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.012
Identities = 19/78 (24%), Positives = 27/78 (34%), Gaps = 1/78 (1%)

Query: 277 GGSHAAGGSDAAGGSNAAGGSNAAGGSDAAGGSNAAGGSNAAGGSDAAGGSGSGSSGGGD 336
G+H+ G + GG G A N G + G GGSG G+ GG
Sbjct: 11 TGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 337 TATQVDDGGDDMDTSGEP 354
+ G ++ P
Sbjct: 70 NSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26675PREPILNPTASE712e-16 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 70.6 bits (173), Expect = 2e-16
Identities = 59/173 (34%), Positives = 82/173 (47%), Gaps = 9/173 (5%)

Query: 93 LVPVVTALACAVLAAATGLRPELAVWLLLAPVGVLLATIDRRVHRLPDRLTLP--AAGAV 150
LV ++TAL +A LLL V V L ID LPD+LTLP G +
Sbjct: 112 LVELLTALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLL 171

Query: 151 VVLLGVAALLPEHGGSWLSALLGGAALGAFYFLL-FLINPNGMGFGDVKLALSLGAALGW 209
LLG L + + + A+ G L + Y+ L GMG+GD KL +LGA LGW
Sbjct: 172 FNLLGGFVSLGD---AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGW 228

Query: 210 YGWAVVFAGGFAGFLLGAVYGFGLMVLKRAGRKTGIPFGPFMITGALLGILLG 262
+V L+GA G GL++L+ + IPFGP++ + +L G
Sbjct: 229 QALPIVL---LLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26715IGASERPTASE320.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.003
Identities = 31/132 (23%), Positives = 52/132 (39%), Gaps = 11/132 (8%)

Query: 33 AVDDNNFEATAADTTLLADIPAGQQAQVQTASLTQQAD--AQASAADAAAKKSVE----- 85
A AT ++TT + Q+++ + + AQ AK +V+
Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 86 -EAARIQA-AKDAKSKKAAADDKLEQERKAKEAKEAEERASRSSVRSASSFAQQGSYTVA 143
E A+ + K+ ++ + +E+E KAK E E+ V S S Q+ S TV
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKV--ETEKTQEVPKVTSQVSPKQEQSETVQ 1140

Query: 144 EIKAIARQIVPA 155
AR+ P
Sbjct: 1141 PQAEPARENDPT 1152


68C5746_26855C5746_26915Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_26855213-0.592629acetylxylan esterase
C5746_26860417-1.646573LacI family transcriptional regulator
C5746_26865315-1.845487hypothetical protein
C5746_26870417-1.497236translational GTPase TypA
C5746_26875317-1.992413hypothetical protein
C5746_26880416-1.174985peptide ABC transporter substrate-binding
C5746_26885613-1.868333ABC transporter permease
C5746_26890480.206790peptide ABC transporter permease
C5746_26895490.854676methionine ABC transporter ATP-binding protein
C5746_26900490.658931peptide ABC transporter ATP-binding protein
C5746_26905390.808505dipeptide/oligopeptide/nickel ABC transporter
C5746_269102111.007672peptide ABC transporter ATP-binding protein
C5746_269152121.368795ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27015TCRTETOQM1544e-42 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 154 bits (391), Expect = 4e-42
Identities = 89/463 (19%), Positives = 169/463 (36%), Gaps = 73/463 (15%)

Query: 7 IRNVAIVAHVDHGKTTLVDAMLRQAGAFAAHAAENLDERMMDSNDLEREKGITILAKNTA 66
I N+ ++AHVD GKTTL +++L +GA + + D+ LER++GITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 67 VKYHPKDGGDPITINIIDTPGHADFGGEVERGLSMVDAVVLLVDASEGPLPQTRFVLRKA 126
++ +NIIDTPGH DF EV R LS++D +LL+ A +G QTR +
Sbjct: 63 FQWEN------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 127 LAAKMPVILCINKTDR-------------------------------------PDSRIAE 149
+P I INK D+ +S +
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 150 VVDETYDLFLDLDADEDQIEFPIVYACARDGVASLTKPEDGTV-------PQDSENLEPF 202
V E D L+ +E + + ++ +++ ++
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFH------NCSLFPVYHGSAKNNIGIDNL 230

Query: 203 FSTILSHVPAPEYDDEAPLQAHVTNLDADNFLGRIALCRVEQGELRKGQTVTWIKRDGTM 262
I + + + ++ L V ++ R+A R+ G L +V +++
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE--- 287

Query: 263 SNVRITELLMTEALTRKPAEKAGPGDICAIAGIPDIMIGETLADPENPIALPLITVDEPA 322
++ITE+ + +KA G+I + + + L D + I P
Sbjct: 288 -KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 323 ISMTIGTNTSPLVGKGGKGHKVTARQVKDRLDRELIGNVSLRVLDTERPDAWEVQGRGEL 382
+ T+ + + D L + LR + G++
Sbjct: 346 LQTTVEPS-----------KPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKV 394

Query: 383 ALAILVEQMRRE-GFELTVGKPEVVTKQVDGKTHEPIERMTID 424
+ + ++ + E+ + +P V+ + K E + +
Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP 437



Score = 40.2 bits (94), Expect = 2e-05
Identities = 16/89 (17%), Positives = 33/89 (37%), Gaps = 1/89 (1%)

Query: 408 KQVDGKTHEPIERMTIDSPEEHLGAITQLMATRKGRMETMTNHGSGWVRMEWIVPSRGLI 467
K+ + EP I +P+E+L + V + +P+R +
Sbjct: 529 KKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQ 587

Query: 468 GFRTEFLTQTRGTGIAHSIFEGHEPWFGE 496
+R++ T G + + +G+ GE
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVTTGE 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27040HTHFIS290.040 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.040
Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 55 TLAVLGESGSGKSVTAQAI 73
TL + GESG+GK + A+A+
Sbjct: 162 TLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27045HTHFIS290.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.027
Identities = 26/95 (27%), Positives = 33/95 (34%), Gaps = 11/95 (11%)

Query: 9 GSLDSTPNVTDVVEVEAADETAAVAAIEAPVERGEPILQVRNLVKHFPLSQGILFKRQIG 68
G+ D P D+ E+ A P + + LV Q I R +
Sbjct: 97 GAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLA 154

Query: 69 AVKAVDGVSFDLYQGETLGIVGESGCGKSTVARLL 103
+ D TL I GESG GK VAR L
Sbjct: 155 RLMQTDL---------TLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27050BCTERIALGSPD300.024 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.5 bits (66), Expect = 0.024
Identities = 22/85 (25%), Positives = 28/85 (32%), Gaps = 8/85 (9%)

Query: 245 DLYGNPRHPYTRALLSAVPEATADEAPARERIRLAGDVPSPVNPPSGCRFRTRCWKATDK 304
D+YG +L V A A A V S P G TR T+
Sbjct: 86 DVYGFAVINMNNGVLKVVRSKDAKTA--------AVPVASDAAPGIGDEVVTRVVPLTNV 137

Query: 305 CASEAPPLVRVEGSREGHLTACHYP 329
A + PL+R G + HY
Sbjct: 138 AARDLAPLLRQLNDNAGVGSVVHYE 162


69C5746_27010C5746_27070Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_27010-116-3.028151GTP-binding protein
C5746_27015-218-5.008279hypothetical protein
C5746_27020-216-5.288449GNAT family N-acetyltransferase
C5746_27025-217-4.143850ferredoxin family protein
C5746_27030115-3.003182succinyldiaminopimelate transaminase
C5746_27035214-2.506194ATP-binding protein
C5746_27040017-2.854433hypothetical protein
C5746_27045017-3.150010succinyl-diaminopimelate desuccinylase
C5746_27050-115-3.877386TIGR00730 family Rossman fold protein
C5746_27055-115-4.703308dihydropteroate synthase
C5746_27060-115-4.617331DNA-3-methyladenine glycosylase I
C5746_27065015-4.608404enoyl-CoA hydratase
C5746_27070112-3.917370DUF3117 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27170TCRTETOQM6310.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 631 bits (1630), Expect = 0.0
Identities = 221/667 (33%), Positives = 345/667 (51%), Gaps = 32/667 (4%)

Query: 1 MHTLNLGILAHVDAGKTSLTERLLHTAGVIDEIGSVDDGSTRTDSLALERQRGITIKSAV 60
M +N+G+LAHVDAGKT+LTE LL+ +G I E+GSVD G+TRTD+ LERQRGITI++ +
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 61 VSFAIDDITVNLIDTPGHPDFIAEVERVLNVLDGAVLVISAVEGVQAQTRVLMRTLQRLR 120
SF ++ VN+IDTPGH DF+AEV R L+VLDGA+L+ISA +GVQAQTR+L L+++
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 121 IPTLIFVNKVDRGGAQDESLLRSISEKLTPAVLAMGSV-DGPGGRDARCTPYTAADARFT 179
IPT+ F+NK+D+ G ++ + I EKL+ ++ V P + T +T ++
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYP---NMCVTNFTESEQ--- 174

Query: 180 DRLTELLADHDDALLAAYVENAAPLPYSRLREALSTQTRQALVHPVFFGSAITGAGVDAL 239
+ + + +D LL Y+ + L + S + + PV+ GSA G+D L
Sbjct: 175 ---WDTVIEGNDDLLEKYMSGKSLEA-LELEQEESIRFHNCSLFPVYHGSAKNNIGIDNL 230

Query: 240 ISGVRELLPVGEGDADGPVSGTVFKVERGPAGEKIAYVRIFSGTVRTRDRLPFGRGEGRD 299
I + + G VFK+E +++AY+R++SG + RD + R ++
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV---RISEKE 287

Query: 300 EGKVTAISVFDRGSDVREAAVGAGRIAKLRGLGGIRVGDAVGVSDTTAPGHW--FAPPTL 357
+ K+T + G + +G I L+ +++ +G + P L
Sbjct: 288 KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 358 ESVVVPCAPASRGELHFALAQLAEQDPLINLRQDDIRKEVSVSLYGEVQKEVIQATLADE 417
++ V P P R L AL ++++ DPL+ D E+ +S G+VQ EV A L ++
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEK 406

Query: 418 FGIDVTFRETTTICLERPNGSGAAYEVGDQDPNPFLATIGLRIDPAPIGSGIEYRLEVEL 477
+ +++ +E T I +ERP + PNPF A+IGL + P P+GSG++Y V L
Sbjct: 407 YHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSL 466

Query: 478 GSMPFSLMRAVEQTVGETLQQGIHGWQVTDCVVTMTHSGYWPRQSHSHAVFDKSMSSTAG 537
G + S AV + + +QG++GW VTDC + + Y+ S ST
Sbjct: 467 GYLNQSFQNAVMEGIRYGCEQGLYGWNVTDCKICFKYGLYY------------SPVSTPA 514

Query: 538 DFRNLTPLVLMSALKEAGTTVYEPMHRFRLELPADLLGPLLPVLAHLRAVPGTPAVQGAT 597
DFR L P+VL LK+AGT + EP F++ P + L A ++
Sbjct: 515 DFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE 574

Query: 598 CVLEGEIPAARVHELQQQLPALTRGEGVLESGFDRYRAVVGTPPGRPRTDRDPLNRKEYL 657
+L GEIPA + E + L T G V + Y G P +P R P +R + +
Sbjct: 575 VILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQP---RRPNSRIDKV 631

Query: 658 LHTVRRI 664
+ +I
Sbjct: 632 RYMFNKI 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27180SACTRNSFRASE472e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 46.9 bits (111), Expect = 2e-08
Identities = 17/70 (24%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 255 VGRCVVDGRWAGFMAVE---VGPEYRRRGLATAVMTALARKALDEGASAAWLQVETDNEG 311
+GR + W G+ +E V +YR++G+ TA++ A + L+ + N
Sbjct: 77 IGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136

Query: 312 ARALYERMGF 321
A Y + F
Sbjct: 137 ACHFYAKHHF 146


70C5746_27285C5746_27340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_27285212-0.461063peptidase
C5746_272900110.036054TIGR00374 family protein
C5746_27295011-0.210174methylated-DNA-protein-cysteine
C5746_27300012-0.713044DNA helicase UvrD
C5746_27305111-0.572672hypothetical protein
C5746_27310113-0.353506ATP-dependent DNA helicase
C5746_273151130.619177dipeptidase
C5746_273202120.418629NAD(+) diphosphatase
C5746_273252120.330526glutaredoxin-like protein
C5746_273302121.149225ATP-dependent DNA helicase
C5746_273352131.276580hypothetical protein
C5746_273402140.667385WhiB family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27430PF05616300.024 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.1 bits (67), Expect = 0.024
Identities = 17/57 (29%), Positives = 25/57 (43%)

Query: 25 SESGTTDDPDGPPRSPIAANAAEASSSGELASQKLTWKPCPAPSPAQGGGKTPSPLP 81
S+ TT D PR + +AEA ++ L P P+P + G P+P P
Sbjct: 299 SQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEP 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27455PF03544340.003 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.8 bits (77), Expect = 0.003
Identities = 20/106 (18%), Positives = 32/106 (30%), Gaps = 4/106 (3%)

Query: 906 TEPPPTPAPTPHAPAPTPHVPAARTPQPAGEQRLTPEESRTLASWDRDLDALTGELRRAR 965
EP P P P P P P V P+P P+ + + RD+ +
Sbjct: 74 VEPEPEPEPIPEPPKEAPVVIEKPKPKPKP----KPKPVKKVEQPKRDVKPVESRPASPF 129

Query: 966 ATVRDVLVPASLSATQLLRLADDPDGFAQELARPMPRPPQPAARRG 1011
+S + + + L+R P+ P A
Sbjct: 130 ENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALR 175


71C5746_27955C5746_27985Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_279552153.216197lysozyme
C5746_279601103.150387MarR family transcriptional regulator
C5746_279652122.920399hypothetical protein
C5746_279700103.998704histidine kinase
C5746_279750103.829499hypothetical protein
C5746_279800104.133206DUF742 domain-containing protein
C5746_27985-1113.624122ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28095PF05616412e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 41.3 bits (96), Expect = 2e-05
Identities = 26/73 (35%), Positives = 33/73 (45%), Gaps = 5/73 (6%)

Query: 820 RHERTADESPEPYAAPEPYTAPAPAPAPAPAPAQAPAPAPAPAPAPAPAPVA----DALP 875
R + T + P A P P +PA PA PAP + P P P P P P A D P
Sbjct: 312 RPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQP 371

Query: 876 GPRQPEQHSQPQQ 888
G R P+ + P +
Sbjct: 372 GTR-PDSPAVPDR 383



Score = 35.5 bits (81), Expect = 0.001
Identities = 22/61 (36%), Positives = 25/61 (40%), Gaps = 3/61 (4%)

Query: 829 PEPYAAPEPYTAPAPAPAPAPAPAQAPA--PAPAPAPAPAPAPVADALPGP-RQPEQHSQ 885
P P P AP P P +PA+ PA PAP P P P D P P+ Q
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQ 370

Query: 886 P 886
P
Sbjct: 371 P 371


72C5746_28220C5746_28280Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28220113-3.012790diaminopimelate decarboxylase
C5746_28225113-3.180101homoserine dehydrogenase
C5746_28230113-1.517865threonine synthase
C5746_28240-111-0.845155homoserine kinase
C5746_28245-112-1.057434transcription termination factor Rho
C5746_28250-114-0.679579transcriptional regulator
C5746_28255-114-0.21505350S ribosomal protein L31
C5746_28260-111-2.154253peptide chain release factor 1
C5746_28265022-4.055565peptide chain release factor N(5)-glutamine
C5746_28270025-3.865318threonylcarbamoyl-AMP synthase
C5746_28275229-4.339270protein-tyrosine-phosphatase
C5746_28280428-5.994017serine hydroxymethyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28370IGASERPTASE412e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 2e-05
Identities = 36/206 (17%), Positives = 63/206 (30%), Gaps = 20/206 (9%)

Query: 2 SDTTDLMGVTADKNVDSAAPAEGAATGTTARRRRSGTGLDGMVLAELQQ---VASGLGIK 58
S+TT+ + + + + E AT TTA+ R V A Q SG K
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 59 GTARMRKGQLIEVIKEAQAGSSAAPKAAAPAADAETKPKRRATSKARTGDDSAAAAPAEK 118
T + V KE +A P ++ PK+ ++ T A A
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE---QSETVQPQAEPAREND 1150

Query: 119 AA--AQQQIDIPGQPASDDQPTGERRR------------RRATAQAGSPETKAESKPAAE 164
++ A +QP E + +PE +
Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210

Query: 165 AKAQTQAEPKSDDRTDAKAEVAVDTA 190
+++ +PK+ R ++
Sbjct: 1211 VNSESSNKPKNRHRRSVRSVPHNVEP 1236



Score = 36.2 bits (83), Expect = 5e-04
Identities = 21/120 (17%), Positives = 39/120 (32%), Gaps = 5/120 (4%)

Query: 70 EVIKEAQAGSSAAPKAAAPAAD----AETKPKRRATSKARTGDDSAAAAPAEKAAAQQQI 125
E I P A P+ AE + T + D + A + A + +
Sbjct: 1015 EEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS 1074

Query: 126 DIPGQP-ASDDQPTGERRRRRATAQAGSPETKAESKPAAEAKAQTQAEPKSDDRTDAKAE 184
++ ++ +G + T + T + + A +TQ PK + K E
Sbjct: 1075 NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE 1134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28375PF06580300.018 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.018
Identities = 18/72 (25%), Positives = 28/72 (38%), Gaps = 9/72 (12%)

Query: 245 RIQLQQAFIKALMEQVKSVGVFSNPKTLFDLANTATKAITTDSDLGSVEQLTGFANGLK- 303
Q+A + AL Q+ NP +F+ N I D + E LT + ++
Sbjct: 155 ASMAQEAQLMALKAQI-------NPHFMFNALNNIRALILEDPT-KAREMLTSLSELMRY 206

Query: 304 GLASKNVHMVTL 315
L N V+L
Sbjct: 207 SLRYSNARQVSL 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28405GPOSANCHOR300.023 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 29.6 bits (66), Expect = 0.023
Identities = 16/39 (41%), Positives = 19/39 (48%)

Query: 275 QGGAQMHTVAAKAVAFGEAATPAFTAYAHRVVAHARVLA 313
Q A M + + GE A P FTA A V+A A V A
Sbjct: 493 QNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAA 531


73C5746_28510C5746_28550Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28510212-0.308692Ser or Arg-related nuclear matrix protein
C5746_28515211-0.663346hypothetical protein
C5746_28520211-0.875485hypothetical protein
C5746_28525313-1.079122hypothetical protein
C5746_285307110.950208thiamine-binding protein
C5746_285356120.876903transcriptional regulator
C5746_285406140.725180MFS transporter
C5746_285457141.542269MarR family transcriptional regulator
C5746_285504132.220640hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28605PRTACTNFAMLY280.035 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.5 bits (63), Expect = 0.035
Identities = 21/72 (29%), Positives = 26/72 (36%), Gaps = 1/72 (1%)

Query: 177 VMGGLR-SLTGIGGTSGEEHQFEFVGAGTVLLQSTEILMPEQPTGATPAQAGVPGGAGQP 235
V+G +L G T G + V LQ I + P G VPGGA
Sbjct: 222 VLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPG 281

Query: 236 GSAPRLPGQLGD 247
G P G + D
Sbjct: 282 GFGPGGFGPVLD 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28635PF03544330.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.002
Identities = 11/87 (12%), Positives = 18/87 (20%), Gaps = 1/87 (1%)

Query: 6 PTPAGGPRDEAASSPHDPSAAPRAAHAGPAGPASASAPPTERADAIDPNPSPALRAAAPA 65
P A P E P +P P A P + +
Sbjct: 62 PPQAVQPPPEPVVEP-EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 66 ADATPDGPAPGMSRGYRAVFAVREFRA 92
++ P P + +
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATS 147


74C5746_28605C5746_28640Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28605216-0.667773acetate kinase
C5746_286154131.032215phosphate acetyltransferase
C5746_286205130.7568556-phosphofructokinase
C5746_286255140.754937hypothetical protein
C5746_286305110.919254transcriptional regulator
C5746_286352120.499484two-component system response regulator
C5746_28640213-0.900535histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28695ACETATEKNASE485e-174 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 485 bits (1251), Expect = e-174
Identities = 185/401 (46%), Positives = 255/401 (63%), Gaps = 9/401 (2%)

Query: 16 RVLVLNSGSSSVKYQLLDMSDRSRLAVGLVERIGEETSRLVHTPLAGSGAESRERIGPIA 75
++LV+N GSSS+KYQL++ D + LA GL ERIG S L H + E + +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDMK 57

Query: 76 DHEAALKAAAGELAADGLGLDSP--ALAAIGHRVVHGGLRFTEPVVIDDEVLKEIERLVP 133
DH+ A+K L G+ + A+GHRVVHGG FT V+I D+VLK I +
Sbjct: 58 DHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIE 117

Query: 134 VAPLHNPANIVGIRTAQALRPDLPQVAVFDTAFHTTMPEYAARYAIDVETADAHRIRRYG 193
+APLHNPANI GI+ + PD+P VAVFDTAFH TMP+YA Y I E ++IR+YG
Sbjct: 118 LAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYG 177

Query: 194 FHGTSHAYVSRKAAELLGRTPEEVNVIVLHLGNGASASAVAGGRCVETSMGLTPLEGLVM 253
FHGTSH YVS++AAE+L + E + +I HLGNG+S +AV G+ ++TSMG TPLEGL M
Sbjct: 178 FHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAM 237

Query: 254 GTRSGDIDPAVTFHLKRVAGMSTDEIDVLLNKKSGLVGLCG-DNDMREIRRR-IDEGDER 311
GTRSG IDP++ +L +S +E+ +LNKKSG+ G+ G +D R++ GD+R
Sbjct: 238 GTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKR 297

Query: 312 AALAFDIYVHRLKKYIGAYSAVLGRVDAVVFTAGVGENSAPVREAAIAGLEEFGLAVDAD 371
A LA +++ +R+KK IG+Y+A +G VD +VFTAG+GEN +RE + GLE G +D +
Sbjct: 298 AQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKE 357

Query: 372 LNAARSGAPRLISPDHARVAVAVVPTDEELEIAVQTFALIG 412
N G +IS ++V V VVPT+EE IA T ++
Sbjct: 358 KN-KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28710PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.002
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 94 VHGQRAATARA-VEDEGGTEEEAGEVPGAPAGE-SPEPPAGPPPAR 137
+HG + + A A V E G E AG V GAPAG +P+PP PP R
Sbjct: 81 IHGLKVSKAAAQVAREEGLESVAGIVMGAPAGAPAPKPPRPEPPPR 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28720HTHFIS682e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-15
Identities = 33/179 (18%), Positives = 65/179 (36%), Gaps = 16/179 (8%)

Query: 4 VLVVEDDPVAADAHQLYVGRVPGFTVAAVAHSRAEAVRALDRTPVDLLLLDLYLPDGHGL 63
+LV +DD + R G+ V + + A R + DL++ D+ +PD +
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRITS-NAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 QLLRSLRAAGHSADVIAVTSARDLTVVREGVSLGVVQYVLKPFTFATLRDRLVRYAEFRA 123
LL ++ A V+ +++ + G Y+ KPF L + R A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-ALAEP 122

Query: 124 AAGEASGQDEVDRALGVLRAPQPTRLPKGLSGPTLDAVTRVLRAAPDGVT---SGATGV 179
+ +D+ + ++ G S + + R +T +G +G
Sbjct: 123 KRRPSKLEDDSQDGMPLV----------GRSAAMQEIYRVLARLMQTDLTLMITGESGT 171


75C5746_28705C5746_28775Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_287052131.215696GNAT family N-acetyltransferase
C5746_287101151.461903NIPSNAP family protein
C5746_287152160.956748helicase
C5746_28720112-0.1853501,4-alpha-glucan branching enzyme
C5746_28725-112-0.096743maltokinase
C5746_28730-217-2.036262maltose alpha-D-glucosyltransferase
C5746_28735-123-2.019914alpha-1,4-glucan--maltose-1-phosphate
C5746_28740-124-2.164284DUF3417 domain-containing protein
C5746_28745026-2.701308peptidase M4
C5746_28750-122-2.921812GntR family transcriptional regulator
C5746_28755-122-4.119308MFS transporter
C5746_28760-122-3.966861alcohol dehydrogenase
C5746_28765-121-3.828512hypothetical protein
C5746_28770021-4.094403ABC transporter
C5746_28775022-3.279162ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28790SACTRNSFRASE334e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 4e-04
Identities = 11/54 (20%), Positives = 20/54 (37%), Gaps = 1/54 (1%)

Query: 93 ISLRPAFRGRGLGVDVLQALCEYGFTVRGLQRLQIETLTDNAPMIAAATRVGFT 146
I++ +R +G+G +L E+ L +ET N + F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28810PF03544330.003 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.4 bits (76), Expect = 0.003
Identities = 17/85 (20%), Positives = 24/85 (28%), Gaps = 7/85 (8%)

Query: 11 PAATEVPQVPLLTPAETAPPAAPAKAAT------PPSSEPAAPAKRKKPAGERAAKTGDG 64
PA E PQ P P + P E P + KP + +
Sbjct: 57 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 65 -AAPPRPRRGTGSQGVRQARPLGNG 88
P R + + ARP +
Sbjct: 117 DVKPVESRPASPFENTAPARPTSST 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28835THERMOLYSIN2586e-80 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 258 bits (659), Expect = 6e-80
Identities = 176/573 (30%), Positives = 251/573 (43%), Gaps = 76/573 (13%)

Query: 19 AALLAVGVQTGTATATPGSATAAATAGANR-----GALAKQLTPSQRAELIREADASKAA 73
A L A+G+ G G++ + N ++ L EL+ +
Sbjct: 5 AMLGAIGLAFGLMAWPFGASAKGKSMVWNEQWKTPSFVSGSLLGRCSQELVYRY-LDQEK 63

Query: 74 TAKELGLGSQEKLVVRDVIQDNDGTTHTRYERTFAGLPVLGGDLVVQETKAGATE-SVTK 132
+LG ++E+L + D G T R+E+ A +G LV + S T
Sbjct: 64 NTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTL 123

Query: 133 ASKVSSGQLKAVDTTADVAPAVAQKQALGLAKADGSKKTAADRAP-------RKVVWMAQ 185
+ LK A++ +QA +AK D + + +R R V++ +
Sbjct: 124 IPNLDKRTLK-------TEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDE 176

Query: 186 GKPQLAYETVVGGLQEDGTPNELHVITDASTGAKLYEWQGVEN---------------GT 230
P+LAYE V L P + DA+ G L +W ++ G
Sbjct: 177 ETPRLAYEVNVRFLTPV--PGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGV 234

Query: 231 GNTQYNGQVTLGTAPS-----YTLTDTGRGNH-KTYNLNHGSSGTGTLFTNSTDVWGNGN 284
G Q + T S Y L D RG+ TY+ + + G+L+ + + +
Sbjct: 235 GRGVLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASY 294

Query: 285 PSNAETAAADAHYGAAETWDYYKNVHGRTGIRGDGVGAYSRVHYGNAYVNAFWQDSCFCM 344
+ AA DAHY A +DYYKNVHGR G S VHYG Y NAFW S M
Sbjct: 295 ----DAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQ--M 348

Query: 345 TYGDGEG-NLKPLT-SLDVAAHEMSHGVTAATAKLVYSGESGGLNEATSDIFAAGVEFYS 402
YGDG+G P + +DV HE++H VT TA LVY ESG +NEA SDIF VEFY+
Sbjct: 349 VYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYA 408

Query: 403 NTAEDPGDYLVGEKI---DINGDGTPLRYMDKPSKDGASKDA--WYSGIG-NIDVHYSSG 456
N D+ +GE I + GD LR M P+K G Y+G N VH +SG
Sbjct: 409 NRNP---DWEIGEDIYTPGVAGDA--LRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSG 463

Query: 457 PANHFFYLLSEGSGAKVINGVSYDSPTSDGLPVTGIGRAKAEQIWFKALATKFTSTTNYA 516
N YLLS+G G+ VTGIGR K +I+++AL T T+N++
Sbjct: 464 IINKAAYLLSQGG-------------VHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFS 510

Query: 517 AARTGTLAVAGELYGTTSAEYKAVGDAWAAINV 549
R + A +LYG+TS E +V A+ A+ V
Sbjct: 511 QLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28840TETREPRESSOR664e-15 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 66.5 bits (162), Expect = 4e-15
Identities = 47/213 (22%), Positives = 77/213 (36%), Gaps = 22/213 (10%)

Query: 86 LSRGRIVRAAIELADAEGLPAVSMRRVATTLSTSTMALYRHVPGKAELVRLMSDEVFGER 145
L+R ++ AA+EL + G+ ++ R++A L LY HV K L+ ++ E+
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 146 PLGTVP---RDWRSGLEVAARWLRSVYGRHPWMAQATASFTRPTASPHAMRYTEWVLHAL 202
++P W+S L A R R+ A+ TRP E L +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLG-TRPD--EKQYDTVETQLRFM 120

Query: 203 RGTGLSPHTMLHIHLTLFAHVQGLAMGADSEAQARQDTGLSDVEWRVRNEPQFNAISASG 262
G S L+ ++ +H +GA E Q + + R A
Sbjct: 121 TENGFSLRDGLYA-ISAVSH---FTLGAVLEQQEHT----AALTDRP-------AAPDEN 165

Query: 263 DYPFLNSLFEHDEFELDLDSLFEFGLQRTLDGI 295
P L D + F GL+ + G
Sbjct: 166 LPPLLREAL-QIMDSDDGEQAFLHGLESLIRGF 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28845TCRTETB1701e-49 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 170 bits (432), Expect = 1e-49
Identities = 85/394 (21%), Positives = 158/394 (40%), Gaps = 14/394 (3%)

Query: 5 LFVSMDVSILFYALPAIGADLEPGSTQQLWILDIYGFVLAGLLITMGALGDRIGRRTVLI 64
F ++ +L +LP I D W+ + + G L D++G + +L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 65 TGTVLFAAASVAAAYAQSPGA-LIAARALLGVGGACLMPSTLALVRNLFHDPRQRARAVA 123
G ++ SV S + LI AR + G G A P+ + +V + R +A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKAFG 142

Query: 124 LWTTVMATGISVGPVVSGALLEHFWWGAVFLVNLPAMALLLVLAPLLLPESRTPGEGRFD 183
L +++A G VGP + G + + W + L+ + + + L LL E R G FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH--FD 200

Query: 184 ILSAVLSLAALLPLIHGIKEVAKHGYQPLPALGITAGLALGFVFLRRQARLVHPMVDLAL 243
I +L ++ + + + + L +F++ ++ P VD L
Sbjct: 201 IKGIILMSVGIVFFMLFTTSYSI-SFLIVSVLSF-------LIFVKHIRKVTDPFVDPGL 252

Query: 244 LRRRAFGGPVLVNLLAMAATVGFAAFFSQYVQSVLGKSPFEAAMWSLVP-SLGVVVCAPA 302
+ F VL + GF + ++ V S E + P ++ V++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 303 GGALARRFDRGYVMGGGFLVSAAGFLSLTRIGTQSPLWMTLAGSAVYAGGLVSAMTLANE 362
GG L R YV+ G + FL+ + + + +MT+ V GGL T+ +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL-GGLSFTKTVIST 371

Query: 363 LALGAAPPERAGSAAAVLESGQELGGALGMALLG 396
+ + + AG+ ++L L G+A++G
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


76C5746_30070C5746_30130Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_300703131.120973hypothetical protein
C5746_300752141.257002ATP/GTP-binding protein
C5746_300802141.007002PucR family transcriptional regulator
C5746_300851140.779327aldehyde dehydrogenase
C5746_300900100.301679hypothetical protein
C5746_30095012-0.518125GNAT family N-acetyltransferase
C5746_30100-114-0.896734ArsR family transcriptional regulator
C5746_30105-214-1.574696hypothetical protein
C5746_30110015-2.258369hypothetical protein
C5746_30115013-2.730298hypothetical protein
C5746_30120011-3.390073serine/threonine protein phosphatase
C5746_3012509-3.020772serine hydrolase
C5746_30130-110-3.276139alkylhydroperoxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30175TCRTETB330.004 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.004
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 14/113 (12%)

Query: 134 LDIYRLLITGGLVYGFGRLGNW--PAAFERCVAARGAAARAGGAAIGALLVWILVWNGTV 191
L I RLL+ G ++ FG + + + F + AR AG AA AL++ ++
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPK 134

Query: 192 PGMGLAFGLVPGSWLGPDQPMSPVV----------SYGLYVLLTLVIVWPFAR 234
G AFGL+ GS + + + P + SY L + + +I PF
Sbjct: 135 ENRGKAFGLI-GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30180HTHFIS330.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.003
Identities = 16/89 (17%), Positives = 31/89 (34%)

Query: 407 QALSVARRRGRALVEHEELATGSVLPLLADDAVRAFADGMLRALHEHDAKGRGDLVASLR 466
++ +A L+ + +F D + + + L
Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILA 443

Query: 467 AWLSHHGQWDAAAADLGVHRHTLRYRMRR 495
A + G AA LG++R+TLR ++R
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIRE 472


77C5746_30210C5746_30365Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_30210320-2.047738translation initiation factor IF-2
C5746_30215220-1.683538DUF503 domain-containing protein
C5746_30220319-2.03502930S ribosome-binding factor RbfA
C5746_30225223-2.568864tRNA pseudouridine(55) synthase TruB
C5746_30230322-2.805419hypothetical protein
C5746_30235119-2.589606bifunctional riboflavin kinase/FAD synthetase
C5746_30240-120-1.243898XshC-Cox1 family protein
C5746_30245017-1.354217peptide ABC transporter ATP-binding protein
C5746_30250213-1.233064dipeptide/oligopeptide/nickel ABC transporter
C5746_30255111-1.179807ABC transporter permease
C5746_30265112-1.226542ABC transporter permease
C5746_30270214-2.068010ABC transporter substrate-binding protein
C5746_30275111-1.760397topoisomerase II
C5746_30280110-1.641626hypothetical protein
C5746_30285212-1.349360type VII secretion protein EccE
C5746_30290210-1.134433type VII secretion protein EccB
C5746_302950100.365260type VII secretion-associated serine protease
C5746_30305191.933916hypothetical protein
C5746_303101101.687754WXG100 family type VII secretion target
C5746_303151132.374147DUF397 domain-containing protein
C5746_303201151.783973type VII secretion protein EccC
C5746_303251150.984208type VII secretion integral membrane protein
C5746_303302170.47156830S ribosomal protein S15
C5746_303353170.500359polyribonucleotide nucleotidyltransferase
C5746_30340-1132.854028peptidase M16
C5746_303450122.6044544-hydroxy-tetrahydrodipicolinate reductase
C5746_303501123.137808hypothetical protein
C5746_303551103.487174hypothetical protein
C5746_303600123.551100hypothetical protein
C5746_303650143.791187thymidylate synthase (FAD)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30345TCRTETOQM695e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 69.1 bits (169), Expect = 5e-14
Identities = 43/143 (30%), Positives = 62/143 (43%), Gaps = 22/143 (15%)

Query: 537 VMGHVDHGKTRLLDAIRKTNVVAGEAG------------------GITQHIGAYQVSSEV 578
V+ HVD GKT L +++ + E G GIT G +
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI----TSF 63

Query: 579 NGEDRRITFIDTPGHEAFTAMRARGAKSTDIAILVVAANDGVMPQTIEALNHAKAADVPI 638
E+ ++ IDTPGH F A R D AIL+++A DGV QT + + +P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPT 123

Query: 639 VVAVNKIDVEGADPTKVRGQLTE 661
+ +NKID G D + V + E
Sbjct: 124 IFFINKIDQNGIDLSTVYQDIKE 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30390BCTERIALGSPD300.024 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.5 bits (66), Expect = 0.024
Identities = 18/104 (17%), Positives = 45/104 (43%), Gaps = 7/104 (6%)

Query: 173 DPGLIIADEPTTALDVMIQAQILRLIEQLVSEQDL--GLIMISHDLAVLSDTCDRLAVMY 230
+I A T AL V ++ +E+++++ D+ +++ A++++ D +
Sbjct: 308 KNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVE---AIIAEVQDADGLNL 364

Query: 231 AGRVVEEGPASEVYENAHHPYGKALSGA--FPRIGDPASRFAPR 272
+ + + N+ P A++GA + + G +S A
Sbjct: 365 GIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASA 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30410PERTACTIN367e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 36.2 bits (83), Expect = 7e-04
Identities = 23/68 (33%), Positives = 26/68 (38%), Gaps = 7/68 (10%)

Query: 338 PQGGYGFPQQPAPPAPQAPEQGGYGFPQPPQAPQPPHVPEQGGYGFPQPPQAPQPPQAPQ 397
G + APPAP+ Q G P P P P PQPPQ PQ
Sbjct: 556 GNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQP-------PQPPQPPQRQPEAP 608

Query: 398 TPQWPAQQ 405
PQ PA +
Sbjct: 609 APQPPAGR 616



Score = 32.8 bits (74), Expect = 0.007
Identities = 19/62 (30%), Positives = 20/62 (32%)

Query: 259 PPPNTPQDTPAPAMPTPWTPAPPQSGLPPLPPAFHPATPQPSAPRSAPQWPAQTPADQTP 318
N P PAP P P P PQP P PQ + PA Q P
Sbjct: 554 ANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613

Query: 319 DG 320
G
Sbjct: 614 AG 615



Score = 31.6 bits (71), Expect = 0.015
Identities = 29/84 (34%), Positives = 35/84 (41%), Gaps = 7/84 (8%)

Query: 780 AQQQQGLWTSNGSNPPPQYAP-PMPGQQQYSPQQPHQPYPPQQQPYGGQQPPQSYPPQPG 838
A G W+ G+ PP P P PG Q QP QP P Q P QPPQ QP
Sbjct: 553 AANGNGQWSLVGAKAPPAPKPAPQPGPQ--PGPQPPQPPQPPQPP----QPPQPPQRQPE 606

Query: 839 PGVQQNGWQQSLPSQAQSAASPAG 862
Q + L + A +A + G
Sbjct: 607 APAPQPPAGRELSAAANAAVNTGG 630



Score = 31.6 bits (71), Expect = 0.015
Identities = 23/53 (43%), Positives = 24/53 (45%)

Query: 305 APQWPAQTPADQTPDGYGFPHAGAPQPPQAPQPPQGGYGFPQQPAPPAPQAPE 357
A PA PA Q G PQPPQ PQPPQ P+ PAP P E
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE 617



Score = 30.5 bits (68), Expect = 0.043
Identities = 19/49 (38%), Positives = 19/49 (38%)

Query: 254 APQDVPPPNTPQDTPAPAMPTPWTPAPPQSGLPPLPPAFHPATPQPSAP 302
A P PQ P P P P PPQ PP PP P P P P
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30420PREPILNPTASE310.013 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.5 bits (69), Expect = 0.013
Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 14/79 (17%)

Query: 29 RGSSRGARGPASARLNGRPGQLGSFRLQQLVLLETAVALLLAAWVIEP-LLLAPAVVVAA 87
RG RG + P SAR LV L TA+ + A + P A+++
Sbjct: 96 RGRCRGCQAPISARY-------------PLVELLTALLSVAVAMTLAPGWGTLAALLLTW 142

Query: 88 LLVLLAVVRRHRRSLPEWL 106
+LV L + + LP+ L
Sbjct: 143 VLVALTFIDLDKMLLPDQL 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30430SUBTILISIN2279e-74 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 227 bits (580), Expect = 9e-74
Identities = 88/296 (29%), Positives = 146/296 (49%), Gaps = 12/296 (4%)

Query: 46 TFPMKQQYEGRPWSLQRVLLDELWQDTKGKGVRVAVIDTGVDNVNPQLKTAVDTSAGADY 105
+QQ P ++ + +W T+G+GV+VAV+DTG D +P LK +
Sbjct: 12 VIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTD 71

Query: 106 LKGGKSDGTVDEVGHGTKVAGIIAARPRKGTGFVGLAPEATIIPIRQNDEKNSGKDTTMA 165
G + D GHGT VAG IAA G VG+APEA ++ I+ +++ SG+ +
Sbjct: 72 DDEGDPEIFKDYNGHGTHVAGTIAATE-NENGVVGVAPEADLLIIKVLNKQGSGQYDWII 130

Query: 166 TAIDHAIAKGADVINISQDTTKALTEASALGRAVARALAKDIVVVASAGNDGMDGKLKRT 225
I +AI + D+I++S + + L AV +A+A I+V+ +AGN+G
Sbjct: 131 QGIYYAIEQKVDIISMSLGGPE---DVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDE 187

Query: 226 --YPAAFDGVLAVASSDRNNERAPFSQAGEFVGVAAPGVDIVSTVPGNGQCTDNGTSFSA 283
YP ++ V++V + + + + FS + V + APG DI+STVPG T +GTS +
Sbjct: 188 LGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMAT 247

Query: 284 PYVAGVAALMRAKYP-----KWTAAQIVARIEQTAERSVTGHDDFVGWGVVDPVRA 334
P+VAG AL++ T ++ A++ + + G G++
Sbjct: 248 PHVAGALALIKQLANASFERDLTEPELYAQLIKRTI-PLGNSPKMEGNGLLYLTAV 302


78C5746_30795C5746_30860Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_30795-110-3.625791two-component system response regulator
C5746_30800012-3.260520histidine kinase
C5746_30805013-3.297466citrate synthase
C5746_30810-118-2.950688citrate synthase/methylcitrate synthase
C5746_30815018-2.498331cobalamin biosynthesis protein CobW
C5746_30820118-1.806592DNA topoisomerase IV
C5746_30825118-1.472015peptidase M16
C5746_30830119-1.478470peptidase M16
C5746_30835218-0.923984peptidase
C5746_30840217-0.424899GntR family transcriptional regulator
C5746_30845218-0.460644HPr family phosphocarrier protein
C5746_30850118-0.295939acyl-CoA synthetase
C5746_308551190.146297phosphodiesterase
C5746_30860219-0.224922alkaline phosphatase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30935HTHFIS846e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 6e-21
Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 15/172 (8%)

Query: 4 VLVVDDDIRVAKVNAAYVSRVPGFRVAAQAHSAAEALATIEEQRVDLVLLDHYMPERNGL 63
+LV DDD + V +SR G+ V +AA I DLV+ D MP+ N
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 TMVRELRRLGHHTDVIMVTAARDVATVHEAMRHGALQYLVKPFTYAGLRTKLEAYATLRH 123
++ +++ V++++A T +A GA YL KPF L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRALA 120

Query: 124 TLEGGGEAEQGEVDRLFGALWAAGEPDLPKGHSPTTAELVKQALRSAEGPLS 175
+ + + + G S E+ + R + L+
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV----------GRSAAMQEIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30980PYOCINKILLER374e-05 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 37.5 bits (86), Expect = 4e-05
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 77 HQADVAAKAKAEAQRKAEAKKRAEAKKKAEAKAKAEAKRAAEARAEKARAARSAERTRLG 136
+ + +A + Q + A+A +A A KA + AAEA+ + AR R
Sbjct: 188 YNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAA 247

Query: 137 -SFQLPVAGSYVTT 149
++ +P GS V T
Sbjct: 248 NTYAMPANGSVVAT 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30995PF03544300.029 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.3 bits (68), Expect = 0.029
Identities = 12/68 (17%), Positives = 21/68 (30%)

Query: 628 PRPSAQSSAAATPPRAGAATTTGPGPATPSAPATGSGKATTADAGTDGGPGVRAGRIPAY 687
+ Q P + A+ +T + + GP + P Y
Sbjct: 108 VKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQY 167

Query: 688 PAAERAVR 695
PA +A+R
Sbjct: 168 PARAQALR 175


79C5746_31000C5746_31065Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_31000270.137142DUF4193 domain-containing protein
C5746_31005270.031783hypothetical protein
C5746_3101028-0.092740DUF3093 domain-containing protein
C5746_31015280.154221thioesterase
C5746_3102038-0.516072dUTP diphosphatase
C5746_31030413-3.371594DUF3710 domain-containing protein
C5746_31035412-3.185502K(+)-transporting ATPase subunit F
C5746_31040210-1.455089potassium-transporting ATPase subunit KdpA
C5746_3104518-0.625655K(+)-transporting ATPase subunit B
C5746_310501130.987891K(+)-transporting ATPase subunit C
C5746_310552131.311864metallopeptidase
C5746_310602131.547644aminopeptidase
C5746_310652132.473871hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31175RTXTOXINA310.007 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.007
Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 146 ADKLIAGASKATATVGAGIGAAAMMPVPPAMLAELAAEITGV 187
D + S A+V +GI AAA + A ++ L +TG+
Sbjct: 364 IDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGI 405


80C5746_31290C5746_31330Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_312904132.545536hypothetical protein
C5746_312953133.000738serine/threonine protein kinase
C5746_313002172.717519hypothetical protein
C5746_313053182.685887DUF262 domain-containing protein
C5746_313102182.464861DUF3696 domain-containing protein
C5746_313201201.891017hypothetical protein
C5746_313252160.833472TIGR02677 family protein
C5746_313302130.690680TIGR02678 family protein
81C5746_31455C5746_31545Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_314552131.581519arginine deiminase
C5746_314602160.302249ornithine carbamoyltransferase
C5746_31465821-0.727786amino acid permease
C5746_31475315-2.222640ATP-binding protein
C5746_31480213-2.460138enoyl-CoA hydratase family protein
C5746_31485113-3.092651bifunctional salicylyl-CoA
C5746_31490113-1.6467982-aminobenzoate-CoA ligase
C5746_31495012-0.703587acyl-CoA dehydrogenase
C5746_315000101.920842enamine deaminase RidA
C5746_315051112.837429Na+/H+ antiporter subunit A
C5746_315102123.480452Na(+)/H(+) antiporter subunit C
C5746_315151123.467592Na+/H+ antiporter subunit D
C5746_315201123.166321Na+/H+ antiporter subunit E
C5746_31525-1122.840283hypothetical protein
C5746_315300140.528313Na+/H+ antiporter subunit G
C5746_31535213-0.589408AraC family transcriptional regulator
C5746_31540214-0.879068TIGR03086 family protein
C5746_31545213-0.905422hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31675ARGDEIMINASE411e-144 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 411 bits (1058), Expect = e-144
Identities = 150/414 (36%), Positives = 218/414 (52%), Gaps = 16/414 (3%)

Query: 2 GFHVDSEAGRLRRVILHRPDLELKRLTPSNKDALLFDDVLWVRRARQEHDGFADVLRDRG 61
++ SE GRL++V+LHRP EL+ LTP LFDD+ ++ ARQEH+ FA +L++
Sbjct: 7 PINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNL 66

Query: 62 VEVHLFGDLLRESLEIPVA-RRLVLDRVFEEKEYG-PLATEHLRAAFEELPSAELAEALV 119
VE+ DL+ E L VA + + E E L+ F L + ++
Sbjct: 67 VEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKMI 126

Query: 120 GGMTKREFLERHSEPTSVRFHVMDLDDFLLGPLPNHLFTRDTSAWIYDGVSINAMRWPAR 179
G+ E S + V + F++ P+PN LFTRD A I +GV+IN M R
Sbjct: 127 SGVVTEELKNYTSSLDDL---VNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVR 183

Query: 180 QRETVHFEAIYRHHPLFTGPDAGVFHHWSEGQDDYPSTIEGGDVLVIGHGAVLIGMSERT 239
QRET+ E I+++HP++ W +++EGGD LV+ G ++IG+SERT
Sbjct: 184 QRETIFAEYIFKYHPVY----KENVPIWLN--RWEEASLEGGDELVLNKGLLVIGISERT 237

Query: 240 TPQAVEMLARGLF-DAGSAHTIVALDMPKRRAFMHLDTVMTMIDGDTFTKYAGL-GMLRS 297
++VE LA LF + S TI+A +PK R++MHLDTV T ID FT +
Sbjct: 238 EAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSI 297

Query: 298 YTIEPG-EGPRDLKVTDHPPKHMHDAIAHALGLDSIRVLTATQDVHAAQREQWDDGCNVL 356
Y + + + + D ++ LG + A D+ REQW+DG NVL
Sbjct: 298 YVLTYNPSSSKIHIKKEKAR--IKDVLSFYLGRKIDIIKCAGGDLIHGAREQWNDGANVL 355

Query: 357 AVEPGVVVAYERNATTNTYLRKEGIEVIEIRGSELGRGRGGPRCMSCPVVRDPV 410
A+ PG ++AY RN TN + GI+V I SEL RGRGGPRCMS P++R+ +
Sbjct: 356 AIAPGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


82C5746_31760C5746_31860Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_31760480.898196hypothetical protein
C5746_31765481.463805DNA methylase
C5746_31770381.109722hypothetical protein
C5746_317755120.375780hypothetical protein
C5746_317803150.539434hypothetical protein
C5746_317853160.134383IS5/IS1182 family transposase
C5746_31790218-0.393148hypothetical protein
C5746_31795217-1.062307DNA (cytosine-5-)-methyltransferase
C5746_31805419-0.886778very short patch repair endonuclease
C5746_318154261.205311hypothetical protein
C5746_318203241.134168ATP-binding protein
C5746_318252250.693263restriction endonuclease subunit R
C5746_318302211.269553hypothetical protein
C5746_318402201.070639type I restriction endonuclease subunit M
C5746_318451200.822509hypothetical protein
C5746_318501190.717511hypothetical protein
C5746_318551170.195042hypothetical protein
C5746_31860218-0.181578hypothetical protein
83C5746_32045C5746_32125Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_32045022-3.128173ribonuclease D
C5746_32050123-3.324777DNA-binding response regulator
C5746_32055017-4.603316DUF3000 domain-containing protein
C5746_32060118-4.927497uroporphyrinogen decarboxylase
C5746_32070020-4.666511flavoprotein oxidoreductase
C5746_32075-122-5.013906DUF4349 domain-containing protein
C5746_32080-122-4.986532protoporphyrinogen oxidase
C5746_32085025-4.820384hypothetical protein
C5746_32090-127-5.384901DoxX family protein
C5746_32095027-5.307196TIGR04222 domain-containing membrane protein
C5746_32100-128-5.922029TIGR04222 domain-containing membrane protein
C5746_32105127-5.547000endonuclease
C5746_32110123-4.912079hypothetical protein
C5746_32115120-4.793514peptidyl-tRNA hydrolase
C5746_32120219-3.640173hypothetical protein
C5746_32125-115-3.275661ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32310HTHFIS381e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 1e-05
Identities = 26/103 (25%), Positives = 41/103 (39%), Gaps = 5/103 (4%)

Query: 18 KPTAMVVVADPRVRSTVTRHLWALGVRDVIEASSIAEARPRVGN-PRDICVADVHLPDGS 76
T +V D +R+ + + L G DV S+ A + D+ V DV +PD +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 77 GLTLLSETRAAGWPNG--LALSAADDIGAVRNALAGGVKGYVV 117
LL + A P+ L +SA + A G Y+
Sbjct: 62 AFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLP 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32345ACRIFLAVINRP280.014 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.014
Identities = 12/54 (22%), Positives = 19/54 (35%), Gaps = 1/54 (1%)

Query: 52 TVYAAICGGSEFLGGMGLALGLFTPLAAAALIGVMI-NAMVTVTAAHGLWETQG 104
V I G ++ + IG+ NA++ V A L E +G
Sbjct: 903 VVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32355cloacin379e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 9e-05
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 9/77 (11%)

Query: 278 GAGPGNGSCGSTNNGGSGGGGGGCSSGASCGSGSSCS---------SGSGCGGSSGSSCS 328
G G G+ ++ N G G G GAS GSG S SG GG SG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 329 SSSGSSCGGGSSCGSSS 345
+G+S GG + G+ S
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 7e-04
Identities = 18/47 (38%), Positives = 21/47 (44%)

Query: 275 WCAGAGPGNGSCGSTNNGGSGGGGGGCSSGASCGSGSSCSSGSGCGG 321
W G+G GNG + GGSG GG + A G S G GG
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.001
Identities = 21/61 (34%), Positives = 26/61 (42%), Gaps = 6/61 (9%)

Query: 280 GPGNGSCGSTNNGGSGGG------GGGCSSGASCGSGSSCSSGSGCGGSSGSSCSSSSGS 333
GP G + GSG GGG SG G GS +G G G S G S + + S
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 334 S 334
+
Sbjct: 83 A 83



Score = 32.8 bits (74), Expect = 0.002
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 7/83 (8%)

Query: 258 GSRSSSDSSGMFVAPVMWCAGAGPGNGSCGSTNNGGSGGGGGGCSSGASCGSGSSCSSGS 317
+SD SG W G+G G G + +G GG G + G+ S+ ++
Sbjct: 29 VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPV 88

Query: 318 GCG-------GSSGSSCSSSSGS 333
G G+ G + S S+G+
Sbjct: 89 AFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.002
Identities = 25/79 (31%), Positives = 31/79 (39%), Gaps = 11/79 (13%)

Query: 278 GAGPGNGSCGSTNNGGSGGGG--GGCSSGASC---------GSGSSCSSGSGCGGSSGSS 326
G G S NGG G G GG S G+ GSGS G G G +G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 327 CSSSSGSSCGGGSSCGSSS 345
+S G S GG+ ++
Sbjct: 68 NGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.005
Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 5/66 (7%)

Query: 278 GAGPGNGSC-----GSTNNGGSGGGGGGCSSGASCGSGSSCSSGSGCGGSSGSSCSSSSG 332
G G G G+ S NN GG G G G G G+ +G+ GGS S+
Sbjct: 26 GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85

Query: 333 SSCGGG 338
+ G
Sbjct: 86 APVAFG 91



Score = 28.9 bits (64), Expect = 0.035
Identities = 20/52 (38%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 294 SGGGGGGCSSGASCGSGSSCSSGSGCGGSSGSSCSSSSGSS---CGGGSSCG 342
SGG G G ++GA SG+ +G G G+S S S GGGS G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32380PF05272290.025 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.025
Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 17 IEGANGTGKSTLLRLLAGIDAPTEGRITDRRPRTAY 52
+EG G GKSTL+ L G+D ++ + +Y
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSY 636


84C5746_32180C5746_32205Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_32180112-3.907572dihydrodipicolinate reductase
C5746_32185012-4.033009carboxymuconolactone decarboxylase
C5746_32190012-3.413844cholesterol oxidase
C5746_32195014-4.081143hypothetical protein
C5746_32200014-4.271747hypothetical protein
C5746_32205012-3.523702hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32450PF07520300.048 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.9 bits (67), Expect = 0.048
Identities = 37/162 (22%), Positives = 56/162 (34%), Gaps = 24/162 (14%)

Query: 79 RTVALGGPGADRFHAHAVHLPAGTGLPGDVPAITAWRSPNWASTTPD--GGTPGRMASLP 136
RTV L P + H H V + T L + + A D R+ S P
Sbjct: 131 RTVELPQPDPETGHTHRVQIALDTALSDQDQS-----AHYVAPERADSEKPREFRLVSDP 185

Query: 137 ASAAFGPEALNDFAVSR--------SPWLAAVFADLRRASEESSSTRVVLVERQSADVAR 188
+ ++ + L S WL +F D +RA S + AR
Sbjct: 186 GAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEENLPHMFEHWAR 245

Query: 189 WIA----LAGAVLPPENARRLTFTTYT-RRPAQSPHRVVGVL 225
+++ + AV PP+ + F R A +P V VL
Sbjct: 246 YLSYLQVIQRAVAPPK----MRFANTVAPRDAVAPVEVDLVL 283


85C5746_32585C5746_32680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_32585212-2.508370TIGR03086 family protein
C5746_32590215-3.446210disulfide bond formation protein DsbA
C5746_32595317-3.300951cobalamin biosynthesis protein CbiX
C5746_32600023-2.631536sulfate ABC transporter permease
C5746_32605121-2.715640sulfate ABC transporter ATP-binding protein
C5746_32615-119-3.235492sulfate ABC transporter substrate-binding
C5746_32620015-1.321134sulfate adenylyltransferase
C5746_32630013-1.838763sulfate adenylyltransferase subunit CysD
C5746_32635014-1.061256adenylyl-sulfate kinase
C5746_32640014-1.472925phosphoadenylyl-sulfate reductase
C5746_32650-316-0.331406hypothetical protein
C5746_326601102.365944sulfite reductase
C5746_326652142.735829GNAT family N-acetyltransferase
C5746_326703132.734654diguanylate cyclase
C5746_326752163.024429DNA alkylation response protein
C5746_326802203.433214hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32870TCRTETOQM613e-12 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 61.4 bits (149), Expect = 3e-12
Identities = 39/134 (29%), Positives = 65/134 (48%), Gaps = 14/134 (10%)

Query: 25 VDDGKSTLVGRLLHDSKSVLTDQLEAVEHASRSRGQEAPDLALLTDGLRAEREQGITIDV 84
VD GK+TL LL++S ++ +L +V +R TD ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAI--TELGSV-DKGTTR----------TDNTLLERQRGITIQT 58

Query: 85 AYRYFATPRRRFILADTPGHVQYTRNMVTGASTAELAVVLVDARNGVVEQTRRHAAVAAL 144
F + + DTPGH+ + + S + A++L+ A++GV QTR
Sbjct: 59 GITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRK 118

Query: 145 LRVPHVVLAVNKMD 158
+ +P + +NK+D
Sbjct: 119 MGIPTIFF-INKID 131


86C5746_32905C5746_33020Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_329052221.460776GntR family transcriptional regulator
C5746_329101181.667171hypothetical protein
C5746_329152121.756149hypothetical protein
C5746_329203113.179288carbon monoxide dehydrogenase
C5746_329254112.948693(2Fe-2S)-binding protein
C5746_329302103.299561xanthine dehydrogenase subunit D
C5746_329350162.203564MFS transporter
C5746_32940-1172.002660XshC-Cox1-family protein
C5746_32945-1151.918444peptidase
C5746_32950-1151.468897MFS transporter
C5746_32955-2151.415963mini-circle protein
C5746_32960-2171.024223SAM-dependent methyltransferase
C5746_32970221-0.265315nuclear transport factor 2 family protein
C5746_32975120-0.667529alcohol dehydrogenase
C5746_32980019-1.537836antibiotic biosynthesis monooxygenase
C5746_32985518-1.018060glycerol-3-phosphate responsive antiterminator
C5746_32990620-1.060984FAD-binding oxidoreductase
C5746_32995617-0.787994FAD-dependent oxidoreductase
C5746_33000918-0.551280carbohydrate kinase
C5746_33005822-1.380809MFS transporter
C5746_33010921-0.436274hypothetical protein
C5746_33015628-1.545604N-acetylmuramoyl-L-alanine amidase
C5746_33020526-0.847867hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33215SUBTILISIN2257e-68 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 225 bits (575), Expect = 7e-68
Identities = 114/318 (35%), Positives = 161/318 (50%), Gaps = 40/318 (12%)

Query: 224 VPQVNAPQAWAEGYDGKGSTVAVLDTGIDATHPDVKDRILETKSFVPGEE-----VLDKH 278
V + AP W + G+G VAVLDTG DA HPD+K RI+ ++F +E D +
Sbjct: 26 VEMIQAPAVWNQTR-GRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYN 84

Query: 279 GHGTHVASTIAGSGAASEGVNKGVAPGADLIIGKVLSNEGSGADSGIIEAMEWAKAEGAD 338
GHGTHVA TIA + + V GVAP ADL+I KVL+ +GSG II+ + +A + D
Sbjct: 85 GHGTHVAGTIAATENENGVV--GVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD 142

Query: 339 VVSMSLGSSIPDDGSDPMSQAVDALSADGGPLFVIAAGNAYGAG----TIGSPGSAEKAL 394
++SMSLG + + +AV A L + AAGN +G PG + +
Sbjct: 143 IISMSLG---GPEDVPELHEAVKKAVASQI-LVMCAAGNEGDGDDRTDELGYPGCYNEVI 198

Query: 395 TVAAVDKQDNRAGFSSMGPLVRSYGLKPDLSAPGVDINAAASQAVPGVSGMYRTMSGTSM 454
+V A++ + + FS+ + DL APG DI + VPG G Y T SGTSM
Sbjct: 199 SVGAINFDRHASEFSNSNN-------EVDLVAPGEDILS----TVPG--GKYATFSGTSM 245

Query: 455 ATPHVAGAAAILKQR-----HPDWSGGRIKDALMSSSKKLDAYTPYEQGTGRLDVKAAVD 509
ATPHVAGA A++KQ D + + L+ + L +P +G G L + A +
Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPL-GNSPKMEGNGLLYLTAVEE 304

Query: 510 -----TTIEATGSVAVAS 522
T G ++ AS
Sbjct: 305 LSRIFDTQRVAGILSTAS 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33220ACRIFLAVINRP310.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.009
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 290 SWRTPNRVRLPVPAAGLGV--AMVVAGMSPNVYVLVAMVTAAGVFVAPAL 337
SW P V L VP +GV A + +VY +V ++T G+ A+
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33260ISCHRISMTASE300.026 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 29.6 bits (66), Expect = 0.026
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 383 TLEMAASWTDLPVIYDEVVDAIQSVPGTLAASAHQSHAYTDGACVYFSLRGDVAP 437
LE AA V+ D ++D +Q+ P + ++ + C ++R +A
Sbjct: 189 ALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCE--NIRKQIAE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33275TCRTETA463e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.6 bits (108), Expect = 3e-07
Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 1/82 (1%)

Query: 68 ILTATAPALAIIFNPVGGWLATRIGRVPPLLIAKVFAIAGALLAAFAGDFTVVWLGRVLV 127
IL A + PV G L+ R GR P LL++ A + A A V+++GR++
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106

Query: 128 GVAYGIDFAVAMALLAEYTPAK 149
G+ G AVA A +A+ T
Sbjct: 107 GIT-GATGAVAGAYIADITDGD 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33285MICOLLPTASE2651e-77 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 265 bits (679), Expect = 1e-77
Identities = 118/600 (19%), Positives = 214/600 (35%), Gaps = 80/600 (13%)

Query: 121 KPVKADKQSSAAAGDECGDISGVINATGSALVQQLKALPRITCTYPLFSLTGEDARKTF- 179
+P+ S A ++ + S LV+ +K + LF+ D TF
Sbjct: 76 RPLGPSIAPSRARNNKIYTFDELNRMNYSDLVELIKTI-SYENVPDLFNFN--DGSYTFF 132

Query: 180 -REAQMVTVANALRDASATYSGNNSAAIGQLVLFLRAGYYVQDNHANVVGDYGTALDSAA 238
++ + L D+ TY+ ++ I LV FLRAGYY+ + + L +
Sbjct: 133 SNRDRVQAIIYGLEDSGRTYTADDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNEC 192

Query: 239 LGALDAFFASPHSKDVTDANGEIFNEVVTLIDSAHAAGRYAGVVKWMLGSY------DGT 292
L A+ A + + + T A + + LI +A A ++L + G+
Sbjct: 193 LPAMKAIQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGS 252

Query: 293 WPSQMN---LAMQHVEWVVEN------GFKAKNDDRGWRAALKADPTILNTWAGFITRNS 343
S+ N M+ +++ + G+ AKN + + + L + + +
Sbjct: 253 NYSKGNAVFNLMKGIDYYTNSVIYNTKGYDAKNTE--FYNRIDPYMERLESLCTIGDKLN 310

Query: 344 AQLNRLDVVSNVGRYLGYALDVPELKDRVRPLLKDLINRYPNVGPTAPITMNLGWYTRQY 403
+ +V+N Y G E + L+ + YP ++
Sbjct: 311 N--DNAWLVNNALYYTGRMGKFREDPSISQRALERAMKEYPY------LSYQYIEAANDL 362

Query: 404 DRNNCAAYAIC----------DLGDRVLPAILPIQHTCTPDLKI-RAQD-MSPGQLAGTC 451
D N + D ++ LP +T + +A D ++ ++
Sbjct: 363 DLNFGGKNSSGNDIDFNKIKADAREKYLPK----TYTFDDGKFVVKAGDKVTEEKIKRLY 418

Query: 452 TSLVNQDAYFHRVIGDKGA-IPGDVNTNLEVVVFDDYTQYSLYAWAIYNIDVDNGGMYEE 510
+ A F RV+ + A G+ + L VV+++ +Y L I DNGG+Y E
Sbjct: 419 WASKEVKAQFMRVVQNDKALEEGNPDDILTVVIYNSPEEYKLNR-IINGFSTDNGGIYIE 477

Query: 511 GNPAAAGNQARFIAHEAHWLRPDFQIWNLN----HEYTHYLDGRYN---MAGDFEASLTT 563
N F +E P+ I+ L HE+THYL GRY M G E
Sbjct: 478 -------NIGTFFTYER---TPEESIYTLEELFRHEFTHYLQGRYVVPGMWGQGEFYQEG 527

Query: 564 PTIWWVEGIAENISYGYRNE----RNADAIGEAGKKTYK--LSDLFDTVYNQDGDPEVNS 617
W+ EG AE + R + R + G A + + L + Y S
Sbjct: 528 VLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNNRMSLYGVLHAKYG--------S 579

Query: 618 NRVYRWGFLAVRYMLQAHPADVETVLNKYRTGDWNGARTFLKQTIGTSY-DAGFATWLTT 676
Y +GF YM + + N + D +G + ++ + + ++ +
Sbjct: 580 WDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDS 639



Score = 73.2 bits (179), Expect = 3e-15
Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 28/168 (16%)

Query: 535 QIWNLNHEYTHYLDGRYNMAGDFEASLTTPTIWWVEGIAENISYGYRNERNADAIGEAGK 594
+ LN +Y Y+D N + + L + + A++I+ + + I +
Sbjct: 625 SDYGLNDKYQDYMDSLLNNIDNLDVPLVS-DEYVNGHEAKDINEITNDIKEVSNIKDL-- 681

Query: 595 KTYKLSDLFDTVYNQDGDPEVNSNRVYRWGFLAVRYMLQAHP-----ADVETVLNKYRTG 649
+ F T Y+ R ++ R + + + + +L +
Sbjct: 682 SSNVEKSQFFTTYD------------MRGTYVGGRSQGEENDWKDMNSKLNDILKELSKK 729

Query: 650 DWNGARTFLKQ--------TIGTSYDAGFATWLTTVCATNDCGPLPEA 689
WNG +T YD F T P+A
Sbjct: 730 SWNGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKA 777


87C5746_33280C5746_33330Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_33280213-0.869580hypothetical protein
C5746_33285110-0.811243DNA-binding response regulator
C5746_33290112-0.223625two-component sensor histidine kinase
C5746_33295211-0.060229ABC transporter substrate-binding protein
C5746_33300210-0.053749ABC transporter ATP-binding protein
C5746_33305211-0.419749IclR family transcriptional regulator
C5746_3331029-0.671951allantoinase AllB
C5746_33315113-1.593176allantoicase
C5746_33320121-1.276360antibiotic biosynthesis monooxygenase
C5746_33325117-0.131500aldo/keto reductase
C5746_33330219-0.009692oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33600HTHFIS553e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 3e-11
Identities = 27/118 (22%), Positives = 43/118 (36%), Gaps = 3/118 (2%)

Query: 5 IRLLLADDHPVVRAGLRAVLDSEPDFCVVAEAATAERAVELAATGEFDVVLMDLQFGAGM 64
+L+ADD +R L L V + A A G+ D+V+ D+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMP-DE 60

Query: 65 HGSRATAAITARPDGPRVLILTTYDSDADILAAVEAGAAGYLLKDAPPEELAAAVRTA 122
+ I VL+++ ++ + A E GA YL K EL + A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33625UREASE394e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 39.0 bits (91), Expect = 4e-05
Identities = 36/168 (21%), Positives = 59/168 (35%), Gaps = 37/168 (22%)

Query: 6 VNLVLRSTRVVTPDGTRPAAVAVADGRI---------------DAVLPYGTEVPAGARSE 50
V+ V+ + ++ G A + + DGRI ++ GTEV AG
Sbjct: 68 VDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK- 126

Query: 51 DFGDDVLLPGLVDTHVHVNDPGRTEWEGFRTATRAAAAGGITTLLDM---PLNSLPPTTS 107
++ G +D+H+H P + E A G+T +L P + TT
Sbjct: 127 -----IVTAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTGPAHGTLATTC 172

Query: 108 VA---HLRTKQQVAAPKAHIDTGFWGGAIPSNVKDLRPLYEAGVFGFK 152
H+ + AA ++ F G S L + G K
Sbjct: 173 TPGPWHIARMIE-AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33645DHBDHDRGNASE1061e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 106 bits (265), Expect = 1e-29
Identities = 79/252 (31%), Positives = 113/252 (44%), Gaps = 19/252 (7%)

Query: 14 RAALVTGGSRGIGAATALRLARDGADVALTYVRDEDGAQAVVKEIQSYGRRGIALRADSA 73
+ A +TG ++GIG A A LA GA +A E + VV +++ R A AD
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK-LEKVVSSLKAEARHAEAFPADVR 67

Query: 74 DPDAAPAAVRRAADTLGRLDILVNNAGIGVLGPIESLTPADVDRVLAVNVRAVFLACRTA 133
D A R +G +DILVN AG+ G I SL+ + + +VN VF A R+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 134 AGLMGD--GGRIISLGTALSRHAGGP--GSTLYTMSKSALAGLTKPLARELGPRGITVNL 189
+ M D G I+++G S AG P Y SK+A TK L EL I N+
Sbjct: 128 SKYMMDRRSGSIVTVG---SNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 190 VQPGPVDTDL-------NPADGPLAAGQLS----ATALDRFGSVDEVASLIAYLASDEAA 238
V PG +TD+ + G L L + ++A + +L S +A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 239 YITGTEVTVDGG 250
+IT + VDGG
Sbjct: 245 HITMHNLCVDGG 256


88C5746_33510C5746_33655Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_33510-2103.2745745,10-methylene tetrahydromethanopterin
C5746_33515-492.467949oxidoreductase
C5746_33520-3101.705782SsgA family sporulation/cell division regulator
C5746_335250100.2256104-carboxymuconolactone decarboxylase
C5746_33530111-0.622565MBL fold metallo-hydrolase
C5746_33535312-1.773296exodeoxyribonuclease III
C5746_33550419-3.643975chloramphenicol efflux pump
C5746_33560-212-0.667200alkaline phosphatase
C5746_33565-2140.505200hypothetical protein
C5746_335700140.746398glutamate ABC transporter ATP-binding protein
C5746_33580-1111.941642ABC transporter substrate-binding protein
C5746_335851111.928746amino acid ABC transporter permease
C5746_33595292.241320amino acid ABC transporter permease
C5746_33605091.791130hypothetical protein
C5746_336101110.904774peptidoglycan bridge formation protein FemAB
C5746_336151110.614082murein biosynthesis integral membrane protein
C5746_336200110.938340DNA-binding response regulator
C5746_336251130.816999two-component sensor histidine kinase
C5746_336301120.927049hypothetical protein
C5746_336351121.735676hypothetical protein
C5746_336403122.184380hypothetical protein
C5746_336453101.617517gamma-glutamyltransferase
C5746_336502121.155666hypothetical protein
C5746_336552101.822528type I methionyl aminopeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33875TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 3e-05
Identities = 61/275 (22%), Positives = 101/275 (36%), Gaps = 15/275 (5%)

Query: 41 GLLTSAFAIGMVVGAPLMALFSRNWPRRRALLFFLCVFSAVHVIGALTPSYGVLLATRVV 100
G+L + +A+ AP++ S + RR LL L + + I A P VL R+V
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 101 GALANAGFWAVALVTAVSMVGPEARARATSVVVGGVTIACVAGVPAGAALGGHWGWRSAF 160
+ A AVA + + RAR + VAG P L G + + F
Sbjct: 106 AGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAG-PVLGGLMGGFSPHAPF 163

Query: 161 WAVALVSVPAVVVIARTIPGARPKATPAPARDELGALARPR-------LLLTLLTSALVQ 213
+A A ++ + +P + R+ L LA R + + ++Q
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223

Query: 214 GATFCAFSYFEPLATHVTGFGAAWVPALLALFG-LGSFVGVTVAGRIIDARPVALTTAGL 272
+ + + A + LA FG L S + G + A + A +
Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPV--AARLGERRALM 281

Query: 273 LSLAVGWTGFALTAGSPFATVALVFVQGMLAFGTG 307
L + TG+ L A FAT + M+ +G
Sbjct: 282 LGMIADGTGYILLA---FATRGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33930HTHFIS913e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 3e-23
Identities = 33/164 (20%), Positives = 68/164 (41%), Gaps = 6/164 (3%)

Query: 2 PRVLLIEDDPSVREGVELGLRRRGHELRSVETGEEGLAALGEFRPDLVLLDLMLPGMNGV 61
+L+ +DD ++R + L R G+++R + DLV+ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QVCRLIRET-SQLPIIMLTARGDDFDVVVGLEAGADDYIVKPARTEVIEARIRAVL---R 117
+ I++ LP+++++A+ + E GA DY+ KP + I L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 118 RLAEPAGRGGIEFHGELAVDRAGLSVAKAGQRLLLAPSELKLLL 161
R + + A + + R L ++L L++
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR--LMQTDLTLMI 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_33935PF06580423e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 3e-06
Identities = 43/264 (16%), Positives = 91/264 (34%), Gaps = 60/264 (22%)

Query: 230 LAELAWTFNESSTKLQESVEELQRAEARARRFASDVSHELRTPLAGMLAVTEVLDEDAAQ 289
+ W+ ++ ++ + + + A + +L L AQ
Sbjct: 126 VVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEA--QLM-----ALK---------AQ 169

Query: 290 LNA----DTAAAVR-LISAETGKLATLVEDLMEISRFDAKAADLHL----DEVDVAETIR 340
+N + +R LI + K ++ L E+ R+ + ++ DE+ +
Sbjct: 170 INPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADEL---TVVD 226

Query: 341 K--TLQGRHWEDRVRTEL--PDGIRAMLDPRRFDIVVANLVGNALRHGAQPV----TVRL 392
L +EDR++ E I + P ++V LV N ++HG + + L
Sbjct: 227 SYLQLASIQFEDRLQFENQINPAIMDVQVPP---MLVQTLVENGIKHGIAQLPQGGKILL 283

Query: 393 RTERRGAARWLVTEVADNGPGIDPAVLPHIFDRFYKADAARTRSAGSGLGLAITQENVR- 451
+ + + EV + G A + +G GL +E ++
Sbjct: 284 KGTKDNG--TVTLEVENTGSL-----------------ALKNTKESTGTGLQNVRERLQM 324

Query: 452 LHGGTVR-AVNGPAGGAVLTVELP 474
L+G + ++ G V +P
Sbjct: 325 LYGTEAQIKLSEKQGKVNAMVLIP 348


89C5746_34070C5746_34185Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_34070215-1.968661MFS transporter
C5746_34075214-1.733299SAM-dependent methyltransferase
C5746_34080213-1.560234LysR family transcriptional regulator
C5746_34085214-1.619224L-glyceraldehyde 3-phosphate reductase
C5746_34095116-2.251816Mini-circle protein
C5746_34105121-2.388557amidase
C5746_34110-120-2.043561TetR family transcriptional regulator
C5746_34115018-2.071645hydrolase
C5746_34120021-3.005186hypothetical protein
C5746_34125017-1.249653two-component sensor histidine kinase
C5746_34130017-0.678944DNA-binding response regulator
C5746_34140221-0.940729hypothetical protein
C5746_34145223-0.562258peptidoglycan-binding protein
C5746_34150225-0.562128ABC transporter ATP-binding protein
C5746_341600140.995162ABC transporter permease
C5746_34165-1131.359698Mini-circle protein
C5746_34170-2162.418044GntR family transcriptional regulator
C5746_34175-3172.703167ABC transporter permease
C5746_34185-1123.054970ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34385TCRTETB1602e-44 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 160 bits (406), Expect = 2e-44
Identities = 104/424 (24%), Positives = 185/424 (43%), Gaps = 22/424 (5%)

Query: 51 ATPRHIRLVFLGLMLTLLLAALDQMIVATALPKIVGELHGLE-KMSWAVTAYLLASTIGL 109
+ RH +++ L + + L++M++ +LP I + + +W TA++L +IG
Sbjct: 8 SNLRHNQILIW-LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 110 PIYGKLGDLFGRKGVFQFAIVVFVIGSALAGWSRTMDE-LIAFRAVQGIGGGGLMIGVQA 168
+YGKL D G K + F I++ GS + + LI R +QG G V
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 169 IIADVVPPRERGRFMGLIGAAFGLASVAGPLLGGFFTDHASWRWCFYINVPFGLITLAVI 228
++A +P RG+ GLIG+ + GP +GG + W + + +P +IT+ +
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITV 182

Query: 229 TVVLKLPRPTVRPR--LDILGALLLAAASTCLVLLTSWGGTEYAWGSRTILGLAAGAAGT 286
++KL + VR + DI G +L++ +L T T Y+ + L+
Sbjct: 183 PFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFT----TSYSISFLIVSVLSFL---- 234

Query: 287 ALLFVVVEHRAAEPIIPLRLFRDSIFNVTALVGAVVGVALFGAASYLPTFLQMVDGATAT 346
+FV + +P + L ++ F + L G ++ + G S +P ++ V +
Sbjct: 235 --IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 347 ESG-LLMLPMMGGIVGASVVSGQLISRTGRYRVYPILGGAVSVVGMWLLSRLETDTPRFE 405
E G +++ P ++ + G L+ R G V +G V S L T F
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVL-NIGVTFLSVSFLTASFLLETTSWFM 351

Query: 406 YSIAQAVLGIGIGLVMPVLVLAVQNSVEPADLGAATSANNYFRQIGGSVGAAVFGTLFAG 465
I VLG G+ V+ V +S++ + GA S N+ + G A+ G L +
Sbjct: 352 TIIIVFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410

Query: 466 RLAD 469
L D
Sbjct: 411 PLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34420TETREPRESSOR708e-17 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 69.6 bits (170), Expect = 8e-17
Identities = 46/204 (22%), Positives = 79/204 (38%), Gaps = 14/204 (6%)

Query: 17 VTLDAILDAATEIADDRGLDAVTFRVVADRLGVSPMAIHRTTGGIDALQHALVSRIVGE- 75
+ ++++DAA E+ ++ G+D +T R +A +LG+ ++ AL AL I+
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 76 VTRSVHWP-DDWCGVVRLFADTLHDLLMRHPVILEAH--RRASLVGPGADDVALRVVDAL 132
S+ + W +R A + L+R+ + H R + LR +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRF---M 120

Query: 133 RTAGLDEEGAVYAYGALHDFVTG-------HVAIRLGRGDPEQLRLPPERRAMSVFADHH 185
G +YA A+ F G H A R LPP R D
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAAPDENLPPLLREALQIMDSD 180

Query: 186 DYDRRFAYGLDLVIGGIAAAAAPV 209
D ++ F +GL+ +I G +
Sbjct: 181 DGEQAFLHGLESLIRGFEVQLTAL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34435FLGFLIH310.006 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.3 bits (70), Expect = 0.006
Identities = 42/157 (26%), Positives = 64/157 (40%), Gaps = 15/157 (9%)

Query: 232 ADALQASDAELRVQEAKARRFVADVSHELRTPLAAMTMVATVLEEDADQLPPDAARAARA 291
A L+ AE + Q+A + + E +T L A+ V A +L A AAR
Sbjct: 77 AQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVI------ASRLMQMALEAARQ 130

Query: 292 VGAET-----ARLSRLVEDLMEISRFDAKAVRLNAAETDLA---DTVRASLALRGWTDRV 343
V +T + L + ++ L++ + +L DL D + A+L+L GW R
Sbjct: 131 VIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRG 190

Query: 344 QTHLHE-GVRAVVDRRRIDVIVANLVGNALRHGAPPV 379
LH G + D +D VA R AP V
Sbjct: 191 DPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34440HTHFIS989e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 9e-26
Identities = 39/140 (27%), Positives = 63/140 (45%), Gaps = 5/140 (3%)

Query: 2 PHVLLIEDDASVRDGMELVLRRHGYGVDTAATGEQALALLAGERGSRVELAVLDLMLPGM 61
+L+ +DDA++R + L R GY V + +A G L V D+++P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD---LVVTDVVMPDE 60

Query: 62 DGFEVCRRIRARSATLPVIMLTARGDDSDIVTGLEAGADDYVVKPVTAPVLEARIRAAL- 120
+ F++ RI+ LPV++++A+ + E GA DY+ KP L I AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 -RRAEPSARQRSDADLAGLV 139
+ PS + D LV
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34450MICOLLPTASE300.022 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.7 bits (66), Expect = 0.022
Identities = 19/96 (19%), Positives = 33/96 (34%)

Query: 79 VTVAASEGKTLTMGQALYELNDKPVTLLYGPVPMFREMKAGDRGSDVLQLERNLRDLGYG 138
+TV + G T + + + DKPV ++ P KA + ++ L + Y
Sbjct: 837 LTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYS 896

Query: 139 ANLYVDARYDENTEAAVKQWQKSLNRETTGKVGKGD 174
Y D N + + T K G +
Sbjct: 897 DKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLN 932


90C5746_34410C5746_34435Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_344103101.891017Tat pathway signal sequence domain protein
C5746_344153112.411326alpha-L-fucosidase
C5746_344202141.825614hypothetical protein
C5746_344252131.918456oxidoreductase
C5746_344303151.745977long-chain fatty acid--CoA ligase
C5746_344352171.980217LysR family transcriptional regulator
91C5746_34965C5746_35085Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_34965323-3.175802SAM-dependent methyltransferase
C5746_34970616-3.829100hypothetical protein
C5746_34975517-4.128735hypothetical protein
C5746_34980618-4.369981hypothetical protein
C5746_34985518-3.884538hypothetical protein
C5746_34990315-2.697555glyoxalase/bleomycin resistance/extradiol
C5746_34995215-2.537771isoprenyl transferase
C5746_35000216-0.049774hypothetical protein
C5746_35005-1150.171016hypothetical protein
C5746_35010-2150.713085hypothetical protein
C5746_35015-2150.746378IS5/IS1182 family transposase
C5746_3502516-0.612265hypothetical protein
C5746_3503016-0.876642phosphotransferase
C5746_3503529-1.114833hypothetical protein
C5746_35040214-1.189466hypothetical protein
C5746_35045215-1.141371hypothetical protein
C5746_35050215-1.270246DUF4132 domain-containing protein
C5746_35055125-1.305313hypothetical protein
C5746_35060218-1.914119SAM-dependent methyltransferase
C5746_35065212-2.365181hypothetical protein
C5746_35070210-3.268439hypothetical protein
C5746_35075210-3.519813transcriptional regulator
C5746_35080311-3.175301DUF397 domain-containing protein
C5746_35085311-2.638671methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35320SHAPEPROTEIN300.005 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 30.5 bits (69), Expect = 0.005
Identities = 24/101 (23%), Positives = 40/101 (39%), Gaps = 18/101 (17%)

Query: 89 GRPAEQLGPLIEIIEETVRDITAVGRPWEVQVIGSLDMLPGVSARVLKQAAAATAGRGGL 148
G A+Q+ + T +I A+ RP + VI + + +KQ + + R
Sbjct: 56 GHDAKQM------LGRTPGNIAAI-RPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSP 108

Query: 149 KVDVAVGYGG----RREIVDAVKSAFKEHIAAGGDPAELVE 185
+V V V G RR I ++ + AG L+E
Sbjct: 109 RVLVCVPVGATQVERRAIRESAQG-------AGAREVFLIE 142


92C5746_35225C5746_35255Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_35225223-3.814299ATP-dependent DNA ligase
C5746_35230122-3.743518hypothetical protein
C5746_35235124-3.039314glycosyl hydrolase
C5746_35240328-3.101563transcriptional regulator
C5746_35245428-3.324777copper oxidase
C5746_35255432-2.422522helicase SNF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35580INTIMIN310.027 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.8 bits (69), Expect = 0.027
Identities = 45/263 (17%), Positives = 77/263 (29%), Gaps = 48/263 (18%)

Query: 215 APIVVNEVGTHTIRYRATDKAGNTAAEKSVGFAVAAPPTDDRTPPETSATVSGEKDDQGR 274
VV++VG T + + V P + VSG
Sbjct: 551 NGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAV---- 606

Query: 275 YLGMATVTVTASDTGSGVNTIEYATGTDGAWQPYTAPVMVHETGTHQVRYRATDKAGNAA 334
+A+ GSG T+ + G QV A +A
Sbjct: 607 -----LSANSANTNGSGKATVTLKSDKPG-----------------QVVVSAKTAEMTSA 644

Query: 335 AEKSVDFTVVAPPTQDKTPPETSAEVEGDKN---SDNAYITSAEVTVTATDAGSGVDKVE 391
+ + T+ + E++ DK ++ + V V D +V
Sbjct: 645 LN--ANAVIFVDQTK-----ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVT 697

Query: 392 YSLDGGPYLAYTTTVIVDRVGYHTVAHRATDKAGNASVAKQVSFTVAESGGVPAPNCPEF 451
++ G +T D GY V +T G + V+ +VS + V AP F
Sbjct: 698 FTTTLGKL--SNSTEKTDTNGYAKVTLTSTTP-GKSLVSARVS---DVAVDVKAPEVEFF 751

Query: 452 DE------RLTVIVGTVDTGVPN 468
+ ++ V +P
Sbjct: 752 TTLTIDDGNIEIVGTGVKGKLPT 774


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35590PF03544354e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 35.0 bits (80), Expect = 4e-04
Identities = 14/85 (16%), Positives = 25/85 (29%), Gaps = 3/85 (3%)

Query: 321 THEKPAQEKPAQEKPAQEKPAQEKPAQDKPAQDKPAQDKPAPDKTAPDKTAPDKTAPDK- 379
A + P + E + P K A +KP P K P +
Sbjct: 60 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP--VVIEKPKPKPKPKPKPVKKVEQPKRD 117

Query: 380 TASGDTVPGKTAKESAAHDPAAHGS 404
++ P + +A P + +
Sbjct: 118 VKPVESRPASPFENTAPARPTSSTA 142


93C5746_35435C5746_35525Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_35435315-0.338545UDP-glucose 4-epimerase GalE
C5746_354403140.152441transferase
C5746_35445315-0.583487nucleotide sugar-1-phosphate transferase
C5746_35450311-0.891543dehydrogenase
C5746_35455215-1.390254transferase
C5746_35460113-2.035095glycosyl transferase
C5746_35465012-2.195959ABC transporter
C5746_35470112-3.038736teichoic acid ABC transporter ATP-binding
C5746_35475312-3.406677squalene synthase HpnC
C5746_35480212-3.655903squalene synthase HpnD
C5746_35485114-2.994277phytoene dehydrogenase
C5746_35490017-3.035956dimethylallyltranstransferase
C5746_35495118-3.458111squalene--hopene cyclase
C5746_35500220-3.1292741-hydroxy-2-methyl-2-butenyl 4-diphosphate
C5746_35505220-3.094998hopanoid biosynthesis associated radical SAM
C5746_35510319-2.6515214-hydroxy-3-methylbut-2-en-1-yl diphosphate
C5746_35515219-2.4862031-deoxy-D-xylulose-5-phosphate synthase
C5746_35520217-2.491444aspartate aminotransferase family protein
C5746_35525214-2.004645XRE family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35800NUCEPIMERASE1621e-49 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 162 bits (411), Expect = 1e-49
Identities = 78/342 (22%), Positives = 142/342 (41%), Gaps = 38/342 (11%)

Query: 1 MTWLITGGAGYIGAHVVRSMVGAGERVVVLDDLST--------GVVERLP-ADVPLIRGS 51
M +L+TG AG+IG HV + ++ AG +VV +D+L+ +E L +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 AADRALLDRVLAQYDVTGVVHLAAKKQVGESVEKPLLYYRENMAGLTVLLEAVVAAGVRR 111
ADR + + A V + V S+E P Y N+ G +LE ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 112 FLFSSSAAVYGV-PDVELITEDTPCVPINPYGETKLAGEWLVRATGRAHSLSTGCLRYFN 170
L++SS++VYG+ + T+D+ P++ Y TK A E + + L LR+F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 171 VAG-AATPELSDTGVFNVIPMFFDRLTRGEAPRIFGDDYATPDGTCIRDYIHVADLADAH 229
V G P++ + F + G++ ++ G RD+ ++ D+A+A
Sbjct: 181 VYGPWGRPDM-------ALFKFTKAMLEGKSIDVYN------YGKMKRDFTYIDDIAEAI 227

Query: 230 LAVARRLAEQDAGDLT--------------VNIGRGEGVSVRELAGLVGEVSGRPEKPEI 275
+ + + D NIG V + + + + G K +
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 276 EARRPGDAAKAVASAVRMSEELGWTARRGVREMVESAWQGWR 317
+PGD + A + E +G+T V++ V++ +R
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35830ABC2TRNSPORT353e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 34.5 bits (79), Expect = 3e-04
Identities = 22/107 (20%), Positives = 41/107 (38%), Gaps = 4/107 (3%)

Query: 168 VLAVVAVAFGSYPGPSWLLIIPALVMQFLFNTGLAMIMARLGSKTPDLAQLMPFVMRTWM 227
+ VVA A G S L +P + + L L M++ L V+ +
Sbjct: 131 GIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPIL 190

Query: 228 YASGVMFSIPVMLKDKPDWIANVLQYNPAAIYMDLIRFALIDGYGSE 274
+ SG +F + + P ++ P + +DLIR ++ +
Sbjct: 191 FLSGAVFPVDQL----PIVFQTAARFLPLSHSIDLIRPIMLGHPVVD 233


94C5746_35635C5746_35660Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_356352210.514746hypothetical protein
C5746_35640219-0.118187PPOX class F420-dependent enzyme
C5746_35645415-0.323835MBL fold metallo-hydrolase
C5746_35650314-0.628699hypothetical protein
C5746_35655214-0.618168ATP-binding protein
C5746_35660210-0.140743dynein regulation protein LC7
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36020PF05616412e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 40.5 bits (94), Expect = 2e-05
Identities = 34/127 (26%), Positives = 45/127 (35%), Gaps = 21/127 (16%)

Query: 356 LGVDGDAPAEADAALVP----PPGPVHRPPAEDGTGSHRRPNPTLNPGPGAGPGPSPAAS 411
G D D ++P PG P A+ NP NP P PG P
Sbjct: 295 FGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPE 354

Query: 412 PAPQASANRQPYV----------PTQPLRLHQPHQQARREGEAP-------PLPLRAERL 454
P P + + P P P R + H++ R+EGE P L +RL
Sbjct: 355 PDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKEGEDGGLLCKFFPDILACDRL 414

Query: 455 DRPTPAD 461
P PA+
Sbjct: 415 PEPNPAE 421


95C5746_36345C5746_36475Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_36345214-0.861389wax ester/triacylglycerol synthase family
C5746_36350112-0.300975aldo/keto reductase
C5746_363551110.346034L-rhamnose mutarotase
C5746_36360-1111.352483amidohydrolase
C5746_36365-1101.658582DNA-binding response regulator
C5746_363700101.647861hypothetical protein
C5746_363750101.477668cation acetate symporter
C5746_363801102.065040sensor histidine kinase
C5746_363852101.623809hypothetical protein
C5746_36390190.625840hypothetical protein
C5746_36395191.165245RNA polymerase subunit sigma
C5746_364000111.546564amino acid permease
C5746_364050111.530616universal stress protein UspA
C5746_36410112-0.012675hypothetical protein
C5746_36415112-0.865561hypothetical protein
C5746_36420111-0.701383hypothetical protein
C5746_36425212-1.149725hypothetical protein
C5746_36430111-1.233756GGDEF domain-containing protein
C5746_36435211-0.999748LysR family transcriptional regulator
C5746_36440211-0.708802succinate dehydrogenase
C5746_36445110-1.415312fumarate reductase/succinate dehydrogenase
C5746_3645009-1.812053succinate dehydrogenase
C5746_36455-19-1.596509hypothetical protein
C5746_36460-210-1.966366ABC transporter
C5746_36465013-2.537517polysaccharide pyruvyl transferase
C5746_36470014-2.546936hypothetical protein
C5746_36475411-1.693299hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36855TYPE3IMSPROT300.014 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.014
Identities = 16/64 (25%), Positives = 23/64 (35%), Gaps = 7/64 (10%)

Query: 249 YTAAPLALL----DRALRIKAVTEGHGVPLRAAALHYPLAHPAVAGVLVGTRSPDEVRDA 304
T PL + ++ + E GVP+ PLA LV P E +A
Sbjct: 277 ETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRI---PLARALYWDALVDHYIPAEQIEA 333

Query: 305 AALL 308
A +
Sbjct: 334 TAEV 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36870HTHFIS499e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 9e-09
Identities = 29/143 (20%), Positives = 49/143 (34%), Gaps = 15/143 (10%)

Query: 3 RVLAVDDEEPALEEL-LYLLRADPRIRSAEGATGATEALRRIGGAVDAGPDDPSAIDVVF 61
+L DD+ L L RA +R A + G D+V
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG------------DLVV 52

Query: 62 LDIHMAGLTGLDVAQLLAGFAAPPLIVFVTAHEGF--AVHAFDLKAVDYVLKPVRRERLA 119
D+ M D+ + ++ ++A F A+ A + A DY+ KP L
Sbjct: 53 TDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 120 EAVRRVAEQVGDRSAPVLDTAND 142
+ R + R + + D + D
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36875TCRTETA260.042 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 25.9 bits (57), Expect = 0.042
Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 13/73 (17%)

Query: 35 TALGGAYVRSLMRSQLRAGLTAFTVLAAVVGTLPLVFEALHSAAL------VWAVLGFAA 88
A+ V + QL+ L A T L ++VG PL+F A+++A++ W + G A
Sbjct: 321 QAMLSRQVDEERQGQLQGSLAALTSLTSIVG--PLLFTAIYAASITTWNGWAW-IAGAAL 377

Query: 89 Y----PPLTLAAW 97
Y P L W
Sbjct: 378 YLLCLPALRRGLW 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36885PF065802122e-67 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 212 bits (540), Expect = 2e-67
Identities = 70/229 (30%), Positives = 114/229 (49%), Gaps = 26/229 (11%)

Query: 175 QLELAELDRSR--TQLIEAEIRALRAQISPHFIFNSLAAIASFVRTDPEQARELLLEFAD 232
+ AE+D+ + + EA++ AL+AQI+PHF+FN+L I + + DP +ARE+L ++
Sbjct: 143 NYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSE 202

Query: 233 FTRYSFR-SHGDFTTLADELHSIDQYLALVRARFGERLAVTLQVAPEVLPVALPFLCLQP 291
RYS R S+ +LADEL +D YL L +F +RL Q+ P ++ V +P + +Q
Sbjct: 203 LMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQT 262

Query: 292 LVENAVKHGLEGAVTLPRAPGSVRAGETPTRITIRALDAGSEAEVVIEDDGTGMDPQRLR 351
LVEN +KHG+ +I ++ + +E+ G+
Sbjct: 263 LVENGIKHGIAQL-------------PQGGKILLKGTKDNGTVTLEVENTGSLALK---- 305

Query: 352 RILRGEGGKSTGIGLLNVDERLRQVYGDDYGLVIETGIGAGMKVTVRLP 400
+STG GL NV ERL+ +YG + + + G V +P
Sbjct: 306 -----NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36930GPOSANCHOR513e-10 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 51.2 bits (122), Expect = 3e-10
Identities = 13/61 (21%), Positives = 14/61 (22%), Gaps = 2/61 (3%)

Query: 58 AGPAPEPTPPPAPPPPPPPPPPPPPPPPPPPLPAPPPLAPPRPKPEPVPPPPSPTPSVRP 117
A A E A P P P P KP P T P
Sbjct: 449 AKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAP--QAGTKPNQNKAPMKETKRQLP 506

Query: 118 T 118
+
Sbjct: 507 S 507


96C5746_36665C5746_36705Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_366652130.931104indole acetimide hydrolase
C5746_366703100.378926TetR family transcriptional regulator
C5746_36675111-0.447079hypothetical protein
C5746_366802110.118632hypothetical protein
C5746_366853180.590116threonylcarbamoyl-AMP synthase
C5746_366903130.982339AI-2E family transporter
C5746_366951111.805709hypothetical protein
C5746_367001101.925108proline hydroxylase
C5746_36705291.862723proline hydroxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37185HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 1e-13
Identities = 26/174 (14%), Positives = 58/174 (33%), Gaps = 2/174 (1%)

Query: 2 PKQVDHADRRRRIAEAVCRLADERGLEGVTLRDVAACAQVSMGAVQRCFRTKEEMLVFAL 61
+ + + R+ I + RL ++G+ +L ++A A V+ GA+ F+ K ++
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 62 GYIGERISERVQARLVRCPAQSAAT--ALGHAVTEVSLLQEEHRAEARVWLAFVAQAAVS 119
I E + P + + V E ++ +E R +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 120 EALARTLKANYAALQEAFTRLISEAGEGADCAVPLDPQREARTLLALADGLTAH 173
+ + + + + + E L +R A + GL +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


97C5746_36855C5746_36915Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_368552221.397972transporter
C5746_368602180.894632transcriptional regulator
C5746_368651181.246982beta-N-acetylhexosaminidase
C5746_368701141.071070alcohol dehydrogenase
C5746_368752110.956991acyl-CoA dehydrogenase
C5746_36880012-0.565395acyl-CoA dehydrogenase
C5746_36885018-1.317560hypothetical protein
C5746_36890215-1.713318ABC transporter permease
C5746_36895012-1.750208ABC transporter permease
C5746_36900013-1.902013ABC transporter ATP-binding protein
C5746_36905122-1.283434ABC transporter ATP-binding protein
C5746_36910221-0.485630hypothetical protein
C5746_36915213-1.147620AAA family ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37465SACTRNSFRASE280.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.002
Identities = 10/50 (20%), Positives = 17/50 (34%)

Query: 18 VKAEYRGRGVSIAMKTFGMGFVRMCGARTIRTFHHPANTSAIAMNRTMGF 67
V +YR +GV A+ + + + + N SA F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


98C5746_37885C5746_37950Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_37885329-0.018059hypothetical protein
C5746_37890317-0.398210hypothetical protein
C5746_378951161.422277hypothetical protein
C5746_379002141.102306XRE family transcriptional regulator
C5746_379051111.044081inosine-5'-monophosphate dehydrogenase
C5746_37910090.843478hypothetical protein
C5746_37915190.514366hypothetical protein
C5746_379201120.197150MBL fold metallo-hydrolase
C5746_37925012-2.005996sulfurtransferase
C5746_37930114-1.436268integral membrane family protein
C5746_37935219-0.908235transporter
C5746_37940216-1.406685hypothetical protein
C5746_37945115-1.623763CopY family transcriptional regulator
C5746_37950213-0.698066glycoside hydrolase
99C5746_38275C5746_38375Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_382752111.782161cytochrome c oxidase assembly protein
C5746_382802111.734152hypothetical protein
C5746_382851111.526712hypothetical protein
C5746_382902361.582900hypothetical protein
C5746_38295415-0.842694hypothetical protein
C5746_38300015-0.365642hypothetical protein
C5746_38305028-0.873238hypothetical protein
C5746_38310320-0.866744hypothetical protein
C5746_383152160.511720hypothetical protein
C5746_383202161.078720MFS transporter
C5746_383252141.017908beta-glucosidase
C5746_383302201.717239GNAT family N-acetyltransferase
C5746_383354210.649348TetR family transcriptional regulator
C5746_383404280.486238hypothetical protein
C5746_38345734-0.164892hypothetical protein
C5746_38350834-0.736255hypothetical protein
C5746_38355843-0.927655proline racemase
C5746_38360946-1.666113hypothetical protein
C5746_38365838-2.367301amino acid permease
C5746_38370629-1.196682amidase
C5746_38375314-0.459648TIGR00374 family protein
100C5746_38470C5746_38510Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_384701113.022915DNA-binding response regulator
C5746_384750112.979923hypothetical protein
C5746_384800112.743119beta-N-acetylglucosaminidase
C5746_384850113.547286hypothetical protein
C5746_384900113.473753esterase
C5746_384950133.566651Fe-S cluster assembly protein HesB
C5746_385000123.660017class II aldolase family protein
C5746_38505-1123.456989hypothetical protein
C5746_385100143.634835hypothetical protein
101C5746_38570C5746_38690Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_38570215-2.713588hypothetical protein
C5746_38575016-1.566495ATP-binding protein
C5746_38580016-1.446539hypothetical protein
C5746_38585020-0.990322phosphonoacetate hydrolase
C5746_38590118-0.644365hypothetical protein
C5746_385953160.777061hypothetical protein
C5746_386003230.637443transcriptional regulator
C5746_38605517-0.264633hypothetical protein
C5746_38610-117-1.491019hypothetical protein
C5746_38615-111-2.352491ATP-binding protein
C5746_38620-1111.406861transcriptional regulator
C5746_386250122.280642hypothetical protein
C5746_38630-1122.381125hypothetical protein
C5746_38635-1122.348623MFS transporter
C5746_386400122.982942MerR family transcriptional regulator
C5746_386450113.030497hypothetical protein
C5746_38650-1122.537830IS30 family transposase
C5746_38655-2121.431253hypothetical protein
C5746_38660-1111.278687hypothetical protein
C5746_386650120.913656hypothetical protein
C5746_386700130.815654RNA polymerase subunit sigma-70
C5746_386751150.278826alkylhydroperoxidase
C5746_38680315-0.118555AAA family ATPase
C5746_38685318-0.187227hypothetical protein
C5746_38690319-0.131205hypothetical protein
102C5746_38790C5746_38875Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_38790222-1.546623hypothetical protein
C5746_38795322-2.797685thioesterase
C5746_38800318-3.062628hypothetical protein
C5746_38805218-1.818025hypothetical protein
C5746_38810219-1.439055hypothetical protein
C5746_38815119-1.825354hypothetical protein
C5746_38820217-1.600639hypothetical protein
C5746_38825015-1.035876GNAT family N-acetyltransferase
C5746_38830-117-0.852073hypothetical protein
C5746_38835-120-1.568485hypothetical protein
C5746_38840019-2.491027IS5/IS1182 family transposase
C5746_38845119-2.277818transposase
C5746_38850119-2.687927glyoxalase
C5746_38855122-2.645150DDE endonuclease
C5746_38860123-3.100562cupin
C5746_38865022-4.003827MarR family transcriptional regulator
C5746_38870-118-3.260961hypothetical protein
C5746_38875-215-3.266974hypothetical protein
103C5746_38950C5746_39035Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_38950-123-3.035836GlcNAc-PI de-N-acetylase
C5746_38955124-3.118961hypothetical protein
C5746_38960128-3.814504IS5/IS1182 family transposase
C5746_38965223-3.944556GNAT family N-acetyltransferase
C5746_38970-124-2.773702IS630 family transposase
C5746_38975118-2.883193hypothetical protein
C5746_38980015-1.373918IS5/IS1182 family transposase
C5746_38985-213-1.795841hypothetical protein
C5746_38990012-2.173811transposase
C5746_38995314-1.330555hypothetical protein
C5746_39000310-1.538686hypothetical protein
C5746_39005080.384126CHAT domain-containing protein
C5746_39010081.697298hypothetical protein
C5746_39015-182.559208hypothetical protein
C5746_39020-182.909319IS6 family transposase
C5746_39025093.037602alpha/beta hydrolase
C5746_390300133.134020hypothetical protein
C5746_390353123.159345esterase
104C5746_40065C5746_40090Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_400652104.390483amidohydrolase
C5746_400703105.079356alpha/beta hydrolase
C5746_40075195.015937hypothetical protein
C5746_40080195.027873hypothetical protein
C5746_400850104.638818hypothetical protein
C5746_400900154.415014hypothetical protein
105C5746_40180C5746_40265Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_40180-119-3.138673hypothetical protein
C5746_40185119-3.760058hydroxymethylbilane synthase
C5746_40190020-3.883550hydrolase
C5746_40195018-4.396035hypothetical protein
C5746_40200016-4.183793alpha/beta hydrolase
C5746_40205020-3.757334hypothetical protein
C5746_40210020-2.876400hypothetical protein
C5746_40215019-1.827131NUDIX hydrolase
C5746_40220020-1.873797MFS transporter
C5746_40225119-0.988655TetR family transcriptional regulator
C5746_40230119-0.901053short-chain dehydrogenase
C5746_40235119-1.402636nitrate ABC transporter substrate-binding
C5746_40240017-1.224866LysR family transcriptional regulator
C5746_402452170.122504IS630 family transposase
C5746_402501140.521959hypothetical protein
C5746_402551152.159888hypothetical protein
C5746_402601142.618344IclR family transcriptional regulator
C5746_402652153.0909764-hydroxybenzoate 3-monooxygenase
106C5746_40345C5746_40425Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_40345125-3.603884SRPBCC family protein
C5746_40350223-5.805740hypothetical protein
C5746_40355225-6.942542ArsR family transcriptional regulator
C5746_40360323-7.491444GNAT family N-acetyltransferase
C5746_40365320-7.293269SAM-dependent methyltransferase
C5746_40370215-7.168939TetR family transcriptional regulator
C5746_40375216-2.342652GNAT family N-acetyltransferase
C5746_40380117-1.648405DUF1203 domain-containing protein
C5746_40385217-1.418642MFS transporter
C5746_403907170.179745hypothetical protein
C5746_403957190.385769RNA polymerase subunit sigma-24
C5746_404006220.453668hypothetical protein
C5746_404058210.027763extradiol dioxygenase
C5746_40410818-0.176997polyketide synthase
C5746_40415517-0.824394ketoacyl synthase
C5746_40420122-3.240814hypothetical protein
C5746_40425315-4.000576hypothetical protein
107C5746_40540C5746_40625Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_40540327-2.474043hypothetical protein
C5746_40545324-3.065676hypothetical protein
C5746_40550220-2.979492stress protein
C5746_40555117-2.619253hypothetical protein
C5746_40560019-2.074277enoyl-[acyl-carrier-protein] reductase FabV
C5746_40565-120-3.338812hypothetical protein
C5746_40570-217-3.091197molecular chaperone DnaK
C5746_40575-315-2.711907nucleotide exchange factor GrpE
C5746_40580015-2.664711polysaccharide deacetylase family protein
C5746_40585014-2.700289short-chain dehydrogenase
C5746_40590214-3.223890galactosyldiacylglycerol synthase
C5746_40595117-3.628243hypothetical protein
C5746_40600317-3.810308hypothetical protein
C5746_40605420-3.882342exonuclease SbcC
C5746_40610218-4.758300cysteine desulfurase-like protein
C5746_40615218-4.563613hypothetical protein
C5746_40620221-4.2922362-nitropropane dioxygenase
C5746_40625321-3.598265hypothetical protein
108C5746_41100C5746_41135Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_41100223-2.245725hypothetical protein
C5746_41105323-2.678690hypothetical protein
C5746_41110329-3.606997transcriptional regulator
C5746_41115626-4.266397cation transporter
C5746_41120231-3.821616hypothetical protein
C5746_41125133-4.753349hypothetical protein
C5746_41130-122-4.186275methyltransferase type 11
C5746_41135-217-4.556088hypothetical protein
109C5746_41190C5746_41245Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_411902290.740678phenolic acid decarboxylase subunit B
C5746_411951280.293961AsnC family protein
C5746_41200021-0.033387amidase
C5746_412050190.008556recombinase
C5746_41210228-1.028859hypothetical protein
C5746_41215329-2.182347ABC transporter
C5746_41220724-3.075803antibiotic ABC transporter ATP-binding protein
C5746_41225327-3.501028hypothetical protein
C5746_41230225-3.876794hypothetical protein
C5746_41235221-2.831019hypothetical protein
C5746_41240215-2.235508hypothetical protein
C5746_41245213-1.923376hypothetical protein
110C5746_41290C5746_41600Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_41290219-2.031059hypothetical protein
C5746_41295321-1.067141hypothetical protein
C5746_41300319-0.845836hypothetical protein
C5746_41305118-1.542943hypothetical protein
C5746_41310115-1.683752phage tail protein
C5746_41315019-1.483429hypothetical protein
C5746_41320225-2.630120hypothetical protein
C5746_41325125-4.696039hypothetical protein
C5746_41330126-3.877798hypothetical protein
C5746_41335127-2.665189LacI family transcriptional regulator
C5746_41340230-1.898590alpha-galactosidase
C5746_41345229-1.997943carbohydrate ABC transporter permease
C5746_41350322-2.010187lactose ABC transporter permease
C5746_41355120-2.019685ABC transporter substrate-binding protein
C5746_41360123-2.258963hypothetical protein
C5746_41365021-1.936979hypothetical protein
C5746_41370224-2.075516LuxR family transcriptional regulator
C5746_413751020-0.578446hypothetical protein
C5746_41380822-0.345373hypothetical protein
C5746_41385724-0.094577hypothetical protein
C5746_413909190.006259hypothetical protein
C5746_41395817-0.688612hypothetical protein
C5746_41400615-1.081591hypothetical protein
C5746_41405116-2.348691DUF4262 domain-containing protein
C5746_41410116-1.915952transposase
C5746_41415217-2.191925transposase
C5746_41420118-1.682599heavy metal-responsive transcriptional
C5746_41425117-1.715119alkylmercury lyase
C5746_41430021-1.546568hypothetical protein
C5746_41435221-2.307233oxidoreductase
C5746_41440125-2.854189serine/threonine protein phosphatase
C5746_41445224-2.163402hypothetical protein
C5746_41450224-2.715853alpha/beta hydrolase
C5746_41455030-2.925576transcriptional regulator
C5746_41460034-3.345520DNA-binding protein
C5746_41465029-2.885410hypothetical protein
C5746_41470-126-2.600664hypothetical protein
C5746_41475-123-3.306114hypothetical protein
C5746_41480-222-3.008190transposase
C5746_41485-322-2.568429ABC transporter
C5746_41490-219-2.851206hypothetical protein
C5746_41495-219-2.805634hypothetical protein
C5746_41500-115-3.181920hypothetical protein
C5746_41505016-3.208701IS5/IS1182 family transposase
C5746_41510114-3.324777hypothetical protein
C5746_41515213-3.065037hypothetical protein
C5746_41520113-2.530554hypothetical protein
C5746_41525314-2.520180IS6 family transposase
C5746_41530315-2.357815erythromycin esterase
C5746_41535117-1.153769alpha/beta hydrolase
C5746_41540217-1.594008IS6 family transposase
C5746_41545516-1.474865XRE family transcriptional regulator
C5746_41550212-2.625477integrase
C5746_41555114-3.391411cellulose 1,4-beta-cellobiosidase
C5746_41560110-3.341511hypothetical protein
C5746_4156509-1.734671helicase
C5746_41570-110-1.197412hypothetical protein
C5746_41575-112-0.478062hypothetical protein
C5746_41580-2151.360349
C5746_41585-1173.409378
C5746_415900213.616022
C5746_41595-2233.206542
C5746_41600-3223.691864
111C5746_41670C5746_41820Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_41670225-2.504397
C5746_41675223-1.241444
C5746_41680124-2.513391
C5746_41685225-2.722270
C5746_41690123-2.647694
C5746_41695-219-1.414096
C5746_41700016-0.809970
C5746_41705016-1.277994
C5746_41710015-0.746508
C5746_41715113-0.538994
C5746_41720016-0.115668
C5746_41725313-0.781635
C5746_41730215-0.880985
C5746_41735215-1.386390
C5746_41740315-0.762646
C5746_41745315-1.097435
C5746_41750316-1.612632
C5746_41755315-1.456036
C5746_41760-114-1.047708
C5746_41765-116-1.025927
C5746_41770-115-1.648318
C5746_41775015-1.384048
C5746_41780-115-1.063883
C5746_41785-116-1.157692
C5746_41790022-2.359670
C5746_41795420-2.981778
C5746_41800518-1.529532
C5746_41805319-2.296093
C5746_41810320-2.697600
C5746_41815121-2.468231
C5746_41820219-2.764171
112C5746_42110C5746_42190Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_42110118-3.709886
C5746_42115017-3.460279
C5746_42120023-3.386165
C5746_42125222-4.460308
C5746_42130122-3.663343
C5746_42135021-3.324777
C5746_42140221-3.694919
C5746_42145322-3.281431
C5746_42150321-3.324777
C5746_42155321-2.752638
C5746_42160223-3.502992
C5746_42165323-2.954571
C5746_42170022-1.307041
C5746_42175120-1.443057
C5746_42180017-1.220898
C5746_42185113-0.673859
C5746_421902121.010423
113C5746_42420C5746_42510Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_424202131.011957
C5746_424252210.062513
C5746_42430321-1.690094
C5746_42435321-2.268811
C5746_42440118-4.454721
C5746_42445116-4.125763
C5746_42450117-3.209934
C5746_42455115-2.444860
C5746_42460214-1.934682
C5746_424655140.423380
C5746_424706140.831341
C5746_424754141.863395
C5746_424804131.458753
C5746_424854100.623890
C5746_42490513-0.177854
C5746_42495412-1.755042
C5746_42500314-1.673067
C5746_42505314-1.110755
C5746_42510214-0.739494
114C5746_42595C5746_42675Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_42595313-0.232867
C5746_42600314-0.617130
C5746_42605415-0.783876
C5746_42610319-0.400801
C5746_426153250.340957
C5746_42620028-0.089262
C5746_426250300.417507
C5746_42630-1270.370461
C5746_426352290.066203
C5746_42640329-0.413879
C5746_426454250.176288
C5746_42650318-0.130241
C5746_426553160.160647
C5746_42660112-0.766245
C5746_42665-215-1.581049
C5746_42670-315-2.587565
C5746_42675-220-3.007317
115C5746_42760C5746_42850Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_42760214-0.154147
C5746_427652120.486151
C5746_42770112-0.235666
C5746_42775212-0.470486
C5746_42780111-0.339703
C5746_42785112-0.717499
C5746_42790011-0.377449
C5746_427952100.195000
C5746_428001112.452210
C5746_428051113.045805
C5746_428101112.955693
C5746_428152113.238212
C5746_428202113.457422
C5746_428251133.380342
C5746_428302152.252181
C5746_428352130.917647
C5746_428402141.724759
C5746_428452132.489962
C5746_428503142.580229
116C5746_42900C5746_42985Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_42900-112-3.231988
C5746_42905112-3.265894
C5746_42910013-4.034546
C5746_42915110-3.793860
C5746_4292019-4.464926
C5746_4292519-3.962678
C5746_4293009-3.530270
C5746_42935011-3.605699
C5746_42940-19-2.925217
C5746_42945016-3.161128
C5746_42950113-2.420594
C5746_42955211-2.187255
C5746_42960416-2.586320
C5746_42965517-1.695596
C5746_42970418-1.651168
C5746_42975317-2.156854
C5746_42980315-1.754034
C5746_42985218-1.365822
117C5746_00490C5746_00505N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00490010-0.508978multidrug transporter
C5746_004950100.140569sensor histidine kinase
C5746_00500-1100.504240DNA-binding response regulator
C5746_00505-2101.113134DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00515ACRIFLAVINRP689e-14 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 67.9 bits (166), Expect = 9e-14
Identities = 51/226 (22%), Positives = 86/226 (38%), Gaps = 21/226 (9%)

Query: 487 RIDLYPTADPQSQQARDLASGPVRAAV---AQHTPAGTTAHVGGTAAIYADISTAVDHDL 543
R + P+ + Q + A +SG A + A PAG G +
Sbjct: 817 RYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQ----A 872

Query: 544 KIVFPVAAALIALILLLLLRSLLAPVILMLSVGLGFAATLGAATLLFQHVLGKPGVNFSL 603
+ ++ ++ L L L S PV +ML V LG L AATL +
Sbjct: 873 PALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF--------NQKNDV 924

Query: 604 P-LVLFLFVVALGTDYNILISDRIREEMQRPG-PARAAVARA-VQHTTPAIATAGLVLAG 660
+V L + L ILI + ++ M++ G A A P + T+ + G
Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILG 984

Query: 661 SFAT-LATTPGNE-QIAFATT-LGILLSALVLSLVLVPALTALLGR 703
++ G+ Q A +G ++SA +L++ VP ++ R
Sbjct: 985 VLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 53.3 bits (128), Expect = 3e-09
Identities = 56/298 (18%), Positives = 111/298 (37%), Gaps = 46/298 (15%)

Query: 204 MGLILLINVLVFRSVLAALLPLVAVAMIIGVAGGAVAGAAILTGRKLDAGTPDLISVVL- 262
+ L+ L+ L +++ A L+P +AV +++ +A G ++ T + +VL
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILA----AFGYSINTLT--MFGMVLA 401

Query: 263 LGIGIDYLLFLL---FRFREQLRARPEQSAREAAGQVSGRVGTAITSAALTIVAGF---A 316
+G+ +D + ++ R + + P+++ ++ Q+ G A+ A+ + A F A
Sbjct: 402 IGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQG----ALVGIAMVLSAVFIPMA 457

Query: 317 TLGVATFGQFRSLGPAIAVAVLVMLLGSLTLLPALLAAAGRKMFWPSKALGHEPDEGRAA 376
G +T +R I A+ + +L +L L PAL A + P A HE G
Sbjct: 458 FFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA----TLLKPVSAEHHENKGGFFG 513

Query: 377 RFG--------------ARVARRPMTMTFASVALLAALAAGLIGI------RMDYGQGNA 416
F ++ ++A + + + D G
Sbjct: 514 WFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLT 573

Query: 417 AAKTPAAATATEISRAL-----PAGVSDPTSVFVTATDGGTLTAGRLDGLSRALTQVK 469
+ PA AT + L ++ +V T G +G+ A +K
Sbjct: 574 MIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLK 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00520PF06580320.004 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.1 bits (73), Expect = 0.004
Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 13/88 (14%)

Query: 311 NARKYA-----GPARASVRLTYRQDRVTVEVRDDGGSTPPQEGASSVGSGGYGLIGMRER 365
N K+ + ++ T VT+EV + G S+ G GL +RER
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST----GTGLQNVRER 321

Query: 366 VALHGG---TLAVGPQADGGFAVVADLP 390
+ + G + + + G + +P
Sbjct: 322 LQMLYGTEAQIKLSEK-QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00525HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 1e-16
Identities = 28/118 (23%), Positives = 44/118 (37%), Gaps = 2/118 (1%)

Query: 9 PRIRVLIADDQPLVRRGLSLILSPDPSFEVVGEAEDGAQAVALARRLRPDVVVMDIRMPV 68
+L+ADD +R L+ LS ++V + A D+VV D+ MP
Sbjct: 2 TGATILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 69 LDGVGATGELAATVPGCRVLALSTFDMDEYVVGALRAGACGFLPKDSSPEDLSTAIRT 126
+ + P VL +S + + A GA +LPK +L I
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00530HTHFIS456e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 45.2 bits (107), Expect = 6e-08
Identities = 14/85 (16%), Positives = 31/85 (36%), Gaps = 1/85 (1%)

Query: 18 AADGVEAVERARQLRPDVVLMDVRMPRLNGIEATWQLLAESAEPPKVVVITTFENDGYVT 77
++ D+V+ DV MP N + ++ + P V+V++
Sbjct: 33 TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLP-VLVMSAQNTFMTAI 91

Query: 78 AALSAGASGFVLKRLPVPQIAEAVR 102
A GA ++ K + ++ +
Sbjct: 92 KASEKGAYDYLPKPFDLTELIGIIG 116


118C5746_00725C5746_00805N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_00725510-0.1778543-oxoacyl-ACP reductase
C5746_007304130.623890TetR family transcriptional regulator
C5746_007354141.458753transposase
C5746_007403141.863395IS200/IS605 family transposase
C5746_007455140.831341GNAT family N-acetyltransferase
C5746_007505140.423380hypothetical protein
C5746_00755115-1.934682protein phosphatase
C5746_00760117-2.444860alpha/beta hydrolase
C5746_00770118-4.125763alpha/beta hydrolase
C5746_00780321-2.268811TetR family transcriptional regulator
C5746_007902130.062513hypothetical protein
C5746_007951131.011957hypothetical protein
C5746_008000150.879158MarR family transcriptional regulator
C5746_00805-1161.315746MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00760DHBDHDRGNASE1132e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (283), Expect = 2e-32
Identities = 74/262 (28%), Positives = 119/262 (45%), Gaps = 28/262 (10%)

Query: 10 AVVTGASRGIGLATVQALTAEGVRVVAAARTITPELKETGAL--------AIPVDLLTPD 61
A +TGA++GIG A + L ++G + A K +L A P D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 62 APAQLIDRATTELGDLDLLVNNVGGGDGGEGQTGGFLSFTDQQWQQSLDLNFLAAVRTSR 121
A ++ R E+G +D+LVN G + G S +D++W+ + +N SR
Sbjct: 71 AIDEITARIEREMGPIDILVNV-----AGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 AALPSLLRRR-GALVNISSNGARMPHAGPVTYTTAKAALTAFGKALAEEFGPQGVRINTI 180
+ ++ RR G++V + SN A +P Y ++KAA F K L E +R N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 SPGPVRTAM----WESPDGYGAELARSMGVTQEQLLAQIPAAMGMTTGRLLEPSEVATAV 236
SPG T M W +G + S+ E IP +L +PS++A AV
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSL----ETFKTGIP------LKKLAKPSDIADAV 235

Query: 237 AYLASPLAASMSGTDLLIDGGS 258
+L S A ++ +L +DGG+
Sbjct: 236 LFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00765HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 36/199 (18%), Positives = 68/199 (34%), Gaps = 16/199 (8%)

Query: 6 RALRADTERTVRTILEAAERVLAADP--AATMEQIAAAAGVARTTVHRRFATREALVEAL 63
R + + + T + IL+ A R+ + + ++ +IA AAGV R ++ F + L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 64 ATWATERFHQAV-EAARPLTSPPLVALHQVTANVLQVKI------GWSFAMSRTAPSDPE 116
+ + E PL L ++ +VL+ + + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 117 AARVHA-------DVLAQCDQLFRRAQEAGLVSADTDLEWARRVYYALIHEASEEGREVV 169
A V + + +Q + EA ++ AD A + I E
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 170 DIGAVDQLAARVVDTLLHG 188
+ + A V LL
Sbjct: 183 QSFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00785SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 4e-05
Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 7/92 (7%)

Query: 60 EQLRACDESFLGVRDELRLVGAVAWTRLPNGALDICRLVVHPVAHRRGVATALLDALDSI 119
+ ++ E +G + NG I + V ++GV TALL +I
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHK--AI 115

Query: 120 E-----PAELTIVSTGTANLPAVALYRRRGFI 146
E ++ T N+ A Y + FI
Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00800BACYPHPHTASE280.019 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 27.8 bits (61), Expect = 0.019
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 91 CGGGVGRTGTALSAICVFEGMDPKEAVKWVRQNY 124
C GVGRT + A+C+ + + + +V+ +
Sbjct: 403 CRAGVGRTAQLIGAMCMNDSRNSQLSVEDMVSQM 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00815CHLAMIDIAOM6300.015 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 30.0 bits (67), Expect = 0.015
Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 255 IVGNALGFDAMPSTAEREEAFAAFDLRPLLDQDANGPMLVVNGTEDVHVPLDDT 308
I GN + FD++P +E + L+ + DA G ++ + T + VP+ DT
Sbjct: 490 ITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDT--LTVPVSDT 541


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00820HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-15
Identities = 29/205 (14%), Positives = 65/205 (31%), Gaps = 19/205 (9%)

Query: 11 RRPGGRAARVRQAVLAATMEVLAEEGIARLSIAEVAARAGVNETTVYRRWGSREKLVLDA 70
R+ A RQ +L + + +++G++ S+ E+A AGV +Y + + L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 71 M------LAGSDEGIPVPDTGTVRTDLAAFARALTEYLATPTGRAVARAASLSSDD---- 120
+ + G + L + E T R + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 121 -PDLAEAWQTFWQSRLDQAGAIVSRAVERGELPTDTDAALALELLCSPLQTRSLLGHRPI 179
+ +A + D+ + +E LP D A ++ + L+ +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS--GLMENWLF 180

Query: 180 EPDLPERLT------DLVLDGLRGR 198
P + ++L+
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_00840TCRTETA621e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 62.1 bits (151), Expect = 1e-12
Identities = 93/390 (23%), Positives = 150/390 (38%), Gaps = 31/390 (7%)

Query: 3 LALLALAIGAFGIGTTEFVIMGVLPQVAGDFGVTIPAA---GWLVSGYALGVVFGAPLLT 59
+ L +A+ A GIG +IM VLP + D + G L++ YAL AP+L
Sbjct: 9 VILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 60 VLGTKVSRKKMLMFLMTLFVIGNALSAIAPSFGVMLIGRIVSSLAHGAFFGIGSVVAAGL 119
L + R+ +L+ + + A+ A AP V+ IGRIV+ + G+ + A +
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYI-ADI 123

Query: 120 VAPEKKASAISLMFMGLTVANIVGVPGGTYIGQAAGWRVTFVIVAALGVIGFLGVAKLVP 179
+++A M + G G +G F AAL + FL L+P
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 180 ET----GRPDSVTVRTEFAAF------RNVQVWLAMAMTVLGYGGVFAAITYITPMMTEV 229
E+ RP A+F V +A+ + G V AA+ I +
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI--FGEDR 240

Query: 230 AGYTEGAVTWLLVLFGI-GMFLGNLLGGKFADR--RLMPMLLTTLAALTAALLLFTATAH 286
+ + L FGI ++ G A R ++L +A T +LL AT
Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300

Query: 287 NKVLAAITLSLIGALGFATVPPLQKWVLDQASAAPTLASAANIGAFNLGNALAAWLGGVV 346
+ L G +G +P LQ + Q G+ +L + +G ++
Sbjct: 301 WMAFPIMVLLASGGIG---MPALQAMLSRQVDEE---RQGQLQGSLAALTSLTSIVGPLL 354

Query: 347 IAAGLGYTSPNWVGAI-LSGTALLLALLAA 375
A + W G ++G AL L L A
Sbjct: 355 FTAIYAASITTWNGWAWIAGAALYLLCLPA 384


119C5746_01005C5746_01040N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_01005-1122.690146two-component sensor histidine kinase
C5746_01010-2132.437273DNA-binding response regulator
C5746_01015-1132.579517hypothetical protein
C5746_010202220.909113hypothetical protein
C5746_010253240.374447DNA-binding response regulator
C5746_01030226-1.062494two-component sensor histidine kinase
C5746_01035119-1.426524hypothetical protein
C5746_01040018-1.067595hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01015PF06580431e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.9 bits (101), Expect = 1e-06
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 10/83 (12%)

Query: 320 IIQESVTNVVRHA---RADACRVTVDYRA--GDLAIEVADRGRG--DGPASGTGYGLAGM 372
++Q V N ++H ++ + G + +EV + G TG GL +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNV 318

Query: 373 RERVALLHGE---FSAAPRPGGG 392
RER+ +L+G + + G
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01020HTHFIS814e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.4 bits (201), Expect = 4e-20
Identities = 28/114 (24%), Positives = 47/114 (41%), Gaps = 2/114 (1%)

Query: 4 RVALVDDQPLVRTALQMVITEAPDLEVVGQAGTGEEAVRLVAAVRPDLVVMDIRMPGMDG 63
+ + DD +RT L ++ A +V R +AA DLVV D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 IEATRRITEGDDGPQVIVLTTFDDDDYVYGALRAGASGFLVKDMALDDILAAIR 117
+ RI + V+V++ + A GA +L K L +++ I
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01040HTHFIS548e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 8e-11
Identities = 26/113 (23%), Positives = 42/113 (37%), Gaps = 2/113 (1%)

Query: 4 ILVVDDDALVRLGLVDLLSTDPELTVVAEAADGLAAVEQASGHRVDVALVDVRMPRMDGI 63
ILV DDDA +R L LS + A + D+ + DV MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAA--TLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 AATRRLRALPDPPRVITLTTFDLDEYVYDALAAGADGFLLKDTEPREIIRAVH 116
R++ V+ ++ + A GA +L K + E+I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01060FLGMRINGFLIF300.005 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 30.3 bits (68), Expect = 0.005
Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 3/74 (4%)

Query: 106 GQFGSLISELREVRYAPVHFDAAEDLSYWRARVADKVSVGATALTGPTADPSRR---VQL 162
G+ I L V+ A VH + + R + + SV T G D + V L
Sbjct: 137 GELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHL 196

Query: 163 VNAPGAEVGPGQIA 176
V++ A + PG +
Sbjct: 197 VSSAVAGLPPGNVT 210


120C5746_01780C5746_01830N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_01780-111-3.171808MFS transporter
C5746_01785-38-2.222683RNA 2',3'-cyclic phosphodiesterase
C5746_01790-18-0.886728peptidase S8
C5746_01795-28-0.737854serine/threonine protein phosphatase
C5746_01805070.371339hypothetical protein
C5746_01810070.284895hypothetical protein
C5746_01815-1100.418658hypothetical protein
C5746_01825-1110.435602hypothetical protein
C5746_01830-216-1.134056hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01800TCRTETA479e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.7 bits (111), Expect = 9e-08
Identities = 52/240 (21%), Positives = 83/240 (34%), Gaps = 18/240 (7%)

Query: 66 LLAIFLGAWSDHQAHKRRLLILADLVRAAVLLSVPVAYLSGAVTLGQLYAVALLTGAAGV 125
A LGA SD RR ++L L A +V A ++ A L LY ++ G G
Sbjct: 58 ACAPVLGALSDR--FGRRPVLLVSLAGA----AVDYAIMATAPFLWVLYIGRIVAGITGA 111

Query: 126 LFNTAYPPFFVRLVPRASYVDANSKLSASRSASYVAGPAIGGALVQ-ALTAPVTLA--VD 182
A + + +SA VAGP +GG + + AP A ++
Sbjct: 112 TGAVA-GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 183 ALTFLASAVLISKVSVNEPPAASGSTTAPSLLRRARTGLSFVVRHPVLRAALGCAATVNF 242
L FL L+ + V+ A + +
Sbjct: 171 GLNFLTGCFLL------PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224

Query: 243 FTFVAGSGLIVLFANRSLGLSAGVIGIAF-GIGATGALLGAVFAPKISRRFGVGRSIVVG 301
V + L V+F A IGI+ G +L A+ ++ R G R++++G
Sbjct: 225 VGQV-PAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01810SUBTILISIN2191e-65 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 219 bits (559), Expect = 1e-65
Identities = 100/297 (33%), Positives = 142/297 (47%), Gaps = 34/297 (11%)

Query: 221 QIGTPAAWEAGLSGKGVKVAVLDTGVDPAHPDLKDRVSETKSFIEGQE-----VADRNGH 275
I PA W G+GVKVAVLDTG D HPDLK R+ ++F + E D NGH
Sbjct: 28 MIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGH 86

Query: 276 GTHVSSTVGGSGAASDGKEKGVAPGATLAVGKVLSDQGFGSESQIIAGMEWAAKDVHAKI 335
GTHV+ T+ + + GVAP A L + KVL+ QG G II G+ +A + I
Sbjct: 87 GTHVAGTIAATENENGV--VGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQ-KVDI 143

Query: 336 VSMSLGSSQGSDGTDPMAQAVNTLSKDTGALFVIAAGNAGAPG----TIGSPGAADSALT 391
+SMSLG G + + +AV L + AAGN G +G PG + ++
Sbjct: 144 ISMSLG---GPEDVPELHEAVKKAVAS-QILVMCAAGNEGDGDDRTDELGYPGCYNEVIS 199

Query: 392 IGAVDSADRRASFSSQGPRLGDNALKPDLSAPGVDILAARSQLVSGSGPYTSMSGTSMAT 451
+GA++ + FS+ + DL APG DIL+ G Y + SGTSMAT
Sbjct: 200 VGAINFDRHASEFSNSNN-------EVDLVAPGEDILSTVPG-----GKYATFSGTSMAT 247

Query: 452 PHVAGVAALLAEK-----HPDWSGQQLKDGLMSTSKQISGTSYEVGAGRVDVPSAIA 503
PHVAG AL+ + D + +L L+ + + + G G + + +
Sbjct: 248 PHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01820HTHFIS348e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 8e-04
Identities = 25/141 (17%), Positives = 54/141 (38%), Gaps = 8/141 (5%)

Query: 194 RGVRYRAVYAPEALEWPGVLDDIRELVRHGEQARVL---PGLGIKLAIADRRLALMPLSL 250
+ A+ +A WPG ++REL + L + ++ + R + +
Sbjct: 336 KRFDQEALELMKAHPWPG---NVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPI 392

Query: 251 DLNDVRAAVIRPSTLLDALTGYWEMCWKQALPLNAPAEDPLGEEDRLVL--TLLVSGLKD 308
+ R+ + S ++ + + ALP + + L E + ++ L +
Sbjct: 393 EKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQ 452

Query: 309 EAIARQLGWSVRTMRRRISRL 329
A LG + T+R++I L
Sbjct: 453 IKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01825SUBTILISIN1782e-51 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 178 bits (453), Expect = 2e-51
Identities = 86/227 (37%), Positives = 117/227 (51%), Gaps = 26/227 (11%)

Query: 183 VPLIGAPGVWQRKDPAGSRVTGKGTTVAILDSGVDYTHPDLGGGLGKGHKVVGGHDFVNG 242
V +I AP VW + G+G VA+LD+G D HPDL +++GG +F +
Sbjct: 26 VEMIQAPAVWNQ-------TRGRGVKVAVLDTGCDADHPDLKA------RIIGGRNFTDD 72

Query: 243 DE----DPKDDNGHGTHVAGIIAGRAAEKGGVTGVAPGANLLAYKVMDADGSGYTSDIIA 298
DE KD NGHGTHVAG IA E G V GVAP A+LL KV++ GSG II
Sbjct: 73 DEGDPEIFKDYNGHGTHVAGTIAATENENGVV-GVAPEADLLIIKVLNKQGSGQYDWIIQ 131

Query: 299 GVEAASDPANPHRADVINMSLGGPGDGTDPLGRAATAAVQAGVVVVAAAGNDGPG---TG 355
G+ A + D+I+MSLGGP D L A AV + ++V+ AAGN+G G T
Sbjct: 132 GI----YYAIEQKVDIISMSLGGPEDV-PELHEAVKKAVASQILVMCAAGNEGDGDDRTD 186

Query: 356 TVSSPATADGVVAVGASTSNLRLPSAYLAGKKPELIQTYRGILSANP 402
+ P + V++VGA + + + +L+ ILS P
Sbjct: 187 ELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVP 233



Score = 80.7 bits (199), Expect = 4e-18
Identities = 42/117 (35%), Positives = 58/117 (49%), Gaps = 16/117 (13%)

Query: 542 VTLRGTDTTDRIASFSSRGPAQDFGSKPDLVAPGVEIRSTVPKALYGPGQYRMSGTSMAA 601
+++ + + FS+ + DLVAPG +I STVP G SGTSMA
Sbjct: 198 ISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVP----GGKYATFSGTSMAT 247

Query: 602 PHVAGAAALLRQL-----HPGRTPAEIKAELIGTAKPLSGTGPTTQGSGRLDVAAAA 653
PHVAGA AL++QL T E+ A+LI PL + P +G+G L + A
Sbjct: 248 PHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNS-PKMEGNGLLYLTAVE 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_01840TCRTETB378e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 37.2 bits (86), Expect = 8e-05
Identities = 33/157 (21%), Positives = 62/157 (39%), Gaps = 4/157 (2%)

Query: 18 LRENWAQFTLLVIVTASVGSLVGLERTTVPLIGTDVFGLTSNLAVFSFIVAFGLAKALTN 77
LR N L ++ SV + + L ++P I D F + AF L ++
Sbjct: 10 LRHNQILIWLCILSFFSVLNEMVLN-VSLPDIAND-FNKPPASTNW-VNTAFMLTFSIGT 66

Query: 78 LAAGALTARFRRRQLLLAGWLIGVPVPFVLAWGPSWW-WVLAANVLLGVNQGMTWSMTVN 136
G L+ + ++LLL G +I + G S++ ++ A + G ++ +
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 137 MKIDLVGPARRGLATGLNEAAGYVAVGTTALLTGYLA 173
+ + RG A GL + + G + G +A
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163


121C5746_03375C5746_03410N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_03375-310-0.1558153-ketoacyl-ACP reductase
C5746_03380-3100.142160NADP-dependent isocitrate dehydrogenase
C5746_03385-2100.113618glutathione-dependent reductase
C5746_03390-211-1.895957MFS transporter
C5746_03395-111-2.425396MFS transporter
C5746_03400-112-2.425524nitrite reductase small subunit
C5746_03405-211-2.503229nitrite reductase (NAD(P)H)
C5746_03410-210-2.097541NAD(P)/FAD-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03420DHBDHDRGNASE1227e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (306), Expect = 7e-36
Identities = 78/253 (30%), Positives = 122/253 (48%), Gaps = 13/253 (5%)

Query: 2 LTGRNAVVTGGSRGIGRAIVERLCRDGAHVVFNYATSDDAAEEVVRTVQDNGGKAWAIRL 61
+ G+ A +TG ++GIG A+ L GAH+ + + E+VV +++ A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 62 DLADPDAPEQLMEAAEAQLGGLDILVNNAALCLTPSLIADTDPAVFDKVMTVNTRTVFMT 121
D+ D A +++ E ++G +DILVN A + L P LI ++ +VN+ VF
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 122 IRHAARRMRD--GGRIVNISTANTVRPGPGISAYAASKGAVEQLTTIAAHELGARGITVN 179
R ++ M D G IV + + P ++AYA+SK A T EL I N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 180 TVSPGATDTDLLRG--TNQPQALEAVAGM-------TPLGRMGEPSDVADVVAFLAGHDG 230
VSPG+T+TD+ ++ A + + G PL ++ +PSD+AD V FL
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 231 RWITGQNVRATGG 243
IT N+ GG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03435TCRTETB1231e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 1e-32
Identities = 93/421 (22%), Positives = 163/421 (38%), Gaps = 26/421 (6%)

Query: 27 RRWQALAVCLIAGFMTLLDVSIVNVALPSIREGLHTPESDLQWVLSGYSLAFGLFLIPAG 86
R Q L I F ++L+ ++NV+LP I + P + WV + + L F + G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 87 RLGDARGRRVVFMAGLALFTLASAACGAAQSSL-WLVVARLVQGLAGGLISPQISALIQQ 145
+L D G + + + G+ + S S L++AR +QG + ++ +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 146 MFSGQERGRAFGMFGSVVAISTAVGPLSGGLLIQAAGAEEGWRWVFYVNLPLGAVCLLLA 205
+ RG+AFG+ GS+VA+ VGP GG++ W + + +P+ + +
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH------WSYLLLIPMITIITVPF 184

Query: 206 RRLLPDTPSAGRVRLRDLDPLGVLLLGTGVLALLLPFVQSQQWPGNLKWLLLVAAVTLLA 265
L R++ D G++L+ G++ +L F S L+ +V
Sbjct: 185 LMKLL--KKEVRIK-GHFDIKGIILMSVGIVFFML-FTTSY------SISFLIVSVLSFL 234

Query: 266 AFVGWESRCGRRGTRPVVDLSLFRVRSYWLGCLLSLLYFAGFTSIFFITTLYLQSGLHYT 325
FV + R+ T P VD L + + +G L + F + ++ +
Sbjct: 235 IFV----KHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLS 290

Query: 326 ALQAGLAITPFAVGAALTASP-GGRLVGRFGRPLVLTGLSTVAVGLAATALAAHLVPGRG 384
+ G I + + GG LV R G VL T L+ + L A +
Sbjct: 291 TAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTF---LSVSFLTASFLL-ET 346

Query: 385 AGWAMVAPLLLAGVGSGLVVAPNQTLTLSQVPVAGAGSAGGTLQTGQRVGSAIGIAAVGS 444
W M ++ G T+ S + AG+ L + GIA VG
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 445 M 445
+
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03440TCRTETB330.004 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.5 bits (74), Expect = 0.004
Identities = 42/209 (20%), Positives = 64/209 (30%), Gaps = 28/209 (13%)

Query: 248 IGTFGSFIGFGFAFGQVLQV----QFHAQFDTPVKAACLTFLGPLLGSLSRPLGGLLADR 303
IG I FG G V V + Q T + + F G + + +GG+L DR
Sbjct: 260 IGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR 319

Query: 304 FGGARVTFWTFAAMA---LGAGVVLEASRQHSLALFLCGFVALFVLSGAGNGSTYKMIPA 360
G V ++ L A +LE + V + ++
Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLLETTSW----FMTIIIVFVLGGLSFTKTVISTIVS- 374

Query: 361 IFRARARAEIAGGTSPAEAEQRSRRRTTALIGVAGAIGAFGGVLVNLAFRQSFLETHNGQ 420
+ + E G S T+ + I GG+L Q L Q
Sbjct: 375 --SSLKQQEAGAGMSLLN--------FTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQ 424

Query: 421 AAYL------AFLGYYGLCLVVTWAFYLR 443
+ YL F G + +VT Y
Sbjct: 425 STYLYSNLLLLFSGIIVISWLVTLNVYKH 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03455OMADHESIN290.047 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 28.7 bits (63), Expect = 0.047
Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 1/61 (1%)

Query: 102 PVLPPLRGLHAPDGLKAGVHAFRTLADCARLAEGAAGAERAVVIGGGVLGVSAARALAAL 161
PV PP+ G + G+H+ + A A+GAA A A I GV V+ AL
Sbjct: 52 PVRPPVPGAGGLNASAKGIHSI-AIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKAL 110

Query: 162 G 162
G
Sbjct: 111 G 111


122C5746_03755C5746_03775N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_037550122.051567GDP-fucose synthetase
C5746_037601111.577643GDP-mannose 4,6-dehydratase
C5746_037651111.564112hypothetical protein
C5746_037700101.153821hypothetical protein
C5746_03775-1110.718302hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03850NUCEPIMERASE973e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 97.2 bits (242), Expect = 3e-25
Identities = 62/343 (18%), Positives = 119/343 (34%), Gaps = 57/343 (16%)

Query: 22 RIFVAGHRGLVGSAVVRRLIAAGHEVI-------------TRDRDHL-----------DL 57
+ V G G +G V +RL+ AGH+V+ + R L DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 58 RDAARTEAFLREVRPDAVVLAAAKVGGIMANNTYPVQFLEDNLRIQLSVIAGAHAAGAER 117
D + V ++ + + + P + + NL L+++ G +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 118 LLFLGSSCIYPKRAPQPIPESSLLTGPLEPTNEAYALAKIAGIVQTQSYRRQFGASYISA 177
LL+ SS +Y P + P+ YA K A + +Y +G
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGL 176

Query: 178 MPTNLYGPGDNFDLETSHVLPALIRRFHEAGRDGSPVVTLWGSGSPRREFLHVDDLAAAC 237
+YGP D+ + +F +A +G + ++ G +R+F ++DD+A A
Sbjct: 177 RFFTVYGPWGRPDMA--------LFKFTKAMLEGKSID-VYNYGKMKRDFTYIDDIAEAI 227

Query: 238 LLLLERYDGDEP------------------VNVGCGEDLTIHELANIVGDVTEYQGCVEW 279
+ L + + N+G + + + + D +
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 280 DTSKPDGTPRKLLDVSRL-TSLGFAPQIPLRDGIARTYAWWLE 321
+P D L +GF P+ ++DG+ W+ +
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03855NUCEPIMERASE1003e-26 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 100 bits (250), Expect = 3e-26
Identities = 71/328 (21%), Positives = 119/328 (36%), Gaps = 34/328 (10%)

Query: 6 LITGVTGQDGSYLSELLLEKGYTVHGLIRRSSSFNTERIDHIYQGPEEENRSFVLHHADL 65
L+TG G G ++S+ LLE G+ V G+ + ++ + + F DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFH--KIDL 61

Query: 66 ADGVALVNLLRDIRPDEVYNLGAQSHVRVSFDAPLYTGDITGLGTIRLLEAVRASGIDTR 125
AD + +L + V+ + VR S + P D G + +LE R + I
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 126 IYQASSSEMFGATPPPQNEKTPF-------HPRSPYSVAKVYSYWATVNYREAYGMFAVN 178
+Y ASSS ++G N K PF HP S Y+ K + Y YG+ A
Sbjct: 122 LY-ASSSSVYGL-----NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 179 GILFNHESPRRGETFVTRKITRGVARIRAGLQTHLHLGNLDAVRDWGYAPEYVDAMWRML 238
F P K T+ + G ++ RD+ Y + +A+ R+
Sbjct: 176 LRFFTVYGPWGRPDMALFKFTK---AMLEGKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 239 QCDNPDDYVVATGEGVSVRQFLEYAFAHAG-------LDW----REHVRYDAKYE----R 283
D G Y + G +D+ + + +AK +
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 284 PSEVDALIGDAAKAEELLGWKPAVKSRE 311
P +V D E++G+ P ++
Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03865SURFACELAYER310.017 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 31.2 bits (70), Expect = 0.017
Identities = 40/196 (20%), Positives = 64/196 (32%), Gaps = 27/196 (13%)

Query: 16 ARAAAAALCLLAGALTAAASGGSPAAAL------TPPVSLTADDLTTWQTNGIVWSMAAS 69
A AA A+ + A A S + TP +S A + I S+ S
Sbjct: 19 APIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIAAVAKSDTMPAIPGSLTGS 78

Query: 70 -----------------DAGVVYTGGTFSTVRPPDAAAGTSEQPAVNFAAFDAATGAPTG 112
T +TV+P + A + V +F+ + G
Sbjct: 79 ISASYNGKSYTANLPKDSGNATITDSNNNTVKPAELEADKAYTVTVPDVSFNFGS-ENAG 137

Query: 113 CSLSFTLSSGTATVRALALSPDQKTLYVG-GQFGAVSG--VGVSNIAAIDTATCTPRQGF 169
++ ++ T T+ V Q G V + N+ AIDT + +
Sbjct: 138 KEITIGSANPNVTFTEKTGDQPASTVKVTLDQDGVAKLSSVQIKNVYAIDTTYNSNVNFY 197

Query: 170 KVSVSATVRALAVTAD 185
V+ ATV AV+ D
Sbjct: 198 DVTTGATVTTGAVSID 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_03870FLGHOOKAP1330.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 32.6 bits (74), Expect = 0.001
Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 16/73 (21%)

Query: 129 VTGEIADAKPAVEHLCTQHEELVREASQGYADGTYKDEQIRPGRYHDVTPTKNCTWQITG 188
+TG A A P +L Q ++LV E +Q V+ T+ IT
Sbjct: 185 LTGVGAGASP--NNLLDQRDQLVSELNQ-IVGVE-------------VSVQDGGTYNITM 228

Query: 189 ANGKNLVSGSSAS 201
ANG +LV GS+A
Sbjct: 229 ANGYSLVQGSTAR 241


123C5746_05095C5746_05145N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_050951223-5.365594TetR family transcriptional regulator
C5746_05100621-5.582718transcriptional regulator
C5746_05105220-3.567378pyruvate, phosphate dikinase
C5746_05110419-3.526553transcriptional regulator
C5746_05115319-4.075716GNAT family N-acetyltransferase
C5746_05120423-4.656446hypothetical protein
C5746_05125325-3.910143hypothetical protein
C5746_05135028-4.613527hypothetical protein
C5746_05145128-4.709445sporulation protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05290HTHTETR661e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 1e-14
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 1 MVRLTRAQQQERTRAAVLAAAKAEFTERGYAAAKVDEIAERAELTRGAVYSNFPSKRALY 60
M R T+ Q+ + TR +L A F+++G ++ + EIA+ A +TRGA+Y +F K L+
Sbjct: 1 MARKTK-QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 LAVLVDMVERAAAAEHSDSAKRSGTTEQ 88
+ E AK G
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05300PHPHTRNFRASE836e-19 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 83.3 bits (206), Expect = 6e-19
Identities = 56/247 (22%), Positives = 94/247 (38%), Gaps = 37/247 (14%)

Query: 257 VPFGEALFGRQGEDVVSGSSLTEPLSELADREPEVWTRLLSALTRLEENYRD-ACYVEFT 315
V +A + + +S+T+ +E+ +L +AL + +E R E +
Sbjct: 14 VAIAKAFIHLEPNVDIEKTSITDVSTEIE--------KLTAALEKSKEELRAIKDQTEAS 65

Query: 316 FEAGELWILQVRRGRFVGRAAVRVAVDLADAGTIGRDEALLRVSPQH---LTHVRTPRIT 372
A + I V + + + AL VS + +
Sbjct: 66 MGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMK 125

Query: 373 --AIEPRDLFTRGLGASPGVAVGRVATTADSAARLAAGGPVVLLRPETSPLDMHGL--AA 428
A + RD+ R LG GV G +AT A+ V++ + +P D L
Sbjct: 126 ERAADIRDVSKRVLGHLIGVETGSLATIAE---------ETVIIAEDLTPSDTAQLNKQF 176

Query: 429 AAGVVTARGGPTSHAAVVARSMGKPAVVGAANLTVDAADGCVRAGGRTLPEGTLIALDGT 488
G T GG TSH+A+++RS+ PAVVG +T + G ++ +DG
Sbjct: 177 VKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEK------------IQHGDMVIVDGI 224

Query: 489 GGEVVVG 495
G V+V
Sbjct: 225 EGIVIVN 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05305CHLAMIDIAOM6290.021 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 28.5 bits (63), Expect = 0.021
Identities = 14/44 (31%), Positives = 25/44 (56%)

Query: 59 EERHGRHRYVRVADRRVVELIESLAALAPQGSARPRSLSASGRQ 102
+ GR V+V D R VE+ +++ A GS P ++A+G++
Sbjct: 86 DSCFGRMYTVKVNDDRNVEITQAVPEYATVGSPYPIEITATGKR 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05310SACTRNSFRASE290.007 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.007
Identities = 11/49 (22%), Positives = 17/49 (34%), Gaps = 2/49 (4%)

Query: 81 PQLRGSGLGREILRQAEDEAHARGCRTAVLYTITFQAPG--FYHKQGWK 127
R G+G +L +A + A +L T FY K +
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05320PF05844270.027 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 27.3 bits (60), Expect = 0.027
Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 2/52 (3%)

Query: 52 LPALPAALPALPAAPSQPNPSYGYQQPAQQGYAPMQPAQLQHAPAPYIPQQP 103
L A AA+P+ P AP S G Q A + P PA P+Q
Sbjct: 8 LAATQAAIPSEPIAPGAAGRSVGTPQAAAEL--PQVPAARADRVELNAPRQV 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05330PRTACTNFAMLY373e-04 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 37.0 bits (85), Expect = 3e-04
Identities = 31/138 (22%), Positives = 39/138 (28%), Gaps = 12/138 (8%)

Query: 419 SPAEAGFYVSAEGRGTFHGCRVTGSEGYGFHVMDGCRTTLTRCRTERCARGGYEFAEGGT 478
S A A V T G +TG G M G L R R GG
Sbjct: 214 SGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGA 273

Query: 479 AHGDG-------TGAGPVVE-----DCTSDESALRSPAAPAPTVLTATQSASGLLGAVPG 526
G G GPV++ D + L AP + A + G V G
Sbjct: 274 VPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSG 333

Query: 527 PRAAEPTPATVPAAEPVR 544
+ P + R
Sbjct: 334 GSLSAPHGNVIETGGARR 351


124C5746_05205C5746_05260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_052051140.373566TetR family transcriptional regulator
C5746_052100151.204027hypothetical protein
C5746_05220-2161.272924MFS transporter
C5746_05225-2160.928739hypothetical protein
C5746_05235-2111.088195hypothetical protein
C5746_052454242.069734DNA-binding response regulator
C5746_052504251.621600two-component sensor histidine kinase
C5746_052604251.522436TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05390HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 1e-15
Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 13/169 (7%)

Query: 4 RAMSTPDRLIEATQELLWERGYVGTSPKAIQQQAGAGQGSMYHHFAGKPDLALTAIRRTA 63
A T +++ L ++G TS I + AG +G++Y HF K DL +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 64 SQMRETARQLF-DGPGTAYERISTYLLR---------ERDVLRGCPVGRLTMDPEVIASD 113
S + E + PG + L+ R +L + E+
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 114 ELRAPVDETIGWLRGRLAEIIQEGLDQGEFTRLLVPEDVAATIVATVQG 162
+ + + R+ + ++ ++ L+ A + + G
Sbjct: 128 QAQRNLCLE---SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05400TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 62/283 (21%), Positives = 94/283 (33%), Gaps = 12/283 (4%)

Query: 54 HQTGASASTLGLALLGVSAGAVVTMMLTGRLCRRFGSHPVTVVCGVLLPLGIALPAQTHS 113
+ + G+ L + + G L RFG PV +V + A+ A
Sbjct: 36 VHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF 95

Query: 114 ALALGLVLLVFGAAYGGMNVAMNSAAVDLVAALRRPVMPSF-HAAFSLGGMVGAGLGGLV 172
L + +V G VA A D+ R F A F G + G LGGL
Sbjct: 96 LWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGFGMVAGPVLGGL- 153

Query: 173 AGGLSPATHLFVLTGIGLLVTAATGPVLLRQPAPKPASTADDAEKPRQPAGRARRMV--- 229
GG SP F + L TG LL + + R R +
Sbjct: 154 MGGFSPHAPFFAAAALNGL-NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 230 -LLFGVIALCTAYGEGALADWGALHLEQDLHAHPGIAAAGYSLFAL--AMTAGRLSGTAL 286
L V + G+ A W + E H + F + ++ ++G +
Sbjct: 213 AALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP-V 270

Query: 287 LERLGQTRTLVAGGATAAAGMLLGSLAPTTWLALLGFAVTGLG 329
RLG+ R L+ G G +L + A W+A + G
Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313



Score = 32.1 bits (73), Expect = 0.004
Identities = 42/161 (26%), Positives = 55/161 (34%), Gaps = 33/161 (20%)

Query: 252 LHLEQDLHAHPGIAAAGYSL--FALAMTAGRLSGTALLERLGQTRTLVAGGATAAAGMLL 309
L D+ AH GI A Y+L FA A G LS +R G+ L+ A AA +
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALS-----DRFGRRPVLLVSLAGAAVDYAI 89

Query: 310 GSLAPTTWLALLGFAVTGLGLANIFPVAVGRAGELAGPGGVAAAST--------LGY--- 358
+ AP W+ +G V G+ A G A T G+
Sbjct: 90 MATAPFLWVLYIGRIVAGI-----------TGATGAVAGAYIADITDGDERARHFGFMSA 138

Query: 359 ---GGMLLGPPAIGFLADWFSLPLALTTVALLAAAAAALGY 396
GM+ GP G + FS A L G
Sbjct: 139 CFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05425HTHFIS526e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.8 bits (124), Expect = 6e-10
Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 6/118 (5%)

Query: 2 IRVLLADDQLLVRAGF-RALLDAQPDIEVAGEAADGEEAVRLVRELRPDTVLMDIRMPHL 60
+L+ADD +R +AL A D+ + + R + D V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLAATRAITGDFELSGVKVVMLTTFELDEYVFEAIRSGASGFLVKDTEPEELLRAVR 118
+ I + V++++ +A GA +L K + EL+ +
Sbjct: 61 NAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05430PF06580290.049 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.049
Identities = 57/362 (15%), Positives = 106/362 (29%), Gaps = 80/362 (22%)

Query: 85 FGSSAAAMIYLAAGYPYGPVFLAVAVGCFSAVVSG-----------HRR----------- 122
G + YG L + + + G R+
Sbjct: 18 IGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQII 77

Query: 123 -AAWTAV---GMVWLGHVLVAHWLYRWLPPSDDH-PAAWGQELG---VAAWVVAIIAAAE 174
A GMVW VA+ L + P A+ L + VV +
Sbjct: 78 LRVLPACVVIGMVWF----VANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSL 133

Query: 175 FVRVRREQWADQRAEREAAEQRR-ADEERLRMARE------LHDVLAHSISVINVQSSVG 227
++AE + + A E +L + + + L ++I
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL-NNIR--------- 183

Query: 228 LALLDSDPEQARTALTTIKAASKEALGEVRQVLDTLRTPGDAPRTPAPGLDRLPELVEQA 287
AL+ DP +AR LT++ + +L +L +D +L
Sbjct: 184 -ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTV-------VDSYLQLASIQ 235

Query: 288 AGAGLTVTVETD-GVRGAVPPGADLAAFRIVQEALTNVVRHSGSRTAQ-----VRIGYGP 341
L + + + P +VQ + N ++H ++ Q ++
Sbjct: 236 FEDRLQFENQINPAIMDVQVP------PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDN 289

Query: 342 ARIRLRIDDEGPATGDDAG-GSGNGLAGMRERAAALRG-----TIEAGPRADGGFRVRAE 395
+ L +++ G + +G GL +RER L G + G
Sbjct: 290 GTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSE---KQGKVNAMVL 346

Query: 396 LP 397
+P
Sbjct: 347 IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05435HTHTETR472e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.5 bits (110), Expect = 2e-08
Identities = 19/112 (16%), Positives = 42/112 (37%), Gaps = 4/112 (3%)

Query: 5 RGARERARIEVTAAIKGEARKQLAAEGAAKLSLRAVARELGMASSALYRYFPSRDELLTA 64
R ++ A+ E I A + + +G + SL +A+ G+ A+Y +F + +L +
Sbjct: 3 RKTKQEAQ-ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 65 LIVDAYDSVGESAEAAHRAARADAAPHITRWTVVAHAVRDWALAHPHEYALI 116
+ + ++GE D + V + + L+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLS---VLREILIHVLESTVTEERRRLLM 110


125C5746_05670C5746_05710N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_05670-313-0.839570serine/threonine protein kinase
C5746_05675-212-0.648840hypothetical protein
C5746_056800170.956832peptidase M28
C5746_05685-2181.379273XRE family transcriptional regulator
C5746_05690-120-0.930901hypothetical protein
C5746_05695121-0.514816hypothetical protein
C5746_05700115-0.048815two-component sensor histidine kinase
C5746_05710-1160.048414DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05835YERSSTKINASE363e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 35.9 bits (82), Expect = 3e-04
Identities = 50/220 (22%), Positives = 92/220 (41%), Gaps = 33/220 (15%)

Query: 61 LSHPHLVAVFDFGAWEDRFYLVMELVEGRSLGDLLRAEERLGAEQVARIAGQAAAGLAA- 119
L++ H +AV +G ++ L+M+ V+G D LR + + +I +A G
Sbjct: 193 LANVHGMAVVPYGNRKEE-ALLMDEVDGWRCSDTLRT--LADSWKQGKINSEAYWGTIKF 249

Query: 120 -AHR----------QGIVHRDIKPGNLMLD-AEGSVKIGDFGIAQFVDDPSAALTTTGQI 167
AHR G+VH DIKPGN++ D A G + D G+ + T +
Sbjct: 250 IAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGFTES--- 306

Query: 168 VGTSLYLAPERALGRT-AGAASDMYSLGCVIYQLLLG---EPPFRSDTA----TATLYQH 219
+ APE +G A SD++ + + + G P + + T+
Sbjct: 307 -----FKAPELGVGNLGASEKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRFITSEPAHV 361

Query: 220 VDTAPVPLRQRGV-DLSPAFDSYLLGLLAKQPEDRPSAQQ 258
+D P+ + G+ + A+ ++ +L + RP + +
Sbjct: 362 MDENGYPIHRPGIAGVETAYTRFITDILGVSADSRPDSNE 401


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05845THERMOLYSIN1421e-37 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 142 bits (360), Expect = 1e-37
Identities = 117/473 (24%), Positives = 193/473 (40%), Gaps = 58/473 (12%)

Query: 71 GMYSVAYQRTYRGLPVVGGDAVVVADSEG--RVRGTQSAVSRRINVPTTPTVSAKSAETT 128
G + +++ +G V + + GT + + T +S + AE
Sbjct: 87 GHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTLIPNLDKRTLKTEAAISIQQAEMI 146

Query: 129 SRKKLT--------TVTRVDSHRLVVRATGKTSRLAWETVLTGRSAKSPSRLHVFVDAGT 180
+++ + RLV+ +T RLA+E + P +DA
Sbjct: 147 AKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVR-FLTPVPGNWIYMIDAAD 205

Query: 181 GKVLDSYDDVR---------------AGTGNSQWNGPSPLSIDTTASGGSYSLRDPNR-P 224
GKVL+ ++ + G G ++ ++ G Y L+D R
Sbjct: 206 GKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTTYSSYYGYYYLQDNTRGS 265

Query: 225 GLSCADYSTGSVFSKSSDSWGTGNASSKETGCA-DVMWAAQHEWNMLRDWLGRNGHDGNG 283
G+ D +V S + G + A D + A ++ ++ GR +DG+
Sbjct: 266 GIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSN 325

Query: 284 RSWPVKV--GLNDVNAYWDGSSVSIGHNNANQWI---GAMDVVGHEFGHGIDQYTPG-GA 337
+ V G NA+W+GS + G + ++ G +DVVGHE H + YT G
Sbjct: 326 AAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVY 385

Query: 338 NNESG-LGEATGDIMGALTEAYTNEPAPYDDPDYTVGEKINLVG--NGPIRIMYNPSQTG 394
NESG + EA DI G L E Y N +PD+ +GE I G +R M +P++ G
Sbjct: 386 QNESGAINEAMSDIFGTLVEFYAN-----RNPDWEIGEDIYTPGVAGDALRSMSDPAKYG 440

Query: 395 DPNCYSSSIPNTEE----HAAAGPLNHWFYLLAEGSNPGGGKPSSPTCNSSSVTGVGIQS 450
DP+ YS T++ H +G +N YLL++G SVTG+G
Sbjct: 441 DPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGG----------VHYGVSVTGIGRDK 490

Query: 451 AGKVFYGGMLLK-TSGMTYKRYRTTTLTAARNL-DSGCVLFDRTKAAWDAIGV 501
GK+FY ++ T + + R + AA +L S + K A++A+GV
Sbjct: 491 MGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05870PF06580422e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 2e-06
Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 12/82 (14%)

Query: 292 VTNAAKH-----SRATRISVSVKGDGSGVSVSVRDNGTGGADPM----GSGLSGLRSRVD 342
V N KH + +I + D V++ V + G+ G+GL +R R+
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323

Query: 343 AL---DGRLYIDSPSGGPTTII 361
L + ++ + G ++
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_05875HTHFIS518e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.0 bits (122), Expect = 8e-10
Identities = 25/95 (26%), Positives = 42/95 (44%), Gaps = 4/95 (4%)

Query: 2 LAEDSTLLREGLVRLLAEEGHEVLAAVGDGTALVRAVEADPPDLVVVDIRMPPTHTDEGL 61
+A+D +R L + L+ G++V + L R + A DLVV D+ MP +
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN---AF 63

Query: 62 RAALEIRERWPLVGVLVLSQHVERNYAVRLLSANA 96
I++ P + VLV+S A++ A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA 98


126C5746_06905C5746_06940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_06905091.338552TetR family transcriptional regulator
C5746_069101101.689393hypothetical protein
C5746_069150102.061640hypothetical protein
C5746_06920-290.464306uracil-DNA glycosylase
C5746_06925-1120.165789peptide-binding protein
C5746_069300111.2218793-oxoacyl-ACP reductase
C5746_069401101.065104beta-ketoacyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07080HTHTETR411e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.2 bits (96), Expect = 1e-06
Identities = 11/65 (16%), Positives = 26/65 (40%)

Query: 3 TRTTGTSRADLIADAALALLAERGMRGLTHRAVDERAGLPQGSTSNYARTRQSLLEVTVQ 62
T+ I D AL L +++G+ + + + AG+ +G+ + + + L +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 63 RLAER 67

Sbjct: 65 LSESN 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07100PF07520300.023 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 30.3 bits (68), Expect = 0.023
Identities = 10/30 (33%), Positives = 13/30 (43%)

Query: 322 LYSMVPKGIAGHTTSFFDAYGDPDKDKAKR 351
LYS + + G +F D G P D A
Sbjct: 561 LYSELTQKFDGRIDTFLDLKGQPRPDPAGG 590


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07105DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (260), Expect = 4e-29
Identities = 71/250 (28%), Positives = 98/250 (39%), Gaps = 14/250 (5%)

Query: 7 GKVALITGASRGIGYGIAEALVARGDRVCITGRGEDALKEAVEQLGSDRVIGVAGKA--H 64
GK+A ITGA++GIG +A L ++G + + L++ V L ++ A A
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 65 DEAHRAAAVERTMEAFGRVDFLINNAGTNPVFGPIAELDLNVARKVFETNVISALGFAQL 124
D A R G +D L+N AG G I L F N ++
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 125 TWKAWQKENGGAIVNIASVAGISASPFVGAYGMSKAAMVNLTLQLAHEFAPV-VRVNAIA 183
K G+IV + S + AY SKAA V T L E A +R N ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PAVVKTKFAKALYEGREAEAAA----------AYPLGRLGVPEDIGGAAAFLTSSQSDWI 233
P +T +L+ PL +L P DI A FL S Q+ I
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 234 TGQTLVVDGG 243
T L VDGG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07110DHBDHDRGNASE1284e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (322), Expect = 4e-38
Identities = 90/254 (35%), Positives = 140/254 (55%), Gaps = 12/254 (4%)

Query: 5 EQRVAIVTGAARGIGAATAVRLAAEGRAVAVLDLDEAACKDTVEKITAAGGTALAVGCDV 64
E ++A +TGAA+GIG A A LA++G +A +D + + V + A A A DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 65 SDSAQVEAAVARVAAEIGAPTILVNNAGVLRDNLLFKMSESDWDTVMNVHLKGAFLMAKA 124
DSA ++ AR+ E+G ILVN AGVLR L+ +S+ +W+ +V+ G F +++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 125 VQKHMVDAKFGRIVSLSSSSALGNR-GQANYSAVKAGLQGFTKTLAKELGKFGITANAVA 183
V K+M+D + G IV++ S+ A R A Y++ KA FTK L EL ++ I N V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PGFIVTEMTAQ------TAARVGMGF-EEFQAAAATQIPVQRVGRPEDIANAIAFFTGDE 236
PG T+M A +V G E F+ T IP++++ +P DIA+A+ F +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFK----TGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 AGFVSGQVMYVAGG 250
AG ++ + V GG
Sbjct: 243 AGHITMHNLCVDGG 256


127C5746_07395C5746_07450N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_073950120.286886acyl-CoA dehydrogenase
C5746_07400-1120.001936chitinase
C5746_074051161.856731cytochrome P450
C5746_074101182.257456FAD-dependent oxidoreductase
C5746_074151182.215763ferredoxin
C5746_07420-2161.726575sugar dehydrogenase
C5746_07425-2152.400158glucoamylase
C5746_07430-2162.650136RNA polymerase subunit sigma-24
C5746_07435-1131.726293carboxymuconolactone decarboxylase family
C5746_07440-2131.181155hypothetical protein
C5746_07445-2101.533653hypothetical protein
C5746_07450-293.047212CbxX/CfqX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07530STREPKINASE300.016 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 30.1 bits (67), Expect = 0.016
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 338 TIDAVQVLGGYGYTLDFP-VERLMREAKVLQIVEGTNQ 374
T+++VQ + G + LD P V V VEGTNQ
Sbjct: 20 TVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQ 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07535PF06776300.013 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 29.9 bits (67), Expect = 0.013
Identities = 11/65 (16%), Positives = 19/65 (29%), Gaps = 1/65 (1%)

Query: 7 PRARLRALAAAACTVALGATLLGAAGTASAGAALTATQAAAPAAAVAAPTAADDKVVGYF 66
P + + A + L + A A L A A + + A V
Sbjct: 22 PALKAIQMGPAELSPMLASCRRLARRN-GARLMLAGAMAIALSFGWSDRADAQGAVRSVH 80

Query: 67 TNWGV 71
+W +
Sbjct: 81 GDWQI 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07555DHBDHDRGNASE1358e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (342), Expect = 8e-41
Identities = 84/268 (31%), Positives = 127/268 (47%), Gaps = 17/268 (6%)

Query: 12 VSAQLLRGQKALVTGANSGIGMATAIALGRAGADVVVNYVAGADEAEKVVAQIKDFGVRA 71
++A+ + G+ A +TGA GIG A A L GA + ++ EKVV+ +K A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHA 59

Query: 72 YAHEADVSDEAQVVAMVARMVEAFGTIDIMVANAGLQRDAAATEMTLAQWQKVIDVNLTG 131
A ADV D A + + AR+ G IDI+V AG+ R ++ +W+ VN TG
Sbjct: 60 EAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 132 QFLCAREAAKEFLRRGVVEEVSRAAGKIICMSSVHQIIPWSGHVNYASSKGGVAMLMQTL 191
F +R +K + R +G I+ + S +P + YASSK M + L
Sbjct: 120 VFNASRSVSKYMMD--------RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171

Query: 192 AQELAPQRIRVNAVAPGAIRTPINRDAWSSPEAEADLLR--------LIPYRRVGDPEDI 243
ELA IR N V+PG+ T + W+ +++ IP +++ P DI
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDI 231

Query: 244 ANAVVALASDLLDYVVGATLYVDGGMTL 271
A+AV+ L S ++ L VDGG TL
Sbjct: 232 ADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_0756056KDTSANTIGN310.012 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 31.5 bits (71), Expect = 0.012
Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 7/60 (11%)

Query: 325 ELPEEILDHFEGYRGSFPVRIGNGAADQLQLDIYGEAVYAVAQGGEIAQQTTYQGWRALA 384
+ EE+ D F+GY I N +Q+ L+ QG QQ A+A
Sbjct: 309 DTLEELRDSFDGY-------INNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQATAQEAVA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07585FIMBRILLIN290.011 Porphyromonas gingivalis: fimbrillin protein signature.
		>FIMBRILLIN#Porphyromonas gingivalis: fimbrillin protein signature.

Length = 348

Score = 29.2 bits (65), Expect = 0.011
Identities = 12/39 (30%), Positives = 17/39 (43%)

Query: 83 AHYEPYDWTARTFAAPRSLEPAPVVLIEGVGAGRRALRP 121
A+Y DW R + P + P ++E A LRP
Sbjct: 198 ANYTHVDWLGRDYTEPSNNAPQGFYVLESTYAQNAGLRP 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_07590SYCDCHAPRONE310.006 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 31.4 bits (71), Expect = 0.006
Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 22 RGVDAYTMGAYPQAEEEFRAAVRIDPGMADGWLGLHALRAD 62
+ Y G Y A + F+A +D + +LGL A R
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82


128C5746_09695C5746_09735N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_09695-1193.400576iron ABC transporter ATP-binding protein
C5746_09700-1192.970358hypothetical protein
C5746_097050162.713402chaplin
C5746_097150121.832990DNA-binding response regulator
C5746_097200201.193794diguanylate cyclase
C5746_097250140.893531IS110 family transposase
C5746_097305231.021578lytic transglycosylase
C5746_09735224-0.050021ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09815PF07520330.002 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 32.6 bits (74), Expect = 0.002
Identities = 21/97 (21%), Positives = 35/97 (36%), Gaps = 8/97 (8%)

Query: 41 AGKTTLLNIASSYLFPSTGTAKILGEQLGGVGTDVFELRPRIG--IAGIAMAEKLPRRQT 98
AG + + S+ + P + Q GG +R G I G RRQ
Sbjct: 635 AGDDLVHRVISAIVLPRLQDSI---AQAGGQFVAER-MRELFGGDIGGQEQQTVQRRRQF 690

Query: 99 VLQTVLTAAYGMTATWHENYEAVDEERARAFLDRLGM 135
++ ++ A + + E+ E D D LG+
Sbjct: 691 SIRVLVPLAEAILSAC-EDAEEADRIDIP-VADVLGL 725


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09830HTHFIS727e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 7e-17
Identities = 27/115 (23%), Positives = 46/115 (40%), Gaps = 2/115 (1%)

Query: 6 IRVLLVDDHQVVRRGLRTFLEIQDDIEVVGEASDGAEGVARTEELRPDVVLMDIKMPGTD 65
+L+ DD +R L L V S+ A D+V+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GIEALRKLRELENPAKVLIVTSFTEQRTVVPALRAGASGYVYKDVDPDALAGAIR 120
+ L ++++ VL++++ T + A GA Y+ K D L G I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09845IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 2e-04
Identities = 22/121 (18%), Positives = 39/121 (32%), Gaps = 4/121 (3%)

Query: 37 VPANAEGKPEAQTPVSAAPVVLASVAGTPQV----KAVQASIIEQHSTAEQLVKAADLAR 92
VP+N E P T V K ++ + A +
Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 93 AEKAAAVKAKAEAAAKAKAAAAAEVKARAEAKARAAAQVKAKAAAERTETQAASRSEART 152
E + VKA + A++ + + E K A + + KA E +TQ + ++
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129

Query: 153 P 153

Sbjct: 1130 S 1130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_09850PERTACTIN330.007 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.8 bits (74), Expect = 0.007
Identities = 46/162 (28%), Positives = 63/162 (38%), Gaps = 16/162 (9%)

Query: 34 DLTIDDARVSWRHATISWGGHSWFIEDHGSTNGTYVQGRRIHQLEIGPGSAVHLGNATDG 93
DL + D V R A+ G H ++ + GS + G + ++ GSA A
Sbjct: 487 DLGLSDKLVVMRDAS---GQHRLWVRNSGSEPAS---GNTMLLVQTPRGSAATFTLANKD 540

Query: 94 PRLSIG------AAAGADVYSGQGAGAQQAPVQQPQQGGAGRQAPVPPQQQQPQQGWQQA 147
++ IG AA G +S GA A AP PQ G P P Q PQ
Sbjct: 541 GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPG----PQPGPQPPQPPQPPQPPQ 596

Query: 148 PQVQPQQQPAQPQQPQVPHQQGMARTPGAGGPGGVAGAPPVY 189
P PQ+QP P ++ A A GGV A ++
Sbjct: 597 PPQPPQRQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLW 638


129C5746_09915C5746_09975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_09915-1200.191030alcohol dehydrogenase
C5746_09920-2200.244561acetoacetate decarboxylase
C5746_099250120.245874TetR family transcriptional regulator
C5746_09930214-0.968919DNA polymerase III subunit epsilon
C5746_09935115-1.547677hypothetical protein
C5746_09940-214-0.824296hypothetical protein
C5746_09945-3130.528140glucoamylase
C5746_09950-2140.890111oxidoreductase
C5746_09955-2140.948344alkaline shock response membrane anchor protein
C5746_099600121.042109hypothetical protein
C5746_099700141.831052hypothetical protein
C5746_099750141.017899hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10010DHBDHDRGNASE643e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 3e-14
Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 4/169 (2%)

Query: 5 EGQVAVVTGAASGIGLAMARRFAAEGLKVVLADVEEGALDKAAGELRRDGAQVLARAVDV 64
EG++A +TGAA GIG A+AR A++G + D L+K L+ + A DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 65 GERESVQALAEAAYDTFGAVHVLCNNAGVGSGAEGRMWEHEPNDWKWAFAVNVWGVFHGI 124
+ ++ + G + +L N AGV G + +W+ F+VN GVF+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLR--PGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 125 QAFVPRMIAGGGPGHVVNTSSGDGGIAPLPTASVYAVTKSAVVTMTESL 173
++ M+ G +V S G+ P + + YA +K+A V T+ L
Sbjct: 125 RSVSKYMMDRRS-GSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10020TETREPRESSOR627e-14 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 61.9 bits (150), Expect = 7e-14
Identities = 51/207 (24%), Positives = 83/207 (40%), Gaps = 15/207 (7%)

Query: 1 MARSSLTREQVLDTAGALVKRHGPQALTMRALAAELGTAVTSIYWHVGNRESLLDALVER 60
MAR L RE V+D A L+ G LT R LA +LG ++YWHV N+ +LLDAL
Sbjct: 1 MAR--LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVE 58

Query: 61 TVQEMGT--LRPAGRTPAARIVSVARGLHRELRARPHLIAMVHERGLTERMFLPAQQALV 118
+ L AG + + + + A R L + E+ + + L
Sbjct: 59 ILARHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQL- 117

Query: 119 HEVHAAGLRGARAAAAVRAVQFQTVGFLLVERNRERSPVQSPGEGDLWTASTADDDPALA 178
+ G A+ AV T+G +L ++ + P + ++ P L
Sbjct: 118 RFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPA-------APDENLPPLL 170

Query: 179 RA---LARPADPDRLFADSVRALVEGL 202
R + D ++ F + +L+ G
Sbjct: 171 REALQIMDSDDGEQAFLHGLESLIRGF 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10025PF07675300.016 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.1 bits (67), Expect = 0.016
Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 4/68 (5%)

Query: 9 TAAPWPTAYPQGYAVVDVETTGLARDDRIVSAAVYRLDAQGNVE----DHWYTLVNPERD 64
T + + E A D +V+ + QG V + Y + NPE
Sbjct: 422 TGPLFTGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPA 481

Query: 65 PGPVWIHG 72
G +WI G
Sbjct: 482 SGKMWIAG 489


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_1003556KDTSANTIGN320.003 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 31.9 bits (72), Expect = 0.003
Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 5/99 (5%)

Query: 133 DVPAVPKGEVTVTGRLKADETTSASGIKDISDLPDRQVMLINSAQEAERLSRPVLGGYIE 192
D+P +P+ + D+ +A+ I + + M+ + + PVL I
Sbjct: 154 DIPNIPQAQRQAAQPPLNDQKRAAARIAWLKNCAGIDYMVKDPNNPGHMMVNPVLLN-IP 212

Query: 193 QTAPEPVGGSPELIEAPDDSSIGPHMAYAVQWWLFAAGV 231
Q P PVG P+ P + +I H QW G+
Sbjct: 213 QGNPNPVGQPPQRANQPANFAIHNHE----QWRSLVVGL 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10045DHBDHDRGNASE965e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 96.3 bits (239), Expect = 5e-26
Identities = 65/257 (25%), Positives = 101/257 (39%), Gaps = 11/257 (4%)

Query: 4 GLKDRVYIVTGASRGLGNATARALAEDGARVI---ITGRDEKSVEAAAAELGPDAVGLVA 60
G++ ++ +TGA++G+G A AR LA GA + + V ++ A A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 61 DNADPASAQRLVDSAHERFGRLDGILISVGGPAPGFVADNTDEQWQSAFESVFLGAVRLA 120
D D A+ + G +D ++ G PG + +DE+W++ F G +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 121 RAAAATL--GEGGVIGFVLSGSVYEPIAGLTISNGLRPGLAGFAKSLADELGPRGIRVVG 178
R+ + + G I V S P + + F K L EL IR
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 179 VLPGRISTDRMRELDARSGDAEASRTTNEA-----GIPLRRYGTPEEFGRTAAFLLSPAA 233
V PG TD L A + GIPL++ P + FL+S A
Sbjct: 185 VSPGSTETDMQWSLWA-DENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 234 SYLTGVMLPVDGGSRHG 250
++T L VDGG+ G
Sbjct: 244 GHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10050FLGMRINGFLIF270.045 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 27.2 bits (60), Expect = 0.045
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 8/58 (13%)

Query: 108 RGRALEGVLADEAGTLDGVARAQV--------VLTGRRSSPRARIRLLMEPHAAPDEA 157
RALEG LA TL V A+V + + SP A + + +EP A DE
Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_10065TCRTETA250.024 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 24.8 bits (54), Expect = 0.024
Identities = 17/39 (43%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 6 AGMVAGMALG-FAGYFGGFGAFLLVAALGAIGFVVGRFL 43
GMVAG LG G F F AAL + F+ G FL
Sbjct: 142 FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180


130C5746_11535C5746_11570N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_11535-1172.812317bacterioferritin
C5746_11540-1142.916256[Fe-S]-binding protein
C5746_115450152.9753973-deoxy-7-phosphoheptulonate synthase class II
C5746_115500152.262024phenazine-specific anthranilate synthase
C5746_115550141.923297hydroxyacid dehydrogenase
C5746_11560-1130.8493436-phosphofructokinase
C5746_11565-113-0.120378DNA-binding response regulator
C5746_11570-215-0.505228sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11590HELNAPAPROT270.023 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 27.1 bits (60), Expect = 0.023
Identities = 30/141 (21%), Positives = 57/141 (40%), Gaps = 22/141 (15%)

Query: 6 EVLEFLNEQLTAELTAINQYFLHAKMQDNFGWTKLAKYTRS--ESFDEM-----KHAEIL 58
V LN QL N + L++K+ F W + + E F+E+ + + +
Sbjct: 12 LVENSLNTQL------SNWFLLYSKLH-RFHWYVKGPHFFTLHEKFEELYDHAAETVDTI 64

Query: 59 TDRILFLDGLPNY---QRLFHVRV-----GQTVTEMFQADRQVEVEAIDRLKRGIELMRG 110
+R+L + G P + H + + +EM QA + K I L
Sbjct: 65 AERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEE 124

Query: 111 KGDITSANIFESILEDEEQHI 131
D +A++F ++E+ E+ +
Sbjct: 125 NQDNATADLFVGLIEEVEKQV 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11615FbpA_PF05833320.004 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 31.8 bits (72), Expect = 0.004
Identities = 18/76 (23%), Positives = 31/76 (40%), Gaps = 19/76 (25%)

Query: 139 GFDTAVGIATEAIDRLHTTAESHMRVLVVEVMGRHAGWIALHSGLAGGANVILIPEQRFD 198
D V I E+ D L + + L++E+MGRH +N+ LI ++
Sbjct: 95 NQDRIVVIDFESTDEL---GFNSIYSLIIEIMGRH-------------SNMTLIRKRDNI 138

Query: 199 IGQVCAWVT---SRFR 211
I +T + +R
Sbjct: 139 IMDSIKHITPDINTYR 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11620HTHFIS759e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 9e-18
Identities = 29/114 (25%), Positives = 49/114 (42%), Gaps = 1/114 (0%)

Query: 14 IRVMVVDDHPMWRDAVARDLSESGFEVVATAGDGPQAVRRAKAVTPDVLVLDLNLPGMPG 73
++V DD R + + LS +G++V + R A D++V D+ +P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 74 VQVCKELVGSHPGLRVLVLSASGEHADVLEAVKSGATGYLLKSASTQELTEAVR 127
+ + + P L VLV+SA ++A + GA YL K EL +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11625PHPHTRNFRASE290.028 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.4 bits (66), Expect = 0.028
Identities = 17/101 (16%), Positives = 35/101 (34%), Gaps = 4/101 (3%)

Query: 161 IAIGYVVEVARASERTLARALEIEAATRERERLARDIHDSVLQVLAMVQRRGTALGGEAA 220
+AI + I + E E+L + S ++ A+ + ++G + A
Sbjct: 14 VAIAKAFIHLEPNVD--IEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKA 71

Query: 221 ELGRMAGEQEVALRTLVSSGLVPTTRVSEDAAEGAVVRSVE 261
E+ A V + G+ + AE A+ +
Sbjct: 72 EI--FAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSD 110


131C5746_11645C5746_11670N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_116451152.127036hypothetical protein
C5746_11650-1201.946148hypothetical protein
C5746_11655-1170.792157hypothetical protein
C5746_11660-2151.255631hypothetical protein
C5746_11670-2141.176879RNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11690PF06580290.040 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.040
Identities = 19/118 (16%), Positives = 37/118 (31%), Gaps = 5/118 (4%)

Query: 237 FGWQGRVELHYGSMEFLGPHVPLVSTLALGLSVVAFGWLLVWRLRACDFRVNTPADAAFT 296
GW +G G P + ++ +++ G +L R+ R
Sbjct: 18 IGWGVYTLTGFGFASLYGS--PKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQ 75

Query: 297 AVLLFTTTSRVISPQYMLWLVGLAAVCLVFRDSRMGLPAVLVLLATGVTQLEFPLGFV 354
+L VI M+W V ++ + A + LA + + F+
Sbjct: 76 IILRVLPACVVIG---MVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFM 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11700GPOSANCHOR355e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 5e-04
Identities = 28/180 (15%), Positives = 55/180 (30%), Gaps = 11/180 (6%)

Query: 22 SAAAATAAATLGAATANADPQDAPRTAKARVDRLYSEAERATEQYNKAGENADRLRGELK 81
+A+ A A LGA + + +++ E+ E+ +K + L+ +
Sbjct: 19 TASVAVALTVLGAGLVVNTNEVSAVATRSQT----DTLEKVQERADKFEIENNTLKLKNS 74

Query: 82 RTQDRAARDQERLNRMRTALGSAVSAQYRSGGLDPSLALLLSSDPDTYLDRAATLDRLTE 141
++ + L +S + +S R A L++ E
Sbjct: 75 DLSFNNKALKDHND----ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 142 HRATELRQLQQAQRSLAQTRAEAARALADLERNRAAVTRHKR---TVERKLARAKQLLDA 198
++L +A A ADLE+ + L K L+A
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11705GPOSANCHOR385e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.7 bits (87), Expect = 5e-05
Identities = 34/192 (17%), Positives = 70/192 (36%), Gaps = 5/192 (2%)

Query: 31 QAAHADPKPSKGEVKAEVEKLGHEAGEANEQYYGAKEKQQKLEKEVGALQDKVARGQQEI 90
A + K + A L A K + LE E AL+ + A ++ +
Sbjct: 140 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 199

Query: 91 NTLRNGLGSLAAAQYRSGGIDPSVQLFLSS---NPDSFLDEASALDQLTAKQTETLEKIQ 147
N + +A ++ + + ++ ++A ++
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 148 EKQRVLAQERKEAQDKLGDLADVRKTLGAKKKKLQDKLAEAQRLLNTMTEAERTKMRKDE 207
+Q L + + A + + KTL A+K L+ + A+ + + + A R +R+D
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH-QSQVLNANRQSLRRDL 318

Query: 208 QRASRSASDRVE 219
ASR A ++E
Sbjct: 319 D-ASREAKKQLE 329



Score = 35.0 bits (80), Expect = 4e-04
Identities = 29/182 (15%), Positives = 60/182 (32%), Gaps = 3/182 (1%)

Query: 43 EVKAEVEKLGHEAGEANEQYYGAKEKQQKLEKEVGALQDKVARGQQEINTLRNGLGSLAA 102
E++A L A K + LE E AL + A ++ + N + +A
Sbjct: 117 ELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 176

Query: 103 AQYRSGGIDPSVQLFLSSNPDSFLDEASALDQLTAKQT---ETLEKIQEKQRVLAQERKE 159
+++ + + + +AK + ++ L + +
Sbjct: 177 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG 236

Query: 160 AQDKLGDLADVRKTLGAKKKKLQDKLAEAQRLLNTMTEAERTKMRKDEQRASRSASDRVE 219
A + + KTL A+K L+ + AE ++ L K + + A+ E
Sbjct: 237 AMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAE 296

Query: 220 LG 221

Sbjct: 297 KA 298



Score = 32.0 bits (72), Expect = 0.004
Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 17/200 (8%)

Query: 30 SQAAHADPKPSKGEVKAEVEKLGHEAGEANEQYYGAKEKQQKLEKEVGALQDKVARGQQE 89
+A A+ + + + + ++ LEK + + +
Sbjct: 188 LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK 247

Query: 90 INTLRNGLGSLAAAQYRSGGIDPSVQLFLSSNPDSFLDEASALDQLTAKQTETLEKIQEK 149
I TL +L A Q F + + L+ A ++ +
Sbjct: 248 IKTLEAEKAALEARQAELEKALEGAMNFS----TADSAKIKTLEAEKAALEAEKADLEHQ 303

Query: 150 QRVLAQERKEAQDKLGDLADVRKTLGAKKKKLQDKLA-------------EAQRLLNTMT 196
+VL R+ + L + +K L A+ +KL+++ +A R
Sbjct: 304 SQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363

Query: 197 EAERTKMRKDEQRASRSASD 216
EAE K+ + + + S
Sbjct: 364 EAEHQKLEEQNKISEASRQS 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11710GPOSANCHOR387e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.1 bits (88), Expect = 7e-05
Identities = 30/123 (24%), Positives = 54/123 (43%), Gaps = 10/123 (8%)

Query: 140 QRADAERADEESRRELERLRDELAQARSQTKS---ETERLRGELDAARKEADSLQRKLRS 196
+ E L E A Q++ + LR +LDA+R+ L+ + +
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQK 334

Query: 197 VANEVKRGEAALRRSRAETDAVRSEAAAQVSAAESESRRLKARLGEAEASVEAGRR---A 253
+ + K EA+ + R + DA R E+E ++L+ + +EAS ++ RR A
Sbjct: 335 LEEQNKISEASRQSLRRDLDASREAKK----QLEAEHQKLEEQNKISEASRQSLRRDLDA 390

Query: 254 ARE 256
+RE
Sbjct: 391 SRE 393



Score = 32.7 bits (74), Expect = 0.003
Identities = 34/122 (27%), Positives = 55/122 (45%), Gaps = 4/122 (3%)

Query: 138 EVQRADAERADEESRRELERLRDELAQARSQTK-SETER--LRGELDAARKEADSLQRKL 194
R R + SR ++L E + Q K SE R LR +LDA+R+ L+ +
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEH 367

Query: 195 RSVANEVKRGEAALRRSRAETDAVRSEAAAQVSAAESESRRLKARLGEAEASVEAGRRAA 254
+ + + K EA+ + R + DA R EA QV A E+ A L + +E ++
Sbjct: 368 QKLEEQNKISEASRQSLRRDLDASR-EAKKQVEKALEEANSKLAALEKLNKELEESKKLT 426

Query: 255 RE 256
+
Sbjct: 427 EK 428



Score = 32.3 bits (73), Expect = 0.004
Identities = 23/116 (19%), Positives = 51/116 (43%), Gaps = 3/116 (2%)

Query: 137 EEVQRADAERADEESRRELERLRDELAQARSQTKSETERLRGELDAARKEADSLQRKLRS 196
E+ + +R+ L R D +A+ Q ++E ++L + + SL+R L +
Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355

Query: 197 VANEVKRGEAALRRSRAE---TDAVRSEAAAQVSAAESESRRLKARLGEAEASVEA 249
K+ EA ++ + ++A R + A+ ++++ L EA + + A
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAA 411


132C5746_11795C5746_11830N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_117950181.035628hydrogenase expression protein
C5746_118000111.625718DNA-binding response regulator
C5746_11805180.645359two-component sensor histidine kinase
C5746_11810080.777918hypothetical protein
C5746_11815081.235967phage shock protein A
C5746_11820081.408202DUF3043 domain-containing protein
C5746_11825181.003302SAM-dependent methyltransferase
C5746_11830314-0.209926signal protein PDZ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11830ACRIFLAVINRP6060.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 606 bits (1563), Expect = 0.0
Identities = 231/1043 (22%), Positives = 442/1043 (42%), Gaps = 50/1043 (4%)

Query: 4 LSRFSLAQRALIGLISIVALVFGAIAIPQLKQQLLPTIELPMVSVLAPYQGASPDVVEKQ 63
++ F + + +++I+ ++ GA+AI QL PTI P VSV A Y GA V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 VVEPLENSIKAVDGVTGTTSTA-SEGNAVIMASFDFGSEGTKQLVADIQQAVNRARAQLP 122
V + +E ++ +D + +ST+ S G+ I +F G++ V +Q + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV-QVQNKLQLATPLLP 119

Query: 123 DDV-DPQVIAGSTDDIPAVVLAVTSDK---DQQALADQLDRTVVPALEDISGVGQVTVDG 178
+V + + +V SD Q ++D + V L ++GVG V + G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 179 VQDLQVSITPDDKKLAGAGLNAGTLSQALQAGGATVPAGSF------SESGKSRTVQVGG 232
+ I D L L + L+ + AG + ++
Sbjct: 180 -AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 233 AFTSLKQIEDLRVTGQDPATGKPGEPVRVGDVATVKQEPSTAVSITRTNGKPSLAVMATM 292
F + ++ + + G VR+ DVA V+ I R NGKP+ + +
Sbjct: 239 RFKNPEEFGKVTL-----RVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 293 DKDGSAVAISDAVKDKLPDLRKDLGASAELTVVSDQGPAVSKAISGLTTEGALGLLFAVI 352
+A+ + A+K KL +L+ ++ D P V +I + ++ +
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 353 VILVFLASLRSTLVTAVSIPLSVVLALIVLWTRDLSLNMLTLGALTIAIGRVVDDSIVVL 412
V+ +FL ++R+TL+ +++P+ ++ +L S+N LT+ + +AIG +VDD+IVV+
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 413 ENIKRHL-GYGEERQSAIITAVKEVAGAVTSSTLTTVAVFLPIGLVGGMVGELFGSFSLT 471
EN++R + + A ++ ++ GA+ + AVF+P+ GG G ++ FS+T
Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473

Query: 472 VTAALLASLLVSLTVVPVLSYWFLRAPKGATENPDEARREAEEKEARSRLQKLYVPVLRF 531
+ +A+ S+LV+L + P L L+ P A + ++ Y +
Sbjct: 474 IVSAMALSVLVALILTPALCATLLK-PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 532 ATRRRITSIVIAIVVFLATFGMAPLLKTNFFDQGEQEVLSIKQELAPGTSLAAADKAARK 591
++I ++ + L ++F + +Q V +L G + K +
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 592 VEKVLTEDEGVKDYQVTVGSSGFMAAFGGGTGANQASYQVTLKD----SGDFDATQKRID 647
V ++E V +GF G N V+LK +GD ++ + I
Sbjct: 593 VTDYYLKNEKANVESV-FTVNGFSF---SGQAQNAGMAFVSLKPWEERNGDENSAEAVIH 648

Query: 648 TALGKLDGIGDTTIAA---------GDGFGSQDLSVVVKAADADVLKKASEEVRAEVAE- 697
A +L I D + G G + D L +A ++ A+
Sbjct: 649 RAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 698 LKDVTDVQSDLAQSVPRISVKAN-AKAADAGFNQATLGAAVAGAVRGTPSGKAIMDDTER 756
+ V+ + + + ++ + KA G + + + ++ A+ GT I +
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768

Query: 757 DVVIKSAHPATTMAE-LKNLSL-----GPVELGRIADVELVPGPVSMTRIDGQRAATITA 810
+ +++ + E + L + V V G + R +G + I
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQG 828

Query: 811 KPT-GDNTGAVSSSLQTKINALKLPDGATATIGGVSEDQNDAFLKLGLAMLAAIAIVFML 869
+ G ++G + ++ + KLP G G+S + + + + + +VF+
Sbjct: 829 EAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 870 LVATFKSLIQPLILLVSIPFAATGAIGLLVITGTPMGVPAMIGMLMLIGIVVTNAIVLID 929
L A ++S P+ +++ +P G + + V M+G+L IG+ NAI++++
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 930 LINQ-YRAQGMGVVEAVIEGGRHRLRPILMTALATIFALLPMALGVTGEGGFISQPLAVV 988
+G GVVEA + R RLRPILMT+LA I +LP+A+ G G + +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS-NGAGSGAQNAVGIG 1005

Query: 989 VIGGLVTSTLLTLLLVPTLYAMV 1011
V+GG+V++TLL + VP + ++
Sbjct: 1006 VMGGMVSATLLAIFFVPVFFVVI 1028



Score = 112 bits (282), Expect = 3e-27
Identities = 90/521 (17%), Positives = 185/521 (35%), Gaps = 47/521 (9%)

Query: 8 SLAQRALIGLISIVALVFGAIAIPQLKQQLLPTIE--LPMVSVLAPYQGASPDVVEK--- 62
L LI + + + +L LP + + + + P GA+ + +K
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLP-AGATQERTQKVLD 591

Query: 63 QVVEPLENS----IKAVDGVTG-TTSTASEGNAVIMASFDFGSE--GTKQLVADIQQAVN 115
QV + + +++V V G + S ++ + S E G + +
Sbjct: 592 QVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 116 RARAQLPDDV-----DPQVIAGSTDDIPAVVLAVTSDKDQQALADQLDRTVVPALEDISG 170
++ D P ++ T L + AL ++ + A + +
Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711

Query: 171 VGQVTVDGVQD-LQVSITPDDKKLAGAGLNAGTLSQALQAGGATVPAGSFSESGKSRTVQ 229
+ V +G++D Q + D +K G++ ++Q + F + G+ + +
Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771

Query: 230 VGGAFTSLKQIEDLRVTGQDPATGKPGEPVRVGDVATVKQEPSTAVSITRTNGKPSLAVM 289
V ED+ + GE V T + + R NG PS+ +
Sbjct: 772 VQADAKFRMLPEDV---DKLYVRSANGEMVPFSAFTTSHWVYG-SPRLERYNGLPSMEIQ 827

Query: 290 ATMDKD---GSAVAISDAVKDKLPDLRKDLGASAELTVVSDQGPAVSKAISGLTTEGALG 346
G A+A+ + + KLP G + T +S Q +
Sbjct: 828 GEAAPGTSSGDAMALMENLASKLPA-----GIGYDWTGMSYQERL---------SGNQAP 873

Query: 347 LLFAVIVILVFLA------SLRSTLVTAVSIPLSVVLALIVLWTRDLSLNMLTLGALTIA 400
L A+ ++VFL S + + +PL +V L+ + ++ + L
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 401 IGRVVDDSIVVLENIK-RHLGYGEERQSAIITAVKEVAGAVTSSTLTTVAVFLPIGLVGG 459
IG ++I+++E K G+ A + AV+ + ++L + LP+ + G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 460 MVGELFGSFSLTVTAALLASLLVSLTVVPVLSYWFLRAPKG 500
+ + V ++++ L+++ VPV R KG
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11835HTHFIS593e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 3e-12
Identities = 24/119 (20%), Positives = 49/119 (41%), Gaps = 4/119 (3%)

Query: 5 IKVLLVDDQALLRSAFRVLVDSEADMEVVGEAADGAQAVELARSTRADVVLMDIRMPGTD 64
+L+ DD A +R+ + S A +V ++ A + D+V+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GLTATRMVSADPELADVRVVMLTTFEVDEYVVQSLRAGASGFLGKGAEPEELLNAIRVA 123
+ D+ V++++ +++ GA +L K + EL+ I A
Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11840PF06580414e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 4e-06
Identities = 53/370 (14%), Positives = 106/370 (28%), Gaps = 94/370 (25%)

Query: 58 SGVVLMVLGAVVLI----WRRRKPMAVLAATAGLTVVELVRSDPPAPVVMSTVIALYTVA 113
+ + ++G V+ + +R+ L + V + VI +
Sbjct: 44 FNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRV----------LPACVVIGMVWFV 93

Query: 114 ARTDRPTTWRVGLLTMAALTAAAMSFGAAPWYSQENFGVFAWTGMAGAVGDAVRSRRAFV 173
A T + WR+ A + + F W+
Sbjct: 94 ANT---SIWRLLAFINTKPVAFTLPLALS-IIFNVVVVTFMWSL--------------LY 135

Query: 174 DAIRERAERAERTRDEEARRRVAEERLRIARDLHDVVAHHIALVNVQAGVAAHVMDKRPD 233
+ ++ ++ + + ++ L + H + N + A ++ + P
Sbjct: 136 FGWH-FFKNYKQAEIDQWKMASMAQEAQLMA-LKAQINPHF-MFNALNNIRALIL-EDPT 191

Query: 234 QAKEALAHVREASRSALNELRATVGLLRQSGDPEAPTEPAPGLAVLDELVDTVR------ 287
+A+E L + S L +R + LR S + L + + V
Sbjct: 192 KAREMLTSL-----SEL--MRYS---LRYSNARQVS---------LADELTVVDSYLQLA 232

Query: 288 --RAG--LPVEVACTDRRPPLPAAVDLAAYRV----IQEALTNVRKHA----GPGAKAEV 335
+ L E + +V +Q + N KH G K +
Sbjct: 233 SIQFEDRLQFENQINP---------AIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILL 283

Query: 336 SVVRVGATAEVTVIDNGSGGGVRNGDGGGHGLLGMRERVTALGGSLTAGPRYG------- 388
+ T + V + GS + G GL +RER+ L G
Sbjct: 284 KGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY-----GTEAQIKLSEKQ 338

Query: 389 GGFRVHAILP 398
G ++P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11865V8PROTEASE582e-11 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 57.7 bits (139), Expect = 2e-11
Identities = 39/199 (19%), Positives = 68/199 (34%), Gaps = 36/199 (18%)

Query: 58 EYQDVIKNVLPSVVQIE----ASNSLGSGVIYDSKGHIVTNAHVVGNEKTFKVTV----- 108
+ D V I+ + SGV+ K ++TN HVV +
Sbjct: 78 QITDTTNGHYAPVTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPS 136

Query: 109 -----ATGEKVLRASLVAAYPEQ-DLAVIKLAGVPAG------LKPAKFGDSEKVEVGQI 156
A + Y + DLA++K + +KPA ++ + +V Q
Sbjct: 137 AINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQN 196

Query: 157 VLAMGSPLGLSSSVTQGIVSALGRTVSESSAGGGTGATIANMVQTSAAINPGNSGGALVN 216
+ G P V+ + + + + G +Q + GNSG + N
Sbjct: 197 ITVTGYPGDKP-------VATMWESKGKITYLKGEA------MQYDLSTTGGNSGSPVFN 243

Query: 217 LNSEVIGIPTLGAIDPQIG 235
+EVIGI G + +
Sbjct: 244 EKNEVIGIHW-GGVPNEFN 261


133C5746_11910C5746_11940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_119100170.949111short-chain dehydrogenase
C5746_119151171.147937TetR family transcriptional regulator
C5746_119201161.014177TIGR01777 family protein
C5746_119251151.415803oxidoreductase
C5746_119302110.859997hypothetical protein
C5746_119352121.032862hypothetical protein
C5746_119400102.113819regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11950DHBDHDRGNASE1095e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (274), Expect = 5e-31
Identities = 75/256 (29%), Positives = 109/256 (42%), Gaps = 11/256 (4%)

Query: 5 EGRRALITGGGSGIGQATVHRVLAEGGRVVAADVNEAGLKATYATAVADG-TADRLTTLV 63
EG+ A ITG GIG+A + ++G + A D N L+ ++ A+ A+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP--- 63

Query: 64 LDISDEAAVQAGVASAVATLGGLDVLVNAAGILRSEHTHKTSLEFFNKILAVNLTGTFLM 123
D+ D AA+ A +G +D+LVN AG+LR H S E + +VN TG F
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 124 IREAIPALLAGDKPVVVNFSSTSASFAHPYMSAYAASKGGVQSMTHALASEYSKQGLRVV 183
R ++ +V S A M+AYA+SK T L E ++ +R
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 184 AVAPGSISSDM-----TSGNGPGLPADADMSLFMKLSPALGEGFASPDTVAGVVAMLGSQ 238
V+PGS +DM NG + F K L + A P +A V L S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKK-LAKPSDIADAVLFLVSG 241

Query: 239 DGAFITGTEIRIDGGT 254
IT + +DGG
Sbjct: 242 QAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11955HTHTETR656e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.6 bits (157), Expect = 6e-15
Identities = 33/183 (18%), Positives = 58/183 (31%), Gaps = 10/183 (5%)

Query: 8 PSLTERRKAATQLDIARAAAALFAERGPEGTTAEDIAHRAGVALRTFYRYFRSKQDAVGP 67
T++ T+ I A LF+++G T+ +IA AGV Y +F+ K D
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 68 LLSGGAERWRALLE--AAEPGDALAGVLEQAVTEALRVPGADAAEQLSWTRGLLRAAVED 125
+ L A+ VL + + L + +L ++ E
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL--LMEIIFHKCEF 119

Query: 126 PALRAVWYRVNQDSEERLLPVLTRL------AGDGADPLEVRLAAAAATDAVRVALEAWA 179
AV + ++ + + A L R AA + +E W
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 180 ETD 182

Sbjct: 180 FAP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11965NUCEPIMERASE415e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.5 bits (95), Expect = 5e-06
Identities = 46/244 (18%), Positives = 72/244 (29%), Gaps = 45/244 (18%)

Query: 5 RIAVTGSTGLIGAALVRSLRADGHEVV-----------RLVRHPARAGDEVEWDPKRGYV 53
+ VTG+ G IG + + L GH+VV L + + + + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 DGAGLV-------GCDAVVHLAG-AGVGDHRWTEEYKREIRDSRVLGTAAIAEAVASLGV 105
+ + V V R++ E DS + G I E +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAV---RYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 106 PPKVLLCGSAIGYYGDTGDRAIDESAPHGTGFLPSVCVDWEAAAAPAAAAGIRTVYAR-- 163
+ S++ YG AA A + Y+
Sbjct: 119 QHLLYASSSSV--YGLNRKMPFSTDDSVDHPVSLY-------AATKKANELMAHTYSHLY 169

Query: 164 ----TGLVVAREGGAWGR----LFPLFRAGLGGR----LGNGRQYWSFIALHDHIAALRH 211
TGL G WGR LF +A L G+ G+ F + D A+
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 212 LLDT 215
L D
Sbjct: 230 LQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_11985cloacin364e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 4e-04
Identities = 31/109 (28%), Positives = 37/109 (33%), Gaps = 12/109 (11%)

Query: 161 GGAGGARAAGGL--GRSGGASGAAGAPGAAEAGGSARPGGPGGARGIGAQGVHGSHWPGG 218
G GA + G G G GA + P G G GI G G H GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-HGNGG 66

Query: 219 SGHGATGASGVSGRGAVRGPVPSRPSGPPAPPGSPAPSTAPASLGTGLS 267
+ G SG G + P G PA ST P + G +S
Sbjct: 67 GNGNSGGGSGTGGNLS--------AVAAPVAFGFPALST-PGAGGLAVS 106


134C5746_12070C5746_12105N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12070-1151.591986hypothetical protein
C5746_12075-2131.001822sensor histidine kinase
C5746_12080-1121.252914DNA-binding response regulator
C5746_12085-2131.061546MarR family transcriptional regulator
C5746_12090-2140.507387hypothetical protein
C5746_12095-313-0.052794metal-dependent hydrolase
C5746_12100-212-0.206065oxidoreductase
C5746_12105-1130.467749TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12110ACRIFLAVINRP626e-12 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 61.8 bits (150), Expect = 6e-12
Identities = 38/168 (22%), Positives = 77/168 (45%), Gaps = 23/168 (13%)

Query: 504 AFITAVTFLLMLFCFRSYVIAVTSILLNLLSVAAAYGVMVAVFQHGWGASLIGSEGVGAI 563
+ + L L R+ +I ++ + LL + ++ A +G S+
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLL---GTFAILAA-----FGYSI--------- 390

Query: 564 EAWIPLFVLVVLFGLSMDYHVFVVSRI-REARDRGLHTRAAIDEGIRRTAGAVTGAAVIM 622
+ +F +V+ GL +D + VV + R + L + A ++ + + GA+ G A+++
Sbjct: 391 -NTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVL 449

Query: 623 VAVF---AVFGTLSMQDMQQMGVGLAVAVLLDATIVRMILLPSVMALL 667
AVF A FG + +Q + + A+ L + +V +IL P++ A L
Sbjct: 450 SAVFIPMAFFGGSTGAIYRQFSITIVSAMAL-SVLVALILTPALCATL 496



Score = 52.9 bits (127), Expect = 4e-09
Identities = 48/318 (15%), Positives = 106/318 (33%), Gaps = 30/318 (9%)

Query: 75 DVAREVSAAIGRTGEVANLAAPIPSEDGKDALITFDMKGDAATAPDRVQPVLDAVSAVRA 134
DVAR + GE N+ A +GK A A A D + + ++ ++
Sbjct: 264 DVAR-----VELGGENYNVIA---RINGKPAAGLGIKLATGANALDTAKAIKAKLAELQP 315

Query: 135 DHP-DVTIHQFGEASAGKWLGDLLSEDFKKAEFTAVPLALGILLVAFGAVVAALLPVGLA 193
P + + + + L + K F A+ L ++ + + A L+P
Sbjct: 316 FFPQGMKVLYPYDTTP---FVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAV 372

Query: 194 LTACMAAFGLLSLASHQLHLFQTTYSVMFLMGFAVG----VDYCLFYLRR-ERDERAAGR 248
+ F +L F + + + + G + VD + + ER
Sbjct: 373 PVVLLGTFAIL-------AAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKL 425

Query: 249 DAETALRIAAATSGRAVLVSGLTVMVAMAGM-FLSGLM--LFKGFALATIIVVFIAMLGS 305
+ A + + A++ + + M F G +++ F++ + + +++L +
Sbjct: 426 PPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVA 485

Query: 306 VTVLPALLSWLGDRIDAGRVPFLNRRSSRRGNRRARASGGFAGTVLKPVLARPKIF--AA 363
+ + PAL + L + + N S + +L +
Sbjct: 486 LILTPALCATL-LKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIY 544

Query: 364 ASVVVLLALAAPALGMKT 381
A +V + + L
Sbjct: 545 ALIVAGMVVLFLRLPSSF 562


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12115PF06580348e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 8e-04
Identities = 11/70 (15%), Positives = 25/70 (35%), Gaps = 7/70 (10%)

Query: 355 RVWVDIHHADAMLRVSVTDNGRGGAAV---GSGSGLSGIERRLGTF---DGIMAVSSPAG 408
++ + + + + V + G +G+GL + RL + + +S G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 409 GPTMVTMEIP 418
+ IP
Sbjct: 340 KVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12120HTHFIS456e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 45.2 bits (107), Expect = 6e-08
Identities = 38/163 (23%), Positives = 67/163 (41%), Gaps = 12/163 (7%)

Query: 2 RVVLAEDLFLLRDGLVRLLEAYDFEIAAAVETGPELTRALDELEPDVAVVDVRLPPSHTD 61
+++A+D +R L + L +++ L R + + D+ V DV +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPD---E 60

Query: 62 EGLQCALAARQARPGLPVLVLSQHVEQLYARELLADGNGGIGYLLKDRVFDAEQFIDAVR 121
++ARP LPVLV+S + A + A G YL K FD + I +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIK--ASEKGAYDYLPKP--FDLTELIGIIG 116

Query: 122 RVAAGGTAMDPQVISQLLSRRSQDKPMGGLTPRELEVMELMAQ 164
R A + S+L P+ G + E+ ++A+
Sbjct: 117 RA----LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12150HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 2e-15
Identities = 29/141 (20%), Positives = 55/141 (39%), Gaps = 9/141 (6%)

Query: 6 RRRMGVEERRQQLIGVALELFSHRSPDEVSIDEIAAAAGISRPLVYHYFPGKQSLYEAAL 65
+ + +E RQ ++ VAL LFS + S+ EIA AAG++R +Y +F K L+
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 66 RRAADELAARFVE---PREGPLGARLLRVMGRFFD--FVEEHGPGF-SALMRGGPAVGSS 119
+ + +E G + L ++ + EE + VG
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG-- 121

Query: 120 TANAMIDGVRQAAYEQILAHL 140
A++ ++ + +
Sbjct: 122 -EMAVVQQAQRNLCLESYDRI 141


135C5746_12385C5746_12445N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_123852100.251535TetR family transcriptional regulator
C5746_123901111.405133amidinotransferase
C5746_123951102.575193rRNA methyltransferase
C5746_124003121.699784hypothetical protein
C5746_124052111.699251Bcr/CflA family drug resistance efflux
C5746_124101120.826342oxidoreductase
C5746_124152130.479744alkaline phosphatase
C5746_12420113-0.0690423-oxoacyl-ACP reductase
C5746_12425113-0.677048hypothetical protein
C5746_12430-2130.002947phenazine biosynthesis protein PhzF
C5746_12435-2110.291109epimerase
C5746_12440-1110.899792ScbA protein
C5746_12445-290.995250TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12450HTHTETR793e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.5 bits (193), Expect = 3e-20
Identities = 41/208 (19%), Positives = 68/208 (32%), Gaps = 18/208 (8%)

Query: 6 PPDPSRRSERSRRAIYDAALGLVGEVGYPRTTVEAIAARAGVGKQTIYRWWPSKAAVLLE 65
+ ++ +R+ I D AL L + G T++ IA AGV + IY + K+ + E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 66 AFVALGDRVAEENGAGDARGLPDTGDLAADLKVVLRATVDELTDPAFEAPTRALAAEGIV 125
+ + E L D VLR + + + R L E I
Sbjct: 62 IWELSESNIGE-------LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF 114

Query: 126 DPQLGAEFVQKLLD-------PSLRLYVARLRAAQEAGQVRADIDPRIALELLVGPLT-- 176
+ + S L+ EA + AD+ R A ++ G ++
Sbjct: 115 HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 177 -HRWLLR-TLPLTHEYADAIVDYTLHGL 202
WL + A V L
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12455ARGDEIMINASE320.003 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 31.7 bits (72), Expect = 0.003
Identities = 28/186 (15%), Positives = 56/186 (30%), Gaps = 34/186 (18%)

Query: 17 MDPSKPVDLPLAQTQWEDLRDRYRTLGHTVELLTPR--PELPDMVFAANGATVIDGRV-- 72
+ + ++ E+L++ +L V +P+++F + I V
Sbjct: 116 LTIDNMISKMISGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTI 175

Query: 73 ------------LGARFAYQERYEEAGAHREWFRDNGFTAIHEPAHVNEGEGDFAVTASY 120
+ A + ++ W ++ EG GD V
Sbjct: 176 NKMFTKVRQRETIFAEYIFKYHPVYKENVPIWLNRWEEASL-------EG-GDELVLNKG 227

Query: 121 ILA-GRGFRSSPLSHNE-AQEFFG-----RPVVGLDLVDPR-YYHLDTALSVLDDAGDEI 172
+L G R+ S + A F ++ + R Y HLDT + +D
Sbjct: 228 LLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDY--SVF 285

Query: 173 MYYPGA 178
+
Sbjct: 286 TSFTSD 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12470TCRTETA643e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 64.1 bits (156), Expect = 3e-13
Identities = 84/361 (23%), Positives = 142/361 (39%), Gaps = 28/361 (7%)

Query: 57 TALPPLSMDMYLPALPAVTESLHASAATVQLTLTACLTGMALGQVVVGPM----SDRWGR 112
AL + + + +P LP + L S V L AL Q P+ SDR+GR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGALSDRFGR 72

Query: 113 RRPLLLGMIIYVVATAICVFAPTTELLIGFRLLQGLAGAAGIVIARAVVRDMYDGVEMAR 172
R LL+ + V AI AP +L R++ G+ GA G V A + D+ DG E AR
Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR 131

Query: 173 FFSTLMLISGVAPIVAPLIGGQVLRFTDWRGIFAVLTVVGVVLTLVVQKWLHETLPPQDR 232
F + G + P++GG + F+ F + + L L E+ + R
Sbjct: 132 HFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 233 HTGGIGDALRTMRGLLADRVFTGYMIAGSLAFAALFSYVSASPFVVQEIYGASPQTFS-L 291
+AL + R T +A +A + V P + I+G +
Sbjct: 191 --PLRREALNPLASFRWARGMTV--VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 292 LFGINSVGLIVVGQINGKVLVGRISL----DKALAFGLSVIVLAAAALLLMTSGVFGHVG 347
GI+ ++ + ++ G ++ +AL G+ L T G
Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG------ 300

Query: 348 LVPVAAGLFVLMSAMGLAMPNTNAQALMRTKHAAGSASALLGTSSFL--IGAVASPLVGI 405
+A + VL+++ G+ MP QA++ + L G+ + L + ++ PL+
Sbjct: 301 --WMAFPIMVLLASGGIGMPAL--QAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 406 A 406
A
Sbjct: 357 A 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12475RTXTOXINC300.010 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.010
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 48 RFGIPRAYGSWADLAADDEVDVVYVATPHS 77
R P AY SWA+L+ ++E+ Y+ S
Sbjct: 49 RDDYPVAYCSWANLSLENEIK--YLNDVTS 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12485DHBDHDRGNASE1053e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (262), Expect = 3e-29
Identities = 62/194 (31%), Positives = 87/194 (44%), Gaps = 5/194 (2%)

Query: 11 KTAVVTGAGSGIGRAVALALTGAGWSVALAGRRPEPLAETAALAGKNARVI-TVPTDVSR 69
K A +TGA GIG AVA L G +A PE L + + AR P DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 70 PEDVDALFSSARECFGRLDLLFNNAGAFGPRSVPVEDLSVEDWRSVVDVNVTGAFLCAQA 129
+D + + G +D+L N AG R + LS E+W + VN TG F +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 130 AYRLMKAQDPQGGRIINNGSVSAHAPRPHSVAYTATKHAMTGLTKSLSLDGRPYRIACGQ 189
+ M + + G I+ GS A PR AY ++K A TK L L+ Y I C
Sbjct: 127 VSKYMMDR--RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 190 IDVGNAATDMTERM 203
+ G+ TDM +
Sbjct: 185 VSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12500NUCEPIMERASE671e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.7 bits (163), Expect = 1e-14
Identities = 67/362 (18%), Positives = 108/362 (29%), Gaps = 91/362 (25%)

Query: 3 LRILITGATGFIGSHVVAAARA-----------TPGVRLRLMTHRTALTATAGAGPGIET 51
++ L+TGA GFIG HV + L R L A PG +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ----PGFQF 56

Query: 52 VHGDLADPATLRG--SCEGVDAVIHCASRIG-----GDEPTARSVNDQGTRALAEEAVRC 104
DLAD + + + V R+ + N G + E
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 105 GVTRIVHLSTASVYGRG---PFTRLRPGQAEPDPASVTSLTRAAAE------QHVLAAGG 155
+ +++ S++SVYG PF+ P S+ + T+ A E H+
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDS---VDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 156 VVVRPHIVYGAGDRWAVPGLVALVRQLSAVLTGCEARHSLIDVRTLGR------------ 203
+R VYG W P + AL + A+L G IDV G+
Sbjct: 174 TGLRFFTVYGP---WGRPDM-ALFKFTKAMLEG-----KSIDVYNYGKMKRDFTYIDDIA 224

Query: 204 -ALLGAALSPKEPAG-----------------VYHVNHPEPVSCSQLLSTVVDELRLPWG 245
A++ VY++ + PV + + D L +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE-- 282

Query: 246 ATGIDVDTARERLADVPFALHHLGMLAVAHWFTD-ERVGQDFDWEPGEGFATAFARHAPW 304
+ + DV D + + + + P W
Sbjct: 283 ---AKKNMLPLQPGDVL------------ETSADTKALYEVIGFTPETTVKDGVKNFVNW 327

Query: 305 YR 306
YR
Sbjct: 328 YR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12510HTHTETR881e-23 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 87.8 bits (217), Expect = 1e-23
Identities = 40/169 (23%), Positives = 69/169 (40%), Gaps = 16/169 (9%)

Query: 4 QARAIQTRRSILVAAAAVFDERGYSSATISEILARAGVTKGALYFHFTSKEDLALGVMD- 62
+ A +TR+ IL A +F ++G SS ++ EI AGVT+GA+Y+HF K DL + +
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 63 ---------VQLDSDPLPAQLTKLQELVDQGMLLAHRLRHEPLVRASVGLAMDQA----- 108
++ + L+ L+E++ + L+ + +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 109 VEGLDRGTPFRAWIDRLEHLLTAAKNQGELLPHVNARETAEMLAGSFSG 157
V+ R DR+E L L + R A ++ G SG
Sbjct: 126 VQQAQRNLCLE-SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


136C5746_12570C5746_12595N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12570092.048858DNA-binding response regulator
C5746_125751101.710277hypothetical protein
C5746_125800100.953386serine protease
C5746_125850111.181942AAA family ATPase
C5746_125900111.303054hypothetical protein
C5746_125950111.573670MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12630HTHFIS606e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 6e-13
Identities = 27/131 (20%), Positives = 49/131 (37%), Gaps = 2/131 (1%)

Query: 12 RVLLAEDQGMMRGALALLLGLEPDIEVVAQVGAGDEIVAAALLSRPDVALLDIELPGRSG 71
+L+A+D +R L L +V + D+ + D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 72 LDAAADLREEVPDCRVLILTTFGRPGYLRRAMEAGAAGFLVKDGPVEELAAAIRRVLSGE 131
D +++ PD VL+++ +A E GA +L K + EL I R L+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 132 TVIDPALAAAA 142
L +
Sbjct: 123 KRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12640V8PROTEASE350.001 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 35.0 bits (80), Expect = 0.001
Identities = 36/174 (20%), Positives = 64/174 (36%), Gaps = 25/174 (14%)

Query: 15 PSRLVVTVRRAADGGTAGA-GFVLGIDTVLTCAHVVNDALGRPM-LEARPPQLEEILVEV 72
V ++ A GT A G V+G DT+LT HVV+ G P L+A P +
Sbjct: 86 HYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSA-------I 138

Query: 73 RGTSTE--QYFARLLHWVAPRAKDGSTVR---EGDSEWLGDLAVLRIDGPAGGLPAAPDR 127
+ + A + + D + V+ ++ +G++ PA A +
Sbjct: 139 NQDNYPNGGFTAEQITKYSGEG-DLAIVKFSPNEQNKHIGEVV-----KPATMSNNAETQ 192

Query: 128 AAMSTDQEVSAWHGGGRAATLARLTVASLHGSHGYLDGEATGMAVGPGYSGGPL 181
+ V+ + G AT+ + + G ++ G SG P+
Sbjct: 193 VNQNI--TVTGYPGDKPVATMWE-SKGKITYLKGE--AMQYDLSTTGGNSGSPV 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12650PF05616360.001 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 35.9 bits (82), Expect = 0.001
Identities = 29/109 (26%), Positives = 45/109 (41%), Gaps = 5/109 (4%)

Query: 9 RLAEALRVLSACGHDLDADQLLDVLWLARS--LPAGPGAPLHRERPETGQPSAGPG--PE 64
R ++V++ G D + +DV + R P AP + PE P+ P P
Sbjct: 284 RNGNPVQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEV-SPAENPANNPA 342

Query: 65 PDRVSLRRPLPEPDDRDLPDLTAPSLYAAARQPPAPQSPLPRHGRDSRK 113
P+ RP PEPD PD + +P +P P +GR ++
Sbjct: 343 PNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKE 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12655TCRTETB1324e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 132 bits (334), Expect = 4e-36
Identities = 85/400 (21%), Positives = 165/400 (41%), Gaps = 16/400 (4%)

Query: 26 CVGQFLVVLDVSVVNVALPSMRSDLAMTAAGLQWVLNAYSIAFAGFMLLGGRAADIYGRK 85
C+ F VL+ V+NV+LP + +D A WV A+ + F+ + G+ +D G K
Sbjct: 20 CILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIK 79

Query: 86 RMFLIGLGLFTAASLAGGLAQEGWQLL-AARAAQGLGAAVLAPATLTLLTTTVPEGPART 144
R+ L G+ + S+ G + + LL AR QG GAA PA + ++ R
Sbjct: 80 RLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRG 138

Query: 145 KAIGTWMAVGAGGGAAGGLIGGVLTDLLSWRWVLLINVPVGVLVLAGAAVWLAEGRAGDR 204
KA G ++ A G G IGG++ + W + L+ +P+ ++ + L + +
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSY--LLLIPMITIITVPFLMKLLKKEVRIK 196

Query: 205 RRIDFLGAVLVTAGLAFVAYGIVQTEESGWTAEATLAPLLGGVALLIAFVVVEARTAEPL 264
D G +L++ G+ F T +++ L+ V + FV + +P
Sbjct: 197 GHFDIKGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPF 247

Query: 265 MPLRVLGARAVTSANAAMFVIGSATFSMWYFMTVYAQTVLGYSPLQAGLALM-PTSLAVV 323
+ + +I + + V S + G ++ P +++V+
Sbjct: 248 VDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVI 307

Query: 324 IGSKCAPRVMARAGAKNLALIGTTVAAAGFGWQSTMGADGSYLTSVCLPGVLMMAGAGLA 383
I ++ R G + IG T + F S + S+ ++ + V ++ G
Sbjct: 308 IFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIII--VFVLGGLSFT 365

Query: 384 STPLASLAITGAAHGEAGLVSGLVNTSRTMGGALGLAVLS 423
T ++++ + EAG L+N + + G+A++
Sbjct: 366 KTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


137C5746_12865C5746_12885N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_128651142.743599GNAT family N-acetyltransferase
C5746_128701133.167312EmrB/QacA family drug resistance transporter
C5746_128751153.655791hypothetical protein
C5746_128800133.790575hypothetical protein
C5746_128850143.459047hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12940SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 15/60 (25%), Positives = 26/60 (43%)

Query: 80 DVAELTRVFVRPEHRGTGGGGLLLAAVESAARAFGISTVRLDTRNDLVEARGLYAKHGYR 139
A + + V ++R G G LL A+ + L+T++ + A YAKH +
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12945TCRTETB1612e-45 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 161 bits (410), Expect = 2e-45
Identities = 95/406 (23%), Positives = 176/406 (43%), Gaps = 17/406 (4%)

Query: 33 LLAALDQTIVSTALPTIVSDLGGLEH-LSWVVTAYLLASTAATPLWGKLGDQYGRKKLFQ 91
+ L++ +++ +LP I +D +WV TA++L + T ++GKL DQ G K+L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 92 TAIVIFLIGSALCGVAQNM-PQLIGFRALQGLGGGGLIVLSMAIVGDIVAPRERGKYQGL 150
I+I GS + V + LI R +QG G L M +V + RGK GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 151 FGAVFGATSVLGPLLGGFFTQHLSWRWVFYINLPIGVVALLVIAAVLYIPVRRTQHTIDY 210
G++ +GP +GG ++ W + + +P+ + + L R + D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 211 LGTFLIASVATCLVLVASLGGTTWAWGSAQVIALAALSVLLLIAFVHVERRAAEPVIPLK 270
G L++ + L T+++ +SVL + FV R+ +P +
Sbjct: 202 KGIILMSVGIVFFM----LFTTSYSIS------FLIVSVLSFLIFVKHIRKVTDPFVDPG 251

Query: 271 LFRIRTFSLVAVISFVVGFAMFGAMTYLPTFLQVVHSITPTMSG-VHMLPMVLGLLLTST 329
L + F + + ++ + G ++ +P ++ VH ++ G V + P + +++
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 330 LSGQIVSRTGRWKVFPIAGTGITAIGLQLLHRLTETSSTLEMSIYFFVFGAGLGLVMQVL 389
+ G +V R G V I G ++ L ET+S I FV G GL V+
Sbjct: 312 IGGILVDRRGPLYVLNI-GVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVI 369

Query: 390 VLVVQNAVTYEDLGVATSGATFFRSIGASFGVAVFGTIFTNRLTDK 435
+V +++ ++ G S F + G+A+ G + + L D+
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12950FLGHOOKFLIK300.018 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.8 bits (66), Expect = 0.018
Identities = 20/80 (25%), Positives = 31/80 (38%)

Query: 196 LPSTVASAPAEETASASASASASAVPSGPASTSPSASAPTSTSPSASTSPSPSASAASQP 255
LP+ + + T+ +A P PA A + ++PSP +AAS
Sbjct: 155 LPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214

Query: 256 ATASPSASTPHASRPASSAP 275
T + P + P SAP
Sbjct: 215 ITPHQTQPLPTVAAPVLSAP 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_12960cdtoxina359e-05 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 34.7 bits (79), Expect = 9e-05
Identities = 9/60 (15%), Positives = 20/60 (33%)

Query: 34 KVRNWQTGYVLGVAGGSTVGGAQIQYEVDTDNLAQKWAIDPVTSSTFLLRNLNSDMCVTA 93
+ RN G + G G + + + L++L++ +C+ A
Sbjct: 130 QFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLCIRA 189


138C5746_12960C5746_12990N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_12960016-1.275466DNA-binding response regulator
C5746_12965-2210.370547hypothetical protein
C5746_12970-1160.692767MFS transporter
C5746_129750161.541010MerR family transcriptional regulator
C5746_129801200.785952hypothetical protein
C5746_129850161.208346flavoprotein
C5746_12990-2141.051903hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13040HTHFIS683e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 3e-15
Identities = 37/186 (19%), Positives = 67/186 (36%), Gaps = 11/186 (5%)

Query: 2 IRVLIADDEPMIRKGVGSVLSTDPEIDVVAEAADGHDAVELVRRHRPAVAVLDIRMPGMN 61
+L+ADD+ IR + L + DV ++ + + V D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIEAAAEIRRTVPETGVVMLTTFGEDDYILQALGGGAAGFLIKSGEPEELIAGVRAVADG 121
+ I++ P+ V++++ ++A GA +L K + ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA--- 118

Query: 122 AAYLSPKVAARVVAHLSATGAGALAGRRTA---ARERVRALTGRERDVLAFLGSGLSNGQ 178
L+ + L GR A + L + ++ SG
Sbjct: 119 ---LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 179 IARRLH 184
+AR LH
Sbjct: 176 VARALH 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13045TCRTETB290.038 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 28.7 bits (64), Expect = 0.038
Identities = 22/107 (20%), Positives = 33/107 (30%), Gaps = 7/107 (6%)

Query: 16 LAACVVLGAGAAVVAGLAPGGLGRAGAVEVTGGASASAYRSLTGAVADGPSWTGPLLEAA 75
+ A + GAGAA L + R E G A + GP+ G +
Sbjct: 107 IMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI 166

Query: 76 TEGTL-------VVLGLLLVWAWWGAVRRGDTRLSAGTVLTGIGTVL 115
L ++ L+ VR G +L +G V
Sbjct: 167 HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13050TCRTETA416e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.0 bits (96), Expect = 6e-06
Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 11/150 (7%)

Query: 31 AFVTILTEALPAG----VLPAMSGDLGVSEARA---GLLITVYALAAALTAIPMTAWTLI 83
T+ +A+ G VLP + DL S G+L+ +YAL A + A +
Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69

Query: 84 LPRRTLLLALLLGFAATNMVTAMSTGFALTLAARVVSGVFAGLLWAMVPAYAARLAPQRS 143
RR +LL L G A + A + + R+V+G+ G A+ AY A +
Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDE 128

Query: 144 GKAIATALAGMTVGLSLGIPAGTALGGLIG 173
A M+ G+ AG LGGL+G
Sbjct: 129 R---ARHFGFMSACFGFGMVAGPVLGGLMG 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_13075RTXTOXINA348e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.8 bits (77), Expect = 8e-04
Identities = 27/110 (24%), Positives = 45/110 (40%), Gaps = 8/110 (7%)

Query: 56 LTGAGATAVSSLF------SPAETTSASVG-IFGVVFLAISVLSFARAAQRLFEQTWELK 108
+ G +S T++A+ G I V LAIS LSF A + F++ +++
Sbjct: 279 VLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADK-FKRANKIE 337

Query: 109 PLSVRNTRNGLWWILTLGGYAAVTTLLSALLGGGPLGLAALACGVLVTAA 158
S R + G L + T + A L LA+++ G+ A
Sbjct: 338 EYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAAT 387


139C5746_14430C5746_14465N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_14430-1151.748872energy-dependent translational throttle protein
C5746_14435-3152.124153peptidase
C5746_14440-2162.413846single-stranded DNA-binding protein
C5746_14445-1181.962062ATP-binding protein
C5746_14450-1153.002708ATP-binding protein
C5746_14455-1143.274207*DNA-binding response regulator
C5746_144600143.700401sensor histidine kinase
C5746_144650113.363852hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14510PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.013
Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 4/32 (12%)

Query: 28 FLPGAKIGVV----GPNGAGKSTVLKIMAGLE 55
PG K G G GKST++ + GL+
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14545HTHFIS532e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.5 bits (126), Expect = 2e-10
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)

Query: 1 MDRSLRVVLAEDSVLLREGLIGLLTRFGHEVVAAVGDAEALTAAVAEHGPDIVVTDVRMP 60
M + +++A+D +R L L+R G++V +A L +A D+VVTDV MP
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 PGFQDEGLHAAVRLREKQPALPVLVLSQY 89
+ R+++ +P LPVLV+S
Sbjct: 59 ---DENAFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14550PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-04
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 6/70 (8%)

Query: 358 RVAVSGGHAGGRMFLEIHDDGRGGA--STSGGGSGLTGLADRVSVL---DGRLSLSSPAG 412
++ + G G + LE+ + G + G+GL + +R+ +L + ++ LS G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 413 GPTRLRVEIP 422
V IP
Sbjct: 340 KVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_14555SURFACELAYER290.048 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 28.9 bits (64), Expect = 0.048
Identities = 27/122 (22%), Positives = 41/122 (33%), Gaps = 15/122 (12%)

Query: 12 LAIAVAAAVTTFPAVAQASTPAPVTASATPAAAVPAPTPRRDDFNGDGYPDVAFTAP--- 68
L I AAA A+T PV A+ T A ++ D P ++ A
Sbjct: 5 LRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIAAVAK 64

Query: 69 -------GATVGGTAKAGYVGVVYGSKSGLKTSTKQVFTQDSPGI---PDTAEAGDAFGS 118
++ G+ A Y G Y + L + DS P EA A+
Sbjct: 65 SDTMPAIPGSLTGSISASYNGKSY--TANLPKDSGNATITDSNNNTVKPAELEADKAYTV 122

Query: 119 SM 120
++
Sbjct: 123 TV 124


140C5746_15500C5746_15535N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_15500751-10.020416hypothetical protein
C5746_15505751-10.035368hypothetical protein
C5746_15510845-7.308962hypothetical protein
C5746_15515436-5.722125hypothetical protein
C5746_15520025-2.353599ArsR family transcriptional regulator
C5746_15525123-2.301913serine/threonine protein kinase
C5746_15530-118-2.167853hypothetical protein
C5746_15535-114-0.000907hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15565PF05616330.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.8 bits (74), Expect = 0.002
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 2/64 (3%)

Query: 111 PASPPPPSTQ-FPTVGAPTARPAGGYEPVEDHSDKTTPHPPSEADPAADPDPGPTAGPDP 169
P S P+ Q P V +P PA P E+ + P P + +P A+PD G P
Sbjct: 317 PGSAEAPNAQPLPEV-SPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRP 375

Query: 170 EPTA 173
+ A
Sbjct: 376 DSPA 379



Score = 31.6 bits (71), Expect = 0.004
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 115 PPPSTQFPTVGAPTARPAGGYEPVEDHSDKTTPHPPSEADPAADPDP--GPTAGPDPEPT 172
P P + AP A+P P E+ ++ P+ P +PDP P A PD +
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQ 370

Query: 173 AGS 175
G+
Sbjct: 371 PGT 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15580GPOSANCHOR376e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 6e-05
Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 15/97 (15%)

Query: 156 EPGKPGKPGKPSTPATPSAPGAPSGPAQPTAAPTSSAPEGTKPTSSTSSPSSPSSSPSTQ 215
+ GK TP A G Q GTKP + + +P +
Sbjct: 456 AKLRAGKASDSQTPDAKPGNKAVPGKGQA-------PQAGTKP--------NQNKAPMKE 500

Query: 216 SGGDLAETGSGAPVGLLSAAAAALVAAGGFLVIRRRK 252
+ L TG A +AA + AG V++R++
Sbjct: 501 TKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKE 537



Score = 36.2 bits (83), Expect = 9e-05
Identities = 28/104 (26%), Positives = 41/104 (39%), Gaps = 17/104 (16%)

Query: 153 KPDEPGKPGKPGKPSTPATPSAPGAPSGPAQPTAAPTSSAPE-GTKPTSSTSSPSSPSSS 211
K E + GK S TP A P AP+ GTKP + + +
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAK-----PGNKAVPGKGQAPQAGTKP--------NQNKA 496

Query: 212 PSTQSGGDLAETGSGA-PVGLLSAAAAALVAAGGFLVIRRRKAQ 254
P ++ L TG A P +AAA ++A G + +RK +
Sbjct: 497 PMKETKRQLPSTGETANP--FFTAAALTVMATAGVAAVVKRKEE 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15595PF03544388e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 37.6 bits (87), Expect = 8e-05
Identities = 20/130 (15%), Positives = 29/130 (22%), Gaps = 4/130 (3%)

Query: 308 PAPVQTPPPPSSAPPAAYSPTTPVAPATPPPGYGPPATSQHAQHAQPTQAAQ-HAQPTQA 366
AP PP + PP P P P A + + + +
Sbjct: 55 VAPADLEPPQAVQPPPE-PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ 113

Query: 367 AQPHGPYGPPTPVQPQGPYGTPAQAQAQAQAQAHAYAPTQVPTHAQMYPGHPMPQAPQPA 426
+ P P + A A + PQ P A
Sbjct: 114 PKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTS--VASGPRALSRNQPQYPARA 171

Query: 427 KRKGNRGAVI 436
+ G V
Sbjct: 172 QALRIEGQVK 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15605IGASERPTASE522e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.6 bits (123), Expect = 2e-09
Identities = 36/195 (18%), Positives = 70/195 (35%), Gaps = 16/195 (8%)

Query: 23 PRLEQHQQTLEKELATVTERLE-------SVRTALTALRSLSTSPLSSST-TEAQEVKAE 74
P +E+ QT++ T ++ S + + P + +T +E E AE
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 75 DVKAEAVDVKAEAETESVAAPRIPQQAQDSVDSAAPAEVPQEPAAAAPARRTAR-KTSAP 133
+ K E+ K + E A Q + V A + V A+ + K +
Sbjct: 1043 NSKQES---KTVEKNEQDATETTAQNRE--VAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 134 KGRKQAPAKPKTERGQAKKAVATKSAKPAKAAAP--AKATAPAKDAKPAKATAPAKDAKP 191
K+ K E+ + + + K +P ++ A+PA+ P + K
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 192 AKARASKKAAPAAPA 206
+++ + A PA
Sbjct: 1158 PQSQTNTTADTEQPA 1172


141C5746_15565C5746_15600N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_15565212-1.608210hypothetical protein
C5746_15570111-0.821484MepB domain containing protein
C5746_15575012-0.736085hypothetical protein
C5746_15580112-0.309642TetR/AcrR family transcriptional regulator
C5746_15585-112-0.651821MFS transporter
C5746_15590-312-0.281825hypothetical protein
C5746_15595013-0.779743DUF445 domain-containing protein
C5746_15600-219-2.562990hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15640CHANLCOLICIN322e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 32.0 bits (72), Expect = 2e-04
Identities = 18/64 (28%), Positives = 24/64 (37%), Gaps = 14/64 (21%)

Query: 31 GTWVPALAFVLFPWAIIAGALAVTLGTAGIHYACHGTGRLWTAIAGTTLGIIGFVGTMTL 90
G W P L L A AG V L++ +AGTTLGI G +
Sbjct: 458 GDWKP-LFLTLEKKAADAGVSYVVAL-------------LFSLLAGTTLGIWGIAIVTGI 503

Query: 91 IWAL 94
+ +
Sbjct: 504 LCSY 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15650PYOCINKILLER270.037 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.5 bits (60), Expect = 0.037
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 2/58 (3%)

Query: 43 VDPSVNASVAAKAADDAVAAQSASDEASAE--AEAEAAAEAARNRPEAVRDAFAGLQA 98
V A + + + + A AS EA+A A +AAAEA R E R A A
Sbjct: 190 VKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAA 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15655HTHTETR483e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.7 bits (113), Expect = 3e-09
Identities = 23/129 (17%), Positives = 45/129 (34%), Gaps = 13/129 (10%)

Query: 2 LRAARRAFTQRPYAEVTIRGIAADAGVSPSLVVKHFGRKEELFNTVAD------------ 49
L A R F+Q+ + ++ IA AGV+ + HF K +LF+ + +
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 50 -FGPAAAELFDAPLDMLGRHMVVTLVSRRRELQSDPLLRVVFSLGNRDERSLLRDRFHEQ 108
+ ++L + T+ RR L + + +G + +
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 109 VTDVLAARL 117
D + L
Sbjct: 137 SYDRIEQTL 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15660TCRTETB1094e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (275), Expect = 4e-28
Identities = 78/408 (19%), Positives = 167/408 (40%), Gaps = 22/408 (5%)

Query: 24 VLAFCGVVVAVMQTIVVPLLPHIPALTGATPAAASWLVTVTLLTGAVFTPVLGRVGDMYG 83
+L+F V+ ++ + LP I PA+ +W+ T +LT ++ T V G++ D G
Sbjct: 21 ILSFFSVLNEMVLNVS---LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 84 KRRVLVASLVVLVVGSVLCAVS-SHIGVLITGRALQGAALAVVPLGISILRDEL-PPERV 141
+R+L+ +++ GSV+ V S +LI R +QGA A P + ++ P E
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 142 LSAVALMSSTLGIGAAVGLPVAALVVENFDWHTMFWVSGVIGVIDIVLVLWCVPESPLRT 201
A L+ S + +G VG + ++ W + + +I +I + ++ + + R
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP-MITIITVPFLMKLLKKEV-RI 195

Query: 202 RGRFDAVGALGLSGALVCLLLAVTQGADWGWTSARTVGLLVAAVVVALVWGAYELRVPTP 261
+G FD G + +S +V +L T + L+ +V+ L++ + +V P
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTS-YSISF--------LIVSVLSFLIFVKHIRKVTDP 246

Query: 262 MVDLRVSARPAVLLTNVAALLIGFAFYANSLVTAQMVQEPKATGYGLGASLVVSGLCLLP 321
VD + ++ + +I + M++ + L ++ + + + P
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMK----DVHQL-STAEIGSVIIFP 301

Query: 322 GGVMMVALSPVSARISAKYGPKASLALAAGVIAAG-YVVRYFTSHSLWLIIAGATVVASG 380
G + ++ + + + GP L + ++ + + W + V G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 381 TAIAYSALPALVMRGVPVSETGAANGLNTLMRSIGQAFCSATVAAVLA 428
+ + + +V + E GA L + + A V +L+
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15670TYPE3OMOPROT300.014 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 30.4 bits (68), Expect = 0.014
Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 8/51 (15%)

Query: 94 WAKSAGVGGW-----PGFVAAAAEAGMVGALADWFAVTALFKRPLGLPIPH 139
W+ G W P AA AG + W A T +RP LP+PH
Sbjct: 50 WSAWIKPGDWLEHVSPALAGAAVSAGAEHLVVPWLAAT---ERPFELPVPH 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_15675CARBMTKINASE290.036 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.0 bits (65), Expect = 0.036
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 282 KTVVVALGINDVQQFPQETDPQRIADALRSMTQRA---HARGLRVV 324
K VV+ALG N +QQ Q+ + + D +R ++ ARG VV
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVV 48


142C5746_15990C5746_16025N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_15990014-2.229777hypothetical protein
C5746_15995014-2.067840hypothetical protein
C5746_16000014-2.978105type VII secretion-associated serine protease
C5746_16005116-5.386092type VII secretion-associated serine protease
C5746_16010018-5.619351hypothetical protein
C5746_16015015-5.821355oxidoreductase
C5746_16025-115-5.325687(2Fe-2S)-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16075YERSSTKINASE300.027 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.5 bits (68), Expect = 0.027
Identities = 27/98 (27%), Positives = 41/98 (41%), Gaps = 9/98 (9%)

Query: 115 VRAVGAALCGALGQLHSSEVVHRDLKPSNVMLS-AYG-PKVIDFGIARALGDDRLTRTGT 172
++ + L L + VVH D+KP NV+ A G P VID G+ G+
Sbjct: 247 IKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ------P 300

Query: 173 AAGTPAYMSPEQASGQ-EQTPAGDVFALAGILVFAATG 209
T ++ +PE G + DVF + L+ G
Sbjct: 301 KGFTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16080YERSSTKINASE371e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 36.6 bits (84), Expect = 1e-04
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 13/100 (13%)

Query: 114 VRALAADLARALGDIHAAGLVHRDVKPANIMM--TSDGPRVIDFGIARPEHGLTLTTTGE 171
++ +A L + AG+VH D+KP N++ S P VID G+ + +GE
Sbjct: 247 IKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLH--------SRSGE 298

Query: 172 IP--VTPGYGAPEQVLGQR-VGPAADVFSLGAVLVYAATG 208
P T + APE +G +DVF + + L++ G
Sbjct: 299 QPKGFTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16085SUBTILISIN1311e-37 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 131 bits (332), Expect = 1e-37
Identities = 79/284 (27%), Positives = 119/284 (41%), Gaps = 32/284 (11%)

Query: 1 MQAETMWRTSTGEGVTVAVIDTGV-REVPELAGQLLEGKDFTDGVTGEHDEA------GT 53
+QA +W + G GV VAV+DTG + P+L +++ G++FTD G+ + GT
Sbjct: 29 IQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 54 TAAAVIAGTGKGAGGKESAYGLAPGAKILPLKVSDGSEAPQSGERSIAINSDLAPAIRYA 113
A IA T G G+AP A +L +KV + + Q + I YA
Sbjct: 89 HVAGTIAATENENGV----VGVAPEADLLIIKVLNKQGSGQY--------DWIIQGIYYA 136

Query: 114 ADSEAKVISISVTASLSIGGAVDEAVKCALSKGKLVFAAVG---DSEFPERPVQNPAAIP 170
+ + +IS+S+ + EAVK A++ LV A G D + + P
Sbjct: 137 IEQKVDIISMSLGGPED-VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYN 195

Query: 171 GVTGVAALGKDLAALKSSAVGPEVTLSAAGEDVLSACAEPEGLCTSSGSAVATAVAAASA 230
V V A+ D A + S EV L A GED+LS T SG+++AT A +
Sbjct: 196 EVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPG-GKYATFSGTSMATPHVAGAL 254

Query: 231 ALIWAKYP-----DWTNYQVLRVMVNTVGGPTSGAVRNNYIGYG 269
ALI D T ++ ++ G G
Sbjct: 255 ALIKQLANASFERDLTEPELYAQLIKRT---IPLGNSPKMEGNG 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16090SUBTILISIN1833e-56 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 183 bits (467), Expect = 3e-56
Identities = 87/293 (29%), Positives = 133/293 (45%), Gaps = 26/293 (8%)

Query: 38 QWYLDAMQAEQMWKSSTGENVTVAVIDSGVDASIPDLRGRVLKGKDLAAASPGDE--HTD 95
++ +QA +W + G V VAV+D+G DA PDL+ R++ G++ GD D
Sbjct: 23 PRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKD 82

Query: 96 YDNHGTGMASIIAGTGNVRGGGGSFGLAPGVKILPIRLRDTTGKVNGATGNKYLNEDLSV 155
Y+ HGT +A IA T N G G+AP +L I++ + G G + +
Sbjct: 83 YNGHGTHVAGTIAATEN---ENGVVGVAPEADLLIIKVLNKQGS--GQY------DWIIQ 131

Query: 156 AIRFAVDHGAKIINASVGDSIGGSQQLTDSVKYALDKGALIFAAVGN---SADEGNLIEY 212
I +A++ II+ S+G +L ++VK A+ L+ A GN D + + Y
Sbjct: 132 GIYYAIEQKVDIISMSLGGP-EDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGY 190

Query: 213 PAGTPGVVGVGAIGKDLHKADFSQWGPQVDLSAPGVDMVHGCSGGTKLCRTSGTSDAAAI 272
P V+ VGAI D H ++FS +VDL APG D++ GG K SGTS A
Sbjct: 191 PGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG-KYATFSGTSMATPH 249

Query: 273 VSASAALIWSK-----HLDWTNNQVLRVLLNTVGAPTSGALRNDYVGYGVVRP 320
V+ + ALI D T ++ L+ G G++
Sbjct: 250 VAGALALIKQLANASFERDLTEPELYAQLIKRT---IPLGNSPKMEGNGLLYL 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16105PF05616396e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 39.0 bits (90), Expect = 6e-05
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 19/92 (20%)

Query: 356 VDLAQVPAPDTVQAPDTVQAPDTVPAADTVPAPDTVPAADTPAEHPVEATPEHHPA---- 411
VD+ +P PD P + +AP+ P + PA + PA +P P +P
Sbjct: 305 VDVQVIPRPDL--TPGSAEAPNAQPLPEVSPA-------ENPANNP---APNENPGTRPN 352

Query: 412 -EPPAGTAPDSEPDT--APDTAPDAPPLDTPP 440
EP PD+ PDT P T PD+P + P
Sbjct: 353 PEPDPDLNPDANPDTDGQPGTRPDSPAVPDRP 384


143C5746_16310C5746_16360N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_16310-1101.736828DNA-binding response regulator
C5746_163150132.008556ribosome-associated translation inhibitor RaiA
C5746_16320-1120.113058phosphoribosyltransferase
C5746_16325-211-0.087579hypothetical protein
C5746_16330-1120.200571two-component sensor histidine kinase
C5746_163350120.446245DNA-binding response regulator
C5746_16340-1120.015236S-methyl-5-thioribose-1-phosphate isomerase
C5746_16345115-1.656612hypothetical protein
C5746_16350011-1.746019hypothetical protein
C5746_16355-110-1.156493DUF4350 domain-containing protein
C5746_16360012-2.078373AAA family ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16385HTHFIS692e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 2e-15
Identities = 28/115 (24%), Positives = 44/115 (38%), Gaps = 2/115 (1%)

Query: 33 IRVLVVDDHALFRRGLEIVLAQEEDIQVVGEAGDGAEAVDKAADLLPDIVLMDVRMPKRG 92
+LV DD A R L L++ V + A A D+V+ DV MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 93 GIEACTSIKEVAPSAKIIMLTISDEEADLYDAIKAGATGYLLKEISTDEVATAIR 147
+ IK+ P +++++ + A + GA YL K E+ I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16405PF06580300.035 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.035
Identities = 14/76 (18%), Positives = 31/76 (40%), Gaps = 9/76 (11%)

Query: 381 RKGTRIRVVGDEQPVIAEADARRVER-VLRNLVVNAVEHG-----EGRDVVVRMAVAGGA 434
+ R++ P I + +V +++ LV N ++HG +G ++++ G
Sbjct: 235 QFEDRLQFENQINPAIMDV---QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGT 291

Query: 435 VAVAVRDYGVGLKPGE 450
V + V + G
Sbjct: 292 VTLEVENTGSLALKNT 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16410HTHFIS962e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.4 bits (240), Expect = 2e-25
Identities = 31/141 (21%), Positives = 62/141 (43%), Gaps = 1/141 (0%)

Query: 2 KGRVLVVDDDTALAEMLGIVLRGEGFEPSFVADGDKALAAFREAKPDLVLLDLMLPGRDG 61
+LV DDD A+ +L L G++ ++ DLV+ D+++P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 IEVCRLIRAE-SGVPIVMLTAKSDTVDVVVGLESGADDYIVKPFKPKELVARIRARLRRS 120
++ I+ +P+++++A++ + + E GA DY+ KPF EL+ I L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 EEPAPEQLAIGDLVIDVAGHS 141
+ + + + G S
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16420PRTACTNFAMLY300.027 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 29.6 bits (66), Expect = 0.027
Identities = 18/69 (26%), Positives = 20/69 (28%), Gaps = 4/69 (5%)

Query: 29 VDGSGPAGNW--SPTQPPTGQWSPPSTPGTGPGAPPPAPGWGGGPQGPGWGGPHGSGWRQ 86
DG G + GQWS P AP PAP G P P P +
Sbjct: 543 KDGKVDIGTYRYRLAANGNGQWSLVGAKA--PPAPKPAPQPGPQPPQPPQPQPEAPAPQP 600

Query: 87 PPVAAKPGV 95
P
Sbjct: 601 PAGRELSAA 609


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16435HTHFIS363e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 3e-04
Identities = 31/148 (20%), Positives = 56/148 (37%), Gaps = 17/148 (11%)

Query: 36 VVGQDPAVTGLV----VALLCRGHVLLEGVPGVAKTLLVRAL-AASLDLDTKRVQFTPDL 90
+VG+ A+ + + +++ G G K L+ RAL + V
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 91 MPSDVTGSLVYDARTAEFS---------FQPGPVFTNLLLADEINRTPPKTQSSLLEAME 141
+P D+ S ++ F+ F+ T L DEI P Q+ LL ++
Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT--LFLDEIGDMPMDAQTRLLRVLQ 256

Query: 142 ERQVTVDGTPRLLPEPF-LVAATQNPIE 168
+ + T G + +VAAT ++
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLK 284


144C5746_16590C5746_16625N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_16590-110-0.694372membrane dipeptidase
C5746_16595-211-0.0674485-(carboxyamino)imidazole ribonucleotide mutase
C5746_16600214-0.6756005-(carboxyamino)imidazole ribonucleotide
C5746_16605313-0.726608hypothetical protein
C5746_16610414-0.931615two-component sensor histidine kinase
C5746_16615414-0.694671DNA-binding response regulator
C5746_16620415-1.505877MFS transporter
C5746_16625415-1.211218hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16680ADHESNFAMILY300.013 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.2 bits (68), Expect = 0.013
Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 242 AMATFVPKFVLPAAVAWTLAADENMSAHGLHHLDTTAQAMKIHAAF--EAANPRPMATVA 299
A F + +P+A W + +E + + L + K+ + F + + RPM TV+
Sbjct: 207 AFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVS 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16700PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 5e-06
Identities = 20/81 (24%), Positives = 31/81 (38%), Gaps = 24/81 (29%)

Query: 321 LIENSLMHG------GGTVALRTRVTGNQAVIEVTDEGPGVPPDLGARIFERTISGRNST 374
L+EN + HG GG + L+ +EV + G + + ST
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK-----------NTKEST 311

Query: 375 GIGLAVARDLAEADGGRLELL 395
G GL R+ RL++L
Sbjct: 312 GTGLQNVRE-------RLQML 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16705HTHFIS985e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.4 bits (245), Expect = 5e-26
Identities = 38/128 (29%), Positives = 64/128 (50%)

Query: 2 TRVLLAEDDASISEPLARALRREGYEVEVREDGPTALDAGLQGGIDLVVLDLGLPGMDGL 61
+L+A+DDA+I L +AL R GY+V + + T G DLVV D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVARRLRAEGHTAPILVLTARADEVDTVVGLDAGADDYVTKPFRLAELLARVRALLRRGA 121
++ R++ P+LV++A+ + + + GA DY+ KPF L EL+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 TEPAPQPA 129
P+
Sbjct: 124 RRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_16715cloacin300.015 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.015
Identities = 16/64 (25%), Positives = 25/64 (39%)

Query: 169 GGGSTTAGGTTTGSTTGSTTAGGTTTAGGTTTGSTTAGGTTTAGGTTAGGTTAAGGTTSG 228
G +T+G G T G + +G ++ + GG+ + G GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 229 SGGG 232
SGGG
Sbjct: 71 SGGG 74


145C5746_16925C5746_16960N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_16925-1121.263830hypothetical protein
C5746_169300130.954767peptide ABC transporter ATP-binding protein
C5746_169401161.209931MFS transporter
C5746_169454161.828950DUF485 domain-containing protein
C5746_169503160.512758cation acetate symporter
C5746_169553140.417738cellulose-binding protein
C5746_169601142.035443hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17030ACRIFLAVINRP310.019 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.019
Identities = 33/165 (20%), Positives = 68/165 (41%), Gaps = 24/165 (14%)

Query: 319 VLIEALFLGIVGSVLGVGAGVGLAV-GLMKLMGAMGMDLSTKDLTVAWTTPVVGLALGIV 377
L+ LFL + + L V + + G ++ A G ++T LT+ LA+G++
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT--LTMFGMV----LAIGLL 405

Query: 378 V----TVVAAYIPARRAGKISPMAALRDAGTPADGRAGRVRAAI-GLVLTLAGGAALLAA 432
V VV K+ P A + +++ A+ G+ + L+ +A
Sbjct: 406 VDDAIVVVENVERVMMEDKLPPKEATEKS-------MSQIQGALVGIAMVLSAVFIPMAF 458

Query: 433 TRADKSSEGSLFLGVGVVLTLIGFIVIGPLLAGFVVRVLSALMLR 477
S G+++ +T++ + + L+A + L A +L+
Sbjct: 459 F---GGSTGAIYRQFS--ITIVSAMALSVLVALILTPALCATLLK 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17040TCRTETB1037e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 103 bits (258), Expect = 7e-26
Identities = 80/400 (20%), Positives = 157/400 (39%), Gaps = 22/400 (5%)

Query: 6 LAAIDGTIVSTAVPQIVGDLGGF-SVFSWLFSGYLLAVTVSLPVYGKLSDTFGRKPVLVA 64
+ ++ +++ ++P I D + +W+ + ++L ++ VYGKLSD G K +L+
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 65 GIVLFLFGSVLCAAAWNMAA-LIAFRVVQGLGGGALQGTVQTIAADLYPLKERPKIQAKL 123
GI++ FGSV+ + + LI R +QG G A V + A P + R K +
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 124 STVWATSAVAGPVIGGLFAVYADWRWIFLINLPVGALALWLVVRHLHEPARTRRAAGAPR 183
++ A GP IGG+ A Y W +L+ +P+ + + L + +
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKG----- 197

Query: 184 PRIDWAGALAVFATGTLLLTALVQGGVAWPWFSAPSLGLLAGSVVLGALTVVIERRAAEP 243
D G + + + ++ S+ L SV+ + V R+ +P
Sbjct: 198 -HFDIKGIILMSVGIVFFMLFT----------TSYSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 244 VIPGWVWRRRTIASVNLALGAMGLLMVAPTVFLPTYAQSVLGLGPIAAGF-VLSVMTLSW 302
+ + + L G + + +P + V L G ++ T+S
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 303 PVSAAFSNRVYNRIGFRRTAIIGMSCALLILLAFPLLPYPGEPWQPALIMLLLGAALGLF 362
+ + +R G IG++ + L L + +I+ +LG L
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFT 365

Query: 363 QLPLIVGVQSTVGWAERGTTTASVLFCRQVGQSVGAALFG 402
+ + V S++ E G + + F + + G A+ G
Sbjct: 366 KTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17045PERTACTIN300.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.003
Identities = 17/49 (34%), Positives = 20/49 (40%), Gaps = 1/49 (2%)

Query: 15 PGRLPKPSRRHQPPPPHDPEHLPPWQSSKPSPPPERPSAPTTPVPAPGR 63
P P P QP P P+ P Q +P PP+R P P GR
Sbjct: 569 PAPKPAPQPGPQPGPQ-PPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616



Score = 28.9 bits (64), Expect = 0.008
Identities = 15/48 (31%), Positives = 21/48 (43%)

Query: 5 SPPPSPYSSWPGRLPKPSRRHQPPPPHDPEHLPPWQSSKPSPPPERPS 52
+PP + PG P P P PP P+ P Q +P P+ P+
Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17055IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 24/155 (15%), Positives = 43/155 (27%), Gaps = 6/155 (3%)

Query: 119 GTAEETVAEARKEAAEVREQAESAMAETRRRTASVLAHQEQEHSERWKTAEREVAEAEAA 178
E EA+ + E A + + + Q E E + E A+ E
Sbjct: 1063 AQNREVAKEAKSNVKANTQTNEVAQSGSETKET-----QTTETKETATVEKEEKAKVETE 1117

Query: 179 QAAHHDELTEQAEARLAEARRALARTEEAARHGQEDAEAQGAELIAAARVREERVVRETE 238
+ ++T Q + E + E AR + + E+ +ET
Sbjct: 1118 KTQEVPKVTSQVSPK-QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 239 RILREHEEGREEVQAHMAHVRNSLAALTGRVTPAE 273
+ + V + V N P
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17060PERTACTIN320.010 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.0 bits (72), Expect = 0.010
Identities = 26/75 (34%), Positives = 30/75 (40%), Gaps = 1/75 (1%)

Query: 377 GVHHAATMLADPSMGGPGALQPPGPPGVPGAPQPPGPPGPPAP-PAPPGSTPPPGGGVHH 435
+ A A PG P PP P PQPP PP PP P P PP G +
Sbjct: 561 SLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSA 620

Query: 436 AATMLAGPGQVGPSA 450
AA G VG ++
Sbjct: 621 AANAAVNTGGVGLAS 635



Score = 30.1 bits (67), Expect = 0.048
Identities = 19/60 (31%), Positives = 20/60 (33%)

Query: 440 LAGPGQVGPSAPQPPGPPGMPQGGAPGPVPPNPHTPPPPAYGYPQAPTGQPTVGPGYQAV 499
L G P P P P PP P PP P P+AP QP G A
Sbjct: 562 LVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAA 621


146C5746_17065C5746_17125N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_17065-2100.339775DNA-binding response regulator
C5746_17070-310-0.381009N-acetyltransferase
C5746_17075-212-0.565916galactokinase
C5746_17080-113-1.009243UDP-glucose 4-epimerase GalE
C5746_17085015-1.828014galactose-1-phosphate uridylyltransferase
C5746_170951210.190963Na+/galactose cotransporter
C5746_171050200.402750hypothetical protein
C5746_171100160.291161helix-turn-helix transcriptional regulator
C5746_17115-114-0.070503hypothetical protein
C5746_17120-113-0.173399hypothetical protein
C5746_17125116-0.649880hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17170HTHFIS474e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.7 bits (111), Expect = 4e-08
Identities = 27/158 (17%), Positives = 53/158 (33%), Gaps = 27/158 (17%)

Query: 1 MVRIRVLVVDDHRIFAESLAAALAAEPDVDVAAAGSGPAALRCLERAAAEGRGYDVMLVD 60
M +LV DD L AL+ DV + R + D+++ D
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLWRWIAAGD-----GDLVVTD 54

Query: 61 AELGILAAARGRRAPVPVPRSGENGPVDGISLVAGVRSGQPSVHTVVLAEKDDPLRAALA 120
+ + L+ ++ +P + +V++ ++ + A A
Sbjct: 55 VVMP---------------------DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKA 93

Query: 121 LQAGASGWVAKDCSLQRLLTVIRGVLRDETHLPPALLT 158
+ GA ++ K L L+ +I L + P L
Sbjct: 94 SEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17175SACTRNSFRASE353e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-05
Identities = 16/67 (23%), Positives = 27/67 (40%), Gaps = 3/67 (4%)

Query: 64 TWAHWLHVDQLWVDARHRGCGLGSRLLAEAERVAGADRACTRSRLETWGFQAP--DFYRK 121
W + ++ + V +R G+G+ LL +A A + C LET FY K
Sbjct: 85 NWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFC-GLMLETQDINISACHFYAK 143

Query: 122 RGYEVSG 128
+ +
Sbjct: 144 HHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17185NUCEPIMERASE1511e-45 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 151 bits (384), Expect = 1e-45
Identities = 85/334 (25%), Positives = 144/334 (43%), Gaps = 44/334 (13%)

Query: 7 KYLVTGGAGYVGSVVAQHLLEAGHAVTVLDDLSTGF-------REGVPA--GAEFIEGRI 57
KYLVTG AG++G V++ LLEAGH V +D+L+ + R + A G +F + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 58 QDAAK----WLDPSYDGVLHFAAYSQVGESVVDPEKYWVNNVGGSVALLAAMREAGVRTL 113
D + ++ V V S+ +P Y +N+ G + +L R ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 114 VFSSTAATYGEPVTTPITESDATA-PTNPYGATKLAVDHMITGEAAAHGLAAVSLRYFNV 172
+++S+++ YG P + D+ P + Y ATK A + M + +GL A LR+F V
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 173 AGAYGSCGERHSPESHLIPLVLQVALGQRESISVYGDDYPTPDGTCVRDYIHVADLAEAH 232
G +G P+ L + G +SI VY G RD+ ++ D+AEA
Sbjct: 182 YGPWG------RPDMALFKFTKAMLEG--KSIDVYN------YGKMKRDFTYIDDIAEAI 227

Query: 233 LLALDAATEGE----------------HLVCNLGNGNGFSVREVIETVREVTGHPVPETA 276
+ D + + V N+GN + + + I+ + + G +
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 277 APRRAGDPAVLVASAATARERLGWTPSRADLTGI 310
P + GD A E +G+TP G+
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGV 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17205HTHFIS413e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 40.6 bits (95), Expect = 3e-06
Identities = 26/135 (19%), Positives = 52/135 (38%), Gaps = 6/135 (4%)

Query: 4 RLMVVDDHRLLAEALASALKLRGHRVLAAAAPTSGAAELVVSRAPEVCLFGTAAPAEPGA 63
++V DD + L AL G+ V + + + + ++ + P E
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDENA- 62

Query: 64 FDPIVRIRRERPQVAVVVLGPVPSPRGIAAAFAAGAAGYV----RHDERMEGVERAMVKA 119
FD + RI++ RP + V+V+ + A GA Y+ E + + RA+ +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 120 RAGEAAVAPQLLQGA 134
+ + + G
Sbjct: 123 KRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17210PERTACTIN330.005 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.8 bits (74), Expect = 0.005
Identities = 18/47 (38%), Positives = 19/47 (40%)

Query: 25 GGFGAPTPPPADPFGKQPVTPPAGGFGAPQTPSPAGTPQTPPQQPGP 71
G P P PA G QP P PQ P P PQ P+ P P
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610



Score = 30.8 bits (69), Expect = 0.019
Identities = 19/49 (38%), Positives = 21/49 (42%)

Query: 56 PSPAGTPQTPPQQPGPGYGYPQGQPPQPGYGYPQGQPPQPGYGYPQSPV 104
P+P PQ PQ P P QPPQP P+ PQP G S
Sbjct: 573 PAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAA 621


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17215TONBPROTEIN310.010 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 31.1 bits (70), Expect = 0.010
Identities = 18/79 (22%), Positives = 24/79 (30%), Gaps = 3/79 (3%)

Query: 1 MTQPPSQQPPQGGFGAPQEPPQGSPQPPQGPPPGYGSPQTPGQAPGQAPGYGYPQQPGPY 60
M P +PPQ P P P+P P P + P P +P
Sbjct: 49 MVTPADLEPPQAV--QPPPEPVVEPEPEPEPIP-EPPKEAPVVIEKPKPKPKPKPKPVKK 105

Query: 61 NQQQPGPYAQQPGPYNQQP 79
Q+QP + P
Sbjct: 106 VQEQPKRDVKPVESRPASP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_17220TONBPROTEIN356e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 35.0 bits (80), Expect = 6e-04
Identities = 20/101 (19%), Positives = 25/101 (24%), Gaps = 2/101 (1%)

Query: 3 QPPGQQPPQGGFGAPY-DPPPTPIQPPPSPPGYGSAPPPPGQAPGPYGAPGQAAGPYGAP 61
P QPP P +P P P P +P P P P P + P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKP 116

Query: 62 AQPGPYGVPQPGPYGAPPQPGPYNAPPQPGPYDAPTRPGPY 102
+ P P A A + P
Sbjct: 117 VESRP-ASPFENTAPARLTSSTATAATSKPVTSVASGPRAL 156



Score = 33.4 bits (76), Expect = 0.002
Identities = 28/120 (23%), Positives = 34/120 (28%), Gaps = 12/120 (10%)

Query: 1 MTQPPGQQPPQGGFGAPYDPPPTPIQPPPSPPGYGSAPPPPGQAPGPYGAPGQAAGPYGA 60
M P +PPQ A PP ++P P P P PP +AP P P
Sbjct: 49 MVTPADLEPPQ----AVQPPPEPVVEPEPEPE---PIPEPPKEAPVVIEKPKPKPKP--- 98

Query: 61 PAQPGPYGVPQPGPYGAPPQPGPYNAPPQPGPYDAPTRPGPYNAPTQPGSYGQPPQPQPG 120
+P P Q P A P A A T P+
Sbjct: 99 --KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRAL 156


147C5746_17970C5746_18015N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_17970590.933418hypothetical protein
C5746_179755101.642543glycosyl transferase family 2
C5746_179804112.541763long-chain fatty acid--CoA ligase
C5746_179854112.5968103-oxoacyl-ACP reductase
C5746_179902113.186597hypothetical protein
C5746_179950103.145178TetR family transcriptional regulator
C5746_180000112.158920hypothetical protein
C5746_18005-1131.984957sodium:proton antiporter
C5746_18010-1161.814674DUF4229 domain-containing protein
C5746_180150161.059780GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18080cloacin300.013 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.013
Identities = 22/84 (26%), Positives = 30/84 (35%), Gaps = 4/84 (4%)

Query: 76 TGATGASGTNGADGATGPV--GPTGATGASGTNGADGATGPVGPTGATGASGTNGADGAT 133
+G G GA +G + GPTG G + G + P G G SG+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGGG 59

Query: 134 GPVGPTGATGAAGTNGADGATGAT 157
G G G +G G +
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 28.5 bits (63), Expect = 0.027
Identities = 24/92 (26%), Positives = 34/92 (36%), Gaps = 4/92 (4%)

Query: 97 TGATGASGTNGADGATGPV--GPTGATGASGTNGADGATGPVGPTGATGAAGTNGADGAT 154
+G G GA +G + GPTG G + G + P G G +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGGG 59

Query: 155 GATGPTGATGPAGPVGPSQQANSNVTSVPAGG 186
G G G +G + S V + A G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18100DHBDHDRGNASE864e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.9 bits (212), Expect = 4e-21
Identities = 54/193 (27%), Positives = 83/193 (43%), Gaps = 7/193 (3%)

Query: 199 AAPLTGRTALVTGAARGIGASVASVLARDGAQVICLDIPRSADELKRTAERLGA---TAL 255
A + G+ A +TGAA+GIG +VA LA GA + +D E ++ + A A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 256 PLDITADDAADRIA---AASPDGLDILVHNAGITRDRRLANMPPDRWASVIEVNLGSVLR 312
P D+ A D I +DILV+ AG+ R + ++ + W + VN V
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 313 TTDALLMSGTVNRGGRIVATASIAGIAGNNGQTNYAAGKAGIIGLVRSLAPRAAADHGVT 372
+ ++ R G IV S YA+ KA + + L A++ +
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG-LELAEYNIR 181

Query: 373 VNAVAPGFIETKM 385
N V+PG ET M
Sbjct: 182 CNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18110HTHTETR743e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.5 bits (180), Expect = 3e-18
Identities = 31/204 (15%), Positives = 62/204 (30%), Gaps = 14/204 (6%)

Query: 1 MPRAVREQ------QMMDAAVRTFGQRGYRAASMDEIAELAGVSKPLVYLYLNSKEELFT 54
M R +++ ++D A+R F Q+G + S+ EIA+ AGV++ +Y + K +LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 55 ACIQREAKALVAAVRSGVEPELPADRQLWAGLRAFFTHTAKNPDGW----AVLHRQARTH 110
+ + + + + + ++ +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 111 GEPFATEVMVMRDEIVAFVTGLIGAAAREAHRDPAL-PDRDVAGLAQALVGAAESL-AGW 168
GE V + + I + L D A + G L W
Sbjct: 121 GE--MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 169 ANDTPGVSAKEAAATLMNFAWAGL 192
K+ A +
Sbjct: 179 LFAPQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18115OUTRMMBRANEA250.049 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 24.9 bits (54), Expect = 0.049
Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 1/24 (4%)

Query: 33 TAIALAVLLAA-PSTASAAPGPAT 55
TAIA+AV LA + A AAP T
Sbjct: 4 TAIAIAVALAGFATVAQAAPKDNT 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18135SACTRNSFRASE300.004 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.5 bits (66), Expect = 0.004
Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 2/54 (3%)

Query: 95 VMVHPRHQGKGYGRDLMDAAAQAARGMEGMEAIRL-TCRGGTGVDRFYTSCGYK 147
+ V ++ KG G L+ A + A+ + L T FY +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHFI 147


148C5746_18545C5746_18650N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_18545033-1.012124hypothetical protein
C5746_18550233-1.167280hypothetical protein
C5746_18555236-2.010560adenine glycosylase
C5746_18560136-2.681161SigE family RNA polymerase sigma factor
C5746_18565124-1.397036hypothetical protein
C5746_18570226-1.548246DNA-binding response regulator
C5746_18575117-1.711409two-component sensor histidine kinase
C5746_18580018-1.190873MFS transporter
C5746_18600023-0.211097TetR family transcriptional regulator
C5746_186050251.386721peptidase
C5746_18610-1220.919315ATP-dependent Clp protease ATP-binding subunit
C5746_18615-2301.186318hypothetical protein
C5746_18620-1112.079648Lsr2 family protein
C5746_186250131.595496amino-acid N-acetyltransferase
C5746_18630-1121.443763CopY family transcriptional repressor
C5746_186350120.939150hypothetical protein
C5746_18645-1131.063339type III pantothenate kinase
C5746_186500151.700205nicotinate-nucleotide diphosphorylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18690cloacin310.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 0.006
Identities = 35/123 (28%), Positives = 46/123 (37%), Gaps = 18/123 (14%)

Query: 26 SGGDKKNTDDSRPGGKGPAHAITPGASPSGPAISQQPGGRDES----DGSGSGGGDDSGT 81
SGGD + + G + G G A S G E+ GSGSG G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGA-SDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 82 GAGSDGGASDGGNGADGTDTGSGSDTGAGGASASGATIGQQVPAGSSLPDCAAGALQLSL 141
G G+ GG + G G+ G SA A + PA L AG L +S+
Sbjct: 61 GHGNGGGNGNSGGGS----------GTGGNLSAVAAPVAFGFPA---LSTPGAGGLAVSI 107

Query: 142 STE 144
S
Sbjct: 108 SAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18715HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 1/121 (0%)

Query: 1 MAETHVLFVEDDDVIREATQLALERDGFVVTAMPDGLSGLDAFRADRPDIALLDVMVPGL 60
M +L +DD IR AL R G+ V + + A D+ + DV++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGVSLCRRIRDE-STVPVIMLSARADSIDVVLGLEAGADDYVTKPFDGAVLVARIRAVLR 119
+ L RI+ +PV+++SA+ + + E GA DY+ KPFD L+ I L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 R 120

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18720TYPE3OMBPROT310.013 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 30.8 bits (69), Expect = 0.013
Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 9/109 (8%)

Query: 186 VREAVGGVVQDETDELARAVDA-------LTDALNERIEAERRVTADIAHELRTPVTGLL 238
+R V + + RAV A ++ AL R E + + +L+ T LL
Sbjct: 223 IRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTSLL 282

Query: 239 TAAELLPPGRPTELVRDRAQAMRTLVEDVLEVARLDSASERAELQEIEL 287
T L G +++D+ A++ L E +L + L+E+ +
Sbjct: 283 TPTSLT--GGEESMLKDQVNALKGLNSKRGEPTKLLIRNSDGLLKEVSV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18725TCRTETB1556e-44 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 155 bits (394), Expect = 6e-44
Identities = 96/410 (23%), Positives = 182/410 (44%), Gaps = 18/410 (4%)

Query: 30 VMLALMITMLLAMLDNLIVGTAMPTIVGDLGGLEH-LSWVVTAYTLATAASTPIWGKLGD 88
+++ L I ++L+ +++ ++P I D +WV TA+ L + T ++GKL D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 89 MYGRKGIFLTSIVLFLIGSVLSGMAQDMGQ-LIGFRAVQGLGAGGLMVGVMAIIGDLVPP 147
G K + L I++ GSV+ + LI R +QG GA VM ++ +P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 148 RERGKYQGMMAGVMAIAMIGGPLVGGTITDHLGWRWSFYINLPLGAVALAMVTAVLHLPK 207
RGK G++ ++A+ GP +GG I ++ WS+ + +P+ + + + + L K
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITI-ITVPFLMKLLKK 191

Query: 208 RERTEAKVDYLGAGLLTLGITAIVLVTTWGGSEYDWNSAVIMELMAIGVASLVGFFFVET 267
R + D G L+++GI +L TT Y + + + V S + F
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTT----SYSIS------FLIVSVLSFLIFVKHIR 241

Query: 268 KAAEPIIPLHIFRNRNFTLMSVVGFMSGFVMFGAVLFLPLFQQSVQGASATNSG-LLLLP 326
K +P + + +N F + + G + + G V +P + V S G +++ P
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 327 MLLSMMVVSLIAGRVTTSTGKYKIFPVVGSVLMVTGLFLLSQMDTGTTRFTSGIYMAVLG 386
+S+++ I G + G + +G + S + T+ F + I + VLG
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVL-NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG 360

Query: 387 AGMGFLMQITMLVAQNSVEMKDMGVASSATTLFRTLGSSFGVAIMGALFT 436
G+ F + + +S++ ++ G S L G+AI+G L +
Sbjct: 361 -GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18730HTHTETR783e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.1 bits (192), Expect = 3e-20
Identities = 35/100 (35%), Positives = 50/100 (50%), Gaps = 2/100 (2%)

Query: 1 MGSTPQPRRGNTRQRIQDVALELFAEQGYEKTSLREISERLDVTKAALYYHFKTKEDILV 60
M + TRQ I DVAL LF++QG TSL EI++ VT+ A+Y+HFK K D+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 SIFDDLNRPVEELIE--WGRGQPHNLETKKEILRRYSEAL 98
I++ + EL + L +EIL E+
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18740HTHFIS330.006 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.006
Identities = 34/179 (18%), Positives = 58/179 (32%), Gaps = 24/179 (13%)

Query: 499 EESSRLLRMEDELHKRVIGQKDAIKAL---SQAIRRTRAGLKDPKRPGGSFIFAGPSGVG 555
R L ++ L S A++ L + + + G SG G
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTG 172

Query: 556 KTELSKTLAEFLFGDEDALIALDMSEFSEKHTVSRLFGSPPGYVGYEEGGQLTEKVRRKP 615
K +++ L ++ +A++M+ S LFG E G T R
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224

Query: 616 --FS-----VVLFDEVEKAHPDIFNSLLQILEDG---RLTDSQGRVVDFKNTVIIMTTN 664
F + DE+ D LL++L+ G + D + I+ TN
Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18760SACTRNSFRASE385e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 5e-06
Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 5/74 (6%)

Query: 70 DARVIGCGALHVMWEDLAEVRTLAVDHSIRGAGVGHQVLDKLLQTARWLGVRRVFCLTFE 129
+ IG + W A + +AV R GVG +L K ++ A+ + T +
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 130 VD-----FFAKHGF 138
++ F+AKH F
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18770PERTACTIN320.001 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.4 bits (73), Expect = 0.001
Identities = 22/90 (24%), Positives = 35/90 (38%)

Query: 100 PVAAALLERCQALAERSVRARLAGRSAPPTAPAPRSRLLALPGGRAADAEPERPKTPAAP 159
P + + Q + LA + R RL A G+ + + P P
Sbjct: 515 PASGNTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPA 574

Query: 160 PRPAPPPERPAPKPSEVFPPRRPVPPPQQR 189
P+P P P P+P + P +P PPQ++
Sbjct: 575 PQPGPQPGPQPPQPPQPPQPPQPPQPPQRQ 604


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18775PF03309333e-118 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 333 bits (855), Expect = e-118
Identities = 137/269 (50%), Positives = 190/269 (70%), Gaps = 12/269 (4%)

Query: 1 MLLTIDVGNTHTVLGLFDGE----EIVEHWRISTDARRTADELAVLLQGLMGMHPLLGEE 56
MLL IDV NTHTV+GL G ++V+ WRI T+ TADELA+ + GL+G
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDA----- 55

Query: 57 LGDGIEGIAICSTVPAVLHELREVTRRYYGDVPAVLVEPGIKTGVPILMDNPKEVGADRI 116
+ + G + STVP+VLHE+R + +Y+ +VP VL+EPG++TG+P+L+DNPKEVGADRI
Sbjct: 56 --ERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRI 113

Query: 117 INAVAAVDLYGGPAIVVDFGTATTFDAVSARGEYTGGVIAPGIEISVEALGVKGAQLRKI 176
+N +AA YG AIVVDFG++ D VSA+GE+ GG IAPG+++S +A + A LR++
Sbjct: 114 VNCLAAYHKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRV 173

Query: 177 ELARPRSVIGKNTVEAMQSGIIYGFAGQVDGVVARMKKELAADPD-DVTVIATGGLAPMV 235
EL RPRSVIGKNTVE MQ+G ++GFAG VDG+V R++ ++ DV V+ATG AP+V
Sbjct: 174 ELTRPRSVIGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLV 233

Query: 236 LGESSVIDEHEPWLTLIGLRLVYERNVSR 264
L + ++ ++ LTL GLRLV+ERN +
Sbjct: 234 LPDLRTVEHYDRHLTLDGLRLVFERNRAN 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_18780RTXTOXIND290.029 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.029
Identities = 9/33 (27%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 134 HVEDGDRVAPGQKLLTVT-TRTRDLLTGERSAL 165
V++G+ V G LL +T +S+L
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSL 143


149C5746_19165C5746_19235N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_19165017-0.961170glutamyl-tRNA amidotransferase
C5746_19170011-0.890752metallophosphoesterase
C5746_19180-29-1.212481*MFS transporter
C5746_19185-19-1.477870glycosyltransferase
C5746_19195-19-0.283142glycosyltransferase
C5746_19200-210-1.154235NAD-dependent dehydratase
C5746_19205-38-0.387579hypothetical protein
C5746_19210011-0.234830hypothetical protein
C5746_192150120.251055peptidoglycan-binding protein
C5746_19220-1100.410293ABC transporter ATP-binding protein
C5746_19225-290.818349ABC transporter permease
C5746_19230-191.811835two-component sensor histidine kinase
C5746_19235-1101.440692DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19290FbpA_PF05833290.005 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 29.5 bits (66), Expect = 0.005
Identities = 9/44 (20%), Positives = 19/44 (43%), Gaps = 1/44 (2%)

Query: 51 EVQKVIAKEAKKRREAAEAF-AQGGRTEQAEREKAEGELLDAYL 93
++QK++ + + + + E + K GELL A +
Sbjct: 303 DLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19310TCRTETB1278e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 127 bits (320), Expect = 8e-34
Identities = 83/421 (19%), Positives = 174/421 (41%), Gaps = 28/421 (6%)

Query: 17 VLLTLAAGQFLMALDSSVMNVSIATVADDVGTTVTGIQGAITAYTLVMAMFMIPGGKAGA 76
+L+ L F L+ V+NVS+ +A+D TA+ L ++ GK
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 77 LIGRKRAFVIGCGIYGCGSLITALAPNLPVLLLGWSFLEGVGAALILPAIVALVAGNFAT 136
+G KR + G I GS+I + + LL+ F++G GAA ++ +VA
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 137 ERRPAAYGLVAAAGAVAIALGPLIGGVATTYFSWRWVFAGEVLVVLGILVLARRIADAAS 196
E R A+GL+ + A+ +GP IGG+ Y W ++ ++ ++ + L + +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194

Query: 197 DERPRIDLVGAALSALGLGIFVYGVLRSDEWGWFRPKPDAPSWLGVSLVVWLMLAGLLLV 256
+ D+ G L ++G+ F+ S + L+V ++
Sbjct: 195 IKGH-FDIKGIILMSVGIVFFMLFT-TSYSISF--------------LIVSVLSFL---- 234

Query: 257 WLFLRREARLVEQRREPLIHPSMLENKQLTGGLTMFFFQYLVQMGVFFVVPLYLSVALGL 316
+F++ ++ +P + P + +N G+ + G +VP + L
Sbjct: 235 -IFVKHIRKV----TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQL 289

Query: 317 SALTTGAR-LLPLSITLLAAAVLIPRFFPDVSPRRVVRLGVLALLSGAVALMAALDADAG 375
S G+ + P +++++ + P V+ +GV L L A+ +
Sbjct: 290 STAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS--FLTASFLLETT 347

Query: 376 AEIVTIPLLLIGLGMGSLASQLGSVTVSAVPEQQSAEVGGVQNAVTNLGASLGTALAGSI 435
+ +TI ++ + G+ + + ++ S++ +Q++ + N + L G A+ G +
Sbjct: 348 SWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407

Query: 436 M 436
+
Sbjct: 408 L 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19330NUCEPIMERASE1876e-59 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 187 bits (477), Expect = 6e-59
Identities = 89/366 (24%), Positives = 138/366 (37%), Gaps = 64/366 (17%)

Query: 1 MRVLVTGGAGFIGSHIVKTLISRGHEPVVLDSL-------LPTAHRSVTGPPELGGAAWV 53
M+ LVTG AGFIG H+ K L+ GH+ V +D+L L A + P G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFH 57

Query: 54 RGDVRDRDVVRQ--ALSGMDAVCHQAAMVGLGKDFADAPDYVGCNDLGTAVLLAAMAETG 111
+ D+ DR+ + A + V + + + Y N G +L
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 112 VERLVLAGSMVVYGEGRYECPSHGWVRPGPRAEQDLVSGRFEPLCPQCGAELSPGLVGED 171
++ L+ A S VYG R P + +D
Sbjct: 118 IQHLLYASSSSVYGLNRKM----------PFST-------------------------DD 142

Query: 172 APADPRNVYATTKLAQEHLAAAWARTTGGRAVSLRYHNVYGPRMPRDTPYAGVASLFRSA 231
+ P ++YA TK A E +A ++ G A LR+ VYGP D F A
Sbjct: 143 SVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMAL----FKFTKA 198

Query: 232 LARGEAPRVFEDGGQRRDFVHVRDVAAANAVALEAI-------DVRVPGALTS------Y 278
+ G++ V+ G +RDF ++ D+A A + I V S Y
Sbjct: 199 MLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVY 258

Query: 279 NTGSGDPHTVGEMAAALAAGHGGPSPVVTGEFRLGDVRHITADSSRLKSELGWRPEVGFA 338
N G+ P + + AL G + + GDV +AD+ L +G+ PE
Sbjct: 259 NIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVK 318

Query: 339 EGMAEF 344
+G+ F
Sbjct: 319 DGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19335PF07299414e-07 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 41.4 bits (97), Expect = 4e-07
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 16/120 (13%)

Query: 2 EPLSEKQIRSSFVNCTKGEAARLRLPLDFDELPWDDLDFLGWVDPGAPLRAHLVVSRAQG 61
+ ++ + ++ F +A +L+LP D +EL +L +L W+D G+ R ++ +
Sbjct: 92 QEVTAQTLKKLF-----PKAKKLKLP-DMEELDMKELSYLSWIDKGSS-RKFIIAKNDKN 144

Query: 62 PL-GISLRIPSVRRTSAVKSSMCQICLTGHASSGVTLLAAPLAGARGREGNTVGIYICAD 120
G+ S K S+C +C H V + + G G YIC D
Sbjct: 145 KFVGLQGTF-----QSLNKKSICSLC---HGHEEVGMFLVEIKGDIPGTFVKKGNYICKD 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19340FLGMOTORFLIG280.021 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 28.2 bits (63), Expect = 0.021
Identities = 13/41 (31%), Positives = 19/41 (46%)

Query: 91 NSKWKRDPKALKAQEKCASLSLPIPESVLKEQRPELSEEEI 131
K D AL ++K A L + I + + LS+EEI
Sbjct: 5 KEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEI 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19360PF06580357e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 7e-04
Identities = 16/106 (15%), Positives = 34/106 (32%), Gaps = 21/106 (19%)

Query: 431 LRQVVGNLVVNAVRVTAPGGTVNLALVRDGDLAVIQVRDTGKGIPPEDLPHLFDRFWRAD 490
++ +V N + + + GG + L +D ++V +TG
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------------ 307

Query: 491 AARGRATGGSGLGLSIAR---QIVTDHRGRIDVESTVGVGTTFSVV 533
+G GL R Q++ +I + G ++
Sbjct: 308 ------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19365HTHFIS818e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 8e-20
Identities = 40/164 (24%), Positives = 76/164 (46%), Gaps = 10/164 (6%)

Query: 3 ARILVAEDDVKQARLIGIYLEREGNEVQIVADGRSAIDRARSSRPDLIVLDVMMPNVDGL 62
A ILVA+DD ++ L R G +V+I ++ + + DL+V DV+MP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 DVCRILRAE-SQVPILLLTARTTEEDMLLGLDLGADDYMAKPYSPRELTARV-RALLRRS 120
D+ ++ +P+L+++A+ T + + GA DY+ KP+ EL + RAL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 KAAGRTGPAVLTAGDLEIDTARFE------VRVAG--VPVVLTG 156
+ + L +A + R+ + +++TG
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167


150C5746_19330C5746_19390N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_19330-1161.267246glutamine amidotransferase
C5746_19335-2131.295193MarR family transcriptional regulator
C5746_193400141.276794serine/threonine protein kinase
C5746_19345-1121.209423hypothetical protein
C5746_193500121.191232transcriptional regulator
C5746_193550130.886800DUF397 domain-containing protein
C5746_19360-2120.434134hypothetical protein
C5746_193652120.693991hypothetical protein
C5746_193701111.085540translation initiation factor IF-2
C5746_193751131.797955DNA polymerase III subunit beta
C5746_193801141.333082PucR family transcriptional regulator
C5746_193851141.335534TetR family transcriptional regulator
C5746_193901130.408897NmrA family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19455AEROLYSIN280.028 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 28.1 bits (62), Expect = 0.028
Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 157 PTEPVAFAREVFARLGVYEGEKLDAWYRLFHDSD 190
PT PV + L + +G+++D +RL HDS
Sbjct: 100 PTNPVTGEIPTLSALDIPDGDEVDVQWRLVHDSA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19465PF03544389e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 37.6 bits (87), Expect = 9e-05
Identities = 21/111 (18%), Positives = 29/111 (26%), Gaps = 2/111 (1%)

Query: 314 PAAPTALQDAAPPATKPPAQQPPGGVQAMPAVTTATPPPSQAPAPAPTPTPTPAYGYPSM 373
P Q + P +PP VQ P P P P P P P
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPV-VEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 374 PAQPHPGQGQPYGYGYP-AAPQQQAPQPPGSGPTPSYGPTYPTPVPQPEPA 423
+P P + P + P P P+ + +P
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19485INTIMIN300.036 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.0 bits (67), Expect = 0.036
Identities = 50/290 (17%), Positives = 79/290 (27%), Gaps = 28/290 (9%)

Query: 288 YVYKQDTTKPVRTYDFPNTGNSSGADTLEDAGLAWAPDTSRLFAVTVNSYGVRSLRVLTD 347
YV V + GNSS L +TV S G V
Sbjct: 517 YVQGGSNVYKVTARAYDRNGNSSNNVLL---------------TITVLSNG---QVVDQV 558

Query: 348 AVKSATTVTVNAPAKSERGKKLTVTGRVKSEGAFPVGAKATVTRTDIESPKGKTLPAVTL 407
V T +A A + +T T VK G + L A +
Sbjct: 559 GVTDFTADKTSAKADGT--EAITYTATVKKNGVAQANVPVSFNIVS----GTAVLSANSA 612

Query: 408 KSDGSFSFTDTPTAG--GQVTYKVSYAGDVAHSAASGSDKVAVSRAATSLTLNRNKALYS 465
++GS T T + GQV A + A+ V A+ + +K
Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFV-DQTKASITEIKADKTTAV 671

Query: 466 YGTDVSFTAHLGTTYKNRTVELWVDPFGPDKPKKLVKTGKVNAKGNLSTTV-DMTRDTTV 524
+ T + ++ V F K T K + G T+ T ++
Sbjct: 672 ANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSL 731

Query: 525 TAVFKGDGRYASKTVTSTAYAKVRVSTSVSKQYKTAKIGSTPYAYFHKKT 574
+ D K + + + + T G P +
Sbjct: 732 VSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQ 781


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19490HTHTETR270.033 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 27.3 bits (60), Expect = 0.033
Identities = 14/83 (16%), Positives = 36/83 (43%)

Query: 1 MRPRSSYKAPPSQSLIYSAVLQSMAEHGPRRMNMALAARIADTDRQLLYRNWPDRNVLVR 60
M ++ +A ++ I L+ ++ G ++ A+ A R +Y ++ D++ L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ESAESELQRVLFVASDLRSEQEG 83
E E + + + +++ G
Sbjct: 61 EIWELSESNIGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19515HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 7e-16
Identities = 25/93 (26%), Positives = 46/93 (49%), Gaps = 2/93 (2%)

Query: 4 RRPRAKAADKRQRLMAAAARVLHEQGVERTTIADIAQAADVPAGNVYYYFKTKGELVEAA 63
R+ + +A + RQ ++ A R+ +QGV T++ +IA+AA V G +Y++FK K +L
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 64 LSEHTRHLEALTGRL--DQLTDPRERLKGLVDA 94
++ L DP L+ ++
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIH 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_19520NUCEPIMERASE290.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.014
Identities = 29/126 (23%), Positives = 46/126 (36%), Gaps = 35/126 (27%)

Query: 2 IVVTGATGNVGRALVRMLTDAGVPVTAV-------------ARHITDADMPPGVRATAAD 48
+VTGA G +G + + L +AG V + AR A PG + D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ--PGFQFHKID 60

Query: 49 LAGPAGLRVAFD--GAEALFLLVAG-------DDPHG-----------ILEVAKTAGVRK 88
LA G+ F E +F+ ++PH ILE + ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 89 VVLLSS 94
++ SS
Sbjct: 121 LLYASS 126


151C5746_19920C5746_19960N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_19920-1113.358556MFS transporter
C5746_19925-1102.996515TetR family transcriptional regulator
C5746_19930-3101.889744DUF1876 domain-containing protein
C5746_19935-4121.284077GNAT family N-acetyltransferase
C5746_19940-3120.757301alpha/beta hydrolase
C5746_19945-2160.526967peroxiredoxin
C5746_19950-3180.295014hypothetical protein
C5746_19960-1100.404826flotillin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20045TCRTETB1502e-42 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 150 bits (381), Expect = 2e-42
Identities = 87/410 (21%), Positives = 160/410 (39%), Gaps = 18/410 (4%)

Query: 24 RRWTMLALICAAQFMLVLDVTVVNVALPDMAVDLDLGRTALTWVVTAYTLCFAGLMLLGG 83
R +L +C F VL+ V+NV+LPD+A D + + WV TA+ L F+ + G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 84 RLADVLGARRTLLAGLVVFTAASLVCGLAGSGAT-LIGGRIAQGVGAALLSPAALSLVTT 142
+L+D LG +R LL G+++ S++ + S + LI R QG GAA + +V
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 143 SFHGKERNRALGVWAAIGGTGSAIGVLAGGALTSGPGWQWVFYVNVPVGIALLVAVPAFV 202
+ R +A G+ +I G +G GG + W ++ +P + ++ VP +
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIP--MITIITVPFLM 186

Query: 203 P-PRPERPSAGRLDVAGALLATAGTGSLVYGLVTAGDFGWSAAWTLVPVVGAAALYAAFA 261
+ E G D+ G +L + G + T+ + L ++ F
Sbjct: 187 KLLKKEVRIKGHFDIKGIILMSVGIVFFMLF-TTSYSISFLIVSVLSFLI--------FV 237

Query: 262 AVERVAREPLMDLRMFTRRPVLAGAFLMLFATALLIAFFFLGSVFLQHGRGFGPMRTGLV 321
R +P +D + P + G + F + ++ G V
Sbjct: 238 KHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSV 297

Query: 322 FL-PVAVAVGIGAHIGSRLVGTVGSRATAVGAMVIAAGGCLPLTRVGADSSVYGGLLPGL 380
+ P ++V I +IG LV G + + L + + +S + + +
Sbjct: 298 IIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF--MTIII 355

Query: 381 AVAAFGLGAVFVTATATALGMVAHEEAGLASGVVNTFHEVGGSIGVAVGS 430
GL + + +EAG ++N + G+A+
Sbjct: 356 VFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20050HTHTETR626e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.0 bits (150), Expect = 6e-14
Identities = 31/177 (17%), Positives = 52/177 (29%), Gaps = 15/177 (8%)

Query: 5 PPPAGPRAEAKRQAIVKAAREAFLREGF-GVGMDAIAAEAGVSKVTVYNHFGSKEALFTA 63
A+ RQ I+ A F ++G + IA AGV++ +Y HF K LF+
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 64 VITGALDEPLGGPSTKALAGLPDAADLRTAFLNTARAWVSAVRGNADVIALRNLVAAELH 123
+ + D R + V + R L+ +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLS-------VLREILIHVLESTVTEERRRLLMEIIF 114

Query: 124 RFPELAGAW------QHHGPAGHHPAVADALRSLAERGRLDIP-DLETAIIQLYALL 173
E G Q + + + L+ E L A I + +
Sbjct: 115 HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20060SACTRNSFRASE495e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 48.8 bits (116), Expect = 5e-10
Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 3/75 (4%)

Query: 74 RARIEDVVVDTEARGQGIAALLTQEALTLAREAGARTVDLTSRPDRAAANRLYERLGFR- 132
A IED+ V + R +G+ L +A+ A+E + L ++ +A Y + F
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148

Query: 133 -ARESTVYR-FPMDG 145
A ++ +Y FP
Sbjct: 149 GAVDTMLYSNFPTAN 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20080IGASERPTASE462e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 46.2 bits (109), Expect = 2e-07
Identities = 40/253 (15%), Positives = 81/253 (32%), Gaps = 13/253 (5%)

Query: 206 AARAKQEADIAEAIAKRASE---QARLKAAEEIAIAERTFYLKQAEI---KAETEAAAAK 259
A +KQE+ E + A+E Q R A E A + + E+ +ET+
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKE--AKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 260 ANAAGPLAEAARQQEVLAEQEKVAQRQAALTDRELDTKVRKPADAARYQAEQEAEARRIA 319
E + +V E+ + + + ++ K + + + QAE E
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQS-ETVQPQAEPARENDPTV 1153

Query: 320 QVKEAEADAERSRLTGQGEKLHRSALADAVRIEGEAEAAAIAAKGAAEAEAMQKKADAFA 379
+KE ++ + T Q K S + V + + +
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 380 QYGDAAVLQMLVEVLPHVVAKASEPLSAIDKMTVISTDGASQLSRTVTDNVAQGMELLSS 439
+ + + V S+ D+ TV D S + V + + ++
Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVAL 1273

Query: 440 TTGVDLAALLKNL 452
G ++ + L
Sbjct: 1274 NVGKAVSQHISQL 1286



Score = 38.9 bits (90), Expect = 5e-05
Identities = 44/311 (14%), Positives = 97/311 (31%), Gaps = 26/311 (8%)

Query: 175 SLSGQGLILDAFQIQDITTEGSYLEDLGRPEAARAKQEAD---IAEAIAKRASEQARLKA 231
SL G + L A++ + G Y DL PE + Q D I +A +
Sbjct: 956 SLVGNTVDLGAWKYKLRNVNGRY--DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSN 1013

Query: 232 AEEIAIAERTFYLKQAEIKAETEAAAAKANAAGPLAEAARQQEVL-------AEQEKVAQ 284
EEIA + E A A + +AE ++Q+ A +
Sbjct: 1014 NEEIA--------RVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQN 1065

Query: 285 RQAALTDRELDTKVRKPADAARYQAEQEAEARRIAQVKEAEADAERSRLTGQGEKLHRSA 344
R+ A + + + A+ +E + + E++++ + +
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 345 LADAVRIEGEAEAAAIAAKGAAEAEAMQKKADAFAQYGDAAVLQMLVEVLPHVVAKASEP 404
+ + ++E A+ A E + + +Q A + + V +
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185

Query: 405 LSAIDKMTVISTDGASQLSRTVTDNVAQGMELLSSTTG-VDLAALLKNLKGGAASATSAD 463
+ ++ + + + T V + ++ N++ S+
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS---- 1241

Query: 464 LPDAAPAASAN 474
D + A +
Sbjct: 1242 -NDRSTVALCD 1251


152C5746_20110C5746_20130N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_20110-1132.263749hypothetical protein
C5746_201153120.366111two-component sensor histidine kinase
C5746_201201121.689697DNA-binding response regulator
C5746_201250112.358663methionine--tRNA ligase
C5746_201300112.145092hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20225IGASERPTASE801e-17 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 80.5 bits (198), Expect = 1e-17
Identities = 55/294 (18%), Positives = 84/294 (28%), Gaps = 20/294 (6%)

Query: 153 AAEVEAPAEPQPAEPVVAPTAEAEPQPAPVVAETEPEPVAAPAEAEPEPAAVAAEAEVEA 212
+ P Q P + P +A + PV PA A P +
Sbjct: 994 TTNITTPNNIQADVP-------SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQ 1046

Query: 213 PSETEPKAETAAVEAEPAAELEPVEAEAPTEAEP---EPVAAPADEAEAPAEPKAEPAT- 268
S+T K E A E EA++ +A E + ++ E E AT
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 269 -AEVKAPAETEPDAEPAAVEAEPEP-----AAELEPVEAEAPAEAEVKAPTETEPDAEPA 322
E KA ETE E V ++ P E + V A
Sbjct: 1107 EKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA 1166

Query: 323 AVE--AEPEPAAELEPVEVEAPAEAEPEPVAAPADEAEAPAEPKAEPATAEVKAPTEPQP 380
E A+ + +PV V P + A +P ++ +
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226

Query: 381 VAAEAEVKAPAETEPKAEIATGTAPEATPTSEPALPLARVKSHAPGLVDSYKAA 434
V + PA T + + L AR K+ ++ KA
Sbjct: 1227 VRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV-ALNVGKAV 1279



Score = 77.8 bits (191), Expect = 7e-17
Identities = 52/224 (23%), Positives = 79/224 (35%), Gaps = 13/224 (5%)

Query: 201 PAAVAAEAEVEAPSETEPKAETAAVEAEPAAELEPVEAEAPTEAEPEPVAAPADEAEAPA 260
P V+ + T P A V + P+ E + P P A P++ E A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAP-ATPSETTETVA 1041

Query: 261 E-PKAEPATAEVKA--PAETEPDAEPAAVEAEPEPAAELEPVE-AEAPAEAEVKAPTETE 316
E K E T E ET A EA+ A + E A++ +E + TET+
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 317 PDAEPAAVEAEPEPAAELEPVEVEAPAEAEPEPVAAPADEAEAPAEPKAEPATAE-VKAP 375
E A VE E + E E + ++ P ++ + AEP E +K P
Sbjct: 1102 ---ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 376 TEPQPVAAEAEVKAPAETEPKA--EIATGTAPEATPTSEPALPL 417
A+ + PA+ + T + T S P
Sbjct: 1159 QSQT--NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE 1200



Score = 73.9 bits (181), Expect = 1e-15
Identities = 58/306 (18%), Positives = 89/306 (29%), Gaps = 31/306 (10%)

Query: 96 TVPAQGSAPRPKPKPAEPEAEVAEPVAAKPDEVEVEPVATEPEAAPTAEVKPEAAPAAAE 155
TV + P A+ DE V P A + T E E + ++
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE-TTETVAENSKQESK 1049

Query: 156 VEAPAEPQPAEPVVAPTAEAEPQPAPVVAETEPEPVAAPAEAEPEPAAVAAEAEVEAPSE 215
E E Q V E + A E A +E + +E
Sbjct: 1050 TVEKNEQDATET--------TAQNREVAKEAKSNVKANTQTNEV--AQSGSETKETQTTE 1099

Query: 216 TEPKAETAAVEAEPAAELEPVE-AEAPTEAEPEPVAAPADEAEAPAEPKAEPAT------ 268
T+ ETA VE E A++E + E P P E +P+AEPA
Sbjct: 1100 TK---ETATVEKEEKAKVETEKTQEVPKVTSQVS---PKQEQSETVQPQAEPARENDPTV 1153

Query: 269 --AEVKAPAETEPDAEPAAVEAEPEPAAELEPVEAEAPAEAEVKAPTETEPD-AEPAAVE 325
E ++ T D E A E + + V+ P T P +P
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 326 AEPEPAAELEPVEVEA-PAEAEPEPVAAPADEAEAPAE---PKAEPATAEVKAPTEPQPV 381
V + P EP ++ A + ++ +A + +
Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVAL 1273

Query: 382 AAEAEV 387
V
Sbjct: 1274 NVGKAV 1279



Score = 65.5 bits (159), Expect = 5e-13
Identities = 55/253 (21%), Positives = 83/253 (32%), Gaps = 20/253 (7%)

Query: 82 SDLVAAAFDKASIATVPAQGSAPRPKPKPAEPEAEVAEPVAAKPDEVEVEPVATEPEAAP 141
+D+ + + IA V AP P P PA P +E E VA + E +A
Sbjct: 1005 ADVPSVPSNNEEIARVD---EAPVPPPAPATP-SETTETVAENSKQESKTVEKNEQDATE 1060

Query: 142 TAEVKPEAAPAAAEVEAPAEPQPAE--PVVAPTAEAEPQPAPVVAETEPEPVA-APAEAE 198
T E A A A Q E + T E + A E E A E
Sbjct: 1061 TTAQNREVAKEAKS-NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 199 PEPAAVAAEA--EVEAPSETEPKAETAAVEAEPAAELEPVEAEAPTEAEPEPVAAPADEA 256
E V ++ + E +P+AE A E +P ++ +++ T A+ E PA E
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPAR-ENDPTVNIKEPQSQTNTTADTE---QPAKET 1175

Query: 257 EAPAEPKAEPATAEVKAPAETEPDAEPAAVEAEPEPAAELEPVEAE------APAEAEVK 310
+ E +T + E +P +E V+
Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235

Query: 311 APTETEPDAEPAA 323
T + D A
Sbjct: 1236 PATTSSNDRSTVA 1248



Score = 61.2 bits (148), Expect = 1e-11
Identities = 44/270 (16%), Positives = 82/270 (30%), Gaps = 24/270 (8%)

Query: 16 AEQDAPTATVPPQTERTAPAESEPVTVTVNESAAETASVRETASATKASVPAPASHRSAT 75
+ D P+ VP E A + PV + +ET S ++ + + AT
Sbjct: 1003 IQADVPS--VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT-VEKNEQDAT 1059

Query: 76 GDSGLASDLVAAAFDKASIATVPAQGSAPRPKPKPAEPEAEVAEPVAAKPDEVEVEPVAT 135
+ ++ A T + + + K + K ++ +VE T
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 136 EPEAAPTAEVKPEAAPAAAEVEAPAEPQPAEPVVAPTAEAEPQPAPVVAETEPEPVAAPA 195
+ T++V P+ Q V P AE + P V E P
Sbjct: 1120 QEVPKVTSQVSPK--------------QEQSETVQPQAEPARENDPTVNIKE------PQ 1159

Query: 196 EAEPEPAAVAAEAEVEAPSETEPKAETAAVEAEPAAELEPVEAEAPTEAEPEPVAAPADE 255
A A+ + + +P E+ V + P E P +P + +++
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP-ENTTPATTQPTVNSESSNK 1218

Query: 256 AEAPAEPKAEPATAEVKAPAETEPDAEPAA 285
+ V+ + D A
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20230PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 21/109 (19%), Positives = 37/109 (33%), Gaps = 19/109 (17%)

Query: 274 RLLLAQLVANLLANAVTYNVPDGTVEVSLVTAGDGVLLEVRNTGPVVDAADIPGLFEPFR 333
+L+ LV N + + + G + + V LEV NTG +
Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL-------------- 302

Query: 334 RGEGKDRMGRGSGLGLSIVRS-IAVAHGG-TVTAVPGPEGGLAVTVRLP 380
+G GL VR + + +G + +G + V +P
Sbjct: 303 ---ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20235HTHFIS785e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 5e-19
Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 1/148 (0%)

Query: 10 RVLVAEDEEILAELVATGLRRAGFAVDTVYSGDAALAYLGLHDYDVVVLDRDLPRVHGDD 69
+LVA+D+ + ++ L RAG+ V + ++ D D+VV D +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 70 VARGLVASGSRTRILMLTAAGSMEDRVAGLDLGADDYLGKPFEFPELVSRV-RALRRRSA 128
+ + + +L+++A + + + GA DYL KPF+ EL+ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 129 RPVPPQLERHGIRLDTVRRTASREGREL 156
RP + + R A +E +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20245PF06776310.018 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 30.7 bits (69), Expect = 0.018
Identities = 14/36 (38%), Positives = 18/36 (50%)

Query: 2 RQVSRALTAALAAMALSAGAPGPAHAQPAAESIPGA 37
R +R + A A+ALS G A AQ A S+ G
Sbjct: 47 RNGARLMLAGAMAIALSFGWSDRADAQGAVRSVHGD 82


153C5746_20225C5746_20255N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_202252132.332461AfsR family transcriptional regulator
C5746_20230-1131.385519hypothetical protein
C5746_202350110.614480alkyl hydroperoxide reductase
C5746_202400120.305104hypothetical protein
C5746_20245-2130.661934carbon-nitrogen hydrolase
C5746_20250019-1.599517hypothetical protein
C5746_20255226-3.550766MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20340cloacin320.018 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.018
Identities = 24/46 (52%), Positives = 26/46 (56%), Gaps = 9/46 (19%)

Query: 279 NGSGSGTGIERGHGGGSGHGNGTGIANSNGA-------LSVARPVA 317
G GSG+GI GGGSGHGNG G NS G +VA PVA
Sbjct: 46 WGGGSGSGI--HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20345PF01540310.005 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 30.9 bits (69), Expect = 0.005
Identities = 16/47 (34%), Positives = 23/47 (48%)

Query: 9 TILAAATTAVLALALTACGGDDSGTKSAGPASDAAAAAASTDATDAK 55
T+ A TAVL +A +C D K+ +DAA A+ A + K
Sbjct: 10 TLCGIAATAVLPIATISCNDDKLAEKNGKEKADAALKQANALAEELK 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20355PF05272280.023 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.023
Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 23/142 (16%)

Query: 9 LFAVGDDYWIEDADGRKVFLVDGKAMRVRDTFELKDAQGRILVEIRQKLLSLRDTMLIER 68
L+ G+ Y+ D F + + V + + + R+ +
Sbjct: 736 LYLAGERYFPSPEDEEIYFRPEQELRLVETGVQGR----LWALLTREGAPAAEGA----- 786

Query: 69 DGEQLARIKRKRLSLLRNHYRVTLVDGTELDVS-------GKILDREFAIDYDGELLAQI 121
Q + + LV D G++ D ++
Sbjct: 787 --AQKGYSVNTTFVTIAD-----LVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSG 839

Query: 122 SRRWLTVRDTYGIDVVREDADA 143
RR +R V+ ED +A
Sbjct: 840 QRRRGYMRPQVWPPVIAEDKEA 861


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20370TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.1 bits (99), Expect = 3e-06
Identities = 63/344 (18%), Positives = 104/344 (30%), Gaps = 33/344 (9%)

Query: 53 FHVNASALSTFSILQLLVYAGMQIPV----GLMVDRLGTKKVLTLGAVLFTLGQLGFALS 108
+ + + IL L +YA MQ G + DR G + VL + + A +
Sbjct: 35 LVHSNDVTAHYGIL-LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 109 PSYGMALASRALLGCGDAMTFISVLRLGARWFPARRGPLIGQVAALFGMAGNLVSTLFIA 168
P + R + G A T A + A FG +A
Sbjct: 94 PFLWVLYIGRIVAGITGA-TGAVAGAYIADITD------GDERARHFGFMSACFGFGMVA 146

Query: 169 RALHG-----FGWTTTFVGTSAAGVLVLVPLLLFLKDHPEGHEPPPVEHAGAAYVRKQIA 223
+ G F F +A L + L + +G P A + A
Sbjct: 147 GPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWA 206

Query: 224 ASWREPGTRLGMWVHFTTQFPAMVFLLLWGLPFLVEAQGLSRSTAGELLTLVVLSNMAFG 283
M V F Q V LW + F + +T G L + +
Sbjct: 207 RGMT--VVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 284 LVYGQIIARHHEARAPLALG-TVTVTALLWASTIFYPGDRAPMWLLIVLCVVLGVCGPAS 342
+ +A R L LG T + + R M I++ + G G +
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLASGGIGMPA 319

Query: 343 MIGFDFARPANPPERQGTASG----IVNMGGFIASM--TTLFAV 380
+ + ERQG G + ++ + + T ++A
Sbjct: 320 LQAMLSRQVDE--ERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


154C5746_20300C5746_20340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_20300-116-2.033698DNA-binding protein
C5746_20305-215-1.471135branched-chain alpha-keto acid dehydrogenase
C5746_20310016-1.928129alpha-ketoacid dehydrogenase subunit beta
C5746_20315-2110.371513pyruvate dehydrogenase (acetyl-transferring) E1
C5746_20320-2100.605074DNA-binding response regulator
C5746_20325091.744347hypothetical protein
C5746_203300101.844494aminoglycoside phosphotransferase
C5746_20335-1102.035846serine/threonine protein kinase
C5746_20340-191.800464serine/threonine protein kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20405SHAPEPROTEIN290.022 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.0 bits (65), Expect = 0.022
Identities = 27/108 (25%), Positives = 44/108 (40%), Gaps = 12/108 (11%)

Query: 11 STVLGRRLGGELQSLRVAAGKTQRQAAEVLSATPTKVVKMESGWVPMRDPDIAALCTLYG 70
S V R+ A G A ++L TP + + PM+D IA +
Sbjct: 37 SVVAIRQDRAGSPKSVAAVGH---DAKQMLGRTPGNIAAIR----PMKDGVIAD----FF 85

Query: 71 VSEPRILAHLLELAKTDRERRKVKGWWNETPTLATQVEYIAMEDAAVK 118
V+E ++L H ++ ++ R P ATQVE A+ ++A
Sbjct: 86 VTE-KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQG 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20415PF05616300.018 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.5 bits (68), Expect = 0.018
Identities = 23/50 (46%), Positives = 26/50 (52%), Gaps = 8/50 (16%)

Query: 76 TTVDVGQVIIAVDVAPGSGD---AAPAPE--PAAQP--EPEPEAPKGRQP 118
TTVDV QVI D+ PGS + A P PE PA P P P G +P
Sbjct: 303 TTVDV-QVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRP 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20430HTHFIS673e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.2 bits (164), Expect = 3e-15
Identities = 25/116 (21%), Positives = 48/116 (41%), Gaps = 2/116 (1%)

Query: 6 KITVFLLDDHEVVRRGVHELLSVEDDIEVVGEAATAADALVRIPATRPDVAVLDVRLPDG 65
T+ + DD +R +++ LS V + AA I A D+ V DV +PD
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 66 SGVEVCREVRSRDENIKCLMLTSYADDEALFDAIMAGASGYVLKAIRGNELLNAVR 121
+ ++ ++ ++ L++++ A GA Y+ K EL+ +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20445PF07132340.001 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 34.3 bits (78), Expect = 0.001
Identities = 21/62 (33%), Positives = 25/62 (40%)

Query: 470 GGSTDAGTGTADGGSGTADGGTGAADGGSGTADGGTGAADGGTGAADGGAGGAATGATTG 529
G G G GG G++ GG G G G G + G G+A GG G A GA
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGAGMN 121

Query: 530 TS 531

Sbjct: 122 AM 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20450YERSSTKINASE320.006 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.4 bits (73), Expect = 0.006
Identities = 43/167 (25%), Positives = 74/167 (44%), Gaps = 34/167 (20%)

Query: 79 QHTNIVSVFDTGEDELGGALMPY-------IVMEYVEGQPLGSVLQ--ADIRNYGAMPAD 129
+H N+ +V G A++PY ++M+ V+G L+ AD G + ++
Sbjct: 189 KHPNLANVH-------GMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSE 241

Query: 130 R---ALKVTADVLAALDTSHEM---GLVHRDIKPGNVMVTK-RGVVKVMDFGIARAMQSG 182
+K A L LD ++ + G+VH DIKPGNV+ + G V+D G+
Sbjct: 242 AYWGTIKFIAHRL--LDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ 299

Query: 183 VTSMTQTGMVVGTPQYLSPEQALGR-GVDARSDLYSVGIMLFQLLTG 228
T++ + +PE +G G +SD++ V L + G
Sbjct: 300 PKGFTES--------FKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


155C5746_20525C5746_20565N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_20525-212-2.199236penicillin-binding protein
C5746_20530-110-2.488517Stk1 family PASTA domain-containing Ser/Thr
C5746_20540113-2.704722class E sortase
C5746_20545111-2.573230class E sortase
C5746_20550-113-1.614190aminodeoxychorismate/anthranilate synthase
C5746_20555114-0.531247hypothetical protein
C5746_20560-115-0.301125class E sortase
C5746_205651150.052609hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20645BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.6 bits (77), Expect = 0.001
Identities = 51/273 (18%), Positives = 84/273 (30%), Gaps = 67/273 (24%)

Query: 211 ETYPPGSTFKVVTAAAALENGLYTGID-----EPTKSPLPYRLPLTTGNLENEGNIPCEN 265
E +P STFKVV A L + L P++ +L + +
Sbjct: 60 ERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTV---- 115

Query: 266 ASLRDA-LRVSCNTVFGKISDDLGNQKMIDQANKFGFNKEIFTPVRADASVYPKDNRPQN 324
L A + +S N+ + +G + F ++I D +
Sbjct: 116 GELCAAAITMSDNSAANLLLATVGGPAGLTA-----FLRQI-----GDNVTRLDRWETEL 165

Query: 325 AMAGIGQASNRATPLQMAMVAAAVANDGKLMQPYMVAERKAPNLDVIYTHEKEQLSQPLS 384
A G A + TP MA + SQ LS
Sbjct: 166 NEALPGDARDTTTPASMAATLRKLL-----------------------------TSQRLS 196

Query: 385 GENAQKLQQMMETVVKNGTGQ---RAKIP-GVTIGGKTGTAQHGLNNSEKPYAWFISYAK 440
+ ++L Q M V + R+ +P G I KTG + G ++
Sbjct: 197 ARSQRQLLQWM---VDDRVAGPLIRSVLPAGWFIADKTGAGERGARGI-------VALLG 246

Query: 441 TDSGSPVAVAVVVEDGQA----NRDDISGGGLA 469
++ + V + + D A I+G G A
Sbjct: 247 PNNKAERIVVIYLRDTPASMAERNQQIAGIGAA 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20650YERSSTKINASE373e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 37.0 bits (85), Expect = 3e-04
Identities = 34/119 (28%), Positives = 55/119 (46%), Gaps = 27/119 (22%)

Query: 90 IVMEYVDG----STLRELLHSGR-------------KLLPERTLEMTVGILQALEYSHRA 132
++M+ VDG TLR L S + K + R L++T + +A
Sbjct: 212 LLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIKFIAHRLLDVT-------NHLAKA 264

Query: 133 GIVHRDIKPANVMLTR-TGQVKVMDFGIARAMGDS--GMTMTQTAAVIGTAQYLSPEQA 188
G+VH DIKP NV+ R +G+ V+D G+ G+ G T + A +G + E++
Sbjct: 265 GVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGASEKS 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20660PF05616320.004 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.4 bits (73), Expect = 0.004
Identities = 28/102 (27%), Positives = 39/102 (38%), Gaps = 10/102 (9%)

Query: 107 FERDWYGQQT------PAPTSAPTHREQPVAHPFPLDDETVGLRTADTRRLVDRAMTTRQ 160
F RD G T P P P E P A P P E + TR
Sbjct: 295 FGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLP---EVSPAENPANNPAPNENPGTR- 350

Query: 161 PNSGPEPELGSASDAESFAESGTGPETADPEPRTGGRAERRR 202
PN P+P+L ++ ++ + GT P++ R GR + R
Sbjct: 351 PNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKER 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20665HTHFIS270.042 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.5 bits (61), Expect = 0.042
Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 1/54 (1%)

Query: 1 MSA-RILVVDNYDSFVFNLVQYLYQLGAECEVLRNDEVTTAHAQDGFDGVLLSP 53
M+ ILV D+ + L Q L + G + + N G ++++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_20680PF07675280.048 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 28.1 bits (62), Expect = 0.048
Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 4/50 (8%)

Query: 111 LTGRSVA--VTLDDAPPNATANPGYPDPQPND-LVIHQQDLQAVVNALWQ 157
LTG +V VTL + PN T NP P+P PN + + + A W+
Sbjct: 598 LTGSAVGQKVTLKWSAPNGTPNPN-PNPNPNPGTTTLSESFENGIPASWK 646


156C5746_21250C5746_21335N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_21250-113-0.218052PAS domain S-box protein
C5746_21255-1130.898287hypothetical protein
C5746_212600101.676953**DUF2797 domain-containing protein
C5746_21265-181.048556amidohydrolase
C5746_21270091.008387DNA-binding response regulator
C5746_21275-2101.709509two-component sensor histidine kinase
C5746_21280-2121.650347glycosyl transferase
C5746_21285-3120.899119glycosyl transferase
C5746_21290-314-0.352165metal-dependent hydrolase
C5746_21295-213-0.839559MFS transporter
C5746_21300-1140.694457TetR family transcriptional regulator
C5746_21310016-0.791767PPOX class F420-dependent oxidoreductase
C5746_21315012-0.558246hypothetical protein
C5746_213200130.046009MFS transporter
C5746_21325-2141.070349MarR family transcriptional regulator
C5746_21330-3141.265793hypothetical protein
C5746_21335-2180.751256hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21375HTHFIS398e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.4 bits (92), Expect = 8e-05
Identities = 14/81 (17%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 1224 PRVLLIEEHEEIALALTETLERRGMQVARASTDNEAVALATRMRPNLVVMDLMQVRRRRA 1283
+L+ ++ I L + L R G V S +LVV D++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 1284 GIVDWLRANGQLNRTPLVVYT 1304
++ ++ P++V +
Sbjct: 64 DLLPRIKKARP--DLPVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21380SSBTLNINHBTR361e-05 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 36.4 bits (83), Expect = 1e-05
Identities = 42/128 (32%), Positives = 53/128 (41%), Gaps = 20/128 (15%)

Query: 5 LALTAIA---SLAALSAAAPAATAATGPLPLPLPLPLLHADDAGTRLTVTVSGTGDPAAE 61
L LTA A LA S A+PA A+ P L L + H + A T +
Sbjct: 11 LGLTATAVCGPLAGASLASPATAPASLYAPSALVLTVGHGESAATAAPLRAV-------- 62

Query: 62 GVFELKCGPAG-GSHPAAQQACDRLEELAGEGADPFA-PVPGDAMCTQQFGGPATARVTG 119
L C P G+HPAA AC EL DP A MCT+++ P V G
Sbjct: 63 ---TLTCAPTASGTHPAAAAAC---AELRAAHGDPSALAAEDSVMCTREYA-PVVVTVDG 115

Query: 120 TWRGRGID 127
W+GR +
Sbjct: 116 VWQGRRLS 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21405HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 2e-24
Identities = 35/112 (31%), Positives = 61/112 (54%)

Query: 22 RVLVVDDEAPLAELLSMALRYEGWEVRSAGDGAGAVRMARDFRPDAVILDVMLPDTDGFA 81
+LV DD+A + +L+ AL G++VR + A R D V+ DV++PD + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 82 VLGRLRRELSEVPVLFLTARDAVEDRIAGLTAGGDDYVTKPFSLEEVVARLR 133
+L R+++ ++PVL ++A++ I G DY+ KPF L E++ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21410PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.007
Identities = 25/131 (19%), Positives = 40/131 (30%), Gaps = 27/131 (20%)

Query: 361 HWRLELPDEPATVYGDPTRLHQVLVNLLANARTH--TPPGTTVTVRVRATAGHPWVTLEV 418
+ ++ V P L Q LV N H + ++ T + VTLEV
Sbjct: 241 QFENQINPAIMDVQ-VPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 419 QDDGPGIPPRLLPHVFERFARGDASRSRHAGSTGLGLAIVHAVIAAHGG---RVGVASVP 475
++ G STG GL V + G ++ ++
Sbjct: 297 ENTGSLALKNT------------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 476 GHTVFAVHLPA 486
G V +P
Sbjct: 339 GKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21420cloacin457e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.5 bits (107), Expect = 7e-07
Identities = 33/145 (22%), Positives = 49/145 (33%), Gaps = 4/145 (2%)

Query: 504 GHRGSIATAGPS--GASSMGGPGGGGRGGPGGMGGGMRPPGQGNQQGGGMGQPPTGAQGG 561
GH + + G + G GGG G G P G G+ G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 562 NQQNGRQPGAMPGTGTAPGTAPG-GTGEDGTPGGGGMGGLLNGSSVSAEAEKLLEKNAGD 620
N G +A G TPG GG+ ++ ++SA ++ G
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGP 126

Query: 621 YTWAAAAIGSQNAASYQLATGDPVM 645
+ + + Q+A DP M
Sbjct: 127 FKFGLWGVALYGVLPSQIAKDDPNM 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21425cdtoxinb344e-04 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 34.2 bits (78), Expect = 4e-04
Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 10/86 (11%)

Query: 201 PGRQKVRQILLGDFNAAPAAPELAPLWEELTDIEPGGPTYPAQDPVQRIDYVAVSKDSVR 260
P Q + ++LGDFN PA E+ E P Q + +DY AV+ +SV
Sbjct: 180 PVHQALNWMILGDFNREPADLEMNLTVPVRRASEIISPAAATQTSQRTLDY-AVAGNSVA 238

Query: 261 VRDAAVAETL---------ASDHRPV 277
R + + + +SDH PV
Sbjct: 239 FRPSPLQAGIVYGARRTQISSDHFPV 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21430TCRTETB1591e-45 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 159 bits (404), Expect = 1e-45
Identities = 88/408 (21%), Positives = 176/408 (43%), Gaps = 15/408 (3%)

Query: 22 ILGVICLAQLTVLLDNTVLNVAIPSLTQELDASTADVQWMINAYSLVQSGLLLTAGSSAD 81
IL +C+ +L+ VLNV++P + + + A W+ A+ L S G +D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 82 RYGRKKMLVAGLALFGIGSLVAGLAQSSTQ-LIAARAGMGIGGALLMTTTLAVVVQIFDD 140
+ G K++L+ G+ + GS++ + S LI AR G G A + VV +
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 141 SERVKAIGIWSTVSSLGFAVGPLIGGVMLDHFWWGAIFLINIPVAVIGLVAVVRLVPESK 200
R KA G+ ++ ++G VGP IGG++ + W +L+ IP+ I V + + + +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 201 NPSGERPDLLGALLSTIGMASVVYAIISGPEHGWTSGRVLLTAFIGVAVLIGFVLWELHI 260
D+ G +L ++G+ + S + + ++ + FV +
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTS---YSISF-LIVSVLSFLI-----FVKHIRKV 243

Query: 261 PYPMLDMHFFRNQKFIGAVAGAILVAFGMGGSLFLLTQQLQFVVGYGPLEAGLRTA-PLA 319
P +D +N F+ V ++ + G + ++ ++ V E G P
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 320 LSVVALNLTGLGARLVPKLGTPVTIAAGMSLLAAGLAAIALLGGDGYGGMLLGLVVMGAG 379
+SV+ +G LV + G + G++ L+ + L M + +V + G
Sbjct: 304 MSVIIFGY--IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 380 IALAMPAMANAIMSAIPPEKAGVGAGVNGTLAEFGNGLGVAILGAVLN 427
++ ++ + S++ ++AG G + + G G+AI+G +L+
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21435TETREPRESSOR753e-18 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 74.6 bits (183), Expect = 3e-18
Identities = 49/221 (22%), Positives = 88/221 (39%), Gaps = 24/221 (10%)

Query: 33 AGLDRDRITAASVRLLDAEGLAKFSMRRLAAELDVTAMSLYWYVDTKDDLLE-LALDSV- 90
A L+R+ + A++ LL+ G+ + R+LA +L + +LYW+V K LL+ LA++ +
Sbjct: 2 ARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILA 61

Query: 91 -YSEIAPPREDAHWHDRLRELAVSYRELLVRHGWVSPLAGHFLNMGPHSMLFSHAMQDVV 149
+ + + P W LR A+S+R L+R+ + + +
Sbjct: 62 RHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLG-TRPDEKQYDTVETQLRFM 120

Query: 150 RATGLPLHQQTGALSAVFQFVYGFGTIEAHFVQRSAESGVSQDEFFQQALGTIRSQPQLS 209
G L A+SAV F G + + + DE L
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAAPDE-------------NLP 167

Query: 210 RIVESSQNLMDARGGDTVEEMRDRDFTFALDLLIAGIEAMR 250
++ + +MD+ G ++ F L+ LI G E
Sbjct: 168 PLLREALQIMDSDDG-------EQAFLHGLESLIRGFEVQL 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21450TCRTETB1341e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 134 bits (339), Expect = 1e-35
Identities = 91/423 (21%), Positives = 176/423 (41%), Gaps = 21/423 (4%)

Query: 24 SGAPMTHRQIMEALAGLMLGMFVAILSSTVVSNALPEIISDLGGGQSAYTWVVTASLLAM 83
S + + H QI+ L L F ++L+ V++ +LP+I +D ++ WV TA +L
Sbjct: 6 SQSNLRHNQILIWLCILS---FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 84 TATTPLWGKLSDLFSKKLLVQIALVIYVAGSVVAGMSTSS-GMLIACRVVQGIGVGGLSA 142
+ T ++GKLSD K L+ ++I GSV+ + S +LI R +QG G A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 143 LAQIVMAAMIAPRERGRYSGYLGAVFAVATVGGPLLGGVITDTSWMGWRWCFYVGVPFAI 202
L +V+A I RG+ G +G++ A+ GP +GG+I W + + +P
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY----IHWSYLLLIPMIT 178

Query: 203 VALVVLQKTLKLPVVKREVKVDWWGALFISAAVSLLLVWVTFAGDKYDWLSWQTGVMLAG 262
+ V L V+ + D G + +S + +++ T Y L
Sbjct: 179 IITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTT----SYSIS------FLIV 228

Query: 263 SLILTLLFVFIESRASEPIIPLRLFRNRTITLTSLASLFVGIAMFAGTVFFSQYFQLARG 322
S++ L+FV + ++P + L +N + L + + +
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 323 KSPTMSG-VMTIPMIAGLFVSSTVSGQIITRTGRWKAWLVSGGFLVTAGLGLLGTIRYDT 381
S G V+ P + + + G ++ R G + FL + L + T
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLT-ASFLLETT 347

Query: 382 EYWHIAVFMAVMGLGIGMMMQNLVLAAQNQVAPADLGSASSVVTFFRSLGGAIGVSALGA 441
++ + + V+G G+ + + + + G+ S++ F L G++ +G
Sbjct: 348 SWFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 442 VMA 444
+++
Sbjct: 407 LLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21465cloacin345e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.9 bits (77), Expect = 5e-04
Identities = 30/88 (34%), Positives = 36/88 (40%), Gaps = 10/88 (11%)

Query: 47 AGAGPEGAPSGAVAPSGKVTLVPLDRGGKGGGSAGDRADGSGPSPAASAAGDGSGGTSTA 106
+G G +GA + SG + P G GG S DGSG S + G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS-----DGSGWSSENNPWGGGSGSGIHW 56

Query: 107 SGAAGRPRGTTGPGAVGAGSGTGSSGGG 134
G +G G G SG GS GG
Sbjct: 57 GGGSGH-----GNGGGNGNSGGGSGTGG 79


157C5746_21810C5746_21985N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_21810013-0.737176hypothetical protein
C5746_21815-111-0.686631hypothetical protein
C5746_21820-113-0.942395flotillin
C5746_21825-213-0.294474*hypothetical protein
C5746_21830110-1.100551peptidase
C5746_21835010-0.382309alcohol dehydrogenase
C5746_21840111-0.098478PrsW family intramembrane metalloprotease
C5746_21845211-0.405627tRNA (guanosine(46)-N7)-methyltransferase TrmB
C5746_21850211-0.524538L-2-hydroxyglutarate oxidase
C5746_218602120.085135sporulation protein
C5746_218653172.052819asparagine synthase
C5746_218704150.457729AfsR family transcriptional regulator
C5746_21875316-0.057845RNA polymerase subunit sigma-24
C5746_21895316-0.092019TetR family transcriptional regulator
C5746_21900214-0.565477FAD-dependent oxidoreductase
C5746_21905112-0.665676PAS sensor protein
C5746_21910-19-1.520148MFS transporter
C5746_21915-111-0.990871MarR family transcriptional regulator
C5746_21925012-1.133372DNA-binding response regulator
C5746_21930011-1.217498two-component sensor histidine kinase
C5746_21935-211-0.513311ABC transporter permease
C5746_21940-211-0.771252multidrug ABC transporter ATP-binding protein
C5746_21945-211-0.114357GNAT family N-acetyltransferase
C5746_219550120.313498type VI secretion protein
C5746_21970-1111.036907ATP/GTP-binding protein
C5746_21975-1101.488406hypothetical protein
C5746_21980-1152.248209hypothetical protein
C5746_21985-1103.721604hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21950TYPE3IMSPROT280.011 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.011
Identities = 12/70 (17%), Positives = 22/70 (31%), Gaps = 6/70 (8%)

Query: 14 LWFFLWIMWLFLLFKVISDIFRSDDLGGW-GKAGWLIVALVLPYLGVLVYVIVRGK---- 68
+W + + LL I L G + +I + + + Y +
Sbjct: 154 IWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKE 213

Query: 69 -SMGKRDIKE 77
M K +IK
Sbjct: 214 LKMSKDEIKR 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21960IGASERPTASE310.012 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.012
Identities = 24/168 (14%), Positives = 51/168 (30%), Gaps = 9/168 (5%)

Query: 166 NLAAPHAAAVASQARIAEAKA-DQEASQREQ-QAAALKAEYERDTAIKRAGFLAETEQYN 223
A ++ + E A + A RE + A + T E
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 224 ARAAQAGPLAQARASQEVIEEQTSLAERQAALAAQRLEAEVRRPADAEAYRQRTLAEAAR 283
+ + + ++ E+ + + + ++ ++ ++E +P AE R+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ-AEPARENDPTVNIK 1156

Query: 284 DRVKFEADGSAYTERTLAQAQADANSVRATSLRDGNQELIAANRIVEN 331
E T Q A S + + N +VEN
Sbjct: 1157 -----EPQSQTNTTADTEQP-AKETSSNVEQPVTESTTVNTGNSVVEN 1198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21975IGASERPTASE449e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.9 bits (103), Expect = 9e-07
Identities = 31/152 (20%), Positives = 54/152 (35%), Gaps = 7/152 (4%)

Query: 135 SDNLPDAKSLPGVGSLFSGE-SEADKSAAAAAAAPLTTAGLTTAEAEQGATDAGETLRA- 192
D P P S + +E K + A TTA+ + A +A ++A
Sbjct: 1020 VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN 1079

Query: 193 ----RILQQAEQQQDAAAAEAKAAQEKAAAEKAAAEAKKQQDAAEAKA-AAEKKKAEEEA 247
+ Q + ++ E K EKA E +K Q+ + + + K++ E
Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139

Query: 248 KAKAEALRLAKLAASYAIPTSSYTITSTFGQA 279
+ +AE R + P S T+ Q
Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_21990PERTACTIN364e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 35.8 bits (82), Expect = 4e-04
Identities = 43/160 (26%), Positives = 57/160 (35%), Gaps = 16/160 (10%)

Query: 339 TPAEPLALSSMRARGMARDAARHRHTLAARENWASPYGPGSAPPYGPGSAPPYGPRSAPP 398
TP A ++ + D +R+ LAA N APP P+ AP
Sbjct: 526 TPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPP---------APKPAPQ 576

Query: 399 YGPRSAPPYGPAYAPPYGPAYAPPYGPGHGPGFASPGQHASPRHGPSAQVTAHAKAQAKA 458
GP+ P PP P PP P P +P P G A+A
Sbjct: 577 PGPQPGPQPPQPPQPPQPP--QPPQPPQRQPEAPAP----QPPAGRELSAAANAAVNTGG 630

Query: 459 HGRAAARAVAEYESFATSLASLRRQARRGAA-GPDFAERE 497
G A+ AE + + L LR G A G FA+R+
Sbjct: 631 VGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQ 670


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22010TYPE3OMGPROT300.040 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.9 bits (67), Expect = 0.040
Identities = 19/88 (21%), Positives = 29/88 (32%), Gaps = 11/88 (12%)

Query: 577 LRRVLAGAGIHELPPGWGTPSLATSNATA----RTGLRAALPDLIALFDAPLLADAGLVE 632
+RVL G + W LR L D A +DA +V
Sbjct: 9 FKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDAT------VVV 62

Query: 633 ARVVRKALRAASEGEPLPLDGLAHLAAT 660
+ + + E + P D L H+A+
Sbjct: 63 SDKINDKVSGQFEHDN-PQDFLQHIASL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22020TONBPROTEIN538e-10 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 53.1 bits (127), Expect = 8e-10
Identities = 25/89 (28%), Positives = 29/89 (32%)

Query: 433 PAAVAPAVPTPAPPKPTPTPTPTPKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTPTPT 492
P AV P P+P P P P P +AP P P P P PK P P
Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117

Query: 493 PKPTPPPKPSTPPTPKPPAPAPTPPPSPP 521
P +T P + A P
Sbjct: 118 ESRPASPFENTAPARLTSSTATAATSKPV 146



Score = 46.9 bits (111), Expect = 9e-08
Identities = 32/96 (33%), Positives = 37/96 (38%), Gaps = 10/96 (10%)

Query: 434 AAVAPAVPTPAPPKPTP----TPTPTPKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTP 489
+V + PAP +P TP P A PP P P P P+ P P P P
Sbjct: 30 TSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEP-PKEAPVV 88

Query: 490 TPTPKPTPPPKPSTPP----TPKP-PAPAPTPPPSP 520
PKP P PKP PK P + P SP
Sbjct: 89 IEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124



Score = 40.4 bits (94), Expect = 1e-05
Identities = 20/75 (26%), Positives = 25/75 (33%), Gaps = 5/75 (6%)

Query: 456 PKPDAPAPPAPPAPSPTPPPKSTPMPTPTPTPTPTPTPKPTPPPKPSTPPTPKPPAPAPT 515
P P P P+ PP++ P P+P P P P P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVE-----PEPEPEPIPEPPKEAPVVIEKPK 93

Query: 516 PPPSPPAPTVYRVSE 530
P P P V +V E
Sbjct: 94 PKPKPKPKPVKKVQE 108



Score = 35.3 bits (81), Expect = 5e-04
Identities = 18/96 (18%), Positives = 21/96 (21%), Gaps = 3/96 (3%)

Query: 422 QPKPEAGHAAKPAAVAPAVPTPAP---PKPTPTPTPTPKPDAPAPPAPPAPSPTPPPKST 478
P P PKP P P P P P P S
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASP 124

Query: 479 PMPTPTPTPTPTPTPKPTPPPKPSTPPTPKPPAPAP 514
T T + T P S P+ +
Sbjct: 125 FENTAPARLTSSTATAATSKPVTSVASGPRALSRNQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22025HTHTETR583e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 3e-12
Identities = 38/253 (15%), Positives = 69/253 (27%), Gaps = 61/253 (24%)

Query: 60 PLRVDAQRNLEHVLRAAREVFGELGY-GAPMEDVARRAKVGVGTVYRRFPSKDVLVRRIA 118
+ +AQ +H+L A +F + G + ++A+ A V G +Y F K L I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 119 EEETSRLTDQARTALGQEEEPWSALSRFLRTSVASGAGRLLPPQVLRVGVDTDADADDSN 178
E S + + + P VLR +
Sbjct: 64 ELSESNIGELELEYQAKFPG--------------------DPLSVLREILIH-------- 95

Query: 179 AAGESGSIAASVSDSGPGSARDETRVPQQRQGAGQADLRPADGRSTAETGIEDDGTGAGE 238
+ T ++R+ + + + E
Sbjct: 96 -------------------VLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 239 LLEIVGRLVDRARAAGELRRDVTVADV----------LLVIATAAPSLPDAAQQAAASSR 288
+ + + + A L D+ L+ AP D ++A
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARD--- 193

Query: 289 LLDILLEGLRSRP 301
+ ILLE P
Sbjct: 194 YVAILLEMYLLCP 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22035PF06580300.020 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.020
Identities = 9/36 (25%), Positives = 16/36 (44%), Gaps = 6/36 (16%)

Query: 520 LVANSLKHGTPPM------RLGLRRTDRRLIIEVTD 549
LV N +KHG + L + + + +EV +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVEN 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22040TCRTETA513e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.4 bits (123), Expect = 3e-09
Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 10/278 (3%)

Query: 13 ALSAFGLGFTVPYLYVYVAQV--RDLGAGTAGVVLAVFAMAALAVLPFTGRAIDRRGPLP 70
AL A G+G +P L + + + G++LA++A+ A P G DR G P
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 71 VLVVASAVASLGAAALGLSSGVTASVLSAAVLGAGTAVMQPALATMLVWCSSTATRTRAF 130
VL+V+ A A++ A + + + + V G A A A + + R R F
Sbjct: 75 VLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYI-ADITDGDERARHF 133

Query: 131 AMQFFLQNLGLGIGGLVGGQIVDTSRPETFTLLFLIEAAMFVVLGVVACTVRLSRTPSFS 190
G+ G ++GG + S F AA+ + + C +
Sbjct: 134 GFMSACFGFGMVAGPVLGGLMGGFSP----HAPFFAAAALNGLNFLTGCFLLPESHKGER 189

Query: 191 DVRPTDGSEAPGGLRALLSHRAMVQLCVLGFVLFFACYGQFESGL-AAYGTEAAGIQPST 249
+ R + L + F++ GQ + L +G + +T
Sbjct: 190 RPLRREALNPLASFRWARGMTVVAALMAVFFIMQL--VGQVPAALWVIFGEDRFHWDATT 247

Query: 250 LGIALAANTAVIVVAQFVVLRLVERRRRSRVIAWVGLI 287
+GI+LAA + +AQ ++ V R R +G+I
Sbjct: 248 IGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMI 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22070HTHFIS609e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.8 bits (145), Expect = 9e-13
Identities = 25/115 (21%), Positives = 43/115 (37%), Gaps = 2/115 (1%)

Query: 2 TTVLIADDQPMQRFGFRMLLESQDDMTVVGEAANGHEALSLVSRHHPDVALMDIRMPLLD 61
T+L+ADD R L + +N ++ D+ + D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIEATRRIIRAGARTRVLIVTTFDLDRYAYDGLRAGASGFLIKDALPEELLSGIR 116
+ RI +A VL+++ + A GA +L K EL+ I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22075PF06580330.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.3 bits (76), Expect = 0.001
Identities = 64/359 (17%), Positives = 114/359 (31%), Gaps = 68/359 (18%)

Query: 37 LVLLMSVAFSAPLLWRRTHPLAVLAFMTPFSLANVWTGAIVQAAYLQEIAVFNIALRLPM 96
+ LM + + + M L V +V V N ++ +
Sbjct: 47 AISLMGLVLTHAYRSFIKRQGWLKLNMGQIILR-VLPACVVIGMV---WFVANTSIWRLL 102

Query: 97 RALAWAGAAVTAPLVVGMLRFPHGWDVLFVPHLWAFALVSLLGIVVRTRKEYTEALVDRA 156
+ A T PL + ++ ++V+ V +W SLL K Y +A
Sbjct: 103 AFINTKPVAFTLPLALSII-----FNVVVVTFMW-----SLLYFGWHFFKNYKQA----- 147

Query: 157 HRLELERDQQARLAAAAERTRIAREMHDIIGH----NLSVITGLADGGAYAAAKSPDRAA 212
Q ++A+ A+ ++ I H L+ I L + P +A
Sbjct: 148 ------EIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALI-------LEDPTKAR 194

Query: 213 QALEAIGTTSRQALTELRRLLGVLRDHPAEADRTPQPTLDDIDTLLT--GVRAAG-LPVH 269
+ L ++ R +L L D L +D+ L ++ L
Sbjct: 195 EMLTSLSELMRYSLRYSNARQVSLADE-----------LTVVDSYLQLASIQFEDRLQFE 243

Query: 270 LHIRGRPPSYPPTAGRQLTVYRVVQEALTNTLKHGGTHPSLTAEVTLTYRATE--LEALI 327
I P +VQ + N +KHG ++ L + +
Sbjct: 244 NQI-------NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 328 TDTGTTGAE--PEGSGQGITGMRERASLYDGTLEAGPLLPKAIATDSAAGGWRVRLRLP 384
+TG+ + E +G G+ +RER + GT EA I G + +P
Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGT-EAQ------IKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22090SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.4 bits (76), Expect = 2e-04
Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 4/78 (5%)

Query: 101 AHVVGVFVRPEARGEGLAEELFRAGADWAWSLVEPRIERVRLYVHEDNARAAVLYRRIGF 160
A + + V + R +G+ L +WA E + L + N A Y + F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWA---KENHFCGLMLETQDINISACHFYAKHHF 146

Query: 161 VATG-ESVPVPGDPSARE 177
+ +++ P+A E
Sbjct: 147 IIGAVDTMLYSNFPTANE 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22095TONBPROTEIN439e-07 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 43.4 bits (102), Expect = 9e-07
Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 2/91 (2%)

Query: 138 PSQRQEPRPDAQNPQQTEPQPTPEPEPRPTAHSPEAVPTPAPAPAPAPAPVEHTPPE--P 195
P Q +P P+ + EP+P PEP + P P P P P E + P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKP 116

Query: 196 ALPSPRTPLVVYGPAATRRPAVIQAVQEADG 226
P +P PA A +
Sbjct: 117 VESRPASPFENTAPARLTSSTATAATSKPVT 147



Score = 34.2 bits (78), Expect = 9e-04
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 2/90 (2%)

Query: 150 NPQQTEPQPTPEPEPRPTAHSPEAVPTPAPAPAPAPAPVEHTPPEPALPSPRTPLVVYGP 209
P EP +P P P P AP +E P+P V P
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110

Query: 210 AATRRPAVIQ--AVQEADGPALVVTSDPTV 237
+P + + E PA + +S T
Sbjct: 111 KRDVKPVESRPASPFENTAPARLTSSTATA 140



Score = 30.0 bits (67), Expect = 0.017
Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 2/61 (3%)

Query: 136 PVPSQRQEPRPDAQNPQQTEPQPTPEPEPRPTAHSPEAVPTPAPAPAPAPAPVEHTPPEP 195
PV EP P + P++ P +P+P+P P+ V P PVE P P
Sbjct: 67 PVVEPEPEPEPIPEPPKE-APVVIEKPKPKPKP-KPKPVKKVQEQPKRDVKPVESRPASP 124

Query: 196 A 196

Sbjct: 125 F 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22110cloacin393e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 3e-05
Identities = 26/79 (32%), Positives = 32/79 (40%)

Query: 363 SGDSGSGGARPANAVSGGVAAHSSRNSGGGSGVGGGSVPSAAPPPRSGSGPTSGTPHGSR 422
SG G G A++ SG + + GG G S P GSG GS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 423 SPRGGGSGSSGNNSTGGGG 441
GGG+G+SG S GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 29.7 bits (66), Expect = 0.029
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 356 SSRSGQNSGDSGSGGARPANAVSGGVAAH----SSRNSGGGSGVGGGSVPSAAP 405
S SG +S ++ GG + GG + H + NSGGGSG GG AAP
Sbjct: 34 SDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_22115CHANLCOLICIN300.013 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.7 bits (66), Expect = 0.013
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 5/54 (9%)

Query: 81 GSGGGSGNGGSKPDSSSTAATGTKPVTGKNGSIPSGFAHDEQGAQSAASNFAVA 134
GSGGG G GGSK +SS+ K T + + EQ A++ A+ A A
Sbjct: 33 GSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQA-----EQAARAKAAAEAQA 81


158C5746_23205C5746_23240N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_232051110.865430N-acetylmuramoyl-L-alanine amidase
C5746_23210-1131.452225prolyl aminopeptidase
C5746_232151103.307948hypothetical protein
C5746_232201112.443661MarR family transcriptional regulator
C5746_232252102.402152ABC transporter ATP-binding protein
C5746_232301101.691792multidrug ABC transporter permease
C5746_232351111.710365translation initiation factor IF-2
C5746_23240-1120.883704AAA family ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_23310PF03544432e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 42.7 bits (100), Expect = 2e-06
Identities = 23/108 (21%), Positives = 29/108 (26%), Gaps = 8/108 (7%)

Query: 214 PAADPSADPSDPAATEPP-------APNDSTEPPAEVIPSDPASPSPSDTATVEPIPPVS 266
PA S PA EPP P EP E IP P +P +P P
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPK-EAPVVIEKPKPKPKPK 104

Query: 267 PSPSAPASSSPAATLPPAPPSTVPKPPITSRAGWGADESMSPEAPEYT 314
P P P P + + + P +
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTS 152



Score = 40.3 bits (94), Expect = 1e-05
Identities = 23/132 (17%), Positives = 32/132 (24%), Gaps = 11/132 (8%)

Query: 219 SADPSDPAATEPPAPNDSTEPPAEVIPSDPASPSPSDTATVEPIPPVSPSPSAPASSSPA 278
P+ P + AP D P A P P EP P P P A
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQA-------VQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 95

Query: 279 ATLPPAPPSTVPKPPITSRAGWGADESMSPEAPEYTDTVKAVFVHHTAGTSN----YSCA 334
P P P + P +P ++ +
Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 335 DSAAIVRSVYAY 346
A+ R+ Y
Sbjct: 156 GPRALSRNQPQY 167



Score = 31.9 bits (72), Expect = 0.008
Identities = 11/91 (12%), Positives = 23/91 (25%), Gaps = 1/91 (1%)

Query: 210 AADIPAADPSADPSDPAATEPPAPNDSTEPPAEVIPSDPASPSPSDTATVEPIPPVSPSP 269
+ +P +P P + P P V+P+ SP
Sbjct: 70 PEPVVEPEPEPEPI-PEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128

Query: 270 SAPASSSPAATLPPAPPSTVPKPPITSRAGW 300
+ + + ++ P + S
Sbjct: 129 FENTAPARPTSSTATAATSKPVTSVASGPRA 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_23335ABC2TRNSPORT669e-15 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 66.1 bits (161), Expect = 9e-15
Identities = 57/208 (27%), Positives = 95/208 (45%), Gaps = 13/208 (6%)

Query: 87 GGVPYLDFLAPGIIAQSAMFIAIFYGIMIIWER--DSGVLTKLMVTPTPRTALVTGKAFA 144
GGV Y FLA G++A SAM A F I + R ++ T +V G+
Sbjct: 61 GGVSYTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAW 120

Query: 145 AGVKAVIQAAVVIVIAALIGVGMTWNPLRLLGVVVVVLLASAFFSCLSMSIAGIVLTRDR 204
A KA + A + V+AA +G + L L V+ + LA F+ L M + + + D
Sbjct: 121 AATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLA---FASLGMVVTALAPSYDY 177

Query: 205 LMGIGQAITMPLFFASNALYPVALMPGWLQAISKVNPLSYEVDALRGLLIGTQA-----H 259
+ + P+ F S A++PV +P Q ++ PLS+ +D +R +++G H
Sbjct: 178 FIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQH 237

Query: 260 LGLDLAVLAFAALAGIVAAGSLLGRLAR 287
+G A+ + + ++ L RL R
Sbjct: 238 VG---ALCIYIVIPFFLSTALLRRRLLR 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_23345SYCDCHAPRONE442e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 43.8 bits (103), Expect = 2e-07
Identities = 17/91 (18%), Positives = 34/91 (37%), Gaps = 3/91 (3%)

Query: 19 GRVDEAKAMLAKRLAQDPEDVRAWSQLGRCHEQLKEYEQAVEATGAALAIDPEYVNALII 78
G+ ++A + D D R + LG C + + +Y+ A+ + +D +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 79 RTYALRRLRRTDES---LAAAQEAVRLAPHY 106
L + E+ L AQE + +
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_23350PF05272300.031 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.031
Identities = 24/82 (29%), Positives = 35/82 (42%), Gaps = 14/82 (17%)

Query: 42 VAEAAVALQHAPGDAAARALMMRA-MGLPAPADDKAEKAEEPPKAEEPEQPAAHRPSTGF 100
V++AA + G + ++M A G PAP +PP+ E P +P +
Sbjct: 86 VSKAAAQVAREEGLESVAGIVMGAPAGAPAP---------KPPRPEPPPRPVVEKEC--- 133

Query: 101 DWAAAENEVAGAVPPRFVAPAP 122
W + AVPP F PAP
Sbjct: 134 -WETIQPVPEHAVPPSFWHPAP 154


159C5746_24020C5746_24080N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_240200160.748229serine hydrolase
C5746_24025-1140.301619undecaprenyl-phosphate
C5746_24030-1150.821953hypothetical protein
C5746_24040-2100.181134DUF397 domain-containing protein
C5746_24045-1110.554166transcriptional regulator
C5746_240500100.071914hypothetical protein
C5746_24055-18-0.043750hypothetical protein
C5746_24060-171.058854TetR family transcriptional regulator
C5746_240650120.411558acyltransferase
C5746_240700121.627501serine/threonine protein kinase
C5746_24075-1122.039613MFS transporter
C5746_240800132.468240glycoside hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24150BLACTAMASEA518e-10 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 51.3 bits (123), Expect = 8e-10
Identities = 53/269 (19%), Positives = 96/269 (35%), Gaps = 32/269 (11%)

Query: 1 MHALDIDSGAQL-GVEADQLVCTASVHKLCLLVALYDRAAAGDLDLTEQVECSSDPRTPG 59
M +D+ SG L AD+ S K+ L A+ R AGD L ++
Sbjct: 42 MIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDY 101

Query: 60 PTGLAAMLDAARLSLRDLAYLMMAVSDNAAADLLLARI-GLDAVNGATARLGLDATRAVH 118
L +++ +L + +SDN+AA+LLLA + G + ++G + TR
Sbjct: 102 SPVSEKHLADG-MTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR 160

Query: 119 TFGAMLASVKEDAGPAGAQALADPHVVARLRALDPARSNRSTPRDMTRLLRAVWRDEACL 178
+ ++ DA + +TP M LR + +
Sbjct: 161 WETELNEALPGDA------------------------RDTTTPASMAATLRKLLTSQRLS 196

Query: 179 PEHGAAIRRVM-GLQVWPHRLASGFPFDDVLVAGKTGSLPT-LRNEVGVVEYPDGGRYAV 236
+ + M +V + S P +A KTG+ R V ++ + V
Sbjct: 197 ARSQRQLLQWMVDDRVAGPLIRSVLP-AGWFIADKTGAGERGARGIVALLGPNNKAERIV 255

Query: 237 AVFTRTARTAATLPAADAVIGTAARIAVD 265
++ R T A++ + I ++
Sbjct: 256 VIYLRD--TPASMAERNQQIAGIGAALIE 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24165SECA250.027 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 24.8 bits (54), Expect = 0.027
Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 22 IELAPVDGSIKMRESDDPDVVVTTTVAKLRAFVLGVK 58
++ V + M D PD+V T K++A + +K
Sbjct: 407 LDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIK 443


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24190PF00577260.037 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 25.6 bits (56), Expect = 0.037
Identities = 8/26 (30%), Positives = 11/26 (42%), Gaps = 1/26 (3%)

Query: 47 GDA-VVTTGAFESQQAFQAAFAAVAR 71
GD V A S Q F +++V
Sbjct: 342 GDLQVTIKEADGSTQIFTVPYSSVPL 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24195HTHTETR452e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.4 bits (107), Expect = 2e-08
Identities = 20/78 (25%), Positives = 40/78 (51%)

Query: 5 RTPRENWVEEGLRVLASGGVDAVRVEALAKSLGVTKGGFYGYFADRDALLKEMLDTWERE 64
+ R++ ++ LR+ + GV + + +AK+ GVT+G Y +F D+ L E+ + E
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 65 ATDDVLDRIEREGGDLMD 82
+ L+ + GD +
Sbjct: 70 IGELELEYQAKFPGDPLS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24205YERSSTKINASE330.005 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.8 bits (74), Expect = 0.005
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 108 AGVVHRDLKPANLMLT-AGGEVKVLDFGIARYMSASAKSSKVMGTLAYMAPE 158
AGVVH D+KP N++ A GE V+D G+ S S + K T ++ APE
Sbjct: 264 AGVVHNDIKPGNVVFDRASGEPVVIDLGL---HSRSGEQPKGF-TESFKAPE 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24210TCRTETA444e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.4 bits (105), Expect = 4e-07
Identities = 64/354 (18%), Positives = 120/354 (33%), Gaps = 18/354 (5%)

Query: 34 IFSIVTTEILPIGL----LTSIGSDFAVSDGTA---GLMMTMPGLLAAVSAPLVTVVTAR 86
I S V + + IGL L + D S+ G+++ + L+ AP++ ++ R
Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69

Query: 87 VDRRVMLCAFMLLLALADFLVAAATDYRLVLVSRVMVGITIGGFWSIGAGLAGRLVRPAS 146
RR +L + A+ ++A A ++ + R++ GIT G ++ +
Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDE 128

Query: 147 VGRATAVIFSAVPLGSVLGVPAGTFIGGLTGWRTAFVAMGVLTMGVLALMLLVIP----- 201
R + + G V G G +GG F A L ++P
Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 202 ---ALPPTQVSRASVLRTVLGRTGTRFALLLTFLIVLSHFATYTYVTPFLEQVTRADPAL 258
L ++ + R G T + + F++ L F E D
Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT 247

Query: 259 ITVFLLVYGAAGIVGNFVGGALVTRRPRVAVSLAAGLIATATALLPVLGTGEAGAVALLI 318
I + L +G + + V R +L G+IA T + + ++
Sbjct: 248 IGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIM 307

Query: 319 VWGVAYGAVPVCSQTWFAKAAPDAPEAASVLFTASFQA-TFALGALTGGAILDH 371
V + G Q ++ + + A+ + T +G L AI
Sbjct: 308 VLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24215adhesinmafb320.003 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 31.6 bits (71), Expect = 0.003
Identities = 30/134 (22%), Positives = 42/134 (31%), Gaps = 10/134 (7%)

Query: 25 GVAGGVLSTIAVAGAAGPAQAEPVTQTIEMPTITAGLSTAVAASAQATEQVAQDLQTQAQ 84
GVA G L+ AG A + A + A+ V L + A
Sbjct: 230 GVAAGALNPFISAGEALGIGD--ILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAG 287

Query: 85 EDAAAATAAKTAKKAKAEAVRKAEAKKKAEAAAKAKAEAAERASRT-AARTTLSATSGSS 143
+ A + A E A A AA+ A AA+ +A SG
Sbjct: 288 FEKNTREAVDRWIQENPNAAETVE-------AVFNVAAAAKVAKLAKAAKPGKAAVSGDF 340

Query: 144 AGSAVASASSSSSS 157
A S + S S+
Sbjct: 341 ADSYKKKLALSDSA 354


160C5746_24275C5746_24310N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_24275015-0.987345DNA-binding response regulator
C5746_24280014-0.399071histidine kinase
C5746_24285-113-0.285399histidine kinase
C5746_24290-1110.148932transposase
C5746_242950120.458291resolvase
C5746_24300-1130.792940NADH-quinone oxidoreductase subunit A
C5746_24305-1140.892579hydroxyacid dehydrogenase
C5746_24310-2130.361120dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24415HTHFIS495e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 5e-09
Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 9/89 (10%)

Query: 2 RVVIAEDSVLLREGLTRLLTDLGHDVVAGVGDAEALIKTVRDLAGQDALPDVVVADVRMP 61
+++A+D +R L + L+ G+DV +A L + + D+VV DV MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIA-----AGDGDLVVTDVVMP 58

Query: 62 PTHTDEGVRAAVRLRKDYPGIGVLVLSQY 90
+ R++K P + VLV+S
Sbjct: 59 DEN---AFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24420PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 18/85 (21%), Positives = 32/85 (37%), Gaps = 11/85 (12%)

Query: 368 FTVSELLQNVSKHARATRAS-----VDVWRTADRLMLQVTDNGRGGADV---SSGSGLAG 419
V L++N KH A + + + L+V + G S+G+GL
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQN 317

Query: 420 LTERLDAV---DGVLVVDSPQGGPT 441
+ ERL + + + + QG
Sbjct: 318 VRERLQMLYGTEAQIKLSEKQGKVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24425MALTOSEBP356e-04 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 34.7 bits (79), Expect = 6e-04
Identities = 42/180 (23%), Positives = 68/180 (37%), Gaps = 45/180 (25%)

Query: 108 PWLWSSLTDPVGWRAVLYSFIRLPWGVLTFTVTLVSLFVLWPVLPYIARLLANADRAMAR 167
PW WS++ + + VT++ F P P++ L A + A
Sbjct: 255 PWAWSNIDT----------------SKVNYGVTVLPTFKGQPSKPFVGVLSAGINAA--- 295

Query: 168 ALLSPSDELERRIAE--LESDRGVVVDTAAADLRRIERDLHDGAQARLVALAMGLGLAKE 225
SP+ EL + E L +D G L + +D GA A L +E
Sbjct: 296 ---SPNKELAKEFLENYLLTDEG---------LEAVNKDKPLGAVA--------LKSYEE 335

Query: 226 KLTDDPEAAARMVDEAHGEVKVALQELRDLARGIHPAVLT----DRGLDAALSAIASRCT 281
+L DP AA M + GE+ + ++ + AV+ + +D AL +R T
Sbjct: 336 ELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRIT 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24450PF05616330.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.8 bits (74), Expect = 0.002
Identities = 24/80 (30%), Positives = 28/80 (35%), Gaps = 10/80 (12%)

Query: 297 PGSSDAPWHHARPAFDTTGGPQAEPPVREGAGRGEAPRSDTSVPEATPTADTTAPTDTPD 356
PGS++AP P P P E G P D P+ P A+ PD
Sbjct: 317 PGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPD---PDLNPDAN-------PD 366

Query: 357 QPEPPAQDPTPEPAPDRPAG 376
P P PDRP G
Sbjct: 367 TDGQPGTRPDSPAVPDRPNG 386



Score = 31.6 bits (71), Expect = 0.005
Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 312 DTTGGPQAEPPVREGAGRGEAPRSDTSVPEATPTADTTAPTDTPDQPEPPAQDPTPEPAP 371
+TT Q P G EAP + +PE +P + P + P E P P PEP P
Sbjct: 302 NTTVDVQVIPRPDLTPGSAEAPNAQ-PLPEVSPAEN---PANNPAPNENPGTRPNPEPDP 357

Query: 372 D 372
D
Sbjct: 358 D 358


161C5746_24425C5746_24455N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_24425011-0.262099TetR family transcriptional regulator
C5746_24430-112-0.815017MFS transporter
C5746_24435114-0.230352UDP-N-acetylenolpyruvoylglucosamine reductase
C5746_244402160.077049epimerase
C5746_244453180.773583TetR family transcriptional regulator
C5746_244501251.596106aspartate aminotransferase
C5746_244552280.084867*preprotein translocase subunit SecE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24575HTHTETR615e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 5e-14
Identities = 28/142 (19%), Positives = 60/142 (42%), Gaps = 6/142 (4%)

Query: 3 AEERRESVIRAAITEFARGGYAATSTEAIAKRVGVSQPYLFRLFPNKQAMFLAAAERCLA 62
A+E R+ ++ A+ F++ G ++TS IAK GV++ ++ F +K +F E +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 63 DTRKVFSDAAEGLEGDEA---LHAMAAAYQRLIVGDPDKLLMQMQTYAAVAAAEASGDHE 119
+ ++ + GD + + + + +LLM++ + E + +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 120 FGESTRAGWLQMWDEIHLALGA 141
R L+ +D I L
Sbjct: 129 AQ---RNLCLESYDRIEQTLKH 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24580TCRTETB1552e-44 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 155 bits (393), Expect = 2e-44
Identities = 87/400 (21%), Positives = 162/400 (40%), Gaps = 14/400 (3%)

Query: 23 FMAALDNLVVTTALPSIRESLGGELAELEWTVNAYTLTFAVLLMLGAALGDRFGRRRLFL 82
F + L+ +V+ +LP I A W A+ LTF++ + L D+ G +RL L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 AGLTVFTGASAAAALSPGINE-LIAFRAVQGVGAAIMMPLTLTLLTAAVPPARRGTALGI 141
G+ + S + LI R +QG GAA L + ++ +P RG A G+
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSAITGLAVAGGPLIGGSLTEHLSWQWIFWLNVPIGLVLLPLARLRLTESHAPDSRLDIP 201
+I + GP IGG + ++ W ++ + + I ++ +P L + DI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKGHFDIK 202

Query: 202 GTLLVSAGLFGIVYGLVNADSDGWTSPTVLTALFAGAALIGGFIHHGFHAKNPMLPMRLF 261
G +L+S G+ + S + V F F+ H +P + L
Sbjct: 203 GIILMSVGIVFFMLF---TTSYSISFLIVSVLSF------LIFVKHIRKVTDPFVDPGLG 253

Query: 262 RSRAFFGINMAGLLMFLGMFGSIFLLSQYFQGVLGYSPTEAG-LRMLPWTGMPMLVAPIA 320
++ F + G ++F + G + ++ + V S E G + + P T ++ I
Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313

Query: 321 GILSDRLGGRPVVVAGLALQAVGLALFAMVIGPDASYSSQLPGLIVGGVGMALYFAPAAG 380
GIL DR G V+ G+ +V + + + + ++ G++ +
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 381 LVMSSVRPAEQGIASGANNALREVGGALGVAVLATVFSSQ 420
+V SS++ E G N + G+A++ + S
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24590NUCEPIMERASE280.032 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.8 bits (62), Expect = 0.032
Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 1 MRITVFGATGGIGREIVRQAVAAGHEVTAV 30
M+ V GA G IG + ++ + AGH+V +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24595HTHTETR682e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 2e-16
Identities = 33/164 (20%), Positives = 60/164 (36%), Gaps = 2/164 (1%)

Query: 2 EQKPARVRIIDAAHQLMLTIGLARATTKEIARAAGCSEAALYKHFPSKEELFVAVLKERL 61
E + R I+D A +L G++ + EIA+AAG + A+Y HF K +LF + +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 62 PRLNPLLKRLIE-TPGVGERTVEQNLTEIARQAALFYEQSFPIAASLYAEPRLKERHNAA 120
+ L PG + + L + + + + + E
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 121 MRELGTGPHMPIRGLDAYLRSEQAAGRVRADADTYAAASLLLGA 164
+ ++ L+ A + AD T AA ++ G
Sbjct: 128 QAQRNLCLES-YDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_24615SECETRNLCASE341e-05 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 34.5 bits (79), Expect = 1e-05
Identities = 15/39 (38%), Positives = 26/39 (66%), Gaps = 1/39 (2%)

Query: 41 LFYRQIVAELRKVVWPTRNQLTTYTSVVIVFVVVMIGLV 79
F R+ E+RKV+WPTR Q T +T++++ V ++ L+
Sbjct: 70 AFAREARTEVRKVIWPTR-QETLHTTLIVAAVTAVMSLI 107


162C5746_24875C5746_24920N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_24875114-2.242679alanine racemase
C5746_24880015-1.476991alpha/beta hydrolase
C5746_24885-113-1.378305tRNA
C5746_24890014-1.353763hypothetical protein
C5746_24895013-1.334728hypothetical protein
C5746_2490009-0.982529tRNA
C5746_24905-212-0.582055ribosomal-protein-alanine N-acetyltransferase
C5746_24910-39-1.049914tRNA
C5746_24915-310-0.237961TetR/AcrR family transcriptional regulator
C5746_24920-2100.231547MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25015ALARACEMASE391e-138 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 391 bits (1006), Expect = e-138
Identities = 126/371 (33%), Positives = 178/371 (47%), Gaps = 21/371 (5%)

Query: 9 ARAEIDLAALRANVRVLRARAAGAQLMAVVKSDAYGHGAVPCARAALEAGATWLGTATPQ 68
+A +DL AL+ N+ ++R A A++ +VVK++AYGHG A +
Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATD--GFALLNLE 62

Query: 69 EALALRAAGLGGRV-MCWLWTPGGPWREAIEADIDVSVSGMWALREVVAAATEAGIPARV 127
EA+ LR G G + M + + + V W L+ + A +A P +
Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKA--PLDI 120

Query: 128 QLKADTGLGRNGCQPADWPELVAAARAAEDAGTVRITGLWSHFACADEPGHPSIAAQLNL 187
LK ++G+ R G QP ++ + V L SHFA A+ P I+ +
Sbjct: 121 YLKVNSGMNRLGFQP---DRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDG--ISGAMAR 175

Query: 188 FRDMVAYAEKEGVDPEVRHIANSPATLTIPEAHFDLVRTGIAMYGISPSPELGTPADFGL 247
E R ++NS ATL PEAHFD VR GI +YG SPS + A+ GL
Sbjct: 176 IEQAAEGLECR------RSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGL 229

Query: 248 RPVMRLVASVALVKRVPAGHGVSYGHHYTTSTETTLGLVPVGYADGIPRHASGRAPVLVG 307
RPVM L + + V+ + AG V YG YT E +G+V GYADG PRHA PVLV
Sbjct: 230 RPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVD 289

Query: 308 GVRRRIAGRVAMDQFVVDLDGD-ETEVGTEAVLFGPGDRGEPSAEDWAQACDTIAYEIVT 366
GVR G V+MD VDL + +GT L+G E +D A A T+ YE++
Sbjct: 290 GVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGK----EIKIDDVAAAAGTVGYELMC 345

Query: 367 RISSRVPRVHL 377
++ RVP V +
Sbjct: 346 ALALRVPVVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25020PF06057353e-04 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 35.2 bits (81), Expect = 3e-04
Identities = 19/114 (16%), Positives = 34/114 (29%), Gaps = 25/114 (21%)

Query: 176 QADGV-TVGID------------QLGRDLKAVIDAAAPE---GRLVLVGHSMGGMTMMAL 219
Q G VG + +D A+ID E +++L+G+S G + +
Sbjct: 75 QQQGWPVVGWSSLKYYWKQKDPKDVTQDTLAIIDKYQAEFGTQKVILIGYSFGAEVIPFV 134

Query: 220 AEQYPQLIRDRVAAVAFVGTS---------SGKLGEVTFGLPIAGVNAVRRVLP 264
+ P R V + S S + + V +
Sbjct: 135 LNEMPARYRKNVLGAVLLSPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQTT 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25025HTHFIS270.034 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.5 bits (61), Expect = 0.034
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 28 ESPEQMQALGRRIAGVLRPGDLVMLTGELGAGKTTLTRGL 67
S MQ + R +A +++ +M+TGE G GK + R L
Sbjct: 142 RSA-AMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25045SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 64 IVGYAGLAAA-GDLADVQTIGVTRDHWGGGLGSELLSDLLKHATAFECAEVLLEVRVDNT 122
+G + + A ++ I V +D+ G+G+ LL ++ A ++LE + N
Sbjct: 76 CIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINI 135

Query: 123 RAQKLYERFGFEPIGFRRGYY 143
A Y + F Y
Sbjct: 136 SACHFYAKHHFIIGAVDTMLY 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25060TETREPRESSOR575e-12 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 57.2 bits (138), Expect = 5e-12
Identities = 39/211 (18%), Positives = 77/211 (36%), Gaps = 25/211 (11%)

Query: 34 RLDPDAMVATARRIIEEEGLDALSMRRVAKELGSTPMALYHYVQDKDEL---LMLTLSGT 90
RL+ ++++ A ++ E G+D L+ R++A++LG LY +V++K L L + +
Sbjct: 3 RLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILAR 62

Query: 91 AAAFPRPELPEDPRERL------LAVAVHMHGILEQIPWVLDILALGELTDRNALWMVEE 144
+ P E + L A+ + ++ LG D VE
Sbjct: 63 HHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKV-------HLGTRPDEKQYDTVET 115

Query: 145 IIGSALACGLSPTRAVRAYRTIWSYVYGDLVFRRAAERRAENPPSRPYFPEMITEEDAAA 204
+ G S + A + + G ++ ++ + P+ P D
Sbjct: 116 QLRFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAAP---------DENL 166

Query: 205 LPRLTEIKDRWREYAADYEVAEELDAIIEGL 235
P L E + L+++I G
Sbjct: 167 PPLLREALQIMDSDDGEQAFLHGLESLIRGF 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25065TCRTETB1297e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 129 bits (326), Expect = 7e-35
Identities = 93/406 (22%), Positives = 162/406 (39%), Gaps = 20/406 (4%)

Query: 11 LLVVALVVQFMVALDMSVVNVALPDMRRDLGFTPEGLLWVVNAYALAFGGLLMLGGRLAD 70
+L+ ++ F L+ V+NV+LPD+ D P WV A+ L F + G+L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 71 LIGGSRVLTAGLVLFGAASLAGGLAWSPGG-LIAARAAQGIGAAALAPVAFALIAIAFPA 129
+G R+L G+++ S+ G + S LI AR QG GAAA P ++ +
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIP 133

Query: 130 GPARSRALGLWGMAGAAGGAVGVLAGGVLTDAASWRAVMLVNVPIVVFALVGAVRSGLES 189
R +A GL G A G VG GG++ W L+ +P++ V + L
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMK-LLK 190

Query: 190 RPARTGARLDVAGALLATAGTALLVLGLVRTSTHPWGSARTLSTLGVAVFLLVVFAAVEL 249
+ R D+ G +L + G +L T+++ L V+V ++F
Sbjct: 191 KEVRIKGHFDIKGIILMSVGIVFFMLF---TTSYSISF------LIVSVLSFLIFVKHIR 241

Query: 250 RAGTPLLRLGLLKGRPVLTANLFCLLLSSGQFAAF-YFASLYMQQVLGYGPTAAGAAFV- 307
+ P + GL K P + + C + G A F M+ V G+ +
Sbjct: 242 KVTDPFVDPGLGKNIPFMIG-VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIF 300

Query: 308 PFSVGVVAGSVVATRTVAALGTRRLLAAGGALAALGLAGFAATAQADGSFLYSILGPSLV 367
P ++ V+ + V G +L G ++ + + + + + V
Sbjct: 301 PGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIVFV 358

Query: 368 CGAGIGMCFVPLGTAATTGVASDETGMASGLLNSARQVGGSLGLAV 413
G G+ + T ++ + E G LLN + G+A+
Sbjct: 359 LG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


163C5746_25075C5746_25105N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_25075-1131.513932serine/threonine protein kinase
C5746_25080-2130.308284serine/threonine protein kinase
C5746_25085-1141.216421serine/threonine protein kinase
C5746_25090-2141.463149succinic semialdehyde dehydrogenase
C5746_25095-2141.207638cholesterol oxidase
C5746_25100-1150.573242hypothetical protein
C5746_25105-2140.112792peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25220TONBPROTEIN355e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 35.3 bits (81), Expect = 5e-04
Identities = 37/177 (20%), Positives = 54/177 (30%), Gaps = 32/177 (18%)

Query: 214 GLLYASVEGSPPYDKGSAIATLTAVMTEPLDPPKNAGPLEEVIYGLLARDPDQRLDDAGA 273
GLLY SV + ++T V L+PP+ P E + +P+
Sbjct: 26 GLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPV-----VEPE-------- 72

Query: 274 RALLNDVIHAPEKPDPVVLPPADATQIMALPGQAAEPGSKPASKSGESGEGARDRLRGAL 333
PE P I +P KP + RD
Sbjct: 73 ----------PEPEPIPEPPKEAPVVI---EKPKPKPKPKPKPVKKVQEQPKRDVKPVES 119

Query: 334 RSARNAKAAPAAAVGAAAASTTAPPLPSTPPSTAPSTGSSKPARPSTPAKPAAASAA 390
R A + A + ++ A+ A P T ++ P S P PA A A
Sbjct: 120 RPASPFENTAPARLTSSTAT-AATSKPVTSVASGPRALSRNQ-----PQYPARAQAL 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25225PF05616320.017 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.6 bits (71), Expect = 0.017
Identities = 28/84 (33%), Positives = 34/84 (40%), Gaps = 13/84 (15%)

Query: 214 PAYETDPTGPYAPPPAPAPPVAPAEVPAARGPAAIETGRATGDPYDDDDPYDDPHPGDRY 273
P + P AP P P V+PAE P A PA E +P D D D +P
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENP-ANNPAPNENPGTRPNPEPDPDLNPDANPDT-- 367

Query: 274 ADDPHDGVPGELEAVRPPVNPALP 297
DG PG P +PA+P
Sbjct: 368 -----DGQPGT-----RPDSPAVP 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25230YERSSTKINASE426e-06 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 42.0 bits (98), Expect = 6e-06
Identities = 67/252 (26%), Positives = 102/252 (40%), Gaps = 48/252 (19%)

Query: 54 VAEADRIVLHARTQKEARAAARI-----THPGVVTVHD--VIEYDDRP--WIVMQYVDG- 103
VA+ +R + E A I HP + VH V+ Y +R ++M VDG
Sbjct: 161 VAKIERSIAEGHLFAELEAYKHIYKTAGKHPNLANVHGMAVVPYGNRKEEALLMDEVDGW 220

Query: 104 ------PSLADAAKESGEIEPR----EAARIGLHVLGALRAAHGAGVLHRDVKPGNVLLA 153
+LAD+ K+ G+I I +L AGV+H D+KPGNV+
Sbjct: 221 RCSDTLRTLADSWKQ-GKINSEAYWGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFD 279

Query: 154 K-DGQVLLTDFGIAAIEGDSTITRTGELVGSIDYLAPERVRGG-DPGPASDLWSLGATLY 211
+ G+ ++ D G+ + G+ T + APE G SD++ + +TL
Sbjct: 280 RASGEPVVIDLGLHSRSGEQPKGFTES------FKAPELGVGNLGASEKSDVFLVVSTLL 333

Query: 212 TAVEGRSPFRRTSPISTMQAV--VTEEPP-----------PPGRAG---PLASVITALLR 255
+EG F + I Q + +T EP PG AG IT +L
Sbjct: 334 HCIEG---FEKNPEIKPNQGLRFITSEPAHVMDENGYPIHRPGIAGVETAYTRFITDILG 390

Query: 256 KDPEDRPSAAEA 267
+ RP + EA
Sbjct: 391 VSADSRPDSNEA 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25250cloacin419e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.2 bits (96), Expect = 9e-06
Identities = 30/101 (29%), Positives = 42/101 (41%), Gaps = 8/101 (7%)

Query: 478 SDNGDGGSGSTASGGASGSGSGSGSAGPATAGGSTTGAGATSNGSTTAGAATGGSTTAGS 537
S N +GG GG + GSG S GGS +G G G S
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76

Query: 538 TGGSLASTGT--------VALPVAAGAAVALAAGGVLYATA 570
TGG+L++ ++ P A G AV+++AG + A A
Sbjct: 77 TGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 30.1 bits (67), Expect = 0.030
Identities = 28/90 (31%), Positives = 35/90 (38%), Gaps = 9/90 (10%)

Query: 478 SDNGDGGSGSTASGGASGSG----SGSGSAGPATAGGSTTGAGATSNGSTTAGAATGGST 533
SD S + GG SGSG GSG G S G+G N S A G
Sbjct: 34 SDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93

Query: 534 TAGSTGGSLASTGTVALPVAAGAAVALAAG 563
+ G G +A+ ++AGA A A
Sbjct: 94 ALSTPGA-----GGLAVSISAGALSAAIAD 118


164C5746_25140C5746_25185N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_25140013-0.453129hypothetical protein
C5746_25145-1140.579985hypothetical protein
C5746_25150-2131.178709hypothetical protein
C5746_25155-3120.668602histidine kinase
C5746_251600131.411248DNA-binding response regulator
C5746_251702160.740263GNAT family N-acetyltransferase
C5746_251751170.210576hypothetical protein
C5746_251800170.742840hypothetical protein
C5746_25185-1140.734862glycoside hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25275TCRTETB280.027 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.5 bits (61), Expect = 0.027
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 72 SSAIPALVDLAL-------KNPGAFGYAIAIGELAVGIGTLIG 107
++A PALV + + AFG +I + G+G IG
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25290PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 8/80 (10%)

Query: 347 NAAKYG----GEGGAVQVYAEVEGRTVFVSVRDRGPGFDLDSVPGDRMG---VRESIIGR 399
N K+G +GG + + + TV + V + G ++ G VRE +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 400 MQRNGGTARLRSVPGGGTEV 419
+L G +
Sbjct: 326 YGTEAQI-KLSEKQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25295HTHFIS442e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.1 bits (104), Expect = 2e-07
Identities = 23/110 (20%), Positives = 42/110 (38%), Gaps = 8/110 (7%)

Query: 14 RVRVVLVDDHRMFRTGVQAEIGRTEETGVEVVGEAADVDQAVTVITATRPEVVLLDVHLP 73
+++ DD RT + + R G +V ++ I A ++V+ DV +P
Sbjct: 3 GATILVADDDAAIRTVLNQALSRA---GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 74 GGGGVEVLRRCAPLMGAAENPVRFLALSVSDAAEDVIGVIRGGARGYVTK 123
++L R + A + L +S + I GA Y+ K
Sbjct: 59 DENAFDLLPR----IKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25300SACTRNSFRASE290.011 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.1 bits (65), Expect = 0.011
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 190 LAVAAAFRRRGIGAALCAHLTATAFELGCRVVWLEPGDADVERIYAGIGYRRVGEKL 246
+AVA +R++G+G AL A E + LE D ++ + Y + +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF---YAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25305RTXTOXIND393e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 3e-05
Identities = 26/207 (12%), Positives = 52/207 (25%), Gaps = 24/207 (11%)

Query: 15 SAARTAATLAIAGAATAAALPGSAHADPQLTPAQVRAEVDRLYHDAEVATEQYNGAKEKA 74
R L Q + + L EQ++ + +
Sbjct: 149 EQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK------EQFSTWQNQK 202

Query: 75 TAAADAVDTLRDEAARRTERLNASRNALGSLAAAQYRTGGLDPAVQLALSSDPDQYLERA 134
+D R E R+N N + R + Q + +
Sbjct: 203 YQKELNLDKKRAERLTVLARINRYENLSRVE---KSRLDDFSSLLH-------KQAIAKH 252

Query: 135 SYVDRVGDRQAAEVDG-VERQVARIAQLRSQAGGKLAALASRQAELKKHKATIRTKLADA 193
+ +++ + + E + +++ Q+ S+ K I KL
Sbjct: 253 AVLEQ--ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN---EILDKLRQT 307

Query: 194 QRLLATLSPAERAAYESSDSAHSAGRA 220
+ L+ A S RA
Sbjct: 308 TDNIGLLT--LELAKNEERQQASVIRA 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25315IGASERPTASE402e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 2e-05
Identities = 25/161 (15%), Positives = 49/161 (30%), Gaps = 8/161 (4%)

Query: 110 AAQYRTGSIAPSTTFFLADDPQSYFDESQLMSRMTGQQQKAVTDFRTQQAKASKKRAEAV 169
+ IA + + E+ +Q+ + Q A + + V
Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREV 1068

Query: 170 QSLETLTSTQTTLRTSKRNVQEKLGEARAMLSKLTA----EEKARLAALERKKE----AE 221
T + E + +K TA EEKA++ + ++ ++
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 222 AKRKAQELARQQAAAEAEADRKAAEAAKETESGSGTGTGTG 262
K ++ Q AE + KE +S + T T
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169



Score = 29.6 bits (66), Expect = 0.025
Identities = 38/296 (12%), Positives = 77/296 (26%), Gaps = 36/296 (12%)

Query: 10 PRTRVRTTTPAVGLTTAALASVTLLSAQSATAAPAPKPGIEDVQKKVDDLYRQAGTATQQ 69
++ P+V +A V PAP E + ++ +++ T +
Sbjct: 999 TPNNIQADVPSVPSNNEEIARVDEAPVPP----PAPATPSETTETVAENSKQESKTVEKN 1054

Query: 70 YNRAKEA----------------STTQRAKVDALLEDVAERADKLNDARRTLGAYAAAQY 113
A E + TQ +V + E T+ A+
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 114 RTGSIAPSTTFFLADDPQSYFDESQLMSRMTGQQQKAVTDFRTQQAKASKKRAEAVQSLE 173
T P+ E+ ++ + + Q++ + + E
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 174 T------LTSTQTTLRTSKRNVQEKLGEARAMLSKLTAEEKARLAALERKKEAEAKRKAQ 227
T + TT+ T V+ A T + + + K +
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPA-----TTQPTVNSESSNKPKNRHRRSVRS 1229

Query: 228 ELARQQAAAEAEADRKAAEAAKETESGSGTGTGTGTGSDSSATTKAEKALAFAAAQ 283
+ A + DR T + + SD+ A + A
Sbjct: 1230 VPHNVEPATTSSNDRSTVALCDLTSTNTNAVL-----SDARAKAQFVALNVGKAVS 1280


165C5746_25590C5746_25665N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_25590219-0.618750BMP family ABC transporter substrate-binding
C5746_255955150.919112heme ABC transporter ATP-binding protein
C5746_256003140.869338sugar ABC transporter permease
C5746_25610-1120.744164ABC transporter permease
C5746_25615-1120.598852cytidine deaminase
C5746_25620-211-0.156663thymidine phosphorylase
C5746_25625-211-0.598997restriction endonuclease
C5746_25630-211-0.508548peptidase S8
C5746_25635-111-0.252198hypothetical protein
C5746_25645-290.718197hypothetical protein
C5746_256550111.305253RNA polymerase subunit sigma-70
C5746_256600151.665765hypothetical protein
C5746_256651191.704759MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25710LIPPROTEIN48673e-14 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 66.6 bits (162), Expect = 3e-14
Identities = 69/319 (21%), Positives = 118/319 (36%), Gaps = 50/319 (15%)

Query: 13 VTAALALTATACGESS--------------TESSGSDKGKMKIGMAYDV--------GGR 50
+ A L A +CG + T ++ + K +K + G
Sbjct: 14 IAAILPAVAVSCGNNDESNISFKEKDISKYTTTNANGKQVVKNAELLKLKPVLITDEGKI 73

Query: 51 GDNSFNDSAARGLDKAKSEFSAETKELTAKNGETPADREQRLESLAEGGYNPVIAVGFAY 110
D SFN SA L + E N E ++ E S G+ + GF +
Sbjct: 74 DDKSFNQSAFEALKAINKQTGIE-----INNVEPSSNFESAYNSALSAGHKIWVLNGFKH 128

Query: 111 KDAVDKIAAKY------PKVSFGLVD--SVATAKNVDSIVFTEEQGSYLAGVAAA--LKS 160
+ ++ + + ++ +D K S+ F ++ ++ G A A L
Sbjct: 129 QQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKESAFTTGYAIASWLSE 188

Query: 161 KDGQ---VGFIGGVDLPLIKKFAAGFQQGVLD-TKPKAKVQIQYLSTGSDLSGFGSPDKG 216
+D V GG P + F GF +G+L + +I + S SGF + +K
Sbjct: 189 QDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKM 248

Query: 217 KEAAKGMLDKGADVIFAAAGG--SGAGSIEAVAGK---KGAWSIGVDSDQALDPALSKYK 271
+L + S AG + KG + IGVDSDQ + + K
Sbjct: 249 NTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQYVIGVDSDQ----GMIQDK 304

Query: 272 DTIMTSVVKNVDTGVFDLV 290
D I+TSV+K++ V++ +
Sbjct: 305 DRILTSVLKHIKQAVYETL 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25715BCTERIALGSPD320.007 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.007
Identities = 23/103 (22%), Positives = 36/103 (34%), Gaps = 16/103 (15%)

Query: 220 TTTKQLAELMVGSELPSPETRESTVTDIPMLTVE------GLRLTAT-DPDGVVRSVLDG 272
T A VG E+P ++T D TVE L++ + V ++
Sbjct: 447 TLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIE- 505

Query: 273 IGFTIHKGEVLGIAGVEGNGQSELVDAIMGMRHLDTGVLTLDG 315
EV +A + S+L A R ++ VL G
Sbjct: 506 -------QEVSSVADAASSTSSDL-GATFNTRTVNNAVLVGSG 540


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25725PREPILNPTASE290.026 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 29.4 bits (66), Expect = 0.026
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 62 AALAMAVPIGLAGLGGLWAERAGV-VNIGLEGM----MILGTFFGAWAGWQTNPWLGVLA 116
+L AV +AG LW+ + G EGM L GAW GWQ P + +L+
Sbjct: 179 VSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLS 238

Query: 117 GVIGGALGGLL 127
++G +G L
Sbjct: 239 SLVGAFMGIGL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25735RTXTOXIND300.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.020
Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 3/41 (7%)

Query: 368 ELHAKPGDTVTAGQPLLTLHTDTPEKFDYALKALPSSYDIA 408
E+ K G++V G LL L T + SS A
Sbjct: 109 EIIVKEGESVRKGDVLLKL---TALGAEADTLKTQSSLLQA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25745SUBTILISIN2336e-71 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 233 bits (595), Expect = 6e-71
Identities = 111/304 (36%), Positives = 152/304 (50%), Gaps = 38/304 (12%)

Query: 188 VPQIGVPSAWKAGYTGKGVKIAVLDTGVDTTHPDLQGQILDTKNFTSS-----PDTKDRV 242
V I P+ W G+GVK+AVLDTG D HPDL+ +I+ +NFT KD
Sbjct: 26 VEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYN 84

Query: 243 GHGTHVSSIAAGTGAKSGGKLKGVAPDAKLLEGKVLDDDGFGDDSGILAGMEWAVAQGAD 302
GHGTHV+ A T ++G GVAP+A LL KVL+ G G I+ G+ +A+ Q D
Sbjct: 85 GHGTHVAGTIAATENENGVV--GVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD 142

Query: 303 IINLSLGGGDTPEIDPLEAAVNKLSADKGVLFAIAAGNEGAGAG---TVGSPGSADAALT 359
II++SLGG + ++ L AV K +L AAGNEG G +G PG + ++
Sbjct: 143 IISMSLGGPE--DVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDELGYPGCYNEVIS 199

Query: 360 VGAVDDNDELADFSSRGPRNGDGAIKPDVTAPGVDITAAAAPGSAIDQEVGQNPPGYLTI 419
VGA++ + ++FS+ + D+ APG DI + G Y T
Sbjct: 200 VGAINFDRHASEFSNSNN-------EVDLVAPGEDILSTVPGG------------KYATF 240

Query: 420 SGTSMATPHVAGAAALLKQQ-----HPQWKYAELKGALTASTKPGAYTPFEQGSGRIAVD 474
SGTSMATPHVAGA AL+KQ EL L T P +P +G+G + +
Sbjct: 241 SGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLT 300

Query: 475 KAIA 478

Sbjct: 301 AVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25770PF06776320.001 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 32.2 bits (73), Expect = 0.001
Identities = 13/46 (28%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 11 RARRALALSVTGLMAAPALVLGTGGSAQAASCTTST-GPYQKQVEK 55
ARR A + A AL G A A S G +Q + +
Sbjct: 44 LARRNGARLMLAGAMAIALSFGWSDRADAQGAVRSVHGDWQIRCDT 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25775TCRTETB614e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 60.7 bits (147), Expect = 4e-12
Identities = 64/366 (17%), Positives = 117/366 (31%), Gaps = 49/366 (13%)

Query: 68 LPAISASLGATAGQASWTVSAATGALALCVLPMSALSERFGRRQMMTASLTVAVLVGLLV 127
LP I+ +W +A ++ LS++ G ++++ + + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 128 PFAPS-LGWLIALRAVQGAALAGLPASAMAYLAEEVRPKALVAAIGLFVAGNSIGGMSGR 186
S LI R +QGA A PA M +A + + A GL + ++G G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 187 ILTGWVAQLWGWRAALGAVGLLAVACAVVFHFMIPRARNFTPGTLNPKALAKTVGEHLSD 246
+ G +A W L + + + + R G + K + +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--IKGHFDIKGIILMSVGIVFF 214

Query: 247 PLLRRLYAIGALFMTVFGAVYTVIGYRLVEAPF--------------------------- 279
L Y+I L ++V + V R V PF
Sbjct: 215 MLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAG 274

Query: 280 -------------SLPQGVVGSIFL--VYLVGTVSSAAAGRLVARLGRRGALYLAVSTTA 324
L +GS+ + + + G LV R G L + V+ +
Sbjct: 275 FVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLS 334

Query: 325 AGLL-LSLADQLAAVL--LGLVLITAGFFAGHAVASSSVSRTATKGRAQASA-LYQSAYY 380
L S + + + +V + G V S+ VS + + A A L +
Sbjct: 335 VSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394

Query: 381 VGSSAG 386
+ G
Sbjct: 395 LSEGTG 400


166C5746_25710C5746_25735N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_25710020-1.180963two-component sensor histidine kinase
C5746_25715118-0.675982DNA-binding response regulator
C5746_25720-216-0.946521SigE family RNA polymerase sigma factor
C5746_25725-114-0.362778uridine kinase
C5746_25730011-0.147798hypothetical protein
C5746_257350100.022079aldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25815PF06580417e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 7e-06
Identities = 16/107 (14%), Positives = 36/107 (33%), Gaps = 24/107 (22%)

Query: 388 ILANLIGNALKHGGSP------VRVSVRTEGDELVIEVRDHGPGIPEDVLPHVFDRFYKA 441
++ L+ N +KHG + + + + + +EV + G ++
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT----------- 307

Query: 442 SASRPRSEGSGLGLSIAMEN-AHIHGGDITAANSPDGDGAVFVLRLP 487
E +G GL E ++G + S ++ +P
Sbjct: 308 ------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25820HTHFIS966e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.7 bits (238), Expect = 6e-25
Identities = 40/139 (28%), Positives = 74/139 (53%), Gaps = 4/139 (2%)

Query: 7 LLLIEDDDAIRTALELSLSRQGHRVATAATGEDGLKLLREQRPDLIVLDVMLPGIDGFEV 66
+L+ +DD AIRT L +LSR G+ V + + + DL+V DV++P + F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 CRRIRRTD-QLPIILLTARSDDIDVVVGLESGADDYVVKP-VQGRVLDARIRAV--LRRG 122
RI++ LP+++++A++ + + E GA DY+ KP ++ RA+ +R
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 ERESTDSAAFGNVVIDRSA 141
+ D + G ++ RSA
Sbjct: 126 PSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25825PF05272300.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.014
Identities = 15/57 (26%), Positives = 23/57 (40%), Gaps = 6/57 (10%)

Query: 59 MAVVDAPVGANGGSGGG------AAYGEVIGERKPPAQAEDAETAFTAYVRERRASL 109
+A V +P A GG+GGG + P +D E F ++ + A L
Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAGAGTDPGGPGGGDDGEDPFGEWLDDEVARL 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_25840DNABINDINGHU270.025 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 27.3 bits (61), Expect = 0.025
Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 87 KDQFVREVADAEGLSKSKAAAVVDAAID 114
K + +VA+A L+K +AA VDA
Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFS 31


167C5746_26210C5746_26250N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_26210-1111.381541response regulator
C5746_26215-1121.431168hybrid sensor histidine kinase/response
C5746_262200141.917158hypothetical protein
C5746_262250141.329409hypothetical protein
C5746_262302161.824840AraC family transcriptional regulator
C5746_262351171.329311hypothetical protein
C5746_26240-1160.501830TetR family transcriptional regulator
C5746_26245-116-1.154359siderophore-interacting protein
C5746_26250-117-1.668282N-acyl-D-amino-acid deacylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26335HTHFIS677e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 7e-15
Identities = 31/153 (20%), Positives = 56/153 (36%), Gaps = 13/153 (8%)

Query: 12 SSILIVDDMEENLVALEAVLGSLA-QVVRAQSGEEALKAMLRQEFAVVLIDVMMPGMNGF 70
++IL+ DD L L V + + + + +V+ DV+MP N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 ETAANIKGLDQTKDVPIILLTGAAVDPNYAYRGYTVGAADFLIKPFDPWLLRTKVNVFLD 130
+ IK D+P+++++ A + GA D+L KPFD L +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQN-TFMTAIKASEKGAYDYLPKPFD---LTELIG---- 113

Query: 131 LHRKNRQLVDQAEQLKRLLTADGGHPGIPSAAP 163
R L + + +L + +
Sbjct: 114 --IIGRALAEPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26340HTHFIS872e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-19
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 3/137 (2%)

Query: 1264 RILIVDDDIRNVFALTHVLGRVGISVKYAENGREGLEVLDRTPDVSLVLMDIMMPEMDGY 1323
IL+ DDD L L R G V+ N + LV+ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63

Query: 1324 EMIAAIRRTPRFAGLPVIALTAKAMPGDREKAIESGANDYVPKPVDVDRLLSVICRLLDP 1383
+++ I++ LPV+ ++A+ KA E GA DY+PKP D+ L+ +I R L
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1384 QCTSPEPRPDPSDNARS 1400
P D S +
Sbjct: 122 PKRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26375V8PROTEASE413e-06 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 41.1 bits (96), Expect = 3e-06
Identities = 28/161 (17%), Positives = 53/161 (32%), Gaps = 15/161 (9%)

Query: 71 PALVLSNGHCMESGFPGPGEVVFNQPSTRSFTLLNASGSGVGTLRASKIAYGTMTDTDIS 130
+L+N H +++ P + + N G A +I + D++
Sbjct: 111 KDTLLTNKHVVDATHGDPHALKAFPSAI------NQDNYPNGGFTAEQI-TKYSGEGDLA 163

Query: 131 VYQLTRTYAQIESSYGIKALELNTAHPVQ-GTAITVVSGYWKRTYACNVDGFVYRLKEGE 189
+ + + +K ++ Q ITV Y + +G+
Sbjct: 164 IVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTG------YPGDKPVATMWESKGK 217

Query: 190 WTWKDSVRYTSACQTIGGTSGSPVIDNATGKVVAVNNTGNE 230
T+ T GG SGSPV N +V+ ++ G
Sbjct: 218 ITYLKGEAMQYDLSTTGGNSGSPVF-NEKNEVIGIHWGGVP 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26380TETREPRESSOR583e-12 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 58.0 bits (140), Expect = 3e-12
Identities = 44/231 (19%), Positives = 85/231 (36%), Gaps = 38/231 (16%)

Query: 38 LSADAIVTAAIAVADEDGMAALSMRAVGERLGRTAMALYTYVPSKSELLDLMYDAVHAEL 97
L+ ++++ AA+ + +E G+ L+ R + ++LG LY +V +K LLD + + A
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 98 PTRYAGIDN--WRTGLTDWAHDTLAFLLRHPWVLQVSQARPVLGPHEYTGLDTLVRLLNG 155
W++ L + A LLR+ +V +Y ++T +R +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGT-RPDEKQYDTVETQLRFMTE 122

Query: 156 TGLDASLRRRLVGTLTHFVRGCAGTVAEARQAAAVTGESDEEWWFARSALLGEVAPDFAD 215
G + ++HF G V E ++ A D
Sbjct: 123 NGFSLRDGLYAISAVSHFTLGA---VLEQQEHTA----------------------ALTD 157

Query: 216 RYPSLSAMESEGTPQQPPSAPGEEAMPYLEREARE-TFAVGLDVMLDGIDA 265
R P + EA+ ++ + E F GL+ ++ G +
Sbjct: 158 R---------PAAPDENLPPLLREALQIMDSDDGEQAFLHGLESLIRGFEV 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_26390UREASE455e-07 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 45.1 bits (107), Expect = 5e-07
Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 9/73 (12%)

Query: 1 MDVVIRNALVIDGTGTPAHRADVAIGDGRIIEIHPEHAPGPRP-------TGRRTLDADG 53
+D VI NAL++D G +AD+ + DGRI I P +P G + +G
Sbjct: 68 VDTVITNALILDHWGI--VKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEG 125

Query: 54 LALSPGFIDMHAH 66
++ G +D H H
Sbjct: 126 KIVTAGGMDSHIH 138


168C5746_26870C5746_26920N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_26870417-1.497236translational GTPase TypA
C5746_26875317-1.992413hypothetical protein
C5746_26880416-1.174985peptide ABC transporter substrate-binding
C5746_26885613-1.868333ABC transporter permease
C5746_26890480.206790peptide ABC transporter permease
C5746_26895490.854676methionine ABC transporter ATP-binding protein
C5746_26900490.658931peptide ABC transporter ATP-binding protein
C5746_26905390.808505dipeptide/oligopeptide/nickel ABC transporter
C5746_269102111.007672peptide ABC transporter ATP-binding protein
C5746_269152121.368795ABC transporter
C5746_269200110.505105ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27015TCRTETOQM1544e-42 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 154 bits (391), Expect = 4e-42
Identities = 89/463 (19%), Positives = 169/463 (36%), Gaps = 73/463 (15%)

Query: 7 IRNVAIVAHVDHGKTTLVDAMLRQAGAFAAHAAENLDERMMDSNDLEREKGITILAKNTA 66
I N+ ++AHVD GKTTL +++L +GA + + D+ LER++GITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 67 VKYHPKDGGDPITINIIDTPGHADFGGEVERGLSMVDAVVLLVDASEGPLPQTRFVLRKA 126
++ +NIIDTPGH DF EV R LS++D +LL+ A +G QTR +
Sbjct: 63 FQWEN------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 127 LAAKMPVILCINKTDR-------------------------------------PDSRIAE 149
+P I INK D+ +S +
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 150 VVDETYDLFLDLDADEDQIEFPIVYACARDGVASLTKPEDGTV-------PQDSENLEPF 202
V E D L+ +E + + ++ +++ ++
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFH------NCSLFPVYHGSAKNNIGIDNL 230

Query: 203 FSTILSHVPAPEYDDEAPLQAHVTNLDADNFLGRIALCRVEQGELRKGQTVTWIKRDGTM 262
I + + + ++ L V ++ R+A R+ G L +V +++
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE--- 287

Query: 263 SNVRITELLMTEALTRKPAEKAGPGDICAIAGIPDIMIGETLADPENPIALPLITVDEPA 322
++ITE+ + +KA G+I + + + L D + I P
Sbjct: 288 -KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 323 ISMTIGTNTSPLVGKGGKGHKVTARQVKDRLDRELIGNVSLRVLDTERPDAWEVQGRGEL 382
+ T+ + + D L + LR + G++
Sbjct: 346 LQTTVEPS-----------KPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKV 394

Query: 383 ALAILVEQMRRE-GFELTVGKPEVVTKQVDGKTHEPIERMTID 424
+ + ++ + E+ + +P V+ + K E + +
Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP 437



Score = 40.2 bits (94), Expect = 2e-05
Identities = 16/89 (17%), Positives = 33/89 (37%), Gaps = 1/89 (1%)

Query: 408 KQVDGKTHEPIERMTIDSPEEHLGAITQLMATRKGRMETMTNHGSGWVRMEWIVPSRGLI 467
K+ + EP I +P+E+L + V + +P+R +
Sbjct: 529 KKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQ 587

Query: 468 GFRTEFLTQTRGTGIAHSIFEGHEPWFGE 496
+R++ T G + + +G+ GE
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVTTGE 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27040HTHFIS290.040 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.040
Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 55 TLAVLGESGSGKSVTAQAI 73
TL + GESG+GK + A+A+
Sbjct: 162 TLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27045HTHFIS290.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.027
Identities = 26/95 (27%), Positives = 33/95 (34%), Gaps = 11/95 (11%)

Query: 9 GSLDSTPNVTDVVEVEAADETAAVAAIEAPVERGEPILQVRNLVKHFPLSQGILFKRQIG 68
G+ D P D+ E+ A P + + LV Q I R +
Sbjct: 97 GAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLA 154

Query: 69 AVKAVDGVSFDLYQGETLGIVGESGCGKSTVARLL 103
+ D TL I GESG GK VAR L
Sbjct: 155 RLMQTDL---------TLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27050BCTERIALGSPD300.024 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.5 bits (66), Expect = 0.024
Identities = 22/85 (25%), Positives = 28/85 (32%), Gaps = 8/85 (9%)

Query: 245 DLYGNPRHPYTRALLSAVPEATADEAPARERIRLAGDVPSPVNPPSGCRFRTRCWKATDK 304
D+YG +L V A A A V S P G TR T+
Sbjct: 86 DVYGFAVINMNNGVLKVVRSKDAKTA--------AVPVASDAAPGIGDEVVTRVVPLTNV 137

Query: 305 CASEAPPLVRVEGSREGHLTACHYP 329
A + PL+R G + HY
Sbjct: 138 AARDLAPLLRQLNDNAGVGSVVHYE 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27070RTXTOXIND290.027 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.027
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 181 AFWPVLLSILVSPEENTPVWLTVVSLISVMTAFGWASIARL 221
F P L ++ +P P + + ++ AF + + ++
Sbjct: 40 EFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQV 80


169C5746_26975C5746_27020N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_26975-212-1.418229hypothetical protein
C5746_26980-112-1.676629sulfate ABC transporter ATP-binding protein
C5746_269851100.111346ABC transporter
C5746_269900120.251761histidine kinase
C5746_269950110.454079DNA-binding response regulator
C5746_27000015-1.634636class F sortase
C5746_27005-116-2.915445hypothetical protein
C5746_27010-116-3.028151GTP-binding protein
C5746_27015-218-5.008279hypothetical protein
C5746_27020-216-5.288449GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27130cloacin356e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 6e-04
Identities = 33/132 (25%), Positives = 48/132 (36%), Gaps = 4/132 (3%)

Query: 121 SALDEFGHGGPAAGPRPTPNLGVRTPGPSGPTAGPVTGASSLTPNLDGAG---GPGPGGP 177
S D GH A N G G G + +S P G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 178 RGPGGPAGPQGPGA-SGGVPPRMSDDTAVLTPQFAAPASAGPGGNVSGDTLTSGIPVVPP 236
G GG G G G+ +GG ++ A P + P + G ++S L++ I +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 237 EHRSPSLFPSAG 248
+ P F G
Sbjct: 122 ALKGPFKFGLWG 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27145PF06580290.022 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.022
Identities = 49/325 (15%), Positives = 99/325 (30%), Gaps = 45/325 (13%)

Query: 54 GALGLLVFVGAYLVLVFRHTSKALDRRAVHTTIAFLGALAAV--LSLTLGAAWLVLFVYV 111
G L F A L + S + I+ +G + S WL L +
Sbjct: 21 GVYTLTGFGFASLYGSPKLHS-----MIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQ 75

Query: 112 AVSVGATMPLRNARWLIPVVVSALVCIGLTVDHPREITTAL----VFPALFGGFAMTGVR 167
+ + S + P T L +F + F + +
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLY 135

Query: 168 QLIRTTIQLREARATVAQLAANEERLRLARDLHDLLGHSL--SLITLKSELAGRMLPGHP 225
++A ++A+ + +L + H + +L +++ ++ P
Sbjct: 136 FGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRA-----LILEDP 190

Query: 226 EQAAAQVADIEQVSRQALVDVRSAVTGYRRPTLPGELAGARTALAAAGI--------TAD 277
+A + + ++ R +L + +L EL + L A I
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQV-----SLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 278 VPADAPDGLPEKPEEVLAWALREAVTNVVRH---SSARHCTVILTPRQTLDGRALELTVA 334
+ +V ++ V N ++H + ++L + D + L V
Sbjct: 246 INPAI------MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTK--DNGTVTLEVE 297

Query: 335 DDGVGAAGTKP---GNGLTGITERL 356
+ G A G GL + ERL
Sbjct: 298 NTGSLALKNTKESTGTGLQNVRERL 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27150HTHFIS642e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 2e-14
Identities = 26/135 (19%), Positives = 51/135 (37%), Gaps = 2/135 (1%)

Query: 1 MSMIRLLLAEDQSMVREALAALLGLEPDIEVVAQVARGDEVLAAAHEHRIDVALLDIEMP 60
M+ +L+A+D + +R L L V + + D+ + D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GMTGIDAAAALRRELPAVKVVVVTTFGRPGYLRRAMESGADAFLVKDAPAAQLAAAVRKV 120
D +++ P + V+V++ +A E GA +L K +L + +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 LAGERVIDPTLAAAA 135
LA + L +
Sbjct: 119 LAEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27160PF03544280.040 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 27.6 bits (61), Expect = 0.040
Identities = 23/132 (17%), Positives = 35/132 (26%), Gaps = 10/132 (7%)

Query: 20 LGRALMWP---------AVAAGVGMLVIYNSIETSVDDKPPAPPAAVAPAAVPEVLGSHA 70
L R WP AV AG+ ++ IE +P + VAPA +
Sbjct: 10 LPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISV-TMVAPADLEPPQAVQP 68

Query: 71 APRPVTPSAPAVGPAMSRSVPTRLQIPSLAVKAPFTDLSIGADGRLNPPPPNDSNLVGWF 130
P PV P P + I K + + +
Sbjct: 69 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128

Query: 131 KGGVTPGERGAA 142
P ++
Sbjct: 129 FENTAPARPTSS 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27170TCRTETOQM6310.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 631 bits (1630), Expect = 0.0
Identities = 221/667 (33%), Positives = 345/667 (51%), Gaps = 32/667 (4%)

Query: 1 MHTLNLGILAHVDAGKTSLTERLLHTAGVIDEIGSVDDGSTRTDSLALERQRGITIKSAV 60
M +N+G+LAHVDAGKT+LTE LL+ +G I E+GSVD G+TRTD+ LERQRGITI++ +
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 61 VSFAIDDITVNLIDTPGHPDFIAEVERVLNVLDGAVLVISAVEGVQAQTRVLMRTLQRLR 120
SF ++ VN+IDTPGH DF+AEV R L+VLDGA+L+ISA +GVQAQTR+L L+++
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 121 IPTLIFVNKVDRGGAQDESLLRSISEKLTPAVLAMGSV-DGPGGRDARCTPYTAADARFT 179
IPT+ F+NK+D+ G ++ + I EKL+ ++ V P + T +T ++
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYP---NMCVTNFTESEQ--- 174

Query: 180 DRLTELLADHDDALLAAYVENAAPLPYSRLREALSTQTRQALVHPVFFGSAITGAGVDAL 239
+ + + +D LL Y+ + L + S + + PV+ GSA G+D L
Sbjct: 175 ---WDTVIEGNDDLLEKYMSGKSLEA-LELEQEESIRFHNCSLFPVYHGSAKNNIGIDNL 230

Query: 240 ISGVRELLPVGEGDADGPVSGTVFKVERGPAGEKIAYVRIFSGTVRTRDRLPFGRGEGRD 299
I + + G VFK+E +++AY+R++SG + RD + R ++
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV---RISEKE 287

Query: 300 EGKVTAISVFDRGSDVREAAVGAGRIAKLRGLGGIRVGDAVGVSDTTAPGHW--FAPPTL 357
+ K+T + G + +G I L+ +++ +G + P L
Sbjct: 288 KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 358 ESVVVPCAPASRGELHFALAQLAEQDPLINLRQDDIRKEVSVSLYGEVQKEVIQATLADE 417
++ V P P R L AL ++++ DPL+ D E+ +S G+VQ EV A L ++
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEK 406

Query: 418 FGIDVTFRETTTICLERPNGSGAAYEVGDQDPNPFLATIGLRIDPAPIGSGIEYRLEVEL 477
+ +++ +E T I +ERP + PNPF A+IGL + P P+GSG++Y V L
Sbjct: 407 YHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSL 466

Query: 478 GSMPFSLMRAVEQTVGETLQQGIHGWQVTDCVVTMTHSGYWPRQSHSHAVFDKSMSSTAG 537
G + S AV + + +QG++GW VTDC + + Y+ S ST
Sbjct: 467 GYLNQSFQNAVMEGIRYGCEQGLYGWNVTDCKICFKYGLYY------------SPVSTPA 514

Query: 538 DFRNLTPLVLMSALKEAGTTVYEPMHRFRLELPADLLGPLLPVLAHLRAVPGTPAVQGAT 597
DFR L P+VL LK+AGT + EP F++ P + L A ++
Sbjct: 515 DFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE 574

Query: 598 CVLEGEIPAARVHELQQQLPALTRGEGVLESGFDRYRAVVGTPPGRPRTDRDPLNRKEYL 657
+L GEIPA + E + L T G V + Y G P +P R P +R + +
Sbjct: 575 VILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQP---RRPNSRIDKV 631

Query: 658 LHTVRRI 664
+ +I
Sbjct: 632 RYMFNKI 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27180SACTRNSFRASE472e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 46.9 bits (111), Expect = 2e-08
Identities = 17/70 (24%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 255 VGRCVVDGRWAGFMAVE---VGPEYRRRGLATAVMTALARKALDEGASAAWLQVETDNEG 311
+GR + W G+ +E V +YR++G+ TA++ A + L+ + N
Sbjct: 77 IGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136

Query: 312 ARALYERMGF 321
A Y + F
Sbjct: 137 ACHFYAKHHF 146


170C5746_27730C5746_27780N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_277300130.352805NAD(P)-dependent oxidoreductase
C5746_27735-1120.975239amino acid transporter
C5746_277400120.083622TetR family transcriptional regulator
C5746_277452110.008556MFS transporter
C5746_27750111-0.560218hypothetical protein
C5746_27755216-1.142052superoxide dismutase, Ni
C5746_27760116-0.294474nickel-type superoxide dismutase maturation
C5746_277652160.738630zf-CGNR multi-domain protein
C5746_277750121.258291SAM-dependent methyltransferase
C5746_277800141.792089SigE family RNA polymerase sigma factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27860NUCEPIMERASE280.046 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.8 bits (62), Expect = 0.046
Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 8 VAVTGASGAVGGRVAQRLVRAG 29
VTGA+G +G V++RL+ AG
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27870HTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 7e-10
Identities = 21/168 (12%), Positives = 47/168 (27%), Gaps = 10/168 (5%)

Query: 12 RPGGRTARVRESVLRAAGDALAEHGFDRLDLADVARRAEVGKTTVYRRWSTPTGLIADLL 71
+ R+ +L A ++ G L ++A+ A V + +Y + + L +++
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 72 DDMAEQSSPRTDT------GSLTEDLRANARLVVTTLTDPRQGALFKSVIAAATCDPRTT 125
+ G LR V+ + + L +I
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 126 EALHRFYAIRIKEW----SGCVTEAVERGELPAGTDADEVIRAVSAPL 169
+ + E + +E LPA + +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27875TCRTETB1163e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 116 bits (291), Expect = 3e-30
Identities = 88/413 (21%), Positives = 180/413 (43%), Gaps = 16/413 (3%)

Query: 29 RQKLVLALLLGAQFMIAVDFSILNVALPVVGEGLGFSLSHLQWIATAFALAAAGFTLLFG 88
R +L L F ++ +LNV+LP + + W+ TAF L + T ++G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 89 RVADLVGRKRLFLGGMVVLGLSSALGGLATSP-EVLLTARVLQGLATAAVTPAGLALLTT 147
+++D +G KRL L G+++ S +G + S +L+ AR +QG AA PA + ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVA 129

Query: 148 AFKEGPLRERALGLNGALMSAGFTAGAILGGLLTDLLSWRWAFFVNVPVAALVVFLAPAV 207
+ R +A GL G++++ G G +GG++ + W++ + +P+ ++
Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFLMK 187

Query: 208 ITDSRPARRPRLDVPGAVTVTGGLLLLVYGLTQAGESGWTTPTTLVALLAGLALLLGFWS 267
+ + D+ G + ++ G++ + T + +++ L+ L+ F
Sbjct: 188 LLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSY--------SISFLIVSVLSFLI-FVK 238

Query: 268 IEKRAASPLVPVHILKRRSVIWGNTVGLIAFVTETSLVFLLTLYLQEVLGYSPLATGLAF 327
++ P V + K + G G I F T V ++ +++V S G
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 328 GVLG-IGTVIGGAFGGRAVGRYGSRMTIVTGGVIQAVATLCLVALGTSGAWIWLLLVATF 386
G + +I G GG V R G + G +V+ L L + +W +++
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358

Query: 387 IGGIGNMLVIVGFMVTATSGLPDEEQGLATGLATMTQQVGITMGIPVMSAVVT 439
+GG+ ++ +V+++ L +E G L T + GI ++ +++
Sbjct: 359 LGGLSFTKTVISTIVSSS--LKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27890TYPE3OMOPROT280.020 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 27.7 bits (61), Expect = 0.020
Identities = 23/83 (27%), Positives = 35/83 (42%), Gaps = 8/83 (9%)

Query: 51 DWLLVQYGATV-RPGDVVILRHPFQQDLLIVKRAAERRRGGW-----WVLADNPFAGGDS 104
+WLL Q R G L +P +Q + + AE+R W W+ +P G +
Sbjct: 12 EWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVSPALAGAA 71

Query: 105 TDYGTVPEELVLARVRARYRPLK 127
G E LV+ + A RP +
Sbjct: 72 VSAGA--EHLVVPWLAATERPFE 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27905PF06917270.040 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 27.2 bits (60), Expect = 0.040
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 4/26 (15%)

Query: 50 RRHGRLDNIEAYARKTLINTYIAACR 75
R+ R+DN A A TL IAA +
Sbjct: 492 HRYFRIDNPIALALLTL----IAAKQ 513


171C5746_27820C5746_27860N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_278201101.430125helix-turn-helix domain-containing protein
C5746_278250122.151575peptidase
C5746_278301142.235170PadR family transcriptional regulator
C5746_278352122.373031hypothetical protein
C5746_278402112.604603hypothetical protein
C5746_278454113.157950hypothetical protein
C5746_278501142.019808serine protease
C5746_278550140.900891AAA family ATPase
C5746_27860-1130.487932hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27950HTHFIS270.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.5 bits (61), Expect = 0.003
Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 1/67 (1%)

Query: 3 EATDLAARAGDRDPRVGLRAVAALRRLLEQLEAVQVRSA-RVQGWSWQEIAAELGVSRQA 61
+A + R L R+L ++E + +A + + A LG++R
Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465

Query: 62 VHKKYGR 68
+ KK
Sbjct: 466 LRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27960TYPE4SSCAGA290.049 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 28.5 bits (63), Expect = 0.049
Identities = 21/100 (21%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 137 GRKSDWESAFGDLGDFGDKETWRAAKEELRKA--KQEWKEQARRAKDESRRAREDAQQAR 194
G+ ++ A D + G+ + + A+++L K+ K+E E+ K ES+ ++ +A+
Sbjct: 586 GKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAK 645

Query: 195 RQAKEAQDRAREQMQNAARQVQEHFARGDWPSGVREGLAE 234
QA +D + A + A G++ L++
Sbjct: 646 AQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSD 685


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27980PERTACTIN310.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.2 bits (70), Expect = 0.003
Identities = 20/58 (34%), Positives = 23/58 (39%)

Query: 136 QTPPQAAASPATQAAPPMPPPPSSPPGHPPAPTAPPAAVPGASAGNPDAGASTGAPAH 193
+ PP +P P PP P PP P PP P A A P AG A A+
Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAAN 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27985V8PROTEASE403e-05 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 39.6 bits (92), Expect = 3e-05
Identities = 33/182 (18%), Positives = 55/182 (30%), Gaps = 42/182 (23%)

Query: 46 GFLGSGFFIAPSWVLTCAHVAMEGEGRHVNVVFKPDPGSAETAVEGAVVVALPERRHAAA 105
F+ SG + +LT HV G + A P +
Sbjct: 101 TFIASGVVVGKDTLLTNKHVVDATHGDPHALK------------------AFPSAINQDN 142

Query: 106 GPGGV----RVPAPSGGWPAPDLALIRL--LRPVEHPCVYVTERPAGMFG----GGSVLY 155
P G ++ SG DLA+++ +H V ++
Sbjct: 143 YPNGGFTAEQITKYSGE---GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITV 199

Query: 156 TGWT-DGGSGQLTRFSGRCQVMGTFGDWAEDDEQMRLDGDRMYPGLSGGPVVDLARGEVV 214
TG+ D + G+ + M+ D G SG PV + + EV+
Sbjct: 200 TGYPGDKPVATMWESKGKITYLKGEA--------MQYDLS-TTGGNSGSPVFN-EKNEVI 249

Query: 215 GV 216
G+
Sbjct: 250 GI 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_27995PF05616391e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 39.0 bits (90), Expect = 1e-04
Identities = 27/82 (32%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 45 PLLSAGPAEAGRPAGRPAPQVTGDEAPDPAPDDDEPAEQRDLGDADRRIALYPVPRTASP 104
P L+ G AEA P +P P+V+ E P P +E R + D + P T
Sbjct: 313 PDLTPGSAEA--PNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQ 370

Query: 105 GGPRPDGPAGRGRSAARGGRAR 126
G RPD PA R R + R
Sbjct: 371 PGTRPDSPAVPDRPNGRHRKER 392


172C5746_28450C5746_28510N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28450-111-1.522073cellulose-binding protein
C5746_28455-110-0.319474hypothetical protein
C5746_28460-110-0.734496methylmalonyl-CoA epimerase
C5746_2846508-1.235865acetyl-CoA C-acyltransferase
C5746_28470-112-1.465635methylmalonyl Co-A mutase-associated GTPase
C5746_28475-110-1.393093serine/threonine protein kinase
C5746_28480-3120.961712hypothetical protein
C5746_28490-290.581758peptidase M4
C5746_28495-1101.447043DNA-binding response regulator
C5746_285000122.338838two-component sensor histidine kinase
C5746_28505-1121.715789MarR family transcriptional regulator
C5746_28510212-0.308692Ser or Arg-related nuclear matrix protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28550IGASERPTASE431e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.7 bits (100), Expect = 1e-06
Identities = 36/249 (14%), Positives = 71/249 (28%), Gaps = 16/249 (6%)

Query: 42 SLEKRIEELHLETQNAQAQVNDAEPSYAGLGARVEKILRLAEEEAKDLREEARRAAE--Q 99
+ E E +++ + A R +EAK + + E Q
Sbjct: 1031 ATPSETTETVAENSKQESKTVEKNEQDA---TETTAQNREVAKEAKSNVKANTQTNEVAQ 1087

Query: 100 HRELAESAAQQVRNDAESFAAERKAKAEDEGVRIVEKAKGEAT--SLRTEAQKDAAQKRE 157
+ + + E KAK E E + V K + + ++E + A+
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 158 EADALFEETRAKAAQAAADFETNLAKRRDQSERDLASRQAKAEKRLAEIEH--------R 209
E D ++ AK + + + +E+
Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207

Query: 210 AEQLRLEAEKLRTDAERRARQTVETAQRQSEDIVADANAKADRIRSESERELAALTNRRD 269
+ E+ + RR+ ++V + D + A S A L++ R
Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA-LCDLTSTNTNAVLSDARA 1266

Query: 270 SINAQLTNV 278
NV
Sbjct: 1267 KAQFVALNV 1275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28555IGASERPTASE621e-11 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.4 bits (151), Expect = 1e-11
Identities = 54/387 (13%), Positives = 116/387 (29%), Gaps = 38/387 (9%)

Query: 192 NETRQRLGSEAESARADAEAILLRARK-DAERLLNAASSQAQEATSHAEQLRTSTTAETE 250
R L D A + R + L + + T + T + +
Sbjct: 947 KAQRDHLNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQAD 1006

Query: 251 QTRQQTTELNRA---------------AEQRMQEAETQLREARLAAEKVTSEAKEA-AVK 294
+ A +E AE +E++ EK +A E A
Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK-TVEKNEQDATETTAQN 1065

Query: 295 RLAAAESQNEQRTRTARSEIARLVGEA-------TKDAESLKAEAEQ-ALADARAEADRL 346
R A E+++ + T +E+A+ E TK+ +++ E + + E ++
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 347 KSEAAEKARTAAAEDAAAQLAKAARAAEEVLTKASEDAK-STTRAASEEADRIRREAETE 405
S+ + K + A+ A+ + S+ + T ++E + TE
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185

Query: 406 ADRLRG--EAAEQADQLKGAAKDDTKEYRAKTVELQEEARRLRGEAEQLRSEAVAEGERI 463
+ + E + A T + R +R + + +
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSND-- 1243

Query: 464 RGEARREAVQQIEEGAKTAEELLSKAKADAEE--LRTTAGTESERVRTEATERATALRKQ 521
R V + + +LS A+A A+ L + E
Sbjct: 1244 -----RSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWV 1298

Query: 522 AEEALERARAEAEQLRTESEEQARSVT 548
+ ++ + + ++ R S+ +
Sbjct: 1299 SNTSMNKNYSSSQYRRFSSKSTQTQLG 1325



Score = 52.8 bits (126), Expect = 1e-08
Identities = 36/266 (13%), Positives = 81/266 (30%), Gaps = 26/266 (9%)

Query: 433 AKTVELQEEARRLRGEAEQLRSEAVAEGERIRGEARREAVQQIEEGAKTAEELLSKAKAD 492
++T E E + + + + E E +EA ++ +T E ++++ ++
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE--VAQSGSE 1091

Query: 493 AEELRTTAGTES------ERVRTEATERATALRKQAEEALERARAEAEQLRTESEEQARS 546
+E +TT E+ E+ + E + + ++ + ++ ++E Q + E +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 547 VTTVAEQAAAELREETERAVAARQAKAADELTRLHTEAETRVTTAEQALNDARSEAE--- 603
+ E + A + ++ T T + E N + +
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 604 ----------RIRRETNEESERLRAEAAERLRALQEQAETEAERLRDEAAADASRSRAEG 653
R RR + T A +A S A
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTS-----SNDRSTVALCDLTSTNTNAVLSDARA 1266

Query: 654 ESAAVRLRSEAAAEAERLKSEAQESA 679
++ V L A + E
Sbjct: 1267 KAQFVALNVGKAVSQHISQLEMNNEG 1292



Score = 50.8 bits (121), Expect = 4e-08
Identities = 38/310 (12%), Positives = 94/310 (30%), Gaps = 10/310 (3%)

Query: 685 EAAAAAERVGTEAAEALAAAQEEANRRRREAEETLDAARTEANQERERAREQSEELLASA 744
E + V T Q + EE ++ E +A
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 745 RKRVEQAQAEAQRLVEEADSRAMELVSAAEQTAQQVRDSVNGLQEQAEAEIAGLRSAAEH 804
K+ + VE+ + A E + + A++ + +V + E +G +
Sbjct: 1044 SKQESKT-------VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 805 VAERTKSEAQEEADR--VRSDAHAERDRASEDAARIRREAQEESEAAKAMAERTVSDAIA 862
E ++ E+ ++ V ++ E + + + + +++ A+ E + I
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 863 ESERLRADTAEYSQRMRTEASDALASAEQDAARSRAEAREDANRMRSDAATQ-ADRLVGE 921
E + TA+ Q + +S+ + + + + + A TQ
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 922 ATSEAERIRTESTQQATQLAEEATQQATRLVDEATQQATRVADEATDGAERLRAEAAATV 981
+ R+ + + V +T +D + + A
Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276

Query: 982 ASAQEHAART 991
+ +H ++
Sbjct: 1277 KAVSQHISQL 1286



Score = 46.2 bits (109), Expect = 9e-07
Identities = 48/322 (14%), Positives = 97/322 (30%), Gaps = 26/322 (8%)

Query: 63 PAYDSADIGYQAEQLLRNAQIQAEQLRTDAERELRDARAQTQRILQEHAEHQARLQAELH 122
P + + + IQA+ + E AR + A E
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI-ARVDEA-PVPPPAPATPSETTETV 1040

Query: 123 NEAVQRRQQLDQELAERRQTVESHVNENVAWAEQLRARTESQARRL-------LEESRAE 175
E +++ E E+ T + N VA + + +Q + E E
Sbjct: 1041 AEN-SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 176 AEQSLAAARAEAGRLANETRQ---RLGSEAESARADAEAILLRARKDAER--LLNAASSQ 230
+++ + E ++ E Q ++ S+ + +E + +A E +N Q
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 231 AQEATSHAEQL---RTSTTAETEQTRQQTTE-LNRAAE--QRMQEAETQLREARLAAEKV 284
+Q T+ + TS+ E T T N E + A TQ ++ K
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP 1219

Query: 285 TSEAKEAAVKRLAAAESQNEQRTRTARSEIARLVGEATKDAESLKAEAEQALADARAEAD 344
+ + + E + + L T S Q +A +
Sbjct: 1220 KNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVA-----LN 1274

Query: 345 RLKSEAAEKARTAAAEDAAAQL 366
K+ + ++ + +
Sbjct: 1275 VGKAVSQHISQLEMNNEGQYNV 1296



Score = 46.2 bits (109), Expect = 1e-06
Identities = 49/330 (14%), Positives = 109/330 (33%), Gaps = 26/330 (7%)

Query: 441 EARRLRGEAEQLRSEAVAEGERIRGEARREAVQQIEEG--AKTAEELLSK-AKADAEELR 497
E R + + + + + + E + +++E A S+ + AE +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 498 TTAGTESERVRTEATERATALRKQAEEALERARAEAEQLRT--ESEEQARSVTTVAEQAA 555
+ T E+ +ATE R+ A+EA +A + E + TT ++ A
Sbjct: 1046 QESKTV-EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 556 AELREETERAVAARQAKAADELTRLHTEAETRVTTAEQALNDARSEAERIRRETNEESE- 614
+EE + + + T++ + +SE + + E E++
Sbjct: 1105 TVEKEEKAKVETEKTQEVPK-------------VTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 615 RLRAEAAERLRALQEQAETEAERLRDEAAADASRSRAEGESAAVRLRSEAAAEAERLKSE 674
+ + + E A+ + S +V E A +
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 675 AQESADRVRSEAAAAAERVGTEAAEALAAAQEEANRRRREAEETLDAARTEANQERERAR 734
ES+++ ++ + V A ++ + R D T N AR
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTSSND------RSTVALCDLTSTNTNAVLSDAR 1265

Query: 735 EQSEELLASARKRVEQAQAEAQRLVEEADS 764
+++ + + K V Q ++ + E +
Sbjct: 1266 AKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295



Score = 45.1 bits (106), Expect = 2e-06
Identities = 50/336 (14%), Positives = 104/336 (30%), Gaps = 27/336 (8%)

Query: 846 SEAAKAMAERTVSDAIAESERLRADTAEYSQRMRTEASDALASAEQDAARSRAEAREDAN 905
+++A V+D E + S+ R + +L D + + R
Sbjct: 917 TKSATGNFTLQVADKTGEPNHNELTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVNG 976

Query: 906 RMRSDAATQADRLVGEATSEAERIRTESTQQATQLAEEATQQATRLVDEATQQATRVADE 965
R R T + I T + QA + + + VDEA
Sbjct: 977 RYDLYNPEVEKR---NQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP------- 1026

Query: 966 ATDGAERLRAEAAATVASAQEHAARTREESEQVRADAEAAAEQMRAEARQEADRLLDEAR 1025
A +E TVA + ++T E++EQ + A ++ EA+
Sbjct: 1027 --PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE 1084

Query: 1026 EAAAKRRADAAEQADQLINKAQEEALRAATEAEEQADTMVGVARKEAVRITSEATVEGNS 1085
A + + + E+ +A E E+ E ++TS+ + +
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ---------EVPKVTSQVSPKQ-- 1133

Query: 1086 LVERARTDADELLVGARRDATAIRERAEELRTRIESEIEELHDRARRETSEQM-KTAGER 1144
E++ T + D T + + ++ E+ + + ++
Sbjct: 1134 --EQSETVQPQAEPARENDPTVNIKEPQSQTNT-TADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 1145 VDNLMKAATEQRDDAAAKAKELLADANSEASKVRIA 1180
N + E A + +N ++ R +
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28570HTHFIS280.042 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.042
Identities = 15/68 (22%), Positives = 26/68 (38%)

Query: 9 EQARAGRPRAVARLISLVEGASPQLREVMAALAPLTGNAYVVGLTGSPGVGKSTSTSALV 68
+ R + ++ + G S ++E+ LA L + +TG G GK AL
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH 181

Query: 69 SAYRRAGK 76
+R
Sbjct: 182 DYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28575YERSSTKINASE472e-07 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 47.4 bits (112), Expect = 2e-07
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 8/81 (9%)

Query: 133 AGIVHRDVKPGNVML--ASDGPRLIDFGIARDAGATPLTTTSRMVGSPAFMSPEHVAGSG 190
AG+VH D+KPGNV+ AS P +ID G+ +G P T +F +PE G+
Sbjct: 264 AGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGFTE------SFKAPELGVGNL 317

Query: 191 RVVPASDVFCLASVLCYAATG 211
SDVF + S L + G
Sbjct: 318 GASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28590HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 3e-19
Identities = 30/118 (25%), Positives = 57/118 (48%)

Query: 2 RLLIVEDEKRLAMSLAGGLAAEGFAVDVVHDGLEGLHRAAEGAYDLVVLDIMLPGMNGYR 61
+L+ +D+ + L L+ G+ V + + A G DLVV D+++P N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCAALRAAGHEMPILMLTAKDGEYDEAEGLDTGADDYLTKPFSYVVLVARIRALLRRR 119
+ ++ A ++P+L+++A++ + + GA DYL KPF L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28605PRTACTNFAMLY280.035 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.5 bits (63), Expect = 0.035
Identities = 21/72 (29%), Positives = 26/72 (36%), Gaps = 1/72 (1%)

Query: 177 VMGGLR-SLTGIGGTSGEEHQFEFVGAGTVLLQSTEILMPEQPTGATPAQAGVPGGAGQP 235
V+G +L G T G + V LQ I + P G VPGGA
Sbjct: 222 VLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPG 281

Query: 236 GSAPRLPGQLGD 247
G P G + D
Sbjct: 282 GFGPGGFGPVLD 293


173C5746_28560C5746_28625N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28560081.596335TetR family transcriptional regulator
C5746_28565081.833374co-chaperone YbbN
C5746_28570082.456454cholesterol esterase
C5746_28575-192.078942hypothetical protein
C5746_285800121.418995hypothetical protein
C5746_285851160.545225hypothetical protein
C5746_28595016-0.335957pyruvate kinase
C5746_28605216-0.667773acetate kinase
C5746_286154131.032215phosphate acetyltransferase
C5746_286205130.7568556-phosphofructokinase
C5746_286255140.754937hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28655HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 2e-13
Identities = 32/202 (15%), Positives = 73/202 (36%), Gaps = 13/202 (6%)

Query: 11 RTGRPRSAAADEAILEATRASLVDLGWSKLTMSDVATRAGVAKTTLYRRWAGKNELVVDA 70
R + + + IL+ G S ++ ++A AGV + +Y + K++L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 71 VA-------VLFDELELPDLGSLSADVQAVVLQFAALLERPETQTALMAVV---AESTRD 120
L E + G + ++ +++ E + LM ++ E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 121 EALRARIRDSIVDRQKRLVLQGRQRAQERGELPVEQDEAVAATTDDLIFDVIAGAVVHRA 180
A+ + + ++ + Q + E LP + AA ++ I+G + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA---IIMRGYISGLMENWL 179

Query: 181 LVSAEPVDEDWARRFTVLLLAG 202
+ AR + +LL
Sbjct: 180 FAPQSFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28675ACRIFLAVINRP300.006 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.006
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 16/68 (23%)

Query: 67 SLIIGVLLITLGLTMWFHHIVRVFAG-VAAILLALISIPVANIGGFLI----GF---LFS 118
+L ++L+ L ++ +F + A L+ I++PV +G F I G+ +
Sbjct: 343 TLFEAIMLVFL--------VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLT 394

Query: 119 LLGGALSI 126
+ G L+I
Sbjct: 395 MFGMVLAI 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28680CHANLCOLICIN363e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 35.8 bits (82), Expect = 3e-04
Identities = 35/155 (22%), Positives = 62/155 (40%), Gaps = 10/155 (6%)

Query: 99 SPSATDSGDKGDTSTPKPS------ATPSASDAAKPETKAADAPAAAAPSASPTPTKSTN 152
+P + SG G K AT S A +T+A A A A + + K+
Sbjct: 28 TPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANR 87

Query: 153 PLDPLGVGDALKDLFDGPDKETASPSPSATTASPKPSQSDAAEPAEKPAEKATDAVKETA 212
+ LKD+ + + AS +PSAT + + + AE KA + ++ A
Sbjct: 88 D----ALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEA 143

Query: 213 DKTAAAIRDAADKAGKSVEELDESAKGLDTKKDED 247
+ A ++A + + E E+ + L + E+
Sbjct: 144 EAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEE 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28695ACETATEKNASE485e-174 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 485 bits (1251), Expect = e-174
Identities = 185/401 (46%), Positives = 255/401 (63%), Gaps = 9/401 (2%)

Query: 16 RVLVLNSGSSSVKYQLLDMSDRSRLAVGLVERIGEETSRLVHTPLAGSGAESRERIGPIA 75
++LV+N GSSS+KYQL++ D + LA GL ERIG S L H + E + +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDMK 57

Query: 76 DHEAALKAAAGELAADGLGLDSP--ALAAIGHRVVHGGLRFTEPVVIDDEVLKEIERLVP 133
DH+ A+K L G+ + A+GHRVVHGG FT V+I D+VLK I +
Sbjct: 58 DHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIE 117

Query: 134 VAPLHNPANIVGIRTAQALRPDLPQVAVFDTAFHTTMPEYAARYAIDVETADAHRIRRYG 193
+APLHNPANI GI+ + PD+P VAVFDTAFH TMP+YA Y I E ++IR+YG
Sbjct: 118 LAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYG 177

Query: 194 FHGTSHAYVSRKAAELLGRTPEEVNVIVLHLGNGASASAVAGGRCVETSMGLTPLEGLVM 253
FHGTSH YVS++AAE+L + E + +I HLGNG+S +AV G+ ++TSMG TPLEGL M
Sbjct: 178 FHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAM 237

Query: 254 GTRSGDIDPAVTFHLKRVAGMSTDEIDVLLNKKSGLVGLCG-DNDMREIRRR-IDEGDER 311
GTRSG IDP++ +L +S +E+ +LNKKSG+ G+ G +D R++ GD+R
Sbjct: 238 GTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKR 297

Query: 312 AALAFDIYVHRLKKYIGAYSAVLGRVDAVVFTAGVGENSAPVREAAIAGLEEFGLAVDAD 371
A LA +++ +R+KK IG+Y+A +G VD +VFTAG+GEN +RE + GLE G +D +
Sbjct: 298 AQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKE 357

Query: 372 LNAARSGAPRLISPDHARVAVAVVPTDEELEIAVQTFALIG 412
N G +IS ++V V VVPT+EE IA T ++
Sbjct: 358 KN-KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28710PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.002
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 94 VHGQRAATARA-VEDEGGTEEEAGEVPGAPAGE-SPEPPAGPPPAR 137
+HG + + A A V E G E AG V GAPAG +P+PP PP R
Sbjct: 81 IHGLKVSKAAAQVAREEGLESVAGIVMGAPAGAPAPKPPRPEPPPR 126


174C5746_28720C5746_28755N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28720112-0.1853501,4-alpha-glucan branching enzyme
C5746_28725-112-0.096743maltokinase
C5746_28730-217-2.036262maltose alpha-D-glucosyltransferase
C5746_28735-123-2.019914alpha-1,4-glucan--maltose-1-phosphate
C5746_28740-124-2.164284DUF3417 domain-containing protein
C5746_28745026-2.701308peptidase M4
C5746_28750-122-2.921812GntR family transcriptional regulator
C5746_28755-122-4.119308MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28810PF03544330.003 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.4 bits (76), Expect = 0.003
Identities = 17/85 (20%), Positives = 24/85 (28%), Gaps = 7/85 (8%)

Query: 11 PAATEVPQVPLLTPAETAPPAAPAKAAT------PPSSEPAAPAKRKKPAGERAAKTGDG 64
PA E PQ P P + P E P + KP + +
Sbjct: 57 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 65 -AAPPRPRRGTGSQGVRQARPLGNG 88
P R + + ARP +
Sbjct: 117 DVKPVESRPASPFENTAPARPTSST 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28835THERMOLYSIN2586e-80 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 258 bits (659), Expect = 6e-80
Identities = 176/573 (30%), Positives = 251/573 (43%), Gaps = 76/573 (13%)

Query: 19 AALLAVGVQTGTATATPGSATAAATAGANR-----GALAKQLTPSQRAELIREADASKAA 73
A L A+G+ G G++ + N ++ L EL+ +
Sbjct: 5 AMLGAIGLAFGLMAWPFGASAKGKSMVWNEQWKTPSFVSGSLLGRCSQELVYRY-LDQEK 63

Query: 74 TAKELGLGSQEKLVVRDVIQDNDGTTHTRYERTFAGLPVLGGDLVVQETKAGATE-SVTK 132
+LG ++E+L + D G T R+E+ A +G LV + S T
Sbjct: 64 NTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTL 123

Query: 133 ASKVSSGQLKAVDTTADVAPAVAQKQALGLAKADGSKKTAADRAP-------RKVVWMAQ 185
+ LK A++ +QA +AK D + + +R R V++ +
Sbjct: 124 IPNLDKRTLK-------TEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDE 176

Query: 186 GKPQLAYETVVGGLQEDGTPNELHVITDASTGAKLYEWQGVEN---------------GT 230
P+LAYE V L P + DA+ G L +W ++ G
Sbjct: 177 ETPRLAYEVNVRFLTPV--PGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGV 234

Query: 231 GNTQYNGQVTLGTAPS-----YTLTDTGRGNH-KTYNLNHGSSGTGTLFTNSTDVWGNGN 284
G Q + T S Y L D RG+ TY+ + + G+L+ + + +
Sbjct: 235 GRGVLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASY 294

Query: 285 PSNAETAAADAHYGAAETWDYYKNVHGRTGIRGDGVGAYSRVHYGNAYVNAFWQDSCFCM 344
+ AA DAHY A +DYYKNVHGR G S VHYG Y NAFW S M
Sbjct: 295 ----DAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQ--M 348

Query: 345 TYGDGEG-NLKPLT-SLDVAAHEMSHGVTAATAKLVYSGESGGLNEATSDIFAAGVEFYS 402
YGDG+G P + +DV HE++H VT TA LVY ESG +NEA SDIF VEFY+
Sbjct: 349 VYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYA 408

Query: 403 NTAEDPGDYLVGEKI---DINGDGTPLRYMDKPSKDGASKDA--WYSGIG-NIDVHYSSG 456
N D+ +GE I + GD LR M P+K G Y+G N VH +SG
Sbjct: 409 NRNP---DWEIGEDIYTPGVAGDA--LRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSG 463

Query: 457 PANHFFYLLSEGSGAKVINGVSYDSPTSDGLPVTGIGRAKAEQIWFKALATKFTSTTNYA 516
N YLLS+G G+ VTGIGR K +I+++AL T T+N++
Sbjct: 464 IINKAAYLLSQGG-------------VHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFS 510

Query: 517 AARTGTLAVAGELYGTTSAEYKAVGDAWAAINV 549
R + A +LYG+TS E +V A+ A+ V
Sbjct: 511 QLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28840TETREPRESSOR664e-15 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 66.5 bits (162), Expect = 4e-15
Identities = 47/213 (22%), Positives = 77/213 (36%), Gaps = 22/213 (10%)

Query: 86 LSRGRIVRAAIELADAEGLPAVSMRRVATTLSTSTMALYRHVPGKAELVRLMSDEVFGER 145
L+R ++ AA+EL + G+ ++ R++A L LY HV K L+ ++ E+
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 146 PLGTVP---RDWRSGLEVAARWLRSVYGRHPWMAQATASFTRPTASPHAMRYTEWVLHAL 202
++P W+S L A R R+ A+ TRP E L +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLG-TRPD--EKQYDTVETQLRFM 120

Query: 203 RGTGLSPHTMLHIHLTLFAHVQGLAMGADSEAQARQDTGLSDVEWRVRNEPQFNAISASG 262
G S L+ ++ +H +GA E Q + + R A
Sbjct: 121 TENGFSLRDGLYA-ISAVSH---FTLGAVLEQQEHT----AALTDRP-------AAPDEN 165

Query: 263 DYPFLNSLFEHDEFELDLDSLFEFGLQRTLDGI 295
P L D + F GL+ + G
Sbjct: 166 LPPLLREAL-QIMDSDDGEQAFLHGLESLIRGF 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28845TCRTETB1701e-49 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 170 bits (432), Expect = 1e-49
Identities = 85/394 (21%), Positives = 158/394 (40%), Gaps = 14/394 (3%)

Query: 5 LFVSMDVSILFYALPAIGADLEPGSTQQLWILDIYGFVLAGLLITMGALGDRIGRRTVLI 64
F ++ +L +LP I D W+ + + G L D++G + +L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 65 TGTVLFAAASVAAAYAQSPGA-LIAARALLGVGGACLMPSTLALVRNLFHDPRQRARAVA 123
G ++ SV S + LI AR + G G A P+ + +V + R +A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKAFG 142

Query: 124 LWTTVMATGISVGPVVSGALLEHFWWGAVFLVNLPAMALLLVLAPLLLPESRTPGEGRFD 183
L +++A G VGP + G + + W + L+ + + + L LL E R G FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH--FD 200

Query: 184 ILSAVLSLAALLPLIHGIKEVAKHGYQPLPALGITAGLALGFVFLRRQARLVHPMVDLAL 243
I +L ++ + + + + L +F++ ++ P VD L
Sbjct: 201 IKGIILMSVGIVFFMLFTTSYSI-SFLIVSVLSF-------LIFVKHIRKVTDPFVDPGL 252

Query: 244 LRRRAFGGPVLVNLLAMAATVGFAAFFSQYVQSVLGKSPFEAAMWSLVP-SLGVVVCAPA 302
+ F VL + GF + ++ V S E + P ++ V++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 303 GGALARRFDRGYVMGGGFLVSAAGFLSLTRIGTQSPLWMTLAGSAVYAGGLVSAMTLANE 362
GG L R YV+ G + FL+ + + + +MT+ V GGL T+ +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL-GGLSFTKTVIST 371

Query: 363 LALGAAPPERAGSAAAVLESGQELGGALGMALLG 396
+ + + AG+ ++L L G+A++G
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


175C5746_28785C5746_28815N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_28785-211-0.410147hypothetical protein
C5746_28795-210-0.281479sensor histidine kinase
C5746_28800-29-0.341160DNA-binding response regulator
C5746_28810-28-0.667297glycogen debranching enzyme GlgX
C5746_28815-37-0.796687hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28875ABC2TRNSPORT338e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.0 bits (75), Expect = 8e-04
Identities = 53/261 (20%), Positives = 91/261 (34%), Gaps = 15/261 (5%)

Query: 2 TTANPAKAMDASTPWAAVLRTETRLFLRE-PASLFWILVFPTALLTILGLIPSFRDPDDA 60
TA P +++ W AV R + + ASL L P L+ + GL
Sbjct: 6 VTALPGGSLN----WIAVWRRNYIAWKKAALASLLGHLAEP--LIYLFGLGAGLGVMVGR 59

Query: 61 LGGRRVIDLYVPVSVLL-AMIMAGLQAMPPVLTGYRERGILRRMSTTPVRPSALLTAQIA 119
+GG V AM A + + + M T +R ++ ++A
Sbjct: 60 VGGVSYTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMA 119

Query: 120 LHGAAALGSAALVMAVGRIAFGVTLPGQPFGYVLALLLSTASVLV-LGALLCALSRTTKA 178
A + A + V A Y L ++ T LG ++ AL+ +
Sbjct: 120 WAATKAALAGAGIGVVA--AALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDY 177

Query: 179 SAAISSVVNLVMMFSAGVWIPVQSMPDTLRRIVEVTPFGAASQALDRAASGGWPG---WA 235
++V ++F +G PV +P + P + S L R G P
Sbjct: 178 FIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPL-SHSIDLIRPIMLGHPVVDVCQ 236

Query: 236 ELGVMALWAALVTLLATRLFR 256
+G + ++ + L+T L R
Sbjct: 237 HVGALCIYIVIPFFLSTALLR 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28880PF06580354e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.2 bits (81), Expect = 4e-04
Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 26/183 (14%)

Query: 250 LQVVTSIADTDPALARQHLDRAAALARHSLGEARRSVHDLVPAA--LEHDDLPGALKKTV 307
L + ++ DP AR+ L + L R+SL + V A L D L
Sbjct: 179 LNNIRALILEDPTKAREMLTSLSELMRYSLRYSN---ARQVSLADELTVVDSYLQLASI- 234

Query: 308 TGWGERHGVRAEFTVTGTVEPVHDEIAATLLRIAEEALANAGRHA-----GASRVGITLS 362
+ R +F + ++ L++ E N +H ++ + +
Sbjct: 235 -----QFEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGT 286

Query: 363 FMGDELTLDVRDDGCGFDPVALPPYSGAGGFGLGGMRARAERVAG---TVEVETEPGRGT 419
+TL+V + G +AL + G GL +R R + + G +++ + G+
Sbjct: 287 KDNGTVTLEVENTG----SLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342

Query: 420 AVC 422
A+
Sbjct: 343 AMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28885HTHFIS702e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 2e-16
Identities = 37/160 (23%), Positives = 58/160 (36%), Gaps = 8/160 (5%)

Query: 9 ITLVVVDDHPVVRDGLRGMFDSAPGFQVLGEASNGVEGVDLVARLDPDVVLMDLRMPGGG 68
T++V DD +R L A G+ V SN +A D D+V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 69 GVAAIAELTGRGARSKVLVLTTYDTDSDTLPAIEAGATGYLLKDAPRDELFNAVRAAADG 128
+ + VLV++ +T + A E GA YL K EL + A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 129 RTVLSPAVASRLVSRVRTPAAPGNDSLSAREREVLELVAK 168
+ G SA +E+ ++A+
Sbjct: 122 PKRR---PSKLEDDSQDGMPLVGR---SAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_28895BACINVASINB320.004 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 32.4 bits (73), Expect = 0.004
Identities = 19/80 (23%), Positives = 28/80 (35%)

Query: 75 DGADSVATSGALKISAKQGKLTTVKVADPKGTEVEGKIAADGTSWTPDQHLAAATKYKVH 134
D A SV + K++ Q KL ++ ADP + E + G T +
Sbjct: 151 DTAKSVYDAATKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATV 210

Query: 135 AVAKDAKGRESAKDTSFTTL 154
DAK + D T
Sbjct: 211 KAGTDAKAKAEKADNILTKF 230


176C5746_29295C5746_29325N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_29295-216-0.068524urease subunit alpha
C5746_29300-2140.642040agmatine deiminase
C5746_29305-3151.476547transcriptional regulator
C5746_29310-2141.168476alpha-ketoglutarate transporter
C5746_29315-1142.029125citramalate synthase
C5746_293200143.378023hypothetical protein
C5746_293251133.410234histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29370UREASE7730.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 773 bits (1997), Expect = 0.0
Identities = 267/571 (46%), Positives = 362/571 (63%), Gaps = 11/571 (1%)

Query: 8 AMSIDRHEYASVHGPRAGDRVRLGDSGLTVRVESDAQQYGEEFLAGFGKTARDGLHLKAA 67
+ + R YA++ GP GD+VRL D+ L + VE D +GEE G GK RDG+ ++
Sbjct: 2 SYRMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMG-QSQ 60

Query: 68 AVRD--TCDVVISNVLVIDAVQGIRKVSIGIREGRISAIGRAGNPDTLDGVDVVVGTGTT 125
R+ D VI+N L++D GI K IG+++GRI+AIG+AGNPD GV ++VG GT
Sbjct: 61 VTREGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTE 119

Query: 126 IVSGEGLIATAGAVDTHVHLLSPRIMEASLASGVTTIIGQEFGPVWGVGVNS----PWAL 181
+++GEG I TAG +D+H+H + P+ +E +L SG+T ++G GP G + PW +
Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179

Query: 182 RHAFNAFDAWPVNIGFLGRGSSSHEAPLVEALAEGGACGFKVHEDMGAHTRALDTALRVA 241
A DA+P+N+ F G+G++S LVE + GGA K+HED G A+D L VA
Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMV-LGGATSLKLHEDWGTTPAAIDCCLSVA 238

Query: 242 EEHDVQVALHSDGLNECLSVEDTLRVLEGRTIHAFHIEGCGGGHVPNVLKMAGVPNVIGS 301
+E+DVQV +H+D LNE VEDT+ ++GRTIHA+H EG GGGH P+++++ G PNVI S
Sbjct: 239 DEYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPS 298

Query: 302 STNPTLPFGRDAVAEHYGMIVSVHDLKTDLPGDAAMARDRIRAGTMGAEDVLHDLGAIGI 361
STNPT P+ + +AEH M++ H L +P D A A RIR T+ AED+LHD+GA I
Sbjct: 299 STNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSI 358

Query: 362 TSSDAQGMGRAGETVRRTFAMAGKMKAELGPMDGDGPADDNARVLRYIAKLTINPAIAHG 421
SSD+Q MGR GE RT+ A KMK + G + + +DN RV RYIAK TINPAIAHG
Sbjct: 359 ISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHG 418

Query: 422 LAHEIGSIETGKLADIVLWRPEFFGAKPQLVLKSGFPAYGVTGDPNAATDTCEPLVLGPQ 481
L+HEIGS+E GK AD+VLW P FFG KP +VL G A GDPNA+ T +P+ P
Sbjct: 419 LSHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPM 478

Query: 482 FGAYGATAADLSVAFVSQAATQLG-ADLMPTRRRRVAVRGTR-GIGPGDLVHNSRTGEVA 539
FGAYG + + SV FVSQA+ G A + + VAV+ TR GIG ++HNS T +
Sbjct: 479 FGAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIE 538

Query: 540 VDAHSGLVTLDGDPLRSEPADSVSLNRLYFL 570
VD + V DG+ L EPA + + + YFL
Sbjct: 539 VDPETYEVRADGELLTCEPATVLPMAQRYFL 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29380HTHTETR918e-25 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 90.8 bits (225), Expect = 8e-25
Identities = 24/172 (13%), Positives = 59/172 (34%), Gaps = 5/172 (2%)

Query: 3 PAPRRRNTAPPREEVLAAAMATIAERGLDGLTMAGLGREVGMSSGHLLYYFRTKDELLLQ 62
++ R+ +L A+ +++G+ ++ + + G++ G + ++F+ K +L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 63 TLEWSEGRLGAQRRTLLSGRV-TVRERLDAYIDLYLPDGHRDPHWTLWLEVWNRSQNADD 121
E SE +G + L + L + L +E+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 122 DA---RARQAAIEGAWHRDLVALLAEGASRGELRT-VDAERFATRLRALLDG 169
+ + Q + + + L L + R A +R + G
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29385TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 2e-05
Identities = 59/335 (17%), Positives = 114/335 (34%), Gaps = 64/335 (19%)

Query: 37 FFPEGNETANLMNTMGIFAVGFFMRPVGGWLLGRIGDRKGRKAALTLTVTLMSASAVLIA 96
+ TA+ + ++A M+ +LG + DR GR+ L +++ + ++A
Sbjct: 35 LVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMA 91

Query: 97 IAPTYDVAGYGGVAVLLVARLLQGLSVGGEYAASATYLTEASAPHRR----GFASSFQYV 152
AP + VL + R++ G++ G A + Y+ + + R GF S+
Sbjct: 92 TAPF--------LWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 153 SMTAGQLIGLGLQIILQRNMSDAALHSWGWRIPFIVGALGAAIVFYLRRSMLETEVYAES 212
M AG ++G GL + + PF A + F +L E
Sbjct: 143 GMVAGPVLG-GL------------MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGER 189

Query: 213 GAAEQEDRGTLKAL-WQHKRE--------AFLVMALTMGGTVAYYTYTTYLTKFLSKSAG 263
+E L + W F++ + + + + + + G
Sbjct: 190 RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIG 249

Query: 264 MEKSTASLVSFCALFVFMCIQPLA-----------GLLSDRIGRRPLLITFAVGSTFLTV 312
+ S A+ +L M P+A G+++D G ++ ++
Sbjct: 250 I--SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG---YILLAFATRGWMAF 304

Query: 313 PIMTMLKHAGTFWPAFGLALLALVVVTGYTSINAC 347
PIM +L G PA + S
Sbjct: 305 PIMVLLASGGIGMPA----------LQAMLSRQVD 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29400BCTERIALGSPC310.023 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 30.7 bits (69), Expect = 0.023
Identities = 19/90 (21%), Positives = 35/90 (38%), Gaps = 2/90 (2%)

Query: 138 VVAGPQGIRYTHP--DPALIGKHLVGPYKEALAGHGFTKSFSGSQGPSLNSVVPVKRLDG 195
+ P + PA + V L G K+ +G+ S S +P L+
Sbjct: 36 RIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGALDASQMSNLPPSTLNL 95

Query: 196 SVAGVVSVGLTDKSVNSVAARSIPFSIAVG 225
S+ GV++ +S+ ++ + FS V
Sbjct: 96 SLTGVMAGDDDSRSIAIISKDNEQFSRGVN 125


177C5746_29695C5746_29730N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_29695-2110.284848chromosome segregation protein SMC
C5746_29700113-0.981618MFS transporter
C5746_29705115-1.556707LLM class flavin-dependent oxidoreductase
C5746_29710113-1.409626allantoin permease
C5746_29715212-0.754917signal recognition particle-docking protein
C5746_29720115-1.414900DNA primase
C5746_297250140.535162hypothetical protein
C5746_29730-1140.182585ammonia channel protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29770GPOSANCHOR473e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.4 bits (112), Expect = 3e-07
Identities = 42/275 (15%), Positives = 84/275 (30%), Gaps = 12/275 (4%)

Query: 219 QADLRDARLRLLADDLVRLHTALRSEIADEAALKQRREAAEAELKSALAREAELEDEVRR 278
+ + + L + A +A L++ E A + A+ LE E
Sbjct: 128 ALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAA 187

Query: 279 LGPRLQRAQQTWYELSQLAERVRGTISLADARVKSAGAAPEEERRGTGFRDPESMEREAA 338
L R ++ + I +A + A + E+
Sbjct: 188 LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL------------EKALE 235

Query: 339 RIREQEAELEAALEAAEHALDDTAAHRAELERELAAEERRLKDAARAIADRREGLARLNG 398
A ++ E A +AELE+ L + I A L
Sbjct: 236 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295

Query: 399 QVNAARSRAGSAKAEIDRLAAARDEAQERAVAAQEEYEQLKAEVEGLDAGDAELGERHEA 458
+ ++ A L D ++E + E+++L+ + + +A L +A
Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355

Query: 459 AKRELAEAEAALSAAREAATAAERKRAAVAARHEA 493
++ + EA E +E R ++ +A
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 390



Score = 45.4 bits (107), Expect = 1e-06
Identities = 53/331 (16%), Positives = 113/331 (34%), Gaps = 12/331 (3%)

Query: 162 EEAAGVLKHRKRKEKALRKLDAMGANLARVQDLTDELRRQLKPLGRQAAVARRAAVIQAD 221
+ G + ++ L+A A LA + ++ + + +
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA 186

Query: 222 LRDARLRLLADDLVRLHTALRSEIADEAALKQRREAAEAELKSALAREAELEDEVRRLGP 281
+AR L L ++ A L+ + A A +
Sbjct: 187 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 246

Query: 282 RLQRAQQTWYELSQLAERVRGTISLADARVKSAGAAPEEERRGTGFRDPESMEREAARIR 341
+++ + L + + A + A + + ++E E A +
Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA-----EKAALEAEKADLE 301

Query: 342 EQEAELEAALEAAEHALDDTAAHRAELERELA--AEERRLKDAARAIADR-----REGLA 394
Q L A ++ LD + + +LE E E+ ++ +A+R R RE
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 361

Query: 395 RLNGQVNAARSRAGSAKAEIDRLAAARDEAQERAVAAQEEYEQLKAEVEGLDAGDAELGE 454
+L + + ++A L D ++E ++ E+ +++ L+ + EL E
Sbjct: 362 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE 421

Query: 455 RHEAAKRELAEAEAALSAAREAATAAERKRA 485
+ ++E AE +A L A +A K+A
Sbjct: 422 SKKLTEKEKAELQAKLEAEAKALKEKLAKQA 452



Score = 39.3 bits (91), Expect = 1e-04
Identities = 55/344 (15%), Positives = 111/344 (32%), Gaps = 21/344 (6%)

Query: 171 RKRKEKALRKLDAMGANLARVQDLTDELRRQLKPLGRQAAVARRAAVIQADLRDARLRLL 230
+ EKAL + + + L+ + A A+ + A+++ L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 231 ADDLVRLHTALRSEIADEAALKQRREAAEAELKSALAREAELEDEVRRLGPRLQRAQQTW 290
+ L A A++K+ A +A LE L L+ A
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 276

Query: 291 YELSQLAERVRGTISLAD---ARVKSAGAAPEEERRGTGFRDPESMEREAARIREQEAEL 347
S + + + + A ++ R+ RD ++ ++ + +L
Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLR-RDLDASREAKKQLEAEHQKL 335

Query: 348 EAALEAAEHA-------LDDTAAHRAELERELAAEERRLKDAARAIADRREGLARLNGQV 400
E + +E + LD + + +LE E E + K + + R L
Sbjct: 336 EEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK 395

Query: 401 NAARSRAGSAKAEIDRLAAARDEAQERAVAAQEEYEQLKAEVEGLDAGDAELGERHEAAK 460
A +++ L E +E ++E +L+A++E +A K
Sbjct: 396 KQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEA----------EAKALK 445

Query: 461 RELAEAEAALSAAREAATAAERKRAAVAARHEALALGLRRKDGT 504
+LA+ L+ R + + A G + GT
Sbjct: 446 EKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGT 489



Score = 31.2 bits (70), Expect = 0.026
Identities = 56/370 (15%), Positives = 116/370 (31%), Gaps = 30/370 (8%)

Query: 696 ELTEGQRRAGERRSASAGLVEELGERRRAAEREKSGVAQQLGRLAGQARGAAGEAERMTA 755
+ + S + + L + E S ++L + A + + + A
Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 756 SAARAQEALERATEEAEELAERLLVAEEMPVEEEPNTSVRDRLAADGANARQTEMEARLQ 815
A ++ALE A + + ++ + + E+ + + L A
Sbjct: 121 RKADLEKALEGAMNFSTADSAKI---KTLEAEKAALAARKADLEKALEGAM----NFSTA 173

Query: 816 VRTHEERVKALAGRADSLDRGARAEREARARAEQRRARLRHEAAVASAVASGARQLLAHV 875
+ ++A ++ E + A + +
Sbjct: 174 DSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKA----- 228

Query: 876 EVSVVRAERERDAAEAAKAERERELAAERNQGRELKSELDKLTDSVHRGEVLGAEKRLRI 935
E+ + A ++ + L++ +L ++ +I
Sbjct: 229 -----DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 936 EQLEAKALEELGVEPAGLAAEYGPNQLVPPSPAAEGEELPEDPEHPRNQPKPFVRAEQ-- 993
+ LEA+ L E A L + A + L D + R K Q
Sbjct: 284 KTLEAE-KAALEAEKADLEHQSQVLN-------ANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 994 EKRLKSAERAYQQLGKVNPLALEEFSALEERHKFLSEQLEDLKKTRADL---IQVIREVD 1050
E++ K +E + Q L + + E LE H+ L EQ + + +R L + RE
Sbjct: 336 EEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK 395

Query: 1051 ERVEQVFTEA 1060
++VE+ EA
Sbjct: 396 KQVEKALEEA 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29775TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 24/90 (26%), Positives = 37/90 (41%), Gaps = 1/90 (1%)

Query: 75 IGAATAGRIADRIGRIRCMQIASVLFTISAVGSALPFALWDLAMWRIIGGFAIGMASVIG 134
A G ++DR GR + ++ + A LW L + RI+ G +V G
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117

Query: 135 PAYIAEVSPPAYRGRLGSFQQAAIVIGIAI 164
AYIA+++ R R F A G+
Sbjct: 118 -AYIADITDGDERARHFGFMSACFGFGMVA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29790TONBPROTEIN406e-06 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 40.4 bits (94), Expect = 6e-06
Identities = 41/192 (21%), Positives = 71/192 (36%), Gaps = 18/192 (9%)

Query: 17 VISGLVVSSRKKKQLPPAPSSTPTIT---------PPAEPHVGEEAETPREEPRRTIEEV 67
V++GL+ +S + PAP+ ++T P A E P EP E +
Sbjct: 23 VVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEP----EPI 78

Query: 68 APPSVEAPAEEAPETVEPEAPAAPALEVPEPTAGRLVRLRARLARSQNSLGKGLLTLLSR 127
P EAP +P+ P +V E + + +R A + LT +
Sbjct: 79 PEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTA 138

Query: 128 DNL--DEDTWEEIEDTLLTADVGVAPTQELVERLRERVRVLGTRTPEGLRSLLREELLTL 185
T L+ + P + R+ +V+V TP+G + ++L+
Sbjct: 139 TAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDG--RVDNVQILSA 196

Query: 186 LGTD-FDRAVKT 196
+ F+R VK
Sbjct: 197 KPANMFEREVKN 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_29805CHANLCOLICIN372e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 36.6 bits (84), Expect = 2e-04
Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 276 AASGAVAGLVAITPSGGAVSPLGAIAVGAIAGVLCA 311
AA V+ +VA+ S A + LG + + G+LC+
Sbjct: 471 AADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCS 506


178C5746_29955C5746_29995N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_29955315-1.636181ABC transporter ATP-binding protein
C5746_29960519-1.713004aspartate aminotransferase family protein
C5746_29965316-1.699777Lrp/AsnC family transcriptional regulator
C5746_29970421-2.672097gamma-aminobutyraldehyde dehydrogenase
C5746_29975216-3.129356polyamine ABC transporter substrate-binding
C5746_29980-110-2.639642hypothetical protein
C5746_29985-111-2.077151peptidase
C5746_29990-112-2.263206DNA-binding response regulator
C5746_29995012-1.520708sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30045PHAGEIV300.009 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 29.9 bits (67), Expect = 0.009
Identities = 21/96 (21%), Positives = 36/96 (37%), Gaps = 11/96 (11%)

Query: 130 LERLDIGVCAKKRPHTLLQGQRQRIAVARALAASPSVIFADEPTAALHRADRTHVLRTL- 188
+ER ++G+ P + G ++A + S S +D T R++
Sbjct: 312 VERQNVGISMSVFPVAMAGGNIVLDITSKADSLSSSTQASDVITNQ----------RSIA 361

Query: 189 TSAARSHGITVVLATHDAEVATLADRTVPLLDGRPV 224
T+ G T++L T D VP L P+
Sbjct: 362 TTVNLRDGQTLLLGGLTDYKNTSQDSGVPFLSKIPL 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30060ARGREPRESSOR290.031 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 28.7 bits (64), Expect = 0.031
Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 4/43 (9%)

Query: 170 AAGNTVVIKPSDTTPASTVLIAEIIGQILPKGVFNVICGDRDT 212
+A + +V+K T P + I ++ + + + ICGD DT
Sbjct: 89 SASHLIVLK---TMPGNAQAIGALMDNLDWEEIMGTICGD-DT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30075BLACTAMASEA310.011 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.5 bits (69), Expect = 0.011
Identities = 17/51 (33%), Positives = 17/51 (33%), Gaps = 3/51 (5%)

Query: 81 RGGSGVHDLESNRPADR---DARFRAGSVTKVFTAAVVLQLAGEGRLDLNR 128
R G DL S R D RF S KV VL G L R
Sbjct: 39 RVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLER 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30080HTHFIS562e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.4 bits (136), Expect = 2e-11
Identities = 18/115 (15%), Positives = 44/115 (38%), Gaps = 3/115 (2%)

Query: 3 IRVVVADDQELVRSGFAMILDAQPDIEVVAEAGDGAAAVDAVRRLGPEVALLDIRMPGTD 62
++VADD +R+ L ++ +V + A + ++ + D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GIEACRTINAE-TGCRTVMLTTFDSDEYVYEALHAGASGFLLKDVRRDDLVHAVR 116
+ I ++++ ++ +A GA +L K +L+ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30085PF06580280.049 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.049
Identities = 18/90 (20%), Positives = 30/90 (33%), Gaps = 10/90 (11%)

Query: 304 IVQEALTNVVRHAAAD-----TVFVQLDYGTGALIITVTDDGCGFAGGV----GMGLIGI 354
+VQ + N ++H A + ++ G + + V + G G GL +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNV 318

Query: 355 GER-AAAHGGTADTGAGPGGRGFRVRVTIP 383
ER +G A V IP
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


179C5746_30430C5746_30460N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_30430-215-0.1886123-oxoacyl-ACP reductase
C5746_30435-118-0.531903DNA glycosylase
C5746_30440014-0.669390DEAD/DEAH box helicase
C5746_30445013-0.278351AraC family transcriptional regulator
C5746_30450012-0.276480branched-chain amino acid transporter AzlD
C5746_30455125-0.295851hypothetical protein
C5746_30460021-0.874575hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30585DHBDHDRGNASE1291e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 129 bits (325), Expect = 1e-38
Identities = 77/260 (29%), Positives = 125/260 (48%), Gaps = 11/260 (4%)

Query: 3 LTAYDLTGRSAFITGAAGGIGRAGAILLAAAGATVHCADRDEKGLLETRNLIAEAGGTSH 62
+ A + G+ AFITGAA GIG A A LA+ GA + D + + L + + + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 63 IHTLDVTDPGRIKAAV----AAAGNLDILAAVAGIMHTSSVLETADDDLDRVLSVNFKGV 118
DV D I G +DIL VAG++ + +D++ + SVN GV
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 119 LYACQEVARSMIARGAPGSLITMASGAVDAASPGLLCYSAAKAAVVQLTKTLATELGPHS 178
A + V++ M+ R + GS++T+ S + Y+++KAA V TK L EL ++
Sbjct: 121 FNASRSVSKYMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 179 IRVNAVAPGWIRTPM-----TARHDAEQQQRAEATMARIS-PLGRVGEPEDVAHTLVYLA 232
IR N V+PG T M + AEQ + + PL ++ +P D+A +++L
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 233 SDASAFMTGQILRPNGGVAM 252
S + +T L +GG +
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30595PF05946300.046 Toxin-coregulated pilus subunit TcpA
		>PF05946#Toxin-coregulated pilus subunit TcpA

Length = 199

Score = 30.3 bits (68), Expect = 0.046
Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 1366 GVVTRGAVQAEGVEGGFSATYRILSAFEDNGQARRGYV--VEGLGAAQ 1411
G+V+ G + ++ + F T + +F N A + + V+GL AQ
Sbjct: 72 GLVSLGKISSDEAKNPFIGTNMNIFSFPRNAAANKAFAISVDGLTQAQ 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30610RTXTOXINA260.034 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.1 bits (57), Expect = 0.034
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 5/65 (7%)

Query: 44 ALLAALTAQQTFGDGQHLT----LDARGAGLAAAALALVLRAPFLVVVGSAVVVTAGV-R 98
+LLAA + D T L + +G++AAA ++ AP +VG+ + +G+
Sbjct: 352 SLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILE 411

Query: 99 ALGQA 103
A QA
Sbjct: 412 ASKQA 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30620PF07520300.021 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.6 bits (66), Expect = 0.021
Identities = 12/59 (20%), Positives = 25/59 (42%)

Query: 5 GTSPLNRAEQFIWLTARALEQRRFAHHFLKGSAEAVETALAAYLNEDGGYGHALEPDLR 63
S L+ ++++W L+ RF +H + A +LNE G ++ ++
Sbjct: 378 TLSGLSSPKRYLWDDDAVLQDWRFQNHHDPNNLPRPVRAAMRHLNEAGDVLAQVKTEIG 436


180C5746_30600C5746_30645N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_306000160.705526hypothetical protein
C5746_30605016-1.126669diaminopimelate epimerase
C5746_30610-121-1.816799bifunctional (p)ppGpp
C5746_30615-117-2.211739zinc metalloprotease
C5746_30620015-1.751016GTPase HflX
C5746_30625013-1.324078hypothetical protein
C5746_30630-113-1.137623aspartate aminotransferase family protein
C5746_30635-3140.172701iron transporter
C5746_30640-2130.474242GNAT family N-acetyltransferase
C5746_30645-2161.140558IucA/IucC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30735ACRIFLAVINRP270.037 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.037
Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 9/51 (17%)

Query: 43 ARRLRIWQLAPIVMLAAVGSLMFAFPLAFEFGDGGAV-----VAMLGLLIS 88
A R+R L PI+M + ++ PLA G G + ++G ++S
Sbjct: 966 AVRMR---LRPILMTSLA-FILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30750MICOLLPTASE300.026 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.1 bits (67), Expect = 0.026
Identities = 15/45 (33%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query: 310 WYVESIMVHELAHQWFGDSVSPRAWSDLWLN-EGHASWYEARYAE 353
+ +E + HE H G V P W EG +WYE AE
Sbjct: 494 YTLEELFRHEFTHYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAE 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30755BINARYTOXINA320.005 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 32.3 bits (73), Expect = 0.005
Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 5/83 (6%)

Query: 1 MTSSSSLPQD--AQDAQSATENVTDSLTESLRADALM--EEDVAWSHEIDGERDGEQFDR 56
+ +SS P AQD Q A+ +TD D L E + W + + ER + D
Sbjct: 16 LILTSSFPSYTYAQDLQIASNYITDRAFIERPEDFLKDKENAIQWEKK-EAERVEKNLDT 74

Query: 57 SERAALRRVAGLSTELEDVTEVE 79
E+ AL S ++ + ++
Sbjct: 75 LEKEALELYKKDSEQISNYSQTR 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30775PF041832402e-73 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 240 bits (615), Expect = 2e-73
Identities = 90/349 (25%), Positives = 122/349 (34%), Gaps = 37/349 (10%)

Query: 177 RRAPGAPAEADLFLTAEQSLLLGHPLHPTPKSREGLSESESRLYSPELHGSFPLHWMAVD 236
RR A +L Q LL GHP K R G + Y+PE +F LHW+AV
Sbjct: 113 RRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVK 172

Query: 237 RSVLATDSA--------WTDGGRPVSATELITSHAEGLQLPDNTTPIPLHPWQARELGHR 288
R + T P + + L N P+P+HPWQ ++
Sbjct: 173 REHMIWRCDNEMDIHQLLTAAMDPQEFAR-FSQVWQENGLDHNWLPLPVHPWQWQQKIAT 231

Query: 289 PAVTALLDAGLLHDLGPHGKHWHPTSSVRTVHRPGAE--LMLKLSLGVRITNSRRENLRK 346
+ A G + LG G W S+RT+ L +KL L + T+ R +
Sbjct: 232 DFI-ADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGR 290

Query: 347 ELHRGVEVHRLLRSGLAAQWHAVHPGFDIVRDPAWLAVDGP--------DGEPVAGLDVM 398
+ G R L+ A V G I+ +PA V L V+
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVI 350

Query: 399 LRHNPF---GPYDDAVCIAALTAPRPWPGRTGMHSRIAEIVISLAAATGRAVGAVATEWF 455
R NP P + V +A L LA A G A W
Sbjct: 351 WRENPCRWLKPDESPVLMATLME----CDENNQ---------PLAGAYIDRSGLDAETWL 397

Query: 456 LRYLDRVVRPVLWLDAHAGVALEAHQQNTLVLLDPHGWPVGGRYRDNQG 504
+ VV P+ L GVAL AH QN + + G P +D QG
Sbjct: 398 TQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKE-GVPQRVLLKDFQG 445


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_30785PF04183515e-179 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 515 bits (1329), Expect = e-179
Identities = 201/612 (32%), Positives = 284/612 (46%), Gaps = 42/612 (6%)

Query: 22 LNRTVWDKAAARLLAKMLGQFAYEEVIEPVRQADGGDTYTLGLDDGGTLSFSARRGVYGS 81
+N WD RL+AKML + YE+V Q D L G F A RG++G
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLP---GAQWRFIAERGIWGW 57

Query: 82 WHIAPDSVRETCGSARESPAESMREGVRGTPAEPLRDRNTDTVPFRDPLQFLARARHILG 141
I ++R EP+ L + + +L
Sbjct: 58 LWIDAQTLR--------------------CADEPV-----------LAQTLLMQLKQVLS 86

Query: 142 IDGATLGHLIREITTTLAADARLDHT--ALCADRLADLDYAELEGHQTGHPWLVANKGRL 199
+ AT+ ++++ TL D +L L A L +L+ L+ +GHP V NKGR
Sbjct: 87 MSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRR 146

Query: 200 GFSATDASRYAPEARTPVRLPWIAVSTRIAAYRGVTGLATPEQLYVQELDPSVRDSFAAV 259
G+ RYAPE RL W+AV +R + QL +DP F+ V
Sbjct: 147 GWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEM-DIHQLLTAAMDPQEFARFSQV 205

Query: 260 LRARGLDPGSYLCLPVHPWQWDEWIVPVFAPAIAAGDIVPLHTDADLRLPQQSIRTFTNV 319
+ GLD +L LPVHPWQW + I F A G +V L D L QQS+RT TN
Sbjct: 206 WQENGLDHN-WLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNA 264

Query: 320 ARPDRHTVKLPLSILNTLVWRGLPTERTLAAPAVTAWVQGLRDRDAFLRDTCQVILLGEV 379
+R +KLPL+I NT +RG+P A P + W+Q + DA L + VIL GE
Sbjct: 265 SRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVIL-GEP 323

Query: 380 ASVTVEHPLYDHLPETPYQFKEILGAIWREPLQPRLAPGERARTLASLLHTDPQGRAFTA 439
A+ V H Y L PY+++E+LG IWRE L P E +A+L+ D +
Sbjct: 324 AAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAG 383

Query: 440 ELVARSGLTPTAWLTRLFAALLPPLLHFLYRYGTVFSPHGENAIVVFDNQDVPVRLAIKD 499
+ RSGL WLT+LF ++ PL H L RYG HG+N + VP R+ +KD
Sbjct: 384 AYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKE-GVPQRVLLKD 442

Query: 500 FVDDVNISARPLPEHESMPQDVRRILLTEEPSFLTQFIHAGLFIGVFRFLSPLCEEQLGV 559
F D+ + PE +S+PQ+VR + +L + G F+ V RF+SPL +LGV
Sbjct: 443 FQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDLQTGHFVTVLRFISPL-MVRLGV 501

Query: 560 PEDDFWSLVRAEILRYHARFPELKERFEMFDMLTPRIERLCLNRNRLHVDGYRDRPRRPH 619
PE F+ L+ A + Y + P++ ERF +F + P+I R+ LN +L D R
Sbjct: 502 PERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTW-PDLDGGSRML 560

Query: 620 AAIHGTVPNPLH 631
+ NPL
Sbjct: 561 PNYLEDLQNPLW 572


181C5746_30865C5746_30900N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_30865116-1.004400thymidine kinase
C5746_30870118-1.198606hypothetical protein
C5746_30875117-1.368827hypothetical protein
C5746_30880113-2.222627hypothetical protein
C5746_30885111-2.937368hypothetical protein
C5746_30890110-2.467679hypothetical protein
C5746_30895015-1.864529hypothetical protein
C5746_30900-114-1.890669hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31010ALARACEMASE280.037 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 27.8 bits (62), Expect = 0.037
Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 159 VGGEMVVEGEQVVVGDVSSPATEVGYEVLCRRHHR 193
+G + + G+++ + DV++ A VGYE++C R
Sbjct: 316 IGTPVELWGKEIKIDDVAAAAGTVGYELMCALALR 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31025PF07132608e-11 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 60.5 bits (146), Expect = 8e-11
Identities = 52/168 (30%), Positives = 63/168 (37%), Gaps = 7/168 (4%)

Query: 209 AAIGGALGGPFALLGGGLGKLLGNLTGKGLGKILGNDAGNVLGKGLAGGGKGAGGAAGGA 268
+ +GG LGG LG LG L G L G GLG LG+ G+ LG L GG GA GA A
Sbjct: 63 SMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGAGMNA 122

Query: 269 GKAAGGLGKGAGGAGAGGLGKGAGGAGAGAAGKGAGGLGKGAGGAGAGAAGKGAGGLGKG 328
+ +G A LG G G G K A +G
Sbjct: 123 MNPSAMMGSLLFSALEDLLGGGMSQQQGGLFG------NKQPSSPEISAYTQGVNDALSA 176

Query: 329 AGGAGAGAAGKGAGAAGAGAGKGAGSAGAGAGKGAGAA-GAGLGKGAG 375
G G G G +GAGA G+ G +G+ AG
Sbjct: 177 ILGNGLSQTKGQTSPLQLGNNGLQGLSGAGAFNQLGSTLGMSVGQKAG 224



Score = 50.5 bits (120), Expect = 1e-07
Identities = 49/190 (25%), Positives = 66/190 (34%), Gaps = 23/190 (12%)

Query: 248 NVLGKGLAGGGKGAGGAAGGAGKAAGGLGKGAGGAGAGGLGKGAGGAGAGAAGKGAGGLG 307
+++ + G GG GG G LG GG GGLG G G + G GG
Sbjct: 53 DIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGL 112

Query: 308 KGAGGAGAGAAGKGA-----------GGLGKGAGGAGAGAAGKGAGAAGAGAGKGAGSAG 356
GA GAG A A LG G G G ++ + G
Sbjct: 113 GGALGAGMNAMNPSAMMGSLLFSALEDLLGGGMSQQQGGLFGNKQPSSPEISAYTQGVND 172

Query: 357 AGAGKGAGAAGAGLGKGAGAAGDGAAGAGGRAGAGAAGQAISEESARKLGERLGNIMGRT 416
A + G GL + G G G G AG + +LG LG +G+
Sbjct: 173 A----LSAILGNGLSQTKGQTSPLQLGNNGLQGLSGAG------AFNQLGSTLGMSVGQK 222

Query: 417 N--ESLNHVG 424
+ LN++
Sbjct: 223 AGLQELNNIS 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31035RTXTOXIND270.043 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 26.7 bits (59), Expect = 0.043
Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 7/71 (9%)

Query: 4 FQEQLAEAMAGLAEQTKKIQQVQEELARASASATSKDRMVTASVNAHGEILSFKFHTEGY 63
F+ ++ + L + T I + EL A + ++ A V+ ++ K HTEG
Sbjct: 296 FKNEILDK---LRQTTDNIGLLTLEL--AKNEERQQASVIRAPVS--VKVQQLKVHTEGG 348

Query: 64 RTMPGAQLAEV 74
L +
Sbjct: 349 VVTTAETLMVI 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31045cloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.7 bits (79), Expect = 0.001
Identities = 25/87 (28%), Positives = 34/87 (39%), Gaps = 12/87 (13%)

Query: 364 AGGDGAGGDGGGTGGKGGGNLDLGAPPPVTGPSGSTSGSNLPKNNN---------LGLGP 414
+GGDG G + G G N G P + G++ GS NN + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN---GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 415 NPVTGPSGGTSGTGNPPGGGGSLTVPA 441
G GG +G G GG+L+ A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 29.7 bits (66), Expect = 0.045
Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 7/61 (11%)

Query: 648 NPAAPPTQGNVSTQSLGALGRGGAGGGTGTGTPFMP-------PMGMGGGGAPGGGGNGG 700
N A T GN++ G GGA G+G + P + GGG G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 701 D 701
+
Sbjct: 70 N 70



Score = 29.7 bits (66), Expect = 0.045
Identities = 17/47 (36%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 360 FGNGAGGDGAGGDGGGTGGKGGGNLDLGAPPPVTG-PSGSTSGSNLP 405
G G+G G G G GGNL A P G P+ ST G+
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103


182C5746_30930C5746_31005N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_309300110.778614secretion protein EccC
C5746_309350100.933792hypothetical protein
C5746_30940090.313734sporulation protein
C5746_30945-1110.067798hypothetical protein
C5746_30950-1100.435558glyoxalase/bleomycin resistance/extradiol
C5746_30955-1110.074622sulfurtransferase
C5746_30960-1110.116905DUF3071 domain-containing protein
C5746_30965-1120.479026hypothetical protein
C5746_30970-1122.291092FAD-linked oxidoreductase
C5746_30975-2122.562863MFS transporter
C5746_30980-2142.169050ferrochelatase
C5746_30985-2101.439389inositol monophosphatase
C5746_309900101.777829DNA-binding response regulator
C5746_309950101.704317two-component sensor histidine kinase
C5746_31000270.137142DUF4193 domain-containing protein
C5746_31005270.031783hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31075PF05272350.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.4 bits (81), Expect = 0.002
Identities = 19/46 (41%), Positives = 27/46 (58%)

Query: 686 QGARRVGASAVAGQVVPYRTEWTAPRLPAPDPAPQPVAEEESEESL 731
Q AR G +VAG V+ AP+ P P+P P+PV E+E E++
Sbjct: 92 QVAREEGLESVAGIVMGAPAGAPAPKPPRPEPPPRPVVEKECWETI 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31090SUBTILISIN1296e-36 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 129 bits (326), Expect = 6e-36
Identities = 77/304 (25%), Positives = 125/304 (41%), Gaps = 20/304 (6%)

Query: 107 QVPWAQSFLGIDRAWELSRGAGVKVAVIGTGADLRRVPALDGRVTGGPDVVAGGT----V 162
++P + W +RG GVKVAV+ TG D P L R+ GG + +
Sbjct: 21 EIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADH-PDLKARIIGGRNFTDDDEGDPEI 79

Query: 163 RDDCVGYGTFLAGIVAGGQQQGLEAAGVAPEAEVIAVRATSKQGATTPEALAKGIRAATG 222
D G+GT +AG +A + GVAPEA+++ ++ +KQG+ + + +GI A
Sbjct: 80 FKDYNGHGTHVAGTIAATEN-ENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIE 138

Query: 223 SGAQIIHVALSVPSAPKQLKDAVLAAQKAGALVVAPASAREWESGAGASQAENGPAYPAA 282
II ++L P +L +AV A + LV+ A G G + + YP
Sbjct: 139 QKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAA----GNEGDGDDRT-DELGYPGC 193

Query: 283 LPGVLAVSSVGPGGEPEAPGTGKGRRSQPRLSAPGVSVTGPGPGGRGQFTGRGDAVAGAF 342
V++V ++ ++ L APG + PGG+ G ++A
Sbjct: 194 YNEVISVGAINFDRH---ASEFSNSNNEVDLVAPGEDILSTVPGGKYATFS-GTSMATPH 249

Query: 343 VAGTAALVLS-----YHPRLTAEQVAHRLESTAYGAVGDTPDPRIGLGIVDPVRALSAVL 397
VAG AL+ + LT ++ +L GL + V LS +
Sbjct: 250 VAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRIF 309

Query: 398 PEER 401
+R
Sbjct: 310 DTQR 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31095PF05272300.049 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.049
Identities = 17/62 (27%), Positives = 24/62 (38%), Gaps = 6/62 (9%)

Query: 606 LVFAGPPGTGKTTVARLYGSILASLGVLRSGHLVEVSRADLVAQIIGGTAIKTTE--TFN 663
+V G G GK+T+ L L H + D QI G A + +E F
Sbjct: 599 VVLEGTGGIGKSTLIN----TLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFR 654

Query: 664 KA 665
+A
Sbjct: 655 RA 656


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31135PF05616310.005 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.3 bits (70), Expect = 0.005
Identities = 19/74 (25%), Positives = 27/74 (36%), Gaps = 3/74 (4%)

Query: 215 PATSTDPGQERTPGQESGATPGTDSQGGQSRNPDSGQGDGSSADPVPSPTPSDNTGGQTN 274
P PG P + P + NP + G+ +P P P + + T+
Sbjct: 311 PRPDLTPGSAEAPNAQP--LPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTD 368

Query: 275 PPPATTP-SPAAPG 287
P T P SPA P
Sbjct: 369 GQPGTRPDSPAVPD 382



Score = 29.3 bits (65), Expect = 0.023
Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 18/74 (24%)

Query: 233 ATPGTDSQGGQSRN------PD--SGQGDGSSADPVP----------SPTPSDNTGGQTN 274
AT G DSQG + + PD G + +A P+P +P P++N G + N
Sbjct: 293 ATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPN 352

Query: 275 PPPATTPSPAAPGD 288
P P +P A D
Sbjct: 353 PEPDPDLNPDANPD 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31145TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.9 bits (114), Expect = 4e-08
Identities = 73/321 (22%), Positives = 126/321 (39%), Gaps = 36/321 (11%)

Query: 28 LSMMGIGVV-----TMVSQLTGRYGLA---GALSATLAMSAAVIGPLISRLVDRHGQRRV 79
L +GIG++ ++ L + G L A A+ P++ L DR G+R V
Sbjct: 16 LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPV 75

Query: 80 LRPATLVALAAVAGLLICAQQRLADWTLFVFAAGAGCVPSVGSMIRARWAEIYRGSPRQL 139
L + +A AAV ++ L W L++ AG + G++ A A+I G R
Sbjct: 76 LLVS--LAGAAVDYAIMATAPFL--WVLYIGRIVAGITGATGAVAGAYIADITDGDERAR 131

Query: 140 HTAYSWESIVDEVCFIFGPIISIGLSTAWFPEAGPLLAAGFL-----LVGVFWLTAQRAT 194
H + + S + GP++ GL + P A P AA L L G F L
Sbjct: 132 H--FGFMSACFGFGMVAGPVLG-GLMGGFSPHA-PFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 195 EPVPHPREQHTRGSALRS----RGLQVLVVTFVATGAIFGAIDVVTVAFAEEQGHKAAAS 250
E P RE ++ R + L+ F + + V F E++ H A +
Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT 247

Query: 251 LVLAVYALGSCLAGAVFGLLHLKGKPSTRW------LVGVCAMAVSMIPLQLAGNLP--F 302
+ +++ A G + ++ + + G + R ++G+ A I L A F
Sbjct: 248 IGISLAAFG--ILHSLAQAM-ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304

Query: 303 LAVALFVAGLAIAPTMVTTMA 323
+ L +G P + ++
Sbjct: 305 PIMVLLASGGIGMPALQAMLS 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31160HTHFIS763e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 3e-18
Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 1/119 (0%)

Query: 2 RVLVVEDEQLLADAVATGLRREAMAVDVVYDGAAALERVEVNDYDVVVLDRDLPLVHGDD 61
+LV +D+ + + L R V + + A + D D+VV D +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRRIVELGMPTRVLMLTASGDVSDRVEGLELGADDYLPKPFAFTELTARV-RALGRRT 119
+ RI + VL+++A ++ E GA DYLPKPF TEL + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31165PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 23/104 (22%)

Query: 310 LVQNAVRYNV---PEGGWVEVTTEPRSGQAVLVVSNTGPVVPAYEIDNLFEPFRRLRTER 366
LV+N +++ + P+GG + + +G L V NTG L +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS----------------LALKN 306

Query: 367 TGSDKGVGLGLSIARSVARAHGG--RIVAEPREGGGLVMRVSLP 408
T G GL ++ + +G +I ++G V +P
Sbjct: 307 TKESTGTGL-QNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31175RTXTOXINA310.007 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.007
Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 146 ADKLIAGASKATATVGAGIGAAAMMPVPPAMLAELAAEITGV 187
D + S A+V +GI AAA + A ++ L +TG+
Sbjct: 364 IDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGI 405


183C5746_31075C5746_31100N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_310750141.892518histidine kinase
C5746_31080-1142.687938DNA-binding response regulator
C5746_31085-2142.541128OB-fold tRNA/helicase-type nucleic acid binding
C5746_31090-3132.080485DUF3159 domain-containing protein
C5746_31095-2131.006236potassium transporter TrkA
C5746_311000151.089031potassium transporter TrkA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31240PF06580372e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 2e-04
Identities = 22/119 (18%), Positives = 43/119 (36%), Gaps = 28/119 (23%)

Query: 720 VDVDPGLLERAVAN-----IVENAVKY--SPCAE--RVTVAASALGGRVELRVADRGPGV 770
++P +++ V +VEN +K+ + + ++ + + G V L V + G
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303

Query: 771 PDDAKERIFEPFQRYGDAPRGAGVGLGLAVARGFVESMGG---TLGAEDTPGGGLTMVL 826
+ KE G GL R ++ + G + + G MVL
Sbjct: 304 LKNTKE--------------STGTGLQNVRER--LQMLYGTEAQIKLSEKQGKVNAMVL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31245HTHFIS964e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 4e-25
Identities = 35/120 (29%), Positives = 64/120 (53%), Gaps = 1/120 (0%)

Query: 2 TRVLVVDDEPQIVRALVINLKARKYEVDAAPDGATALQLAAARHPDVVVLDLGLPDMDGV 61
+LV DD+ I L L Y+V + AT + AA D+VV D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVIKGLR-GWTRVPILVLSARHTSDEKVEALDAGADDYVTKPFGMDELLARLRASVRRAE 120
+++ ++ +P+LV+SA++T ++A + GA DY+ KPF + EL+ + ++ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31260NUCEPIMERASE300.007 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.007
Identities = 12/31 (38%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 1 MRVAIAG-AGAVGRSIAGELLENGHEVLLVD 30
M+ + G AG +G ++ LLE GH+V+ +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31265NUCEPIMERASE290.011 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.011
Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 1 MHIVIMG-CGRVGAALAQTLEQQGHTVAVID 30
M ++ G G +G +++ L + GH V ID
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31


184C5746_31335C5746_31365N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_31335012-0.074222TIGR02680 family protein
C5746_313401131.563730TIGR02679 family protein
C5746_313451121.370620hypothetical protein
C5746_313501131.003732DNA-binding response regulator
C5746_313550131.437127two-component sensor histidine kinase
C5746_313601122.346864hypothetical protein
C5746_313651122.2289494'-phosphopantetheinyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31525GPOSANCHOR413e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 41.2 bits (96), Expect = 3e-05
Identities = 45/265 (16%), Positives = 90/265 (33%), Gaps = 16/265 (6%)

Query: 744 AREQARRTRIGELHVEREALEAEREQLAEALRAVTGRREELDAELAAVPGDDDLRHAHAR 803
AR+ + A A+ + L A+ R+ EL+ L + A+
Sbjct: 155 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA--MNFSTADSAK 212

Query: 804 VTAAADALQRARARQRERAAELDRAADEADHTATELSE-TAAAMNLPTDREGLASVHQAL 862
+ AR+ + L+ A + + + ++ A L + L +
Sbjct: 213 IKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA 272

Query: 863 ADHTAHLAALWPALR---ERADAEHAVTGDQAETGRAELLVSELTERTTEAARRAASADE 919
+ + +A L +AE A Q++ A + R +A+R A E
Sbjct: 273 MNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR---QSLRRDLDASREAKKQLE 329

Query: 920 ----HLRTLRSTVGAAVAELQRRLEETAGAARVCESEQQRAQADLRDADRRASHAEGRIE 975
L A+ L+R L+ + A + E+E Q+ + + ++ ++
Sbjct: 330 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLD 389

Query: 976 RLNED---VTDATAARERAVAALQR 997
E V A +AAL++
Sbjct: 390 ASREAKKQVEKALEEANSKLAALEK 414



Score = 36.6 bits (84), Expect = 8e-04
Identities = 33/135 (24%), Positives = 60/135 (44%), Gaps = 4/135 (2%)

Query: 744 AREQARRTRIGELHVEREALEAEREQLAEALRAVTGRREELDAELAAVPGDDDLRHAHAR 803
A + A +L + + L A R+ L L A +++L+AE + + + A +R
Sbjct: 288 AEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA-SR 346

Query: 804 VTAAADALQRARARQRERAAELDRAADEADHTATELSETAAAMNLPTDREGLASVHQALA 863
+ D L +R +++ AE + E + +E S + +L RE V +AL
Sbjct: 347 QSLRRD-LDASREAKKQLEAEHQKL--EEQNKISEASRQSLRRDLDASREAKKQVEKALE 403

Query: 864 DHTAHLAALWPALRE 878
+ + LAAL +E
Sbjct: 404 EANSKLAALEKLNKE 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31545HTHFIS511e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.6 bits (121), Expect = 1e-09
Identities = 33/174 (18%), Positives = 62/174 (35%), Gaps = 17/174 (9%)

Query: 2 RIDVVIADDQQLVRAGFRMILEAQPGIDVVGEAADGVECVDLARRLRPQVVLADIRMPRL 61
+++ADD +R L + G DV ++ +V+ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGLEVTRQLAGPGVADPMNVVVITTFDQDDYVRTALRNGACGFLLKDATPALLVEAVHAA 121
+ ++ ++ P V+V++ + A GA +L K P L E +
Sbjct: 61 NAFDLLPRIKKARPDLP--VLVMSAQNTFMTAIKASEKGAYDYLPK---PFDLTELIGII 115

Query: 122 ARGDALVSPAVTVRLLKRLEQAELGTASGGAEAFGLSGRELDIVRLVA-VGRTN 174
R + KR + G G S +I R++A + +T+
Sbjct: 116 GR---------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31555ACRIFLAVINRP533e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 53.3 bits (128), Expect = 3e-09
Identities = 32/153 (20%), Positives = 60/153 (39%), Gaps = 9/153 (5%)

Query: 181 IGFALAAVILVMTFGSLIAAGLPLLTAVVGVGVSIAAIKALSATFDLNSNTSALATM-LG 239
L +++ + ++ A +P + V + + A + A F + NT + M L
Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA----FGYSINTLTMFGMVLA 401

Query: 240 IAVGIDYALFIVSR-YRSERAAGYGAGEAAARANGTAGSAVVFAGLTVVIA---LAGLAV 295
I + +D A+ +V R EA ++ A+V + + +A
Sbjct: 402 IGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 296 VNLPILTAMGLAAGGAVVIAVLVAVTLVPALLA 328
I + A+ ++VLVA+ L PAL A
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALILTPALCA 494



Score = 49.8 bits (119), Expect = 3e-08
Identities = 32/170 (18%), Positives = 66/170 (38%), Gaps = 23/170 (13%)

Query: 530 ALVVGLAVLVLILVFRSVLVPLKAALGFLLSVLAALGALVAVFQWGWLKDLVGLDQTGPI 589
++V L + + + R+ L+P A + LG + +G+ +
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAV------PVVLLGTFAILAAFGY---------SINT 392

Query: 590 MSLMPILMVGIVFGLAMDYQVFLVTRM-REAYVNGADARTAIESGFRHSAKVVTAAALIM 648
+++ ++ + GL +D + +V + R + + A E + A+++
Sbjct: 393 LTMFGMV---LAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVL 449

Query: 649 IAVF---AGFVGSQDAMLKSMGFGLAIAVFFDAFIVRMTIVPAALALLGK 695
AVF A F GS A+ + + A+ V + + PA A L K
Sbjct: 450 SAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL-VALILTPALCATLLK 498



Score = 32.9 bits (75), Expect = 0.005
Identities = 36/194 (18%), Positives = 68/194 (35%), Gaps = 10/194 (5%)

Query: 150 DALADGRDRGLTIEASGDALAAPEGSASAEA-IGFALAAVILVM--TFGSLIAAGLPLLT 206
+ LA G+ + +G + A A + + V L + + S +L
Sbjct: 844 ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLV 903

Query: 207 AVVGVGVSIAAIKALSATFDLNSNTSALATMLGIAVGIDYALFIVSRYRSERAA-GYGAG 265
+G+ + + A + N + + I + A+ IV + G G
Sbjct: 904 VPLGI---VGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 266 EAAARANGTAGSAVVFAGLTVVIALAGLAVVNLP---ILTAMGLAAGGAVVIAVLVAVTL 322
EA A ++ L ++ + LA+ N A+G+ G +V A L+A+
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 323 VPALLAFAPERVRG 336
VP +G
Sbjct: 1021 VPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_31570ENTSNTHTASED1596e-51 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 159 bits (404), Expect = 6e-51
Identities = 55/197 (27%), Positives = 87/197 (44%), Gaps = 13/197 (6%)

Query: 24 LFPGEAELIRNSVEGRRKEFTTARWCARRALGELGVAPAPILKGERGAPIWPGGVVGSMT 83
L+ + +R++ R+ E R A AL E+GV P + G++ P+WP G+ GS++
Sbjct: 31 LWLPHHDRLRSAGRKRKAEHLAGRIAAVHALREVGVRTVPGM-GDKRQPLWPDGLFGSIS 89

Query: 84 HCAGYRAAAVARRADVLTVGIDAEPHEALPDGVHESIALATELQRELELRRTWPEIHWDR 143
HCA A ++R+ +GID E + + ++ +R++ P
Sbjct: 90 HCATTALAVISRQ----RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALT 145

Query: 144 LLFSAKESVYKAWFPLTHRWLGFEQADIVLHSNGSFTAGLLISTMEPDLAAGATATPTAF 203
L FSAKESVYKA+ GF A + + + LL P AA A T
Sbjct: 146 LAFSAKESVYKAF-SDRVTLPGFNSAKVTSLTATHISLHLL-----PAFAAT-MAERT-V 197

Query: 204 SGRWLVADGLMVTAIVA 220
W D ++T + A
Sbjct: 198 RTEWFQRDNSVITLVSA 214


185C5746_32260C5746_32280N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_32260191.204634DNA-binding response regulator
C5746_322651101.272200sensor histidine kinase
C5746_32270-1110.845695peptide hydrolase
C5746_32275-1140.530834hypothetical protein
C5746_32280-2131.448799hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32525HTHFIS391e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.7 bits (90), Expect = 1e-05
Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 4/84 (4%)

Query: 2 RVVIAEDNVLLSNGLELLLTAKGFHVAAITQDAPGFVAAVTEHRPDVTIVDVRLPPTFRD 61
+++A+D+ + L L+ G+ V T +A + D+ + DV +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMP---DE 60

Query: 62 EGIHAAIHARRQHPGLPVLVLSQY 85
++ P LPVLV+S
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32530PF06580431e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 1e-06
Identities = 11/67 (16%), Positives = 28/67 (41%), Gaps = 6/67 (8%)

Query: 318 RTTVRVAREGDRLSIEVTDDGVGGADETL---GTGIAGIRHRVLAL---DGTVHIHSPSG 371
+ ++ ++ +++EV + G T GTG+ +R R+ L + + + G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 372 GPTTITV 378
+ +
Sbjct: 340 KVNAMVL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32540HTHTETR421e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.5 bits (97), Expect = 1e-07
Identities = 16/72 (22%), Positives = 25/72 (34%), Gaps = 1/72 (1%)

Query: 32 PRRCSHRRHAARARLL-TADELFCAEGVRSVRIDRVIAHAGAAKATLYNAFGTKDGLVRA 90
R+ R +L A LF +GV S + + AG + +Y F K L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 91 YLQARHAATAEH 102
+ + E
Sbjct: 62 IWELSESNIGEL 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_32545NUCEPIMERASE369e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 9e-06
Identities = 24/89 (26%), Positives = 32/89 (35%), Gaps = 12/89 (13%)

Query: 16 LVTGVTSGIGRAVAKRLAADGMSVVV-----AGRNARRGAETVKEITTDGGRARFVEADL 70
LVTG IG V+KRL G VV + ++ + G +F + DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPG--FQFHKIDL 61

Query: 71 EDPAGIDRLAAEVGCS-----ASRSGRRY 94
D G+ L A R RY
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRY 90


186C5746_34110C5746_34145N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_34110-120-2.043561TetR family transcriptional regulator
C5746_34115018-2.071645hydrolase
C5746_34120021-3.005186hypothetical protein
C5746_34125017-1.249653two-component sensor histidine kinase
C5746_34130017-0.678944DNA-binding response regulator
C5746_34140221-0.940729hypothetical protein
C5746_34145223-0.562258peptidoglycan-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34420TETREPRESSOR708e-17 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 69.6 bits (170), Expect = 8e-17
Identities = 46/204 (22%), Positives = 79/204 (38%), Gaps = 14/204 (6%)

Query: 17 VTLDAILDAATEIADDRGLDAVTFRVVADRLGVSPMAIHRTTGGIDALQHALVSRIVGE- 75
+ ++++DAA E+ ++ G+D +T R +A +LG+ ++ AL AL I+
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 76 VTRSVHWP-DDWCGVVRLFADTLHDLLMRHPVILEAH--RRASLVGPGADDVALRVVDAL 132
S+ + W +R A + L+R+ + H R + LR +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRF---M 120

Query: 133 RTAGLDEEGAVYAYGALHDFVTG-------HVAIRLGRGDPEQLRLPPERRAMSVFADHH 185
G +YA A+ F G H A R LPP R D
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAAPDENLPPLLREALQIMDSD 180

Query: 186 DYDRRFAYGLDLVIGGIAAAAAPV 209
D ++ F +GL+ +I G +
Sbjct: 181 DGEQAFLHGLESLIRGFEVQLTAL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34435FLGFLIH310.006 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.3 bits (70), Expect = 0.006
Identities = 42/157 (26%), Positives = 64/157 (40%), Gaps = 15/157 (9%)

Query: 232 ADALQASDAELRVQEAKARRFVADVSHELRTPLAAMTMVATVLEEDADQLPPDAARAARA 291
A L+ AE + Q+A + + E +T L A+ V A +L A AAR
Sbjct: 77 AQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVI------ASRLMQMALEAARQ 130

Query: 292 VGAET-----ARLSRLVEDLMEISRFDAKAVRLNAAETDLA---DTVRASLALRGWTDRV 343
V +T + L + ++ L++ + +L DL D + A+L+L GW R
Sbjct: 131 VIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRG 190

Query: 344 QTHLHE-GVRAVVDRRRIDVIVANLVGNALRHGAPPV 379
LH G + D +D VA R AP V
Sbjct: 191 DPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34440HTHFIS989e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 9e-26
Identities = 39/140 (27%), Positives = 63/140 (45%), Gaps = 5/140 (3%)

Query: 2 PHVLLIEDDASVRDGMELVLRRHGYGVDTAATGEQALALLAGERGSRVELAVLDLMLPGM 61
+L+ +DDA++R + L R GY V + +A G L V D+++P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD---LVVTDVVMPDE 60

Query: 62 DGFEVCRRIRARSATLPVIMLTARGDDSDIVTGLEAGADDYVVKPVTAPVLEARIRAAL- 120
+ F++ RI+ LPV++++A+ + E GA DY+ KP L I AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 -RRAEPSARQRSDADLAGLV 139
+ PS + D LV
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34450MICOLLPTASE300.022 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.7 bits (66), Expect = 0.022
Identities = 19/96 (19%), Positives = 33/96 (34%)

Query: 79 VTVAASEGKTLTMGQALYELNDKPVTLLYGPVPMFREMKAGDRGSDVLQLERNLRDLGYG 138
+TV + G T + + + DKPV ++ P KA + ++ L + Y
Sbjct: 837 LTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYS 896

Query: 139 ANLYVDARYDENTEAAVKQWQKSLNRETTGKVGKGD 174
Y D N + + T K G +
Sbjct: 897 DKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLN 932


187C5746_34660C5746_34680N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_34660013-0.5339763-oxoacyl-ACP reductase
C5746_34665-115-1.887156hypothetical protein
C5746_34670011-2.444274sensor histidine kinase
C5746_34675012-2.509430DNA-binding response regulator
C5746_34680111-2.579971hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34935DHBDHDRGNASE1052e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (263), Expect = 2e-29
Identities = 73/256 (28%), Positives = 118/256 (46%), Gaps = 15/256 (5%)

Query: 20 GLRGKSFLVTGGSRGLGYAVTRVLVAEGASVAI--CGRDADTLARASSELRRDFDAQVVT 77
G+ GK +TG ++G+G AV R L ++GA +A + +S +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-- 62

Query: 78 GAVDVLAPDELEDFTRQAGRELGGLDGLVANIGGKRGSGLLDS-TREDWQATWELNGGHA 136
DV +++ T + RE+G +D LV N+ G GL+ S + E+W+AT+ +N
Sbjct: 63 -PADVRDSAAIDEITARIEREMGPIDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 137 VRTVRSALDSL--RPGGSVVIVASISGWKPRFP-AQYGAAKASEIYLAGALCQELAPYGV 193
RS + R GS+V V S PR A Y ++KA+ + L ELA Y +
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 194 RVNTVSPGS--ILLPGKSW---DRLRRRDSATYEAYAERNPNGRLLNADEVARVIAFLLS 248
R N VSPGS + W + + + E + P +L ++A + FL+S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 249 PASGAVNGAHIPVDGG 264
+G + ++ VDGG
Sbjct: 241 GQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34950PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 5e-05
Identities = 16/87 (18%), Positives = 32/87 (36%), Gaps = 12/87 (13%)

Query: 343 LLTNAAKHA-----RADRVWIDIRHEREALRISVGDDGCGGATV---SAGTGLGGIESRI 394
L+ N KH + ++ + + + + V + G S GTGL + R+
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERL 322

Query: 395 AAF---DGVLSVNSPQGGPTMTTMEIP 418
+ + ++ QG + IP
Sbjct: 323 QMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34955HTHFIS412e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 40.6 bits (95), Expect = 2e-06
Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 4/84 (4%)

Query: 2 RIVLAEDLFLLREGLVRLLTAHGFEIAAAVDNGPELLDATVEHRPDVALVDVRLPPTFTD 61
I++A+D +R L + L+ G+++ N L D+ + DV +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPD---E 60

Query: 62 EGLRAAIEARRRIPGLPVLVLSQY 85
++ P LPVLV+S
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_34960ACRIFLAVINRP427e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 42.1 bits (99), Expect = 7e-06
Identities = 33/159 (20%), Positives = 56/159 (35%), Gaps = 13/159 (8%)

Query: 506 VVIPLVLVVVFVILIALLRAVVAPVLLMATVVLSYFAALGISWQLFQHVLGFPAVDTQVM 565
++ + VVVF+ L AL + PV +M V L L + LF
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAAT-LFNQKNDVYF------ 926

Query: 566 LIGFLFLVALGVDYNIFLIHRIREDVGHHGH--RSGVL-SGLTSTGGVITGAGAVLAATF 622
++G L + L I ++ ++ + G L + ++ + A +
Sbjct: 927 MVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVL 986

Query: 623 --AALTSAPQVAFIEIGIVVAIGVLIDTFLVRSVLVPAL 659
A A A +GI V G++ T L VP
Sbjct: 987 PLAISNGAGSGAQNAVGIGVMGGMVSATLLAI-FFVPVF 1024



Score = 32.1 bits (73), Expect = 0.009
Identities = 20/121 (16%), Positives = 36/121 (29%), Gaps = 26/121 (21%)

Query: 564 VMLIGFLFLVALGVDYNIFL---------------------IHRIREDVGHHGHRSGVLS 602
V+L F L A G N + R+ + +
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME-DKLPPKEATEK 433

Query: 603 GLTSTGGVITGAGAVLAATF---AALTSAPQVAFIEIGIVVAIGVLIDTFLVRSVLVPAL 659
++ G + G VL+A F A + + + I + + + + L PAL
Sbjct: 434 SMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALI-LTPAL 492

Query: 660 A 660

Sbjct: 493 C 493


188C5746_35595C5746_35620N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_35595-213-0.923527TetR/AcrR family transcriptional regulator
C5746_35600-212-1.893753AMP-dependent synthetase
C5746_35605-112-0.837486acyl-CoA dehydrogenase
C5746_35610010-0.485021MFS transporter
C5746_35615-190.005528TetR family transcriptional regulator
C5746_356200100.838199hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35960HTHTETR582e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.7 bits (139), Expect = 2e-12
Identities = 30/189 (15%), Positives = 63/189 (33%), Gaps = 19/189 (10%)

Query: 7 RRVRRTHRALRAALVDLVLEKGFHALSVEEIAERADVARATFYAHYRDKEDLLLGIVRDL 66
+ + T + + + L ++G + S+ EIA+ A V R Y H++DK DL I
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI---W 63

Query: 67 AEDRERLLPAVEQAQAQGFTGLP------VLYIFRHAEQE---RRVYQVILRG------- 110
+ + QA+ ++++ E R + ++I
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 111 EGDGRALREFSAIICKRVETVFRERAAQLGVTPRIPFDVIARAWTGELIGVLTWWVENDT 170
+A R R+E + + + A G + G++ W+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 171 GYSAAEITG 179
+ +
Sbjct: 184 SFDLKKEAR 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35975TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 33/161 (20%), Positives = 71/161 (44%), Gaps = 10/161 (6%)

Query: 23 RYNSRRAWLITALIVAFMVINYADKSVLGLAAVPIMDELHISNSTYGLISSSFYLLFSLS 82
R+N WL ++ F V+N + VL ++ I ++ + ++ ++++F L FS+
Sbjct: 11 RHNQILIWL--CILSFFSVLN---EMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIG 65

Query: 83 GLLVGFASSRIS-SRMLLFTLTVLWAVAQLPVLFVAAVPTLIAGRVLLGAAEGPAAPMSM 141
+ G S ++ R+LLF + + + + + + LI R + GA + M
Sbjct: 66 TAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVM 125

Query: 142 HALYKWFPTTRR----GLPSALQIGGAALGTLISAPLLTWL 178
+ ++ P R GL ++ G +G I + ++
Sbjct: 126 VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35980HTHTETR558e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.4 bits (133), Expect = 8e-12
Identities = 25/166 (15%), Positives = 56/166 (33%), Gaps = 7/166 (4%)

Query: 2 AERREELLRAAVEQIEVRGVAAVRIADVAAVLGVSNALVLYHFSTKEKLVAAAFAHAAEA 61
E R+ +L A+ +GV++ + ++A GV+ + +HF K L + + +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 DLAHLRKLLSRRTSAVR-RLRAAVRWY-APTGQAKGWRLWIEGWAASLRD----PALRNV 115
+ ++ LR + T + RL +E ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 116 AGDLDQQWKAELAEVIEEGAAAGEFHCD-DPMSVAWRLTALLDGLA 160
+L + + + ++ A D A + + GL
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_35985SUBTILISIN2085e-62 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 208 bits (531), Expect = 5e-62
Identities = 103/312 (33%), Positives = 153/312 (49%), Gaps = 39/312 (12%)

Query: 201 TLDKSVPQIGADRAHRAGITGKGTRVAVLDTGYDRDHPDLKSAVVASQDFTEDGD----- 255
+ + V I A G+G +VAVLDTG D DHPDLK+ ++ ++FT+D +
Sbjct: 21 EIPRGVEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEI 79

Query: 256 VQDMQGHGTHVSSIIAGSGAASEGRYAGVAPGAELVEGKVLNNDGYGYDSWILAGMQWAV 315
+D GHGTHV+ IA + + GVAP A+L+ KVLN G G WI+ G+ +A+
Sbjct: 80 FKDYNGHGTHVAGTIAATENENGV--VGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAI 137

Query: 316 DQDVKVVNMSLGSRVASDGTDPLSAAVDKLSAEHGTLFVIAAGNSGD-----RTISAPGA 370
+Q V +++MSLG + L AV K L + AAGN GD + PG
Sbjct: 138 EQKVDIISMSLGG---PEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDELGYPGC 193

Query: 371 ADAALTVGSVTKSGEMSAFTSRGPRQGRPAVKPEISAPGSDIVAARAAGTLDDAAVTERY 430
+ ++VG++ S F++ + ++ APG DI++ G Y
Sbjct: 194 YNEVISVGAINFDRHASEFSNSNN-------EVDLVAPGEDILSTVPGGK---------Y 237

Query: 431 ASLSGTSMASPHVAGAAALLAQR-----HPDWSGARLKAALIGSAAPVDGATVSDTGSGL 485
A+ SGTSMA+PHVAGA AL+ Q D + L A LI P+ + G+GL
Sbjct: 238 ATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSP-KMEGNGL 296

Query: 486 TDVPAALAAAVV 497
+ A + +
Sbjct: 297 LYLTAVEELSRI 308


189C5746_36255C5746_36290N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_36255-111-0.583933TetR family transcriptional regulator
C5746_36260-112-0.303694alpha/beta hydrolase
C5746_36265-214-1.368757peptidase
C5746_36270-114-0.465316hypothetical protein
C5746_36275-2140.326190ATP-binding protein
C5746_362800121.353011histidine kinase
C5746_36285-3121.748511fused response regulator/phosphatase
C5746_36290-2120.946302short-chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36745HTHTETR423e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.3 bits (99), Expect = 3e-07
Identities = 24/184 (13%), Positives = 53/184 (28%), Gaps = 13/184 (7%)

Query: 3 RAGLTTERVVEAAADLADSIGFDKVTISALARGFGVKDASLYSHVRNLHDLRTRVALLAA 62
A T + +++ A L G ++ +A+ GV ++Y H ++ DL + + L+
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 63 GEFIDRIAAAVAGRAGKDALVAFADAYRAFALERPGRYEATQMRIDPAVAARSSAYHRTI 122
+ A G V + R+ + + +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLES----TVTEERRRLLMEIIFHKCEFVGEM 123

Query: 123 ETTGAMLRAYGLAEPDLTDAVRLLRSTFHGYCALEASGGFGAPRDVQTSWERSVGALHFL 182
R + L+ + A + + G + L
Sbjct: 124 AVVQQAQRNL--CLESYDRIEQTLK-------HCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 183 LEHW 186
+E+W
Sbjct: 175 MENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36765IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 0.002
Identities = 22/99 (22%), Positives = 31/99 (31%), Gaps = 1/99 (1%)

Query: 226 APKAGSKTGSTPAAQPKTAPRQKAKPRHQTTEPKQQPATSQHQAAKPRHQTTEPKQRPAT 285
AP S+T T A K + K TE Q + AK + A
Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQ-NREVAKEAKSNVKANTQTNEVAQ 1087

Query: 286 SQHQAAKPQHRTTEPKQHPEKAESHKPQRAEKHSDTLVA 324
S + + Q T+ EK E K + + V
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126



Score = 31.6 bits (71), Expect = 0.008
Identities = 16/91 (17%), Positives = 29/91 (31%), Gaps = 5/91 (5%)

Query: 242 KTAPRQKAKPRHQTTEPKQQPATSQHQAAKPRHQTTEPKQRPATSQHQAAKPQHR----- 296
+T + + E K + T + Q PKQ + + A+P
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 297 TTEPKQHPEKAESHKPQRAEKHSDTLVAPVA 327
+ Q + Q A++ S + PV
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36770PF03544310.006 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.006
Identities = 18/90 (20%), Positives = 25/90 (27%)

Query: 298 ADAEPGVGGIVEPAVDPARPAETQKVVILPEPAPPTPAAAVETIAETFSAAPVMPLERRL 357
EP V EP P P E V+ P+P P V+ + + +
Sbjct: 68 PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPAS 127

Query: 358 GQVEFGPGRSPHSGTAARRGGVSAEPAAPA 387
P R S A A+
Sbjct: 128 PFENTAPARPTSSTATAATSKPVTSVASGP 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36780PF06580441e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.7 bits (103), Expect = 1e-06
Identities = 19/119 (15%), Positives = 36/119 (30%), Gaps = 20/119 (16%)

Query: 401 LPEPTGDATALSMVWQNLIGNAVK--FRRPDVPCRITVGCVRDGDDWHFSVADNGIGIAP 458
+ D M+ Q L+ N +K + +I + +D V + G
Sbjct: 246 INPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG----- 300

Query: 459 EFSQKVFVIFQRLHGREQYEGTGIGLA-LCRKIVEFHGGRIWLDPEAAEG-TCVRFTLP 515
L + E TG GL + ++ +G + +G +P
Sbjct: 301 -----------SLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36785HTHFIS561e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 1e-10
Identities = 30/136 (22%), Positives = 57/136 (41%), Gaps = 4/136 (2%)

Query: 44 ILLVEDDAGDALLVEEMLADSELDSALTWCRTLTEARRFLAGCRTPCCVLLDLHLSDVYG 103
IL+ +DDA ++ + L+ + D + R++A V+ D+ + D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAA-GDGDLVVTDVVMPDENA 62

Query: 104 LDAVTQIVESAPDAAIVVLTGRAEADTGLSAVATGAQDYLVKGRLDPEALGRSVRYALQR 163
D + +I ++ PD ++V++ + T + A GA DYL K D L + AL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGRALAE 121

Query: 164 KQVERAAAALRANRLM 179
+ + + M
Sbjct: 122 PKRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36790DHBDHDRGNASE371e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 36.6 bits (84), Expect = 1e-04
Identities = 23/91 (25%), Positives = 36/91 (39%), Gaps = 4/91 (4%)

Query: 153 LTGRRALLTGGRAKIGMYIGLRLLRDGAHTTITTRFPNDAIRRFKAMPDSDEWIHRLKIV 212
+ G+ A +TG IG + L GAH P + ++
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP-- 63

Query: 213 GIDLRDPAQVVALADSVAAE-GPLDILINNA 242
D+RD A + + + E GP+DIL+N A
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVA 93


190C5746_36350C5746_36380N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_36350112-0.300975aldo/keto reductase
C5746_363551110.346034L-rhamnose mutarotase
C5746_36360-1111.352483amidohydrolase
C5746_36365-1101.658582DNA-binding response regulator
C5746_363700101.647861hypothetical protein
C5746_363750101.477668cation acetate symporter
C5746_363801102.065040sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36855TYPE3IMSPROT300.014 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.014
Identities = 16/64 (25%), Positives = 23/64 (35%), Gaps = 7/64 (10%)

Query: 249 YTAAPLALL----DRALRIKAVTEGHGVPLRAAALHYPLAHPAVAGVLVGTRSPDEVRDA 304
T PL + ++ + E GVP+ PLA LV P E +A
Sbjct: 277 ETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRI---PLARALYWDALVDHYIPAEQIEA 333

Query: 305 AALL 308
A +
Sbjct: 334 TAEV 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36870HTHFIS499e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 9e-09
Identities = 29/143 (20%), Positives = 49/143 (34%), Gaps = 15/143 (10%)

Query: 3 RVLAVDDEEPALEEL-LYLLRADPRIRSAEGATGATEALRRIGGAVDAGPDDPSAIDVVF 61
+L DD+ L L RA +R A + G D+V
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG------------DLVV 52

Query: 62 LDIHMAGLTGLDVAQLLAGFAAPPLIVFVTAHEGF--AVHAFDLKAVDYVLKPVRRERLA 119
D+ M D+ + ++ ++A F A+ A + A DY+ KP L
Sbjct: 53 TDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 120 EAVRRVAEQVGDRSAPVLDTAND 142
+ R + R + + D + D
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36875TCRTETA260.042 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 25.9 bits (57), Expect = 0.042
Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 13/73 (17%)

Query: 35 TALGGAYVRSLMRSQLRAGLTAFTVLAAVVGTLPLVFEALHSAAL------VWAVLGFAA 88
A+ V + QL+ L A T L ++VG PL+F A+++A++ W + G A
Sbjct: 321 QAMLSRQVDEERQGQLQGSLAALTSLTSIVG--PLLFTAIYAASITTWNGWAW-IAGAAL 377

Query: 89 Y----PPLTLAAW 97
Y P L W
Sbjct: 378 YLLCLPALRRGLW 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_36885PF065802122e-67 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 212 bits (540), Expect = 2e-67
Identities = 70/229 (30%), Positives = 114/229 (49%), Gaps = 26/229 (11%)

Query: 175 QLELAELDRSR--TQLIEAEIRALRAQISPHFIFNSLAAIASFVRTDPEQARELLLEFAD 232
+ AE+D+ + + EA++ AL+AQI+PHF+FN+L I + + DP +ARE+L ++
Sbjct: 143 NYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSE 202

Query: 233 FTRYSFR-SHGDFTTLADELHSIDQYLALVRARFGERLAVTLQVAPEVLPVALPFLCLQP 291
RYS R S+ +LADEL +D YL L +F +RL Q+ P ++ V +P + +Q
Sbjct: 203 LMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQT 262

Query: 292 LVENAVKHGLEGAVTLPRAPGSVRAGETPTRITIRALDAGSEAEVVIEDDGTGMDPQRLR 351
LVEN +KHG+ +I ++ + +E+ G+
Sbjct: 263 LVENGIKHGIAQL-------------PQGGKILLKGTKDNGTVTLEVENTGSLALK---- 305

Query: 352 RILRGEGGKSTGIGLLNVDERLRQVYGDDYGLVIETGIGAGMKVTVRLP 400
+STG GL NV ERL+ +YG + + + G V +P
Sbjct: 306 -----NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV-NAMVLIP 348


191C5746_37310C5746_37325N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
C5746_37310-1121.830204DUF1345 domain-containing protein
C5746_37315-190.632121argininosuccinate synthase
C5746_37320-190.670600two-component sensor histidine kinase
C5746_37325-190.784307DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37930RTXTOXINA343e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 34.2 bits (78), Expect = 3e-04
Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 6 ALSAVPRLVGSAVVGALIGAVVGVLTNTPLGILAGI 41
L++V + +A +L+GA V L GI++GI
Sbjct: 374 VLASVSSGISAAATTSLVGAPVSALVGAVTGIISGI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37935MICOLLPTASE300.031 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.7 bits (66), Expect = 0.031
Identities = 22/159 (13%), Positives = 50/159 (31%), Gaps = 28/159 (17%)

Query: 79 EEGLAALTCGAFHIRSGGRAYFNTTPLGRAVTGTLLVRAMLEDNVQIWGDGSTFKGNDIE 138
EEG A A R+ G + G + R L + + +
Sbjct: 533 EEGTAEFF--AGSTRTDGIKPRKSVTQG--LAYDRNNRMSLYGVLH-----AKYGSW--- 580

Query: 139 RFYRYGLLA-----NPHLRIYKPWLD----------ADFVTELGGRKEMSEWLLAHDLPY 183
FY YG N ++ ++ + D++ + +++ +
Sbjct: 581 DFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDSL 640

Query: 184 RDSTEK-AYSTDANIWGATHEAKTLEHLDTGVETVEPIM 221
++ + ++ + HEAK + + ++ V I
Sbjct: 641 LNNIDNLDVPLVSDEYVNGHEAKDINEITNDIKEVSNIK 679


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37940PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 14/59 (23%), Positives = 22/59 (37%), Gaps = 2/59 (3%)

Query: 330 ERLTITVEDDGRTGGNHAHPGTGHGLIGMRERASTVGGCLYA--GERPEGGFTVTAELP 386
+T+ VE+ G + TG GL +RER + G +G +P
Sbjct: 290 GTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
C5746_37945HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 2e-14
Identities = 25/118 (21%), Positives = 43/118 (36%), Gaps = 4/118 (3%)

Query: 3 IRVLLADDQALLRGTFRMLFDSTDDMETVGEASNGQEALELARAERPHVVLMDIRMPEMD 62
+L+ADD A +R + V SN A +V+ D+ MP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDATRLISESEDLSAVKVLILTTFEDDEHVAEALRAGASGFLGKGARPEELLDAVRT 120
D I + + VL+++ +A GA +L K EL+ +
Sbjct: 62 AFDLLPRI--KKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.