PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2263.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_007606 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SDY_0010SDY_0019Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0010227-0.450898hypothetical protein
SDY_0011225-0.589636hypothetical protein
SDY_0012023-0.441697hypothetical protein
SDY_0013022-0.914055molecular chaperone DnaK
SDY_0014-216-2.632286molecular chaperone DnaJ
SDY_0015021-4.340386Gef protein
SDY_0017a018-4.095268insertion element iso-IS1N protein InsA
SDY_0018017-3.675240pH-dependent sodium/proton antiporter
SDY_0019-119-4.553865transcriptional activator NhaR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0011PF07201290.018 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 28.7 bits (64), Expect = 0.018
Identities = 9/51 (17%), Positives = 24/51 (47%)

Query: 138 LHAVDAKVNELEELLPLLMKDKLLAKGVSHLLSSQLTRILRTHAAMSVLGH 188
+ V+ +VN+ +P L + + +++ +S L +S + + A +
Sbjct: 80 VSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0013SHAPEPROTEIN1427e-40 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 142 bits (361), Expect = 7e-40
Identities = 83/387 (21%), Positives = 149/387 (38%), Gaps = 84/387 (21%)

Query: 5 IGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58
+ IDLGT N+ + + + E PS++A QD VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIV-LNE--------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63

Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAWVEVKGQKMAPPQISAE 118
P N + AI+ + +D I F + +
Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93

Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178
+K++ + P ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150

Query: 179 YGL--DKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236
GL + TG+ V D+GGGT ++++I ++ V + +GG+ FD +
Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198

Query: 237 INYLVEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADATG 292
INY+ + G + AE+ K E+ SA + ++ +
Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245

Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIDD--VILVGGQTRMPMV 349
P+ + + LE+L E + + + VAL+ SDI + ++L GG + +
Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303

Query: 350 QKKVAEFFGKEPRKDVNPDEAVAIGAA 376
+ + E G +P VA G
Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0015HOKGEFTOXIC592e-16 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 59.5 bits (144), Expect = 2e-16
Identities = 17/46 (36%), Positives = 30/46 (65%)

Query: 23 HKAMIVALIVICITAVVAAQVTRKDLCEVHIRTGQTEIAVFTAYES 68
+++ ++++C+T ++ +TRK LCE+ R G E+A F AYES
Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50


2SDY_0084aSDY_0094Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0084a-1173.368745insertion element iso-IS1N protein InsA
SDY_00850153.515714insertion element IS1 protein InsA
SDY_0086-1163.840479insertion element iso-IS1d protein InsB
SDY_00880164.472090L-ribulose-5-phosphate 4-epimerase
SDY_00890164.132357L-arabinose isomerase
SDY_00901154.184618ribulokinase
SDY_00920153.496405hypothetical protein
SDY_00931153.253231thiamine ABC transporter ATP-binding protein
SDY_00941163.201204thiamine ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0094PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 0.001
Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 5/80 (6%)

Query: 4 RRQPLIPGWLILGVSAATLVVAVALAAFLALWWNAPQGDWSAVWRDS-YLWHVVRFSFWQ 62
R GWL L + L V A +W+ A +++WR ++
Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVAN----TSIWRLLAFINTKPVAFTLP 115

Query: 63 AFLSALLSVVPAIFLARALY 82
LS + +VV F+ LY
Sbjct: 116 LALSIIFNVVVVTFMWSLLY 135


3SDY_0219SDY_0252Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0219-222-3.732227D,D-heptose 1,7-bisphosphate phosphatase
SDY_0226-224-3.763945***2,5-diketo-D-gluconate reductase B
SDY_0227-124-3.153102LysR family transcriptional regulator
SDY_0228-123-2.843289hypothetical protein
SDY_0230-122-2.923276membrane-bound lytic murein transglycosylase D
SDY_0231-124-3.776938hydroxyacylglutathione hydrolase
SDY_0232-220-3.269815hypothetical protein
SDY_0233-318-3.029468ribonuclease H
SDY_0234-323-3.294244DNA polymerase III subunit epsilon
SDY_0236024-3.463382*insertion element iso-IS1d protein InsB
SDY_0237022-2.359451insertion element IS1 protein InsA
SDY_0238020-2.400301hypothetical protein
SDY_0241a-115-0.443327insertion element iso-IS1N protein InsA
SDY0242a0140.201113insertion element iso-IS1N protein InsA
SDY_0242015-0.307963insertion element iso-IS1n protein InsB
SDY_0245116-1.196660hypothetical protein
SDY_0246214-2.456163C-lysozyme inhibitor
SDY_0248317-1.885798phosphoheptose isomerase
SDY_0249319-2.486921amidotransferase
SDY_0250225-3.664786hypothetical protein
SDY_0251327-3.229773hypothetical protein
SDY_0252326-3.051275damage-inducible protein J
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0251ENTSNTHTASED270.012 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 26.5 bits (58), Expect = 0.012
Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 45 AVYKDHPLQGSWKGYRDAHVEPD 67
+VYK + + G+ A V
Sbjct: 153 SVYKAFSDRVTLPGFNSAKVTSL 175


4SDY_0263aSDY_0276Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0263a1143.522902insertion element iso-IS1N protein InsA
SDY_02611163.672392insertion element IS1 protein InsB
SDY_02671163.761076insertion element iso-IS1d protein InsB
SDY_02681133.957974membrane protein FdrA
SDY_02691134.163297hypothetical protein
SDY_02701173.593877carboxylase
SDY_02721162.075332insertion element iso-IS1d protein InsB
SDY_02732192.171729insertion element IS1 protein InsA
SDY_02742182.255597phosphoribosylaminoimidazole carboxylase ATPase
SDY_02752191.747161AIR carboxylase, catalytic subunit
SDY_02762171.012558UDP-2,3-diacylglucosamine hydrolase
5SDY_0444SDY_0458Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_04442202.198886ferrochelatase
SDY_04453252.236509adenylate kinase
SDY_04463202.391122heat shock protein 90
SDY_04473153.812532recombination protein RecR
SDY_04484143.316225hypothetical protein
SDY_04493141.572321DNA polymerase III subunits gamma and tau
SDY_0450113-0.655511adenine phosphoribosyltransferase
SDY_0451111-0.479273hypothetical protein
SDY_04520140.138339primosomal replication protein N''
SDY_0453113-0.689974hypothetical protein
SDY_0454014-0.901188potassium efflux protein KefA
SDY_0455114-1.365642DNA-binding transcriptional repressor AcrR
SDY_0456215-1.092784acridine efflux pump
SDY_0457115-1.630472acridine efflux pump
SDY_0458026-5.485857hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0446FRAGILYSIN320.009 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 31.6 bits (71), Expect = 0.009
Identities = 24/108 (22%), Positives = 46/108 (42%), Gaps = 12/108 (11%)

Query: 422 RMKEGQEK--IYYITADSYAAAKSSPHLELLRKKGIEVLLLSDRIDEWMMNYLTEFDGKP 479
R+ G++K +I D +A + + G + ++ + + MMN + EF P
Sbjct: 99 RLFNGRDKDSTSFILGDEFAVLR-------FYRNGESISYIAYK-EAQMMNEIAEFYAAP 150

Query: 480 FQSVSKV--DESLEKLADEVDESAKEAEKALTPFIDRVKALLGERVKD 525
F+ + E+ E + D SA + ++ ID+ K +L D
Sbjct: 151 FKKTRAINEKEAFECIYDSRTRSAGKDIVSVKINIDKAKKILNLPECD 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0449IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 4e-05
Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATCARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ERDNAVCLRLRS 558
K A E+ +V+ PK + + E R+N + ++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQSIYEEKLAQARES 617
Q N ++ A+ S ++ E T V N V P + + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0454RTXTOXIND320.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.015
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0455HTHTETR2225e-76 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 222 bits (567), Expect = 5e-76
Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180
GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0456RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 7e-07
Identities = 34/209 (16%), Positives = 73/209 (34%), Gaps = 17/209 (8%)

Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELANGML-----KQENGK--AKVSLITSDGIKFPQDGTLEFSDVTVDQTT 271
+ D + + G KV I D I+ + G + +++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENC 432

Query: 272 GSITLRAIFPNPDHTLLPGMFVRARLEEG 300
S + I L GM V A ++ G
Sbjct: 433 LSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.4 bits (79), Expect = 8e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0457ACRIFLAVINRP13650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1365 bits (3534), Expect = 0.0
Identities = 798/1033 (77%), Positives = 913/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVATNMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVIYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLV+YLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHSDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+H L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGNWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALHESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAAL+ESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRHRF 1033
FVPVFFVV+R F
Sbjct: 1020 FVPVFFVVIRRCF 1032


6SDY_0493SDY_0504Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0493319-1.118035hypothetical protein
SDY_0494622-1.163052carboxylate-amine ligase
SDY_0495a1031-1.441748toxic polypeptide
SDY_0495930-1.096614insertion element IS2 transposase InsD
SDY_0496829-1.374296insertion sequence element IS2 repressor TnpA
SDY_0497524-0.963603hypothetical protein
SDY_04980181.714071hypothetical protein
SDY_0499-1182.677045insertion element IS1 protein InsA
SDY_05000193.154412insertion element iso-IS1d protein InsB
SDY_05010183.037657delta-aminolevulinic acid dehydratase
SDY_05020223.233923taurine dioxygenase
SDY_05032213.277011taurine transporter subunit
SDY_05042171.315763taurine ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0495aHOKGEFTOXIC564e-15 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 55.6 bits (134), Expect = 4e-15
Identities = 17/50 (34%), Positives = 26/50 (52%)

Query: 1 MLAKYALVAVIVLCLTVPGFTLLVGDSLCEFTVKERDIEFRAVLAYEPKK 50
+ + V+++CLT+ FT L SLCE ++ E A +AYE K
Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0497PRTACTNFAMLY300.016 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.0 bits (67), Expect = 0.016
Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 9/93 (9%)

Query: 214 TINGNGDNDNTASIEAGQNEVDNNGDHVAAATGNYKVRIDNATGAGSIADYNGNELIYVN 273
T+ G+G + G ++ A+G +++ + N+ GS L+
Sbjct: 477 TLAGSGLFRMNVFADLGLSDKLVVMQD---ASGQHRLWVRNS---GSEPASANTLLLVQT 530

Query: 274 DKNSNATFSAVN---KADLGAYTYQAEQRGNTV 303
S ATF+ N K D+G Y Y+ GN
Sbjct: 531 PLGSAATFTLANKDGKVDIGTYRYRLAANGNGQ 563


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0498PRTACTNFAMLY802e-19 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 79.7 bits (196), Expect = 2e-19
Identities = 53/228 (23%), Positives = 94/228 (41%), Gaps = 8/228 (3%)

Query: 2 VGVDTKIDGNNAKWIVGAAAGFAKGDMN---DRSGQVDQDSQTAYIYSSAHFANNVF-VD 57
+G D + +W +G AG+ +GD D G D Y + + A++ F +D
Sbjct: 677 LGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGY---ATYIADSGFYLD 733

Query: 58 GSLSYSHFNNDLSASMSNGTYVDGSTNSDAWGFGLKAGYDFKLGDAGYVTPYGSISGLFQ 117
+L S ND + S+G V G + G L+AG F D ++ P ++
Sbjct: 734 ATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRA 793

Query: 118 SGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQALTPYFKQAYVYD-DSNNDND 176
G Y+ +N ++V + S+ LG++ G + + + PY K + + + D
Sbjct: 794 GGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVH 853

Query: 177 VNGDSIDNGTEGSAVRVGLGTQFSFTKNFSAYTDANYLGGGDVDQDWS 224
NG + G+ +GLG + + S Y Y G + W+
Sbjct: 854 TNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWT 901


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0501BINARYTOXINB300.019 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.7 bits (66), Expect = 0.019
Identities = 19/69 (27%), Positives = 30/69 (43%)

Query: 265 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 324
+ EL + +L + QV G A F +D E L I+ A +IF+
Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525

Query: 325 ALDLAEKKI 333
L+L E++I
Sbjct: 526 DLNLVERRI 534


7SDY_0518SDY_0535Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_05182152.677045ferric enterobactin transport protein FepE
SDY_05191164.850151iron-enterobactin ABC transporter ATP-binding
SDY_05200164.895619iron-enterobactin ABC transporter permease
SDY_05210164.346534iron-enterobactin ABC transporter permease
SDY_0522-1163.882038enterobactin exporter EntS
SDY_0523-2163.686980iron-enterobactin ABC transporter
SDY_0524-2183.695705isochorismate hydroxymutase
SDY_0525-1193.548623enterobactin synthase subunit E
SDY_05260182.0062402,3-dihydro-2,3-dihydroxybenzoate synthetase
SDY_05271191.2793632,3-dihydroxybenzoate-2,3-dehydrogenase
SDY_0528-119-0.000104hypothetical protein
SDY_0530-221-1.287821hypothetical protein
SDY_0531-220-1.379059hypothetical protein
SDY_0534024-3.008358hypothetical protein
SDY_0535222-2.187275insertion sequence element IS600 transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0522TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 81/393 (20%), Positives = 144/393 (36%), Gaps = 38/393 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVNSPMIGGLLLAIGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEVMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELR 402
+ A + + +G+ + L LL L LR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0523FERRIBNDNGPP631e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 63.4 bits (154), Expect = 1e-13
Identities = 45/217 (20%), Positives = 84/217 (38%), Gaps = 17/217 (7%)

Query: 105 EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKSWQAL-----LTQL 159
EP+ E + P ++ SA G S + L+ IAP N+ D LT++
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141

Query: 160 GEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLWTPESAQGQML 219
++ + A +AQ++ + + K + + ++ P S ++L
Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201

Query: 220 EQLGFTPAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQKDADAIYANP 279
++ G NA Q + + + LAA + + L + KD DA+ A P
Sbjct: 202 DEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253

Query: 280 LLAHLPAVQNKQVYALGTETFRLDYYSAMQVLDRLNS 316
L +P V+ + + F SAM + L++
Sbjct: 254 LWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDN 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0526ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1133), Expect = e-159
Identities = 144/299 (48%), Positives = 193/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDITQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+ QNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLGKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV L + PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0527DHBDHDRGNASE363e-130 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 363 bits (934), Expect = e-130
Identities = 110/258 (42%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVKAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDALVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


8SDY_0568SDY_0594Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0568124-3.483790hypothetical protein
SDY_0569020-2.295101insertion element iso-IS1d protein InsB
SDY_0570121-2.847923insertion element IS1 protein InsA
SDY_0572a221-4.168086insertion element iso-IS1N protein InsA
SDY_0577224-4.796558****bacteriophage protein
SDY_0578227-4.915152hypothetical protein
SDY_0579230-5.606824bacteriophage protein
SDY_0580326-3.551278hypothetical protein
SDY_0581122-3.539392hypothetical protein
SDY_0582223-2.744490hypothetical protein
SDY_0585223-1.873719insertion sequence element IS600 transposase
SDY_0586218-1.897913insertion sequence element IS911 transposase
SDY_0587-114-0.144448insertion sequence element IS911 integrase core
SDY_0588014-0.659568glutamate/aspartate ABC transporter permease
SDY_0590113-0.455562insertion sequence element IS600 transposase
SDY_05911140.242518insertion sequence element IS600 integrase core
SDY_05921170.624676glutamate/aspartate ABC transporter
SDY_05931161.896678apolipoprotein N-acyltransferase
SDY_05942161.308560transport protein
9SDY_0619SDY_0636Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0619220-2.120380insertion element iso-IS1d protein InsA
SDY_0623021-0.391130ferric uptake regulator
SDY_06240170.646738flavodoxin FldA
SDY_06250171.298017LexA regulated protein
SDY_0626-1161.192843hypothetical protein
SDY_0627-2151.589679replication initiation regulator SeqA
SDY_06281236.341928phosphoglucomutase
SDY_06292277.010749DNA-binding transcriptional activator KdpE
SDY_06314296.936822potassium-transporting ATPase subunit C
SDY_06344316.935837potassium ion accessory transporter KdpF
SDY_06353306.634103hypothetical protein
SDY_06363286.130377rhs element protein RhsC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0629HTHFIS911e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 1e-23
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 1/125 (0%)

Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61
+L+ +D+ AIR L AL G V A DL++ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFIRDLRQWSA-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARMRVALRRHS 120
+ + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 ATAAP 125
+
Sbjct: 124 RRPSK 128


10SDY_0675SDY_0687Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_06752190.010075glutamate mutase L
SDY_0676321-0.260117methylaspartate mutase subunit S
SDY_0676a321-0.016112insertion element iso-IS1N protein InsA
SDY_0680120-0.273154cytochrome d terminal oxidase polypeptide
SDY_0681117-0.717175cytochrome d terminal oxidase polypeptide
SDY_0682718-1.281663outer membrane lipoprotein
SDY_0683318-0.601460hypothetical protein
SDY_0684321-0.514833acyl-CoA thioester hydrolase
SDY_0685320-0.647739colicin uptake protein TolQ
SDY_0686420-0.713360colicin uptake protein TolR
SDY_0687318-1.091046cell envelope integrity inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0675PF03309330.002 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 33.2 bits (76), Expect = 0.002
Identities = 10/53 (18%), Positives = 23/53 (43%)

Query: 3 IVSVDIGSTWTKAALFTREGDALTLVNHVLTPTTTHHLAKGFFSSLNQVLNVD 55
++++D+ +T T L + GD +V T A +++ ++ D
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0687IGASERPTASE608e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.5 bits (146), Expect = 8e-12
Identities = 34/199 (17%), Positives = 69/199 (34%), Gaps = 8/199 (4%)

Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEE 158
E E+ Q QA+ + E A A ++ E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 159 AAK--KAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAA--AAEARKKAATE 214
A+ K + +K E +A + A+ ++ A+ A + +K + E A +E ++ TE
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 215 AAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKK 274
E A E E+KA E + + K+ + +A ++ + +
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 275 AAAAKAAAEKAAAAKAAAE 293
+ A + A + ++
Sbjct: 1160 SQTNTTADTEQPAKETSSN 1178



Score = 57.4 bits (138), Expect = 7e-11
Identities = 30/236 (12%), Positives = 85/236 (36%), Gaps = 11/236 (4%)

Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127
Q+ S ++E+ ++ +E ++ + +K E+ A +
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061

Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAA 180
+ ++ A+EA + A+ A++ +E E + A + ++KA+ E K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240
+ ++ + ++++E + A AR+ T ++ +++ A E+ A + +
Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 241 EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADD 296
E+ + + + A ++ K + ++ +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235



Score = 56.6 bits (136), Expect = 1e-10
Identities = 28/228 (12%), Positives = 75/228 (32%), Gaps = 2/228 (0%)

Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
R ++E+ + + + Q+ E +E Q E + +EKE A E +K E
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAK--KAAADAKKKAEAEAAKAAAEAQ 183
+++ KQ + + A+ + + E ++ A + E + +
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185

Query: 184 KKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKA 243
++ + E A + + + K + ++ ++ +++
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245

Query: 244 AADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 291
+D +A A+ A + A ++ ++ +
Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 55.8 bits (134), Expect = 2e-10
Identities = 32/265 (12%), Positives = 86/265 (32%), Gaps = 14/265 (5%)

Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103
D V A + ++ ++K+ + + EQ A E + ++ A++ +
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163
+ E + ++ ++ Q K+ +K+ KA + + E ++ + K+
Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 164 AADA-KKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAE 222
++ + +AE K+ ++ + A+ ++ +
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 223 AEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAA 282
+ A + +++ K + + + + A + A +
Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254

Query: 283 EKAAAAKAAAEADDIFGELSSGKNA 307
A + A A F L+ GK
Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAV 1279


11SDY_0715SDY_0729Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0715531-4.836231insertion sequence element IS600 transposase
SDY_0716227-3.544285insertion element iso-IS1n protein InsB
SDY_0718a231-3.341871insertion element iso-IS1N protein InsA
SDY_0720227-3.766472insertion sequence element IS911 transposase
SDY_0722227-2.937631insertion sequence element IS911 transposase
SDY_0723227-2.754758insertion sequence element IS600 transposase
SDY_0724227-1.453173insertion sequence element IS600 integrase core
SDY_0725226-1.270045bacteriophage protein
SDY_0726226-1.340167bacteriophage protein
SDY_0727227-0.443742insertion sequence element IS600 integrase core
SDY_0728126-0.144999insertion sequence element IS600 transposase
SDY_07292250.679432insertion element IS2 transposase InsD
12SDY_0828SDY_0845Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_08282153.523447dithiobiotin synthetase
SDY_08292142.090649biotin biosynthesis protein BioC
SDY_08301141.4216448-amino-7-oxononanoate synthase
SDY_08311160.225985biotin synthase
SDY_0832119-0.303863adenosylmethionine-8-amino-7-oxononanoate
SDY_0833324-1.717977kinase inhibitor protein
SDY_0834526-1.930845invasion plasmid antigen
SDY_0835017-1.280519insertion sequence element IS600 transposase
SDY_0836-113-1.102538insertion sequence element IS600 integrase core
SDY_0838-111-1.051500transferase
SDY_0839010-1.750558insertion element iso-IS1n protein InsB
SDY_0839a-113-1.057960insertion element iso-IS1N protein InsA
SDY_0841-216-0.904244D-alanyl-D-alanine carboxypeptidase
SDY_0842-115-0.759451DNA-binding transcriptional repressor DeoR
SDY_0843220-0.902256undecaprenyl pyrophosphate phosphatase
SDY_0845328-1.145512insertion sequence element IS911 transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0841BLACTAMASEA461e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 45.9 bits (109), Expect = 1e-07
Identities = 42/201 (20%), Positives = 65/201 (32%), Gaps = 34/201 (16%)

Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65
+ L A A P + I MD ASG+ L ADE+
Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65

Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWVTGNPALRGSSVMFLKPGDQVSVADLN 125
S K++ V + A +L + + V +P V D ++V +L
Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119

Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181
I S N A L V G + + +++G T ++T PG
Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174

Query: 182 --STARDMA------LLGKAL 194
+T MA L + L
Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195


13SDY_0868SDY_0878Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_08680173.424460insertion sequence element IS600 transposase
SDY_0870-1193.957140ferredoxin-type protein
SDY_0871-1203.457104assembly protein for periplasmic nitrate
SDY_0872-1173.446775nitrate reductase catalytic subunit
SDY_0873-1163.059142quinol dehydrogenase periplasmic subunit
SDY_08740142.289939quinol dehydrogenase membrane subunit
SDY_08751162.149994citrate reductase cytochrome c-type subunit
SDY_08760151.837409cytochrome c-type protein NapC
SDY_08770173.411020cytochrome c biogenesis protein CcmA
SDY_08781203.312223heme exporter protein B
14SDY_0910SDY_0915Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0910222-2.671951insertion element iso-IS1n protein InsB
SDY_0912a225-2.989571insertion element iso-IS1N protein InsA
SDY_0912125-3.412842hypothetical protein
SDY_0913a026-3.086670insertion element iso-IS1N protein InsA
SDY_0913022-3.929940insertion element iso-IS1n protein InsB
SDY_0914023-3.880704FimH-like protein
SDY_0915022-3.534096fimbrial-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0912PF005772365e-74 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 236 bits (603), Expect = 5e-74
Identities = 125/277 (45%), Positives = 180/277 (64%), Gaps = 6/277 (2%)

Query: 4 RSNDSYTSKKNYAWMTSNTSIDNEGHTTQNLGLTETLLDDGNLSYSVQQGYNSEGKTANG 63
S+ +A + + S D G T G+ TLL+D NLSYSVQ GY G +G
Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664

Query: 64 S---ASMDYKGAFADARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIA 120
S A+++Y+G + +A +GY++SD+ +QL Y +SG ++AH+ G+TLGQ L +T VL+
Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDD--IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVK 722

Query: 121 VPGAENTRVANSTGLKTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTK 180
PGA++ +V N TG++TDWRGY V+PYAT YRENR+ALD +L NVDL+NAV NVVPT+
Sbjct: 723 APGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTR 782

Query: 181 GALVLAEFNAHAGARVLMKTTKQGIPLRFGAIATLDGVQTNSGIIDDDGSLYMAGLPAKG 240
GA+V AEF A G ++LM T PL FGA+ T + Q +SGI+ D+G +Y++G+P G
Sbjct: 783 GAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQ-SSGIVADNGQVYLSGMPLAG 841

Query: 241 TITVRWGEASDQICHISYQLTEQQINSAITRMDAICR 277
+ V+WGE + C +YQL + +T++ A CR
Sbjct: 842 KVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0914CLENTEROTOXN320.004 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.004
Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 294 VGVVVTDSQNNIISPAGGTLPLSIPDDADSIARMNVYPVSTTGVPPET 341
+ V TD + I+ A T L++ D +S N+Y ++ P T
Sbjct: 188 LTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWT 235


15SDY_0978SDY_1042Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_09782151.071562glucose-1-phosphatase/inositol phosphatase
SDY_09791172.054022hypothetical protein
SDY_09801173.127570TrpR binding protein WrbA
SDY_0980a0163.277604hypothetical protein
SDY_0981-1163.160308transport protein
SDY_09820162.896959hypothetical protein
SDY_09830152.125397hypothetical protein
SDY_0984-1151.880031acetyltransferase
SDY_0985-1151.259728hypothetical protein
SDY_0987-1151.025966hypothetical protein
SDY_0988018-0.497979tet operon regulator
SDY_0990016-1.511471major sodium/proline symporter
SDY_0993121-3.527675insertion element IS1 protein InsA
SDY_0994123-3.626508insertion element iso-IS1d protein InsB
SDY_0995a321-3.811251insertion element iso-IS1N protein InsA
SDY_0997217-2.991030curli assembly protein CsgE
SDY_0998015-1.347842curli assembly protein CsgF
SDY_0999013-0.980554curli production assembly/transport protein
SDY_0999a016-1.008876insertion element iso-IS1N protein InsA
SDY_1000117-1.964389insertion element iso-IS1n protein InsB
SDY_1001318-2.203688hypothetical protein
SDY_1002218-2.280587oxidoreductase
SDY_1003319-2.359451hydrolase
SDY_1004423-1.103324dehydrogenase
SDY_10068260.182344*hypothetical protein
SDY_10077262.445744hypothetical protein
SDY_10087263.040159hypothetical protein
SDY_10099282.852462hypothetical protein
SDY_10109293.250464structural protein
SDY_10118272.349221hypothetical protein
SDY_1012724-0.539631hypothetical protein
SDY_1013522-0.592667radC-like protein YeeS
SDY_1014423-0.293979hypothetical protein
SDY_1015324-0.356180hypothetical protein
SDY_1016125-1.511471hypothetical protein
SDY_1017124-1.221690hypothetical protein
SDY_1018319-0.767351insertion sequence element IS600 integrase core
SDY_1019318-0.763706insertion element IS2 transposase InsD
SDY_1020215-0.211303insertion sequence element IS2 repressor TnpA
SDY_10211152.172410insertion element iso-IS1n protein InsB
SDY_1021a1162.441909insertion element iso-IS1N protein InsA
SDY_10221162.246648outer membrane receptor FepA
SDY_10230173.513689hypothetical protein
SDY_10241173.845765ferric enterochelin esterase
SDY_10250161.961537ABC transporter protein
SDY_1026020-3.085377glycosyl transferase
SDY_1030119-4.473422insertion sequence element IS911 transposase
SDY_1031118-3.744616insertion element IS1 protein InsB
SDY_1031a119-4.029210insertion element iso-IS1N protein InsA
SDY_1033122-4.701300hypothetical protein
SDY_1034224-3.575996sulfite oxidase subunit YedZ
SDY_1035123-3.015619sulfite oxidase subunit YedY
SDY_1036230-4.086567insertion element IS1 protein InsB
SDY_1038a231-4.486187insertion element iso-IS1N protein InsA
SDY_1038018-2.765248hypothetical protein
SDY_1039017-1.385606transcriptional regulator YedW
SDY_10401170.435580insertion element iso-IS1n protein InsB
SDY_1042a2180.558926insertion element iso-IS1N protein InsA
SDY_10422180.666620hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0988HTHTETR652e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.4 bits (159), Expect = 2e-15
Identities = 29/155 (18%), Positives = 60/155 (38%), Gaps = 8/155 (5%)

Query: 20 KKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLA 79
++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 80 PLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAGAPLLMDELTG 132
++ F PL+ ++E + LE + + L + E + ++
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 133 DLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166
D + +++ L A + + ++
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1018TYPE3IMSPROT280.032 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.032
Identities = 7/28 (25%), Positives = 11/28 (39%), Gaps = 3/28 (10%)

Query: 238 YKPAAADIPVAS---DNPAHYADAIRYN 262
+ ++ +S NP H A I Y
Sbjct: 247 SRNMRENVKRSSVVVANPTHIAIGILYK 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1039HTHFIS822e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 2e-20
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 2 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 61
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 117
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


16SDY_1053SDY_1069aY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1053223-0.315426hypothetical protein
SDY_1053a123-0.351419insertion element iso-IS1N protein InsA
SDY_1054224-3.369552insertion element IS1 protein InsB
SDY_1055225-3.624431insertion element iso-IS1d protein InsB
SDY_1056327-3.878606insertion element IS1 protein InsA
SDY_1057324-3.240429insertion element iso-IS1n protein InsB
SDY_1057a226-4.070761insertion element iso-IS1N protein InsA
SDY_1060224-2.835641hypothetical protein
SDY_1061a122-1.252384insertion element iso-IS1N protein InsA
SDY_1061221-1.403428insertion element IS1 protein InsB
SDY_1063324-1.981783insertion sequence element IS600 integrase core
SDY_1064224-1.905364insertion sequence element IS600 transposase
SDY_1065121-1.250514tail fiber protein
SDY_1066220-2.162859tail fiber assembly protein
SDY_1067221-3.288537phage tail fiber protein
SDY_1068224-3.247429insertion sequence element IS600 transposase
SDY_1069019-2.036112insertion sequence element IS600 integrase core
SDY_1070023-2.789448insertion element IS1 protein InsB
SDY_1069a025-3.426774insertion element iso-IS1N protein InsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1060LUXSPROTEIN310.001 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.001
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 90 AGESKI 95
++KI
Sbjct: 114 ENQNKI 119


17SDY_1151SDY_1166Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_11512212.017546hypothetical protein
SDY_11522191.645717dATP pyrophosphohydrolase
SDY_11532211.810152aspartyl-tRNA synthetase
SDY_11541262.124325hypothetical protein
SDY_11552282.167827insertion element IS110 transposase
SDY_11561270.276208insertion element IS1 protein InsA
SDY_11580250.307216insertion sequence element IS911 transposase
SDY_1161129-2.541872insertion element IS1 4 protein InsB
SDY_1162131-3.527836insertion element IS2 transposase InsD
SDY_1163231-5.859451insertion sequence element IS2 repressor TnpA
SDY_1164122-5.635189insertion element IS1 protein InsA
SDY_1165-120-4.691057insertion element iso-IS1n protein InsB
SDY_1166-118-3.435016hypothetical protein
18SDY_1186SDY_1210Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1186021-3.038463insertion sequence element IS600 transposase
SDY_1187-118-2.863807insertion sequence element IS600 integrase core
SDY_1189-218-2.359451flagellar motor protein MotA
SDY_1190020-3.034266transcriptional activator FlhC
SDY_1191-120-2.624384transcriptional activator FlhD
SDY_1192021-2.615861universal stress protein UspC
SDY_1193021-2.004301trehalose-6-phosphate synthase
SDY_1195226-0.995030insertion sequence element IS911 integrase core
SDY_1196125-2.144674insertion sequence element IS911 transposase
SDY_1197123-1.087027insertion sequence element IS600 integrase core
SDY_1198227-3.001930insertion sequence element IS600 transposase
SDY_1199326-2.268177insertion element IS1 protein InsA
SDY_1200326-2.322901insertion element iso-IS1d protein InsB
SDY_1200a421-1.481195insertion element iso-IS1N protein InsA
SDY_1201322-1.843987insertion element iso-IS1n protein InsB
SDY_1202220-2.061166hypothetical protein
SDY_1203017-1.248340insertion element IS1 protein InsB
SDY_1203a017-1.268181insertion element iso-IS1N protein InsA
SDY_1204017-1.288022ATPase
SDY_1205020-3.311832cell division topological specificity factor
SDY_1206018-3.703256cell division inhibitor MinD
SDY_1207021-4.986605septum formation inhibitor
SDY_1208-219-4.472127insertion element iso-IS1n protein InsB
SDY_1208a-319-5.199885insertion element iso-IS1N protein InsA
SDY_1209-320-5.050315hypothetical protein
SDY_1210-214-3.443788hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1189PF05844330.001 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 33.1 bits (75), Expect = 0.001
Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%)

Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103
++LL +L+R+ K+R++G+ L+RD EN
Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1202PF00577290.025 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.4 bits (66), Expect = 0.025
Identities = 22/137 (16%), Positives = 36/137 (26%), Gaps = 20/137 (14%)

Query: 170 SVLTNAKADATRIDNGGVMDVAGNATNTIING--GTQNIYNHGIATGTNINSGTKNIKSG 227
+A + NG + + G N ++ + TG + +G
Sbjct: 614 WRHASASYSMSHDLNGRM------TNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTG 667

Query: 228 GKADTTNISSGSKQA-VEKGGTATGSNIRAGGTLIVHTGGIAHGVYLDMGSALVA----- 281
G+ G ++ H G+ G L+ LV
Sbjct: 668 YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAK 727

Query: 282 ------NTGAGTDIDGY 292
TG TD GY
Sbjct: 728 DAKVENQTGVRTDWRGY 744


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1204PRTACTNFAMLY2411e-69 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 241 bits (617), Expect = 1e-69
Identities = 223/883 (25%), Positives = 343/883 (38%), Gaps = 97/883 (10%)

Query: 70 NNGGTLDVREKGSATGIQQSSQGAL-VATTRATRVTGTRADGVAFSIEQGAANNILLANG 128
NN + E+ IQ S G + A+ +V+G +A G+ + A + NG
Sbjct: 37 NNQSIVKTGERQHGIHIQGSDPGGVRTASGTTIKVSGRQAQGILL---ENPAAELQFRNG 93

Query: 129 GVLT----VESDTSSDKTQVNTGGREIVKTKATATGTTLTGGEQ----IVEGVANETTIN 180
V + + V ++V AT T + V G + +I
Sbjct: 94 SVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIA 153

Query: 181 DGGIQTVSANGEAIKTKINEGGTLTVNDNGKATDIVQN--------SGAALQTSTANGIE 232
D +Q + + D G +Q+ S L+ + +
Sbjct: 154 DSTLQGAGGVQIERGANVTVQR-SAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVP 212

Query: 233 ISGTHQY------------GTFSIAGNLATNMLLENGGNLLVLAGTEAHDSTVG---KGG 277
SG G G A ++ L A D+ G GG
Sbjct: 213 ASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGG 272

Query: 278 AMQN------LGQDSATKVNSG--GQYTLGRSKDEFQPLARAEDLQVA-----GGTAIVY 324
A+ G V G G G S + Q + A +L A G V
Sbjct: 273 AVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVS 332

Query: 325 AGTLA--DASVSGATGSLSLMTPRDNVTPVKLEGAIRI----------PDSATLTIGNGV 372
G+L+ +V G+ P+ + L+ P+ LT+ G
Sbjct: 333 GGSLSAPHGNVIETGGARRFA-PQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGA 391

Query: 373 DTTLADLTA----------ASRGNVWLNSNNSCAG---------------TSNCEYRVNS 407
D D+ A +V L S G V +
Sbjct: 392 DA-QGDIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGA 450

Query: 408 L-LLNDGDVYLSAPATTNGIYNTLTTSELFGSGNFYLHTNVAGSRGDQLVVNNNATGNFK 466
L L +DG V PA G + LT + L GSG F ++ D+LVV +A+G +
Sbjct: 451 LRLASDGSVDFQQPAEA-GRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHR 509

Query: 467 IFVQDTGVSPQSDDAMTLVKT-GGGDASFTLGNTGGFVDLGTYEYVLKSDGNSNWNLTNN 525
++V+++G P S + + LV+T G A+FTL N G VD+GTY Y L ++GN W+L
Sbjct: 510 LWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGA 569

Query: 526 VNPNPNPNPNPNPNPNPNPNPNPTPD-PTPTPVPEKRITPSTAAVLNMA--ATLPLVFDV 582
P P P P P P P P P P P+ P P P + ++ + A +N ++
Sbjct: 570 KAP-PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYA 628

Query: 583 ELNSIRERLNIMKASPHNNNVWGAMYNTRNNVTTDAGAGFEQTLTGMTVGIDSRNDIPEG 642
E N++ +RL ++ +P WG + R + AG F+Q + G +G D + G
Sbjct: 629 ESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGG 688

Query: 643 IATLGAFMGYSHSHIGFDRGGHGSVDSYSLGGYASWEHESGFYLDGVVKLNRFESNVAGK 702
LG GY+ GF G G DS +GGYA++ +SGFYLD ++ +R E++
Sbjct: 689 RWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKVA 748

Query: 703 MSSGGAANGSYHSNGLGGHIETGMRFT-DGNWNLTPYASLTGFTADNPEYHLSNGMESKS 761
S G A G Y ++G+G +E G RFT W L P A L F A Y +NG+ +
Sbjct: 749 GSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRD 808

Query: 762 VDTRSIYRELGATLSYNMRLGNGMEVEPWLKAAVRKEFVDDNRVKVNSDGNFINDLSGRR 821
S+ LG + + L G +V+P++KA+V +EF V N + +L G R
Sbjct: 809 EGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELRGTR 867

Query: 822 GIYQAGIKASFSSTLSGHLGVGYSHGAGVESPWNAVAGVNWSF 864
G+ A+ S + YS G + PW AG +S+
Sbjct: 868 AELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


19SDY_1282SDY_1320Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1282-218-3.737723nitrate reductase 1 cytochrome b(NR) subunit
SDY_1284-127-5.730369*formyltetrahydrofolate deformylase
SDY_1285027-5.404070hypothetical protein
SDY_1287-119-3.552316response regulator of RpoS
SDY_1288-119-3.360252UTP-glucose-1-phosphate uridylyltransferase
SDY_1289128-2.051145global DNA-binding transcriptional dual
SDY_1290225-1.735963thymidine kinase
SDY_1291227-0.680165insertion element iso-IS1n protein InsB
SDY_1292329-0.640111insertion sequence element IS4 transposase InsG
SDY_1293235-0.991030insertion element IS1 protein InsA
SDY_1295233-0.960134bifunctional acetaldehyde-CoA/alcohol
SDY_1294a124-0.351419insertion element iso-IS1N protein InsA
SDY_1296114-2.225163insertion element iso-IS1n protein InsB
SDY_1297214-3.391443insertion element IS1 protein InsA
SDY_1298111-2.747564insertion element iso-IS1d protein InsB
SDY_1299011-2.602405insertion element IS1 protein InsB
SDY_1301a012-2.376830insertion element iso-IS1N protein InsA
SDY_1301012-2.557352hypothetical protein
SDY_1302114-1.597040oligopeptide transporter permease
SDY_1303015-2.618952hypothetical protein
SDY_1304018-3.189520oligopeptide ABC transporter ATP-binding
SDY_1305021-3.515827hypothetical protein
SDY_1306-122-2.915370dsDNA-mimic protein
SDY_1307-225-3.160033cardiolipin synthetase
SDY_1308028-4.454767voltage-gated potassium channel
SDY_1309128-0.675139hypothetical protein
SDY_1310127-0.852772insertion sequence element IS2 repressor TnpA
SDY_1311024-1.843683insertion element IS2 transposase InsD
SDY_1312121-3.341649DNA invertase-like protein
SDY_1313117-2.703248insertion sequence element IS911 transposase
SDY_1316120-3.485577transporter
SDY_1317222-4.547353acyl-CoA thioester hydrolase
SDY_1318218-3.930556intracellular septation protein A
SDY_1319221-3.281437hypothetical protein
SDY_1320-122-3.851675outer membrane protein W
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1285SECA542e-11 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 54.1 bits (130), Expect = 2e-11
Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 125 IDGTRPQFGRNDPCPCGSGKKIKKCCGQ 152
+ GRNDPCPCGSGKK K+C G+
Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1287HTHFIS907e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 7e-22
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 3/152 (1%)

Query: 10 ILIVEDEQVFRSLLDSWFSSLGATTVLAADGVDALELLGGFTPDLMICDIAMPRMNGLKL 69
IL+ +D+ R++L+ S G + ++ + DL++ D+ MP N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 LEHIRNRGDQTPVLVISATENMADIAKALRLGVEDVLLKPVKDLNRLREMVFACLYPSMF 129
L I+ PVLV+SA KA G D L KP DL L ++ L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122

Query: 130 NSRVEEEERLFRDWDAMVDNPAAAAKLLQELQ 161
R + E +D +V AA ++ + L
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1305HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.008
Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 55 VVGESGCGKSTFARAI 70
+ GESG GK ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1309adhesinmafb315e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 31.2 bits (70), Expect = 5e-04
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97
P+PA G GS E + EA W +P A V +V KV
Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1316TONBPROTEIN2552e-88 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 255 bits (653), Expect = 2e-88
Identities = 234/239 (97%), Positives = 234/239 (97%)

Query: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 60
MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120
VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120

Query: 121 PASPFENTAPTRPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180
PASPFENTAP R TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF
Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180

Query: 181 DVTPDGRVDNVQILLAKPANMFEREVKNAMRRWRYEPGKSGSGIVVNILFKINGTTEIQ 239
DVTPDGRVDNVQIL AKPANMFEREVKNAMRRWRYEPGK GSGIVVNILFKINGTTEIQ
Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239


20SDY_1380SDY_1392Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1380323-2.505533integrase
SDY_1381223-1.513785insertion element IS1 protein InsA
SDY_1382327-4.125833exonuclease
SDY_1383332-5.731507recombination protein Bet
SDY_1384229-4.679527host-nuclease inhibitor protein Gam
SDY_1385330-4.147225hypothetical protein
SDY_1386431-4.430457insertion sequence element IS600 transposase
SDY_1389430-3.537431Shiga toxin subunit A
SDY_1390324-0.696374Shiga toxin subunit B
SDY_13914220.185352hypothetical protein
SDY_13924281.675240hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1389SHIGARICIN1203e-34 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 120 bits (303), Expect = 3e-34
Identities = 49/283 (17%), Positives = 112/283 (39%), Gaps = 40/283 (14%)

Query: 3 IIIFRVLTFFFVIFSVNVVAKE----FTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGT 58
+I F V + + + A E F L +T+ +Y ++ +R A+ +
Sbjct: 1 MIRFLVFSLLILTLFLTAPAVEGDVSFRLSGATSSSYGVFISNLRKALPYERKL-----Y 55

Query: 59 SLLMIDSGTGDNLFAVDVRGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADF- 117
+ ++ S + + + + + + ++ N+YV G+ + Y F +
Sbjct: 56 DIPLLRSTLPGSQRYALIHLTNYADE---TISVAIDVTNVYVMGYRA--GDTSYFFNEAS 110

Query: 118 ----SHVTFPGTTA-VTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSL 172
+ F VTL +Y LQ AG R + + +L ++ L ++
Sbjct: 111 ATEAAKYVFKDAKRKVTLPYSGNYERLQIAAGKIRENIPLGLPALDSAITTLFYYNA--- 167

Query: 173 TQSVARAMLRFVTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVL 232
S A A++ + T+EA R++ I++ +D +++ + + L +W LS +
Sbjct: 168 -NSAASALMVLIQSTSEAARYKFIEQQIGKRVDK----TFLPSLAIISLENSWSALSKQI 222

Query: 233 PDYHGQDSV----------RVGRISFGSINA--ILGSVALILN 263
+ + R++ +++A + ++AL+LN
Sbjct: 223 QIASTNNGQFETPVVLINAQNQRVTITNVDAGVVTSNIALLLN 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1390FLGMOTORFLIM260.024 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 26.0 bits (57), Expect = 0.024
Identities = 7/36 (19%), Positives = 17/36 (47%)

Query: 38 DTFTVKVGDKELFTNRWNLQSLLLSAQITGMTVTIK 73
D F + +G+++ F + + ++AQI +
Sbjct: 293 DPFVLSIGNRKKFLCQPGVVGKKIAAQILERIESTS 328


21SDY_1405SDY_1410Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1405-118-3.663350insertion element iso-IS1n protein InsB
SDY_1407a019-5.120595insertion element iso-IS1N protein InsA
SDY_1408a019-4.765173insertion element iso-IS1N protein InsA
SDY_1407020-4.709462insertion element iso-IS1n protein InsB
SDY_1408020-4.563627LysR family transcriptional regulator
SDY_1409014-3.859023transport periplasmic protein
SDY_1410-113-3.387884hypothetical protein
22SDY_1436SDY_1454Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1436219-1.679179antitermination protein Q
SDY_1439020-1.732064**insertion element IS1 protein InsA
SDY_1440020-1.617003insertion element iso-IS1d protein InsB
SDY_1440a220-2.305338insertion element iso-IS1N protein InsA
SDY_1442317-2.084423insertion element iso-IS1n protein InsB
SDY_1443320-1.800167hypothetical protein
SDY_14440230.234035insertion sequence element IS600 transposase
SDY_14463241.140549insertion element IS1 protein InsB
SDY_1466a3250.791234insertion element iso-IS1N protein InsA
SDY_14482250.163072hypothetical protein
SDY_1449225-0.825059insertion sequence element IS2 repressor TnpA
SDY_1450120-0.931738insertion element IS2 transposase InsD
SDY_1450a120-1.974717phage lysis protein
SDY_1451a121-2.739390prophage protein
SDY_1452122-2.909931insertion sequence element IS600 transposase
SDY_1454221-2.359451iron ABC transporter permease
23SDY_1468aSDY_1498Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1468a025-3.200797insertion element iso-IS1N protein InsA
SDY_1468027-3.027645insertion element IS1 protein InsB
SDY_1469025-3.240861insertion element IS1 protein InsA
SDY_1470-120-3.114738insertion element iso-IS1d protein InsB
SDY_1471020-4.233557hypothetical protein
SDY_1472118-3.865953hypothetical protein
SDY_1473017-2.732984insertion element iso-IS1n protein InsB
SDY_1469a121-3.785959insertion element iso-IS1N protein InsA
SDY_1474117-3.2989083-dehydroquinate dehydratase
SDY_1475120-3.509597quinate/shikimate dehydrogenase
SDY_1476123-2.915007insertion element iso-IS1n protein InsB
SDY_1476a026-2.628268insertion element iso-IS1N protein InsA
SDY_1477223-4.995299hypothetical protein
SDY_1478320-3.715113insertion element iso-IS1d protein InsB
SDY_1479424-4.419376insertion element IS1 protein InsA
SDY_1481519-3.646855insertion sequence element IS600 transposase
SDY_1482418-3.016769insertion element IS1 protein InsA
SDY_1483216-3.062634transport system permease
SDY_1484123-1.664007insertion element IS1 protein InsB
SDY_1482a123-2.049040insertion element iso-IS1N protein InsA
SDY_1485022-2.489382hypothetical protein
SDY_1486-119-2.852548aldehyde reductase
SDY_1487-116-3.096045hypothetical protein
SDY_1488017-3.659278glyceraldehyde-3-phosphate dehydrogenase
SDY_1489120-3.756724methionine sulfoxide reductase B
SDY_1490121-3.953839hypothetical protein
SDY_1491022-4.604349oxidoreductase
SDY_1492023-4.951020transport protein
SDY_1493-122-5.402392oxidoreductase
SDY_1494-120-4.334902insertion element IS1 protein InsA
SDY_1495-117-4.137962kinase
SDY_1498018-3.484716DeoR-type transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1469aACRIFLAVINRP270.007 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.007
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 50 GIKELLTEM-AFNGAGV-----RDTARTLKIGINTVIRTL 83
IK L E+ F G+ DT +++ I+ V++TL
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1487INVEPROTEIN300.016 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 29.7 bits (66), Expect = 0.016
Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 13/81 (16%)

Query: 165 ETTSALHTYFNVGDIAKVSVSGLGNRFIDKVNDAKED-----------VLTDGIQTFPDR 213
E ++AL + N D K S S L N F ++V + + V ++ F +
Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114

Query: 214 TDRVYLNPQDCSVINDEALNR 234
++ +P D ++ E L R
Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1492TCRTETB310.011 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.011
Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 23/142 (16%)

Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMDF-LIACRFVMGVGLGALL 129
+G V G + D+ G + + I+ V+G + LI RF+ G G A
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVSFIGNWSYPLCSLIAMGLTPLISA----EWNWR 181
+ Y+P NR G S V+ + G+ P I +W
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA------------MGEGVGPAIGGMIAHYIHWS 169

Query: 182 VQLLIPAILSLIATALAWRYFP 203
LLIP I I T
Sbjct: 170 YLLLIPMI--TIITVPFLMKLL 189


24SDY_1552SDY_1597Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1552224-2.343502transport protein
SDY_1553224-1.875189multidrug efflux system protein MdtJ
SDY_1554122-0.363443multidrug efflux system protein MdtI
SDY_1555-117-0.242438hypothetical protein
SDY_1556014-0.198221acid shock protein
SDY_1556a0110.307216insertion element iso-IS1N protein InsA
SDY_15600130.353295insertion element iso-IS1d protein InsB
SDY_15610140.273064LysR family transcriptional regulator
SDY_15622140.378926NAGC-like transcriptional regulator
SDY_1563117-0.848281dithiobiotin synthetase
SDY_1564119-1.441009voltage-gated ClC-type chloride channel ClcB
SDY_1566020-2.790200DMSO reductase anchor subunit
SDY_1567-121-3.395012oxidoreductase Fe-S subunit
SDY_1570-123-4.089631hypothetical protein
SDY_1571-121-4.348841hypothetical protein
SDY_1572223-2.179687hypothetical protein
SDY_1572a325-1.088898insertion element iso-IS1N protein InsA
SDY_1574216-3.154680insertion element IS1 protein InsB
SDY_1576117-3.314865hypothetical protein
SDY_1577016-3.321293hypothetical protein
SDY_1580016-3.180949insertion element iso-IS1n protein InsB
SDY_1580a121-3.055883insertion element iso-IS1N protein InsA
SDY_1582021-3.097559transport protein
SDY_1584121-2.468241hypothetical protein
SDY_1585123-3.447848hypothetical protein
SDY_1586123-3.4473003-hydroxy acid dehydrogenase
SDY_1587122-3.281059dipeptidyl carboxypeptidase II
SDY_1588018-3.443127competence damage-inducible protein A
SDY_1589017-3.303316hypothetical protein
SDY_1590017-3.492137hypothetical protein
SDY_1591015-2.513902insertion element IS1 protein InsA
SDY_1592016-2.625207insertion element iso-IS1d protein InsB
SDY_1593219-2.748052MFS-type transporter YdeE
SDY_1594220-2.803357O-acetylserine/cysteine export protein
SDY_1596118-2.655125DNA-binding transcriptional activator MarA
SDY_1597218-2.184296DNA-binding transcriptional repressor MarR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1555V8PROTEASE1332e-39 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 133 bits (337), Expect = 2e-39
Identities = 38/247 (15%), Positives = 78/247 (31%), Gaps = 53/247 (21%)

Query: 17 AFVFADKPDVAKSAN------NEVSTLFFDHDDRVPVNDTTQSPWDAVGQLET---ASGN 67
P + K N E + + ++DR + DTT + V ++
Sbjct: 43 QSSKQQTPKIQKGGNLKPLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTF 102

Query: 68 LCTATLIAPNLALTAGHCLLTPPKGKADKAVALRFV------SNKGLWRYDIHDI---EG 118
+ + ++ + LT H + AL+ N + I G
Sbjct: 103 IASGVVVGKDTLLTNKHVV----DATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 119 RVGPTLGKRLKADGDGWIVPPAAAPWDFGLIVLRNPPSGITPLPLFEGDKAALTAALKAA 178
+ K + ++ + P + A
Sbjct: 159 EGDLAIVK-FSPNEQN-----------------KHIGEVVKPATM-------SNNAETQV 193

Query: 179 GRKVTQAGYPEDH-LDTLYSHQNCEVTGWAQTSVMSHQCDTLPGDSGSPLMLHTNDGWQL 237
+ +T GYP D + T++ + ++T + + M + T G+SGSP+ N+ ++
Sbjct: 194 NQNITVTGYPGDKPVATMWESKG-KIT-YLKGEAMQYDLSTTGGNSGSPVF---NEKNEV 248

Query: 238 IGVQSSA 244
IG+
Sbjct: 249 IGIHWGG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1556IGASERPTASE270.021 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 26.6 bits (58), Expect = 0.021
Identities = 16/91 (17%), Positives = 29/91 (31%), Gaps = 4/91 (4%)

Query: 12 AMGLSSAAFAAETATTPAPTATTTKAAPAKTTHHKKQHKAAPAQKAQAAKKHHKNTKAEQ 71
+ + A T T A T + + + +K T+ Q
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 72 KAPEQKAQAAKKHAGKHSHQQPAKPAAQPAA 102
+ P+ +Q + K + Q P A+PA
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQ----PQAEPAR 1147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1582TCRTETB471e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.8 bits (111), Expect = 1e-07
Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 16/117 (13%)

Query: 45 GAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLGA 104
G ++GK+ D++G K++L I + + + V ++ + + A R IQG GA
Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAGA 117

Query: 105 GAEISGAGTMLAEYAPKGKR----GIISSFVAMGTNCGTLSTTAI-----WAFIFFI 152
A + ++A Y PK R G+I S VAMG G I W+++ I
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1586DHBDHDRGNASE1009e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (251), Expect = 9e-28
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%)

Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58
I +TGA G GE + R QG + A E+L+++ L A+ DVR+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAIEEMLASLPAEWSNIDILVNNAGLALGMEPAHKASIEDWETMIDTNNKGLVYMTRAVL 118
AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178
M++R G I+ +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227
T+ + ++G + +T++ + L P D+++AV + VS H+
Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 228 INTL 231
++ L
Sbjct: 248 MHNL 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1593TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 2e-06
Identities = 41/239 (17%), Positives = 81/239 (33%), Gaps = 18/239 (7%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVVFSLGF 63
R +L++ L +G G +P + L R S D+ G + + + +
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 64 GILADKFDKKRYMLLAITAFTSGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123
G L+D+F ++ +L+++ + + + ++ + + + A A+ AD
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122

Query: 124 NLSSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183
+ + F G GP LG L+ S + PF+ AA + L +
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233
S K + W + A L F+ V A+ +
Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237


25SDY_1654SDY_1695Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1654-124-4.897174hypothetical protein
SDY_1654a128-5.884478insertion element iso-IS1N protein InsA
SDY_1657127-5.659576adhesin
SDY_1658024-4.999378oxidoreductase
SDY_1659-222-7.969607hypothetical protein
SDY_1660-222-6.789328transcriptional regulator YdeO
SDY_1661020-3.179892hypothetical protein
SDY_1662224-4.474255insertion element IS1 protein InsB
SDY_1661a120-0.871624insertion element iso-IS1N protein InsA
SDY_1663222-0.993710hypothetical protein
SDY_1667a224-0.852729insertion element iso-IS1N protein InsA
SDY_1667324-1.234422insertion element iso-IS1n protein InsB
SDY_1668325-1.319229hypothetical protein
SDY_16690170.550827hypothetical protein
SDY_1670-2160.086032hypothetical protein
SDY_1672-1170.105273insertion sequence element IS600 transposase
SDY_1673-216-0.120376hypothetical protein
SDY_1674-1150.218233hypothetical protein
SDY_1675-1140.246412oxidase
SDY_1676117-1.833135inner membrane protein
SDY_1678122-4.100745insertion sequence element IS600 transposase
SDY_1679020-4.678292hypothetical protein
SDY_1680020-3.364017insertion element iso-IS1n protein InsB
SDY_1682a026-4.182769insertion element iso-IS1N protein InsA
SDY_1682024-3.231681hypothetical protein
SDY_1682b-224-2.398483insertion element iso-IS1N protein InsA
SDY_1684024-1.682413hypothetical protein
SDY_1685023-0.705727hypothetical protein
SDY_1686123-0.593652hypothetical protein
SDY_1687023-1.744794hypothetical protein
SDY_1688022-2.012478AraC family transcriptional regulator
SDY_1689020-1.891131amino acid/amine transport protein
SDY_1690-120-3.082477hypothetical protein
SDY_1691-119-3.813055hypothetical protein
SDY_1692-120-4.117861hypothetical protein
SDY_1693118-4.311832insertion element iso-IS1n protein InsB
SDY_1695a118-4.407070insertion element iso-IS1N protein InsA
SDY_1695118-4.454689hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1663ECOLIPORIN1432e-45 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 143 bits (363), Expect = 2e-45
Identities = 65/113 (57%), Positives = 83/113 (73%), Gaps = 11/113 (9%)

Query: 1 MTTYG------DGYISNKAQSFEVVAQYQFDFGLRPSLAYLKSKGRDLGR----YGDQDM 50
MT YG DG ++NK Q+FEV AQYQFDFGLRP++++L SKG+DL D+D+
Sbjct: 271 MTPYGKTDKGYDGGVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDL 330

Query: 51 IEYIDVGATYFFNKNMSTYVDYKINLIDESD-FTRAVDIRTDNIVATGITYQF 102
++Y DVGATY+FNKN STYVDYKINL+D+ D F + I TD+IVA G+ YQF
Sbjct: 331 VKYADVGATYYFNKNFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1669IGASERPTASE320.011 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.011
Identities = 19/137 (13%), Positives = 42/137 (30%), Gaps = 4/137 (2%)

Query: 439 REAESVPQDESAPQPEPVDPVAQHRESMQGMNREQLLEQYADADMAHEGDTSAVHRREAA 498
E + + P P P P + +E + + D + +EA
Sbjct: 1014 NEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAK 1073

Query: 499 SQLLNELDEQAKRQAVMDELKAKPR----PELLEEYRKLSLKEGRTDTEEQQLQAIRDVL 554
S + Q+ + + + +E+ K ++ +T + +
Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQ 1133

Query: 555 RPQREARPEAQPQPENA 571
+P+A+P EN
Sbjct: 1134 EQSETVQPQAEPAREND 1150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1682TCRTETA260.038 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 26.3 bits (58), Expect = 0.038
Identities = 13/47 (27%), Positives = 24/47 (51%)

Query: 37 FAGLLSDRFGRRPFIMLGMCFYMAFFLGILQTNNIIIAYVFGFLAGM 83
G LSDRFGRRP +++ + + + + + Y+ +AG+
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1687PRTACTNFAMLY290.009 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.9 bits (64), Expect = 0.009
Identities = 17/62 (27%), Positives = 26/62 (41%)

Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVYWLGGRGVTLMGS 108
Q +I L IG + + LPPS ++ N ++ A V LG +TL G
Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233

Query: 109 QL 110
+
Sbjct: 234 HI 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1695aACRIFLAVINRP270.007 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.007
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 50 GIKELLTEM-AFNGAGV-----RDTARTLKIGINTVIRTL 83
IK L E+ F G+ DT +++ I+ V++TL
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1695HTHTETR306e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.0 bits (67), Expect = 6e-04
Identities = 9/37 (24%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTILL 35
+ I+ G I+G++ W+ K ++ ILL
Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199


26SDY_1761SDY_1768Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1761120-3.362161insertion element iso-IS1n protein InsB
SDY_4668122-4.011103insertion element iso-IS1N protein InsA
SDY_1762122-3.951043hypothetical protein
SDY_1763225-6.676562hypothetical protein
SDY_1764023-5.712655cytochrome b561
SDY_1768-124-4.725042insertion sequence element IS600 transposase
27SDY_1966SDY_1978Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1966019-3.054723hypothetical protein
SDY_1967-115-1.951953hypothetical protein
SDY_1968-216-3.23168123S rRNA methyltransferase
SDY_1969221-4.730585cold shock-like protein CspC
SDY_1970023-5.221404hypothetical protein
SDY_4680-319-4.070656insertion element iso-IS1N protein InsA
SDY_1971-220-3.299938insertion element iso-IS1n protein InsB
SDY_1972-217-2.533760hypothetical protein
SDY_1973-213-2.122657hypothetical protein
SDY_1974-113-1.689476hypothetical protein
SDY_1975012-1.643705regulator
SDY_1976014-1.391051transport protein
SDY_1977114-1.300298heat shock protein HtpX
SDY_1978215-1.779246carboxy-terminal protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1976TCRTETB1087e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 108 bits (272), Expect = 7e-28
Identities = 81/388 (20%), Positives = 166/388 (42%), Gaps = 14/388 (3%)

Query: 64 MAVLDGAIANVALPTIATDLHATPASSIWVVNAYQIAIVISLLSFSFLGDMFGYRRIYKC 123
+VL+ + NV+LP IA D + PAS+ WV A+ + I + L D G +R+
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 124 GLVVFLLSSLFFALSDS-LQMLTLARVIQGFGGAALMSVNTALIRLIYPQRFLGRGMGIN 182
G+++ S+ + S +L +AR IQG G AA ++ ++ P+ G+ G+
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 183 SFIVAVSSAAGPTIAAAILSIASWKWLFLINVPLGIIALLLAIRFLPPNGSRASKPRFDL 242
IVA+ GP I I W +L LI + + II + ++ L K FD+
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI--KGHFDI 201

Query: 243 PRAVMNALTFGLLITALSGFAQGQSLTLIAAELVVMVVVGIFFIRRQLSLTVPLLPVDLL 302
++ + I F S++ L+V V+ + F++ +T P + L
Sbjct: 202 KGIIL----MSVGIVFFMLFTTSYSISF----LIVSVLSFLIFVKHIRKVTDPFVDPGLG 253

Query: 303 RIPLFSLSICTSVCSFCAQMLAMVSLPFYLQTVLGRSEVETG-LLLTPWPLATMVMAPLA 361
+ F + + F + +P+ ++ V S E G +++ P ++ ++ +
Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313

Query: 362 GYLIERVHAGLLGALGLFIMAAGLFSLVLLPASPADINIIWPMILCGAGFGLFQSPNNHT 421
G L++R + +G+ ++ + L + + + ++ G ++ +
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF-MTIIIVFVLGGLSFTKTVISTI 372

Query: 422 IITSAPRERSGGASGMLGTARLLGQSSG 449
+ +S ++ +G +L L + +G
Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


28SDY_1987SDY_2013Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1987223-2.528288hypothetical protein
SDY_1988323-2.830261hypothetical protein
SDY_1991626-2.880284insertion sequence element IS911 transposase
SDY_1992626-2.304172insertion sequence element IS911 integrase core
SDY_1993627-3.304631hypothetical protein
SDY_1998832-3.862887****hypothetical protein
SDY_1999731-3.643134insertion sequence element IS600 transposase
SDY_2001729-3.254166invasion plasmid antigen
SDY_4682426-3.028496insertion element iso-IS1N protein InsA
SDY_2002427-3.435195insertion element IS1 protein InsB
SDY_2003421-2.321885invasion plasmid antigen
SDY_2006013-0.169399hypothetical protein
SDY_2007013-0.353771insertion element iso-IS1n protein InsB
SDY_4683015-0.429236insertion element iso-IS1N protein InsA
SDY_2008117-2.341236insertion sequence element IS600 transposase
SDY_2011114-2.1563646-phosphogluconolactonase
SDY_2012017-3.097181hypothetical protein
SDY_2013-117-3.200292insertion element iso-IS1n protein InsB
29SDY_2061SDY_2075Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_20614190.92756150S ribosomal protein L32
SDY_20623171.049106hypothetical protein
SDY_20631141.413727insertion element IS1 protein InsB
SDY_20651131.42842823S rRNA pseudouridylate synthase C
SDY_20660121.282845hypothetical protein
SDY_20671131.637995ribonuclease E
SDY_20702120.641143flagellar rod assembly protein/muramidase FlgJ
SDY_20713130.315173flagellar basal body P-ring biosynthesis protein
SDY_2072313-0.183187flagellar basal body L-ring protein
SDY_2075214-0.386542flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2067IGASERPTASE666e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 66.2 bits (161), Expect = 6e-13
Identities = 49/288 (17%), Positives = 89/288 (30%), Gaps = 36/288 (12%)

Query: 513 PSEEEFTERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPAAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSSGEETKPSEQAAPKVEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
+ E +K + K E QN + + + ++
Sbjct: 1039 -----------TVAENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
A +P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 38.9 bits (90), Expect = 1e-04
Identities = 48/302 (15%), Positives = 83/302 (27%), Gaps = 35/302 (11%)

Query: 721 KQRQLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQ 780
+ + NQ V A P V + + P+P A P +
Sbjct: 984 EVEKRNQTVDTTN--------ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE 1035

Query: 781 QEENNADNRDNGGMPRRSRRSPRHLRVSGQRRRRYRDERYPTQSPMPLTVACASPELASG 840
E A+N + R + + VA + E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 841 KVWIRYPIVRPQDVQVEEQREQEEVQVQPMVTEIPVAAAVEPVVSAPVVEEVAEVVEPPV 900
+ V+ EE+ + E + Q E+P + +E +E V+P
Sbjct: 1096 Q---TTETKETATVEKEEKAKVETEKTQ----EVPKVTS-----QVSPKQEQSETVQPQ- 1142

Query: 901 QVAEPQPEVVETTHPEVIAAAVTEQPQVITESDVAVAQEVAEHAEPVVEPQEETADIEEV 960
AEP E T + ++PQ T + Q E + V +P E+ +
Sbjct: 1143 --AEPARENDPTVN--------IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192

Query: 961 AETAEVVVAEPEVVAQPAAPVVAEVATEVETVTAVKPEITVEHNHVTAPMTRAPAPEYVP 1020
E QP + + +V+ +V T + V
Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH----NVEPATTSSNDRSTVA 1248

Query: 1021 EA 1022

Sbjct: 1249 LC 1250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2070FLGFLGJ5110.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 511 bits (1318), Expect = 0.0
Identities = 312/313 (99%), Positives = 312/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2071FLGPRINGFLGI424e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 424 bits (1092), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATVLDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRSLNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2072FLGLRINGFLGH346e-124 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 346 bits (889), Expect = e-124
Identities = 231/232 (99%), Positives = 231/232 (99%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVLGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPV GPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2075FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 37.2 bits (86), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


30SDY_2092SDY_4689Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2092323-2.400603DNA damage-inducible protein I
SDY_2093120-1.933094insertion element iso-IS1n protein InsB
SDY_4687122-2.054201insertion element iso-IS1N protein InsA
SDY_2095221-3.449465biofilm formation regulatory protein BssS
SDY_2096220-2.711210N-methyltryptophan oxidase
SDY_2097221-3.925603hypothetical protein
SDY_2098122-2.799205insertion element IS1 protein InsB
SDY_4688023-1.876897insertion element iso-IS1N protein InsA
SDY_2099016-1.733089DNA-binding transcriptional activator YeiL
SDY_2100213-0.396515insertion element IS1 protein InsB
SDY_4689213-0.332276insertion element iso-IS1N protein InsA
31SDY_2138SDY_2148Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2138215-2.040022insertion element IS1 protein InsB
SDY_4692315-2.811712insertion element iso-IS1N protein InsA
SDY_2140a313-1.567124dihydropyrimidine dehydrogenase
SDY_4693314-0.895277insertion element iso-IS1N protein InsA
SDY_2141316-1.160410insertion element IS1 protein InsB
SDY_2142316-1.160410hypothetical protein
SDY_2143115-0.623031hypothetical protein
SDY_21442170.909385cytidine deaminase
SDY_21453190.521797hypothetical protein
SDY_21461190.633680hypothetical protein
SDY_21482170.557216insertion element IS1 protein InsB
32SDY_4697SDY_2217Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4697124-5.317897insertion element iso-IS1N protein InsA
SDY_2202129-6.910692insertion element IS1 protein InsB
SDY_2203239-10.257054UTP-glucose-1-phosphate uridylyltransferase
SDY_2204550-13.919291dTDP-glucose-4,6-dehydratase
SDY_2205754-16.254379dTDP-4-dehydrorhamnose reductase
SDY_2206858-19.098424glucose-1-phosphate thymidylyltransferase
SDY_2207748-16.484864dTDP-4-dehydrorhamnose 3,5-epimerase
SDY_2208643-15.172466O-antigen transporter
SDY_2209436-12.930616O-antigen polymerase
SDY_2210224-9.915452rhamnosyl transferase II
SDY_2211117-6.809536rhamnosyl transferase I
SDY_2212-115-2.8127296-phosphogluconate dehydrogenase
SDY_2213-214-2.318801UDP-glucose 6-dehydrogenase
SDY_2214-2170.098545O-antigen chain length regulator
SDY_22150242.015678bifunctional phosphoribosyl-AMP
SDY_22160212.973882imidazole glycerol phosphate synthase subunit
SDY_2217-1213.0962161-(5-phosphoribosyl)-5-[(5-
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2204NUCEPIMERASE1791e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 179 bits (456), Expect = 1e-55
Identities = 84/360 (23%), Positives = 146/360 (40%), Gaps = 48/360 (13%)

Query: 1 MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLT--YAGNL-ESLADVSDSKRYVFEHA 57
MK LVTG AGFIG V + ++ VV +D L Y +L ++ ++ + F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDAAAMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSA 117
D+ D M +FA + V V S+ P A+ ++N+ G +LE R+
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LDGDKKNSFRFHHISTDEVYGDLPHPDEVNNKEQLPLFTETTAYAPSSPYSASKASSDHL 177
+ S+ VYG ++P T+ + P S Y+A+K +++ +
Sbjct: 117 ------KIQHLLYASSSSVYGL---------NRKMPFSTDDSVDHPVSLYAATKKANELM 161

Query: 178 VRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVE 237
+ YGLP YGP+ P+ + LEGK++ +Y G RD+ Y++
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 238 DHARALYIVV------------------TEGKAGETYNIGGHNEKKNIDVVLTICDLLDE 279
D A A+ + YNIG + + +D + + D L
Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG- 280

Query: 280 IVPKEKSYREQITYVADRPGHDRRYAIDAEKISRELGWKPQETFESGIRKTVGWYLSNTK 339
+ +K+ +PG + D + + +G+ P+ T + G++ V WY K
Sbjct: 281 -IEAKKNMLPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2205NUCEPIMERASE473e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 47.1 bits (112), Expect = 3e-08
Identities = 31/172 (18%), Positives = 66/172 (38%), Gaps = 29/172 (16%)

Query: 1 MNILLFGKTGQVGWELQRALAPLGN-LIALDVHSTDY--------------------CGD 39
M L+ G G +G+ + + L G+ ++ +D + Y D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 40 FSNPEGVAETVRSIRPDIIVNAAAHTAVDKAESEPEF---AQLLNATSVEAIAKAANEVG 96
++ EG+ + S + + + AV + P + L ++ + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK-IQ 119

Query: 97 AWVIHYSTDYVFPGTGEIPWQEADATA-PLNVYGETKLAGEKALQEHCAKHL 147
+++ S+ V+ ++P+ D+ P+++Y TK A E L H HL
Sbjct: 120 H-LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE--LMAHTYSHL 168


33SDY_2237SDY_2262Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2237220-1.303515hypothetical protein
SDY_2238327-1.208746insertion element iso-IS1n protein InsB
SDY_2239331-1.734862hypothetical protein
SDY_2240231-2.166102adenosylcobinamide kinase
SDY_2241131-2.369203cobalamin synthase
SDY_2242032-3.028411nicotinate-nucleotide--dimethylbenzimidazole
SDY_2243031-3.511213hypothetical protein
SDY_2245029-3.172774*nitrogen assimilation transcriptional regulator
SDY_2246027-2.835641transcriptional regulator Cbl
SDY_2248023-2.275699*hypothetical protein
SDY_2249-123-2.171689hypothetical protein
SDY_2251-222-2.321543*insertion element IS1 protein InsA
SDY_2252-212-1.318404insertion element iso-IS1d protein InsB
SDY_2253-214-2.359451AMP nucleosidase
SDY_2254-114-2.438227shikimate transporter
SDY_2257115-3.229907hypothetical protein
SDY_2258215-3.435063plasmid stabilization protein
SDY_2259215-3.375059hypothetical protein
SDY_2260324-4.959828hypothetical protein
SDY_2261322-3.961752lipid kinase
SDY_2262219-3.302324galactitol utilization operon repressor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2237FbpA_PF05833290.006 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 28.7 bits (64), Expect = 0.006
Identities = 13/83 (15%), Positives = 33/83 (39%), Gaps = 6/83 (7%)

Query: 38 RLFRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIITSMKGDYEDR 97
+++ NKL++ + +++ N++ + L ++ I + + I+ I E
Sbjct: 385 SYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKK------ELI 438

Query: 98 VDDYIIKNAELSKERRDISKKLK 120
YI ++ SK +
Sbjct: 439 ETGYIKFKKIYKSKKSKTSKPMH 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2254TCRTETB320.005 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.2 bits (73), Expect = 0.005
Identities = 38/256 (14%), Positives = 94/256 (36%), Gaps = 18/256 (7%)

Query: 82 VIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFAVGG 141
++G D+LG KR+L+ + + + + + SF ++ I+ ++ A
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAAFPA 122

Query: 142 EWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWGWRI 200
+ + K S V +G GVG + + I W
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------HWSY 170

Query: 201 PFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIPVIEALLRHPGAFLKIIALRL 260
L ++ ++ ++ +++ + + + ++ +L + + + +
Sbjct: 171 LLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV 230

Query: 261 CELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRRVYI 320
L +++ + + GL + + IG+L GG+ T+ F + + +
Sbjct: 231 LSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQL 289

Query: 321 TGALIGTLSAFPFFMA 336
+ A IG++ FP M+
Sbjct: 290 STAEIGSVIIFPGTMS 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2260LIPOLPP20260.024 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 26.3 bits (57), Expect = 0.024
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 3 KGEMKKIAAISLISIFIMSGCAVHNDETSIGKFGLAYK 40
K ++KKI +S+++ ++ GC+ H ++ I K AYK
Sbjct: 2 KNQVKKILGMSVVAAMVIVGCS-HAPKSGISKSNKAYK 38


34SDY_2274SDY_2287Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2274217-0.376262transcriptional regulator
SDY_2277218-0.274242phosphomethylpyrimidine kinase
SDY_2278217-1.198249hydroxyethylthiazole kinase
SDY_2279119-2.317992hypothetical protein
SDY_2280223-4.050272nickel/cobalt efflux protein RcnA
SDY_2281122-3.908828hypothetical protein
SDY_4701022-3.716383insertion element iso-IS1N protein InsA
SDY_2285-124-4.026118insertion element iso-IS1n protein InsB
SDY_4702-122-4.015080insertion element iso-IS1N protein InsA
SDY_2287019-3.280086outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2287PF005775440.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 544 bits (1403), Expect = 0.0
Identities = 182/595 (30%), Positives = 286/595 (48%), Gaps = 14/595 (2%)

Query: 2 YTSSDIFDSVRFRGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVP 61
YT DIFD + FRG +L D MLP+S++ F P + GIA+ A VTI+QNG+ +Y VP
Sbjct: 267 YTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVP 326

Query: 62 PGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDFAAGRSHIE 121
PGPF I D+ AG DL V++KEADGS + VPY++VP + + G ++Y AG
Sbjct: 327 PGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG 386

Query: 122 GASKQSD-FVQAGYQYGFNNLLTLYGGSMVANNYYAFTLGTGWNT-RIGAISVDATKSHS 179
A ++ F Q+ +G T+YGG+ +A+ Y AF G G N +GA+SVD T+++S
Sbjct: 387 NAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANS 446

Query: 180 KQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDEN 239
+ DGQS + YNK ++++ T L +RYS+ Y F D ++ ++
Sbjct: 447 TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQD 506

Query: 240 DVYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLS 295
V + DYY + ++ ++Q L ++ LS + YWG S + +Q
Sbjct: 507 GVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAG 565

Query: 296 YSNNLRRISYTLAASQAYDENHHE-EKRFNIFISIPFD--WGDDVSTPRRQIYMSNSMTF 352
+ I++TL+ S + ++ + ++IPF D + R S SM+
Sbjct: 566 LNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSH 625

Query: 353 DDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGN---ETTAGANLTWNAPVATVNGSYS 409
D G +N G+ GT+ + +Y V + G+ +T A L + N YS
Sbjct: 626 DLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYS 685

Query: 410 QSSTYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVV 469
S +Q VSGG++A + GV L L++T ++ APG KDA V Q T+ G
Sbjct: 686 HSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYA 745

Query: 470 VYDGMTPYRENHLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRA 529
V T YREN + LD + +L P RGA+V F + + L
Sbjct: 746 VLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGIKLLMTLTH 804

Query: 530 DGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVAINKQQGLSCTITF 584
+ +PL FG V + G+V Q+++ + V V +++ C +
Sbjct: 805 NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859


35SDY_2397SDY_2422Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2397024-6.211944hypothetical protein
SDY_2398027-6.862818insertion element iso-IS1d protein InsA
SDY_2398a025-6.486054transposase IS1
SDY_2400-119-5.784109insertion element iso-IS1n protein InsB
SDY_4707-215-5.736768insertion element iso-IS1N protein InsA
SDY_2401-213-4.468632hypothetical protein
SDY_4708017-0.226345insertion element iso-IS1N protein InsA
SDY_2402116-0.249903insertion element IS1 protein InsB
SDY_2404014-0.667532arginine ABC transporter ATP-binding protein
SDY_2405116-0.932510arginine ABC transporter substrate-binding
SDY_2406018-1.345455arginine ABC transporter permease ArtQ
SDY_2407020-3.312657arginine ABC transporter permease ArtM
SDY_2410229-3.874603insertion sequence element IS600 integrase core
SDY_2411329-4.782684insertion sequence element IS600 transposase
SDY_4709229-3.817588insertion element iso-IS1N protein InsA
SDY_2414230-4.207851DNA-binding protein Roi of bacteriophage
SDY_2415335-3.910494antirepressor protein
SDY_2416231-2.636500hypothetical protein
SDY_2417128-2.567958recombination protein
SDY_2418117-0.691064insertion sequence element IS600 integrase core
SDY_2419014-0.306563insertion sequence element IS600 transposase
SDY_24201180.466833integrase
SDY_24212220.779160hypothetical protein
SDY_24222241.000549hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2398HTHTETR290.002 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 29.2 bits (65), Expect = 0.002
Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 8/43 (18%)

Query: 52 THQKIIDMAM--------NGVGCRATARIMGVSLNTILHHLKN 86
T Q I+D+A+ + A+ GV+ I H K+
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2404PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.010
Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 31 LVLLGPSGAGKSSLLRVL 48
+VL G G GKS+L+ L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


36SDY_2472SDY_2479Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_24720263.458235NADH dehydrogenase subunit N
SDY_24731262.778786NADH dehydrogenase subunit M
SDY_24740273.306353NADH dehydrogenase subunit L
SDY_24750282.927371NADH dehydrogenase subunit K
SDY_24760273.040669NADH dehydrogenase subunit J
SDY_24771263.276314NADH dehydrogenase subunit I
SDY_24780253.104621NADH dehydrogenase subunit H
SDY_24791233.147098NADH dehydrogenase subunit G
37SDY_2551SDY_2573Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2551220-1.351067transporter
SDY_2553222-0.804977*insertion element iso-IS1d protein InsB
SDY_2554120-1.176655insertion element IS1 protein InsA
SDY_2555125-3.703946insertion element iso-IS1n protein InsB
SDY_2558129-5.121014sucrose hydrolase
SDY_2559030-5.964523sucrose operon repressor
SDY_2560029-6.587093D-serine permease
SDY_2561128-6.273518D-serine dehydratase
SDY_2562230-7.519065multidrug resistance protein Y
SDY_2563224-4.987241multidrug resistance protein K
SDY_4713126-3.413399insertion element iso-IS1N protein InsA
SDY_2565229-4.649857insertion element IS1 protein InsB
SDY_2567330-5.336390insertion element IS1 protein InsB
SDY_4714332-5.675007insertion element iso-IS1N protein InsA
SDY_2568333-5.993536insertion element IS1 protein InsA
SDY_2569235-6.611748hypothetical protein
SDY_2570234-6.434406transporter YfdV
SDY_2571129-5.238511oxalyl-CoA decarboxylase
SDY_2572-118-2.940209formyl-coenzyme A transferase
SDY_2573018-3.455572hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2562TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIILMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2563RTXTOXIND793e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 79.1 bits (195), Expect = 3e-18
Identities = 64/414 (15%), Positives = 123/414 (29%), Gaps = 96/414 (23%)

Query: 13 RRKYLSLLAIVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71
RR L I+ F+ + + ++E + + + G + I + V + K
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97
+ VR+GD+L+ L A K
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136
K + Q + L + AE + + Y+
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182
R+ L + I+K + S + + I + K LV
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233
L + + + + + I++PV+ + Q V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292
++LM +VP + V A + + + +GQ+ I + F G +G
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404

Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDTKD 342
+ + +V V +S++ L PL G+++TA I T
Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


38SDY_2711SDY_2724Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_27112140.0642814-hydroxy-3-methylbut-2-en-1-yl diphosphate
SDY_2712-1140.672503hypothetical protein
SDY_27131150.249708hypothetical protein
SDY_27140140.168929nucleoside diphosphate kinase
SDY_2717-1131.9370553-mercaptopyruvate sulfurtransferase
SDY_27180151.960791enhanced serine sensitivity protein SseB
SDY_27190163.026187aminopeptidase
SDY_27203192.135708hypothetical protein
SDY_27212231.971088[2FE-2S] ferredoxin, electron carrer protein
SDY_27222231.968472chaperone protein HscA
SDY_27232230.443149co-chaperone HscB
SDY_27242270.793023iron-sulfur cluster assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2718STREPKINASE290.015 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 29.3 bits (65), Expect = 0.015
Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 21/120 (17%)

Query: 127 GNPLSSQEVLEGGESLILSE-----VAEPPAQMIDSLTTLFKTIKPVKRAFICSIKENEE 181
G+ ++SQE+L +S++ + E + ++ +F+TI P+ + F +K E+
Sbjct: 217 GDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQ 276

Query: 182 A-QPNLLIGIEADGDIEEIIQAAGSVATDTLPGDEPIDICQVKKGEKGISHFITEHIAPF 240
A + N G+ + + ++I V +KKGEK F H+ F
Sbjct: 277 AYRINKKSGLNEEINNTDLISEKYYV---------------LKKGEKPYDPFDRSHLKLF 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2722SHAPEPROTEIN1145e-30 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 114 bits (288), Expect = 5e-30
Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%)

Query: 23 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 75
IDLGT N+L+ G + +E PSVV +Q VG+DA+
Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63

Query: 76 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 135
T I++++ + +AD V+ +L+
Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92

Query: 136 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 194
+ V++ VP +R+ +++A+ AG + L+ EP AAAI G
Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 195 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 254
L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 255 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 304
I + + R++ E+ A + + V G +G ++ + E +
Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260

Query: 305 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 361
+ + A AL+ E A ++ E +V+ GG + + + E G P + + D
Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320

Query: 362 PDKVVAIGAAI 372
P VA G
Sbjct: 321 PLTCVARGGGK 331


39SDY_2750SDY_2794Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2750224-3.248802hypothetical protein
SDY_2751326-3.618928DNA-binding transcriptional regulator
SDY_2752530-4.728528DNA-invertase
SDY_2753530-4.769439invasion plasmid antigen
SDY_2755222-2.247844insertion sequence element IS600 transposase
SDY_4726-118-1.461247insertion element iso-IS1N protein InsA
SDY_2761-118-1.961271insertion element IS1 protein InsB
SDY_2762-219-2.440078hypothetical protein
SDY_2763-218-2.277580insertion element iso-IS1d protein InsA
SDY_2765-217-2.051791phospho-2-dehydro-3-deoxyheptonate aldolase
SDY_2766-118-2.288830bifunctional chorismate mutase/prephenate
SDY_2767021-3.253247bifunctional chorismate mutase/prephenate
SDY_2768328-3.014330translation inhibitor protein RaiA
SDY_2770428-2.327276insertion sequence element IS600 transposase
SDY_2771429-2.263112insertion sequence element IS600 integrase core
SDY_2772426-3.014473hypothetical protein
SDY_2773224-2.290722tail fiber protein
SDY_2774320-1.802098insertion sequence element IS2 repressor TnpA
SDY_2776118-1.767622insertion element IS1 protein InsB
SDY_2777118-0.671864hypothetical protein
SDY_2780017-0.31986650S ribosomal protein L19
SDY_2781013-0.155925tRNA (guanine-N(1)-)-methyltransferase
SDY_2782115-0.59523316S rRNA-processing protein RimM
SDY_2783114-0.63062430S ribosomal protein S16
SDY_2784210-0.553324signal recognition particle protein
SDY_2785213-1.213448hypothetical protein
SDY_2786113-1.302049hypothetical protein
SDY_2787217-1.900087heat shock protein GrpE
SDY_2788117-1.657951inorganic polyphosphate/ATP-NAD kinase
SDY_2789118-1.465564recombination and repair protein
SDY_2790120-2.287301hypothetical protein
SDY_2791119-1.820304hypothetical protein
SDY_2792120-0.991560hypothetical protein
SDY_2793019-1.071395SsrA-binding protein
SDY_2794221-1.086724hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2752CHLAMIDIAOM6280.021 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 27.7 bits (61), Expect = 0.021
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 51 TLSAGDTLVVWKLDRLGRSMR-HLVVLVEELRERGINFRSLT 91
T D +VWK+DRLG+ + + V V+ L+E G F + T
Sbjct: 153 TTPTADGKLVWKIDRLGQGEKSKITVWVKPLKE-GCCFTAAT 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2790BLACTAMASEA260.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 26.3 bits (58), Expect = 0.032
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%)

Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61
K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A
Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122

Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88
MSD N + G G+T
Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2794PRTACTNFAMLY280.002 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.1 bits (62), Expect = 0.002
Identities = 19/68 (27%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 3 KEFVDDNRVKVNNDGNFVNDLSGRRGIYQAGIKASFSSTLSGHLGVEYSHGAGVESPWNA 62
+EF V N + +L G R G+ A+ S + EYS G + PW
Sbjct: 844 QEFDGAGTVHTNGIAH-RTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTF 902

Query: 63 VAGVNWSF 70
AG +S+
Sbjct: 903 HAGYRYSW 910


40SDY_2838SDY_2858Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_28382220.044655outer membrane protein assembly complex subunit
SDY_2839527-0.415772insertion sequence element IS4 transposase InsG
SDY_2841429-1.950225insertion sequence element ISSfl4 ORF2
SDY_2842328-2.522317insertion sequence element IS600 integrase core
SDY_2843225-3.153102insertion sequence element IS600 transposase
SDY_2844a124-3.078728hypothetical protein
SDY_2845023-2.529326insertion sequence element IS911 transposase
SDY_2846123-1.672952hypothetical protein
SDY_2847116-1.391709insertion element iso-IS1n protein InsB
SDY_4729114-1.081768insertion element iso-IS1N protein InsA
SDY_2848013-1.426766hypothetical protein
SDY_2849114-0.931738insertion element iso-IS1d protein InsB
SDY_2850217-1.143692insertion element IS1 protein InsA
SDY_2853419-1.872624gamma-aminobutyrate transporter
SDY_2854019-2.479643DNA-binding transcriptional regulator CsiR
SDY_2855021-3.564006LysM domain/BON superfamily protein
SDY_2856023-3.214152hypothetical protein
SDY_2857123-3.937488hypothetical protein
SDY_2858025-3.739493hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2844aHOKGEFTOXIC681e-19 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 67.5 bits (165), Expect = 1e-19
Identities = 38/51 (74%), Positives = 43/51 (84%)

Query: 1 MKLPGNALIWCVLIVCCTLLIFTLLTRNRLCEVRLKDGYREVTATMAYESG 51
MKLP ++L+WCVLIVC TLLIFT LTR LCE+R +DGYREV A MAYESG
Sbjct: 1 MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESG 51


41SDY_4735SDY_2995Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4735324-2.565212insertion element iso-IS1N protein InsA
SDY_2983225-3.283045insertion element iso-IS1n protein InsB
SDY_2986125-2.993969insertion element iso-IS1n protein InsB
SDY_4736128-3.117027insertion element iso-IS1N protein InsA
SDY_2987231-3.253106insertion sequence element IS600 transposase
SDY_2990232-2.328530insertion sequence element IS600 transposase
SDY_2992128-1.429269hypothetical protein
SDY_2993130-0.875580insertion element iso-IS1d protein InsB
SDY_2994231-1.404590insertion element IS1 protein InsA
SDY_2995230-1.606340hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2995cloacin306e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 6e-04
Identities = 16/30 (53%), Positives = 18/30 (60%)

Query: 57 GSSSSSSGGGSSGGGFSGGGGSSGGGGASG 86
GS S GG SG G GG G+SGGG +G
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78



Score = 28.1 bits (62), Expect = 0.004
Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 63 SGGGSSGGGFSGGGGSSGGGGAS 85
SG G+ GG + GGGS GG S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLS 82



Score = 27.0 bits (59), Expect = 0.011
Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 54 SRKGSSSSSSGGGSSGGGFSGGGGSSGGGGAS 85
S S G G G SGGG +GG ++
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 26.2 bits (57), Expect = 0.016
Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 57 GSSSSSSGGGSSGGGFSGGGGSSGGGGASG 86
S ++ GGGS G GGG G GG +G
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69



Score = 25.8 bits (56), Expect = 0.027
Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 49 SKERASRKGSSSSSSGGGSSGGGFSGGGGSSGGGGASG 86
S E G S S G G +GGG + GGG+
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 25.4 bits (55), Expect = 0.030
Identities = 12/30 (40%), Positives = 13/30 (43%)

Query: 54 SRKGSSSSSSGGGSSGGGFSGGGGSSGGGG 83
S G G +GGG GG SG GG
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79


42SDY_3091SDY_3109Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3091-2173.776536insertion element iso-IS1n protein InsB
SDY_4737-3163.732503insertion element iso-IS1N protein InsA
SDY_3092-3173.894487type II secretion protein GspC
SDY_3093-2204.543249type II secretion protein
SDY_3094-1185.005896type II secretion protein
SDY_30950184.056424type II secretion protein
SDY_30960153.346494type II secretion protein
SDY_30971153.549133type II secretion protein
SDY_30982163.448409type II secretion protein
SDY_3099-2151.824046type II secretion protein
SDY_3100-2130.572804type II secretion protein
SDY_3101-115-0.115702GspL-like protein
SDY_3104-115-1.112660*insertion element IS1 protein InsA
SDY_3105-115-0.441621insertion element iso-IS1d protein InsB
SDY_3107-115-0.465339ornithine decarboxylase
SDY_3108-115-0.764153nucleosides transporter
SDY_3109216-0.074215murein transglycosylase C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3092BCTERIALGSPC1092e-30 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 109 bits (273), Expect = 2e-30
Identities = 65/281 (23%), Positives = 114/281 (40%), Gaps = 36/281 (12%)

Query: 1 MFWLMLLIISAKVAHSLWRYFSFSAEYTVVSPSVNKPPRTDAKTFDKNDVQLISQQNWFG 60
+F+L++L+ ++A WR V S + P + + ND L FG
Sbjct: 18 LFYLLMLLFCQQLAMIFWR-IGLPDNAPVSSVQIT-PAQARQQPVTLNDFTL------FG 69

Query: 61 KY-QPVAAPV-KQPEPAPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQGERPG 115
+ A + + + + LN+ L G+ G +R A+I + +Q E
Sbjct: 70 VSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVP 129

Query: 116 SHNAVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTRQKAISDEAKQAVAEPAASAPV 175
+NA I I D V+L+YQG+ E L L +E S ++++ +Q
Sbjct: 130 GYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQ----------- 178

Query: 176 ELPAAVRQALAKDPQKIFNYIQLTPVHKEG-IVGYAVKPGADRALFDASGFKEGDIAIAL 234
+ + +Y+ +P+ + + GY + PG F G ++ D+A+AL
Sbjct: 179 -----------RASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVAL 227

Query: 235 NQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIAL 275
N D D M ++ + + LTV R G R DI +
Sbjct: 228 NGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3093BCTERIALGSPD516e-179 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 516 bits (1331), Expect = e-179
Identities = 268/619 (43%), Positives = 395/619 (63%), Gaps = 37/619 (5%)

Query: 3 PGVQGKVSIRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVG 62
P V+G +++R+ LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+
Sbjct: 58 PSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVAS 117

Query: 63 EGSDNYAGDEMVTKVVPV-----RELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVV 117
+ + GDE+VT+VVP+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V+
Sbjct: 118 DAAPG-IGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVI 176

Query: 118 ERLTEVIQRVDHAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADER 176
+RL +++RVD+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADER
Sbjct: 177 KRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADER 236

Query: 177 TNSVIVSGDPATRNKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAA 236
TN+V+VSG+P +R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ +
Sbjct: 237 TNAVLVSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSE 296

Query: 237 KEEAEGTVGSGREVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVE 296
K+ A+ + + + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I E
Sbjct: 297 KQAAKPV-AALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAE 355

Query: 297 VAEGSNINFGVQWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINP 356
V + +N G+QWA+K+AG+ QF N + +PI T A
Sbjct: 356 VQDADGLNLGIQWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQY 395

Query: 357 DTNGDLST-LAQLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFF 415
+ +G +S+ LA LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F
Sbjct: 396 NKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATF 455

Query: 416 MVGQDVPVLTGSTVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQ 475
VGQ+VPVLTGS S N FNTVERK VGI LKV PQINEG++V + IEQEVS V
Sbjct: 456 NVGQEVPVLTGSQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADA 514

Query: 476 TS-----LDVVFGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFK 530
S L F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+
Sbjct: 515 ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFR 574

Query: 531 STADKKEKRNLMVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQP 588
ST+ K KRNLM+FIRPT++RD S +Y Q + E +++
Sbjct: 575 STSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLE 634

Query: 589 VLPAQNQALPPEVRAFLNA 607
+ P Q+ A +V A ++A
Sbjct: 635 IYPRQDTAAFRQVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3095BCTERIALGSPF433e-153 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 433 bits (1116), Expect = e-153
Identities = 218/400 (54%), Positives = 294/400 (73%), Gaps = 1/400 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSERHARQLLRGKELIPVHI-EARMNASSGGMLQRRRH 59
MA ++YQAL+ G+K +G EADS R ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTCQLATLVQAAIPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL T QLATLV A++PLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATSVVTILLAAVVPKIIEQFDHLGHALPATTRALIAMSDALQASGVYWLAGLLALLVL 239
VVA +VV+ILL+ VVPK++EQF H+ ALP +TR L+ MSDA++ G + L LLA +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMLLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ + + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPML 399
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+L
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPIL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3096BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRTGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3097BCTERIALGSPH583e-13 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 58.0 bits (140), Expect = 3e-13
Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 41/185 (22%)

Query: 1 MLVIFLIGLASAGVVQTFATASESPAKKAAQDFLTRFAQFKDWAVIDGQTLGVLIDPPGY 60
ML++ L+G+++ V+ F + + A + F + + + GQ GV + P +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 QFMQRRHGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQRRRL----TLHD 116
QF+ + P D W L L+ R+ ++
Sbjct: 72 QFLVLEARDGADPA-------------------PADDGWSGYRWLPLRAGRVATSGSIAG 112

Query: 117 IELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKLAHDGALSLSQC 171
+L L + P + P TPF L L ++ +
Sbjct: 113 GKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------LGEAPGIAFNAR 159

Query: 172 DERMP 176
E +P
Sbjct: 160 GESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3098BCTERIALGSPH331e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.0 bits (75), Expect = 1e-04
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATTVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3108TCRTETA300.017 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.017
Identities = 36/239 (15%), Positives = 76/239 (31%), Gaps = 18/239 (7%)

Query: 174 SHMQLYIGAALSAILVLFTLTLPHIPVAKQQANQSWTTLLGLDAFALFKNKRMAIFFIFS 233
H + AAL+ + L L + + L+ A F+ R
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPES---HKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 234 MLLGAELQITNMFGNTFLHSFDKDPMFASSFIVQHASIIMSISQISETLF-ILTIPFFLS 292
M + +Q+ F +D + + I ++ I +L + +
Sbjct: 216 MAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI---GISLAAFGILHSLAQAMITGPVAA 272

Query: 293 RYGIKNVMMISIVAWILRFALFAYGDPTPFGTVLLVLSMIVYGCAFDFFNISGSVFVEKE 352
R G + +M+ ++A + L A+ + ++V + + + ++
Sbjct: 273 RLGERRALMLGMIADGTGYILLAF-----ATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327

Query: 353 VSPAIRASAQGMFLMMTNGFGCILGGIVSGKVVEMYTQNGITDWQ-TVWLIFAGYSVVL 410
V + QG +T+ L IV + IT W W+ A ++
Sbjct: 328 VDEERQGQLQGSLAALTS-----LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381


43SDY_3306SDY_3311Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3306015-4.325191formate acetyltransferase 3
SDY_3307121-6.496706propionate/acetate kinase
SDY_3308018-6.294147threonine/serine transporter TdcC
SDY_3309124-7.248548threonine dehydratase
SDY_3310123-6.674756DNA-binding transcriptional activator TdcA
SDY_3311022-4.291445DNA-binding transcriptional activator TdcR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3307ACETATEKNASE5360.0 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 536 bits (1382), Expect = 0.0
Identities = 173/397 (43%), Positives = 254/397 (63%), Gaps = 11/397 (2%)

Query: 11 VLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVN-GGEPAP--LAHHSYEGA 67
+LVINCGSSS+K+ ++++ D VL G+A+ I ++ L+ N GE ++ A
Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62

Query: 68 LKAIAFELEKRNLN-----DSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLH 122
+K + L + + +GHR+ HGG FT S +ITD+V+ I LAPLH
Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122

Query: 123 NYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTS 182
N AN+ GI++ Q+ P V VAVFDT+FHQTM AYLY +P++YY + +R+YGFHGTS
Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182

Query: 183 HRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 242
H+YVSQRA +LN + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG
Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242

Query: 243 DVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERAQLAI 301
+D +S++ + N S ++ ++NK+SG+ GISG+SSD R + + A+ G +RAQLA+
Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302

Query: 302 KTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVIEHLAVLGVEIDTEMNNRS 361
F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG ++D E N
Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362

Query: 362 NSFGERIVSSENARVICAVIPTNEEKMIALDAIHLGK 398
E I+S+ +++V V+PTNEE MIA D + +
Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


44SDY_3538SDY_3556Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3538-219-4.225008multidrug efflux system protein MdtE
SDY_3539220-8.272100hypothetical protein
SDY_3540217-4.342022insertion element IS1 protein InsA
SDY_3541214-4.453692insertion element iso-IS1d protein InsB
SDY_3542121-5.059637acid-resistance membrane protein
SDY_3543322-3.385844acid-resistance protein
SDY_3544222-1.377308acid-resistance protein
SDY_3546122-0.539201insertion element IS1 protein InsB
SDY_4756023-0.888505insertion element iso-IS1N protein InsA
SDY_3547023-2.033128hemin importer ATP-binding subunit
SDY_3548-217-1.751861iron compound ABC transporter permease
SDY_3549-317-2.350228hypothetical protein
SDY_3550-118-4.198790hypothetical protein
SDY_3552-119-4.761439periplasmic binding protein
SDY_3553019-5.149606hypothetical protein
SDY_3554-116-3.980399outer membrane heme/hemoglobin receptor
SDY_3555-118-4.924735heme/hemoglobin transport protein
SDY_3556116-3.136946hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3538RTXTOXIND491e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.4 bits (118), Expect = 1e-08
Identities = 41/218 (18%), Positives = 70/218 (32%), Gaps = 33/218 (15%)

Query: 96 LQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSR-QDYDT-ARTQLNEAEANVT 153
+ + A L S + K Y Q + +L + N+
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIE----SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 154 VAKAAVEQATINLQYANVTSPITGASGKSSV-TVGALVTANQADSLVTVQRLDPIYVDLT 212
+ + + Q + + +P++ + V T G +VT + +V V D + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTAL 371

Query: 213 QSVQDFLRMKEEVASGQIKQVQGSTPVQLNLE--NGKRY-SQTGTLK--FSDPTVDETTG 267
+D I + + +E RY G +K D D+ G
Sbjct: 372 VQNKD------------IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 268 SVT--LRAI------FPNPNGDLLPGMYVTALVDEGSR 297
V + +I N N L GM VTA + G R
Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 32.1 bits (73), Expect = 0.003
Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 7/118 (5%)

Query: 52 PGRTVPY-EVAEIRPQVGGIIIKRNFI-EGDKVNQGDSLYQIDPAPLQAELNSAKGSLAK 109
G+ EI+P I+ K + EG+ V +GD L ++ +A+ + SL +
Sbjct: 87 NGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 110 ALSTASNARITFNRQASLLKTNYVSRQDYDTARTQLNEAEANVTVAKAAVEQATINLQ 167
A + +I +R L K + D + N +E V + +++ Q
Sbjct: 146 ARLEQTRYQIL-SRSIELNKLPELKLPDEPYFQ---NVSEEEVLRLTSLIKEQFSTWQ 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3547PF05272280.030 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.030
Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 28 EIVAIL-GPNGAGKSTLLRQLTG 49
+ +L G G GKSTL+ L G
Sbjct: 596 DYSVVLEGTGGIGKSTLINTLVG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3552FERRIBNDNGPP320.003 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 31.8 bits (72), Expect = 0.003
Identities = 40/210 (19%), Positives = 72/210 (34%), Gaps = 25/210 (11%)

Query: 55 VKRKKLFTAVLALSWAF--------SVTAAERIVVAGGSLTELIYAMGAGERVVGVDETT 106
+ R++L TA +ALS + RIV EL+ A+G GV +T
Sbjct: 7 ISRRRLLTA-MALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVP--YGVADTI 63

Query: 107 SY------PPETAKLPHIGYWKQLSSEGILSLRPDSVITWQDAGPQIVLDQL-RAQKVNV 159
+Y PP + +G + + E + ++P ++ GP + L R
Sbjct: 64 NYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGP--SPEMLARIAPGRG 121

Query: 160 VTLPRVPATLEQMYANIRQLAKTLQVPEQGDALVTQINQRLERVQQNVAAKKAPVKAMFI 219
L ++ ++A L + + + Q + ++ + A +
Sbjct: 122 FNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTT 181

Query: 220 LSAGGSAPQ--VAGKGSVADAILSLAGAEN 247
L V G S+ IL G N
Sbjct: 182 LI---DPRHMLVFGPNSLFQEILDEYGIPN 208


45SDY_3628SDY_3641Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_36280183.826305insertion element IS1 protein InsA
SDY_36291204.5716593-oxoacyl-ACP synthase
SDY_36300234.894811holo-(acyl carrier protein) synthase 2
SDY_36310244.847756nickel periplasmic binding protein
SDY_36321265.812432nickel transporter permease NikB
SDY_36330234.787515nickel transporter permease NikC
SDY_3634-1223.298985nickel transporter ATP-binding protein NikD
SDY_3635-1200.707216nickel transporter ATP-binding protein NikE
SDY_3636122-1.788022nickel responsive regulator
SDY_4761425-3.800373insertion element iso-IS1N protein InsA
SDY_3639424-3.678396insertion element iso-IS1n protein InsB
SDY_4762225-3.302847insertion element iso-IS1N protein InsA
SDY_3641228-4.378351PTS system transporter subunit IIA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3635HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


46SDY_3795SDY_3800Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_37951204.961888argininosuccinate lyase
SDY_37962224.896139DNA-binding transcriptional regulator OxyR
SDY_37971194.878363soluble pyridine nucleotide transhydrogenase
SDY_47671195.093124insertion element iso-IS1N protein InsA
SDY_3799-1175.357476insertion element iso-IS1n protein InsB
SDY_3800-1175.253430rhs element protein RhsB
47SDY_3857SDY_3885Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3857-120-3.277409ribonuclease BN
SDY_3858020-3.553636phosphatase
SDY_3862021-3.128105insertion sequence element IS911 transposase
SDY_3863219-1.079473hypothetical protein
SDY_4771320-0.204548insertion element iso-IS1N protein InsA
SDY_38673230.216307shikimate transporter
SDY_38682221.339833insertion element IS1 protein InsA
SDY_38692191.933580insertion element iso-IS1d protein InsB
SDY_38712171.722545insertion sequence element IS4 transposase InsG
SDY_38722211.726358GTP-binding protein
SDY_38730151.666238glutamine synthetase
SDY_38740130.984190nitrogen regulation protein NR(II)
SDY_3875014-0.148567nitrogen regulation protein NR(I)
SDY_3876013-1.150149coproporphyrinogen III oxidase
SDY_3877014-1.620995hypothetical protein
SDY_3878-213-1.688953ribosome biogenesis GTP-binding protein YsxC
SDY_3879-214-1.732266DNA polymerase I
SDY_3880014-3.783794acyltransferase
SDY_3881-216-2.667757insertion element iso-IS1n protein InsB
SDY_4772-217-3.053154insertion element iso-IS1N protein InsA
SDY_3883-218-2.919810insertion element iso-IS1d protein InsB
SDY_3884020-3.556504insertion element IS1 protein InsA
SDY_3885-117-3.266480protein disulfide isomerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3872TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3874PF06580280.041 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.041
Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%)

Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223
I+E + R ++ L + L + + S+ V + L S++ D ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279
+P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+
Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293

Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336
++VE+ G + ++ TG GL R + G I+ +
Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 337 GHTEFSVYLP 346
G V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3875HTHFIS6020.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 602 bits (1553), Expect = 0.0
Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL + S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3877SECA280.030 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.5 bits (61), Expect = 0.030
Identities = 11/71 (15%), Positives = 28/71 (39%)

Query: 13 AKARRKTREELNQEARDRKRQKKRRGHAPGSRAAGGNTTSGCKGQNAPKDPRIGSKTPIP 72
+K + + EE+ + + R+ + +R ++ + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 73 LGVTEKVTKQH 83
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


48SDY_4008SDY_4034Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4008219-1.34836216S rRNA methyltransferase GidB
SDY_4009332-0.100415F0F1 ATP synthase subunit I
SDY_40102310.159018ATP synthase F0F1 subunit A
SDY_40114401.176505ATP synthase F0F1 subunit C
SDY_40124381.284584ATP synthase F0F1 subunit B
SDY_40133331.157068ATP synthase F0F1 subunit delta
SDY_40143341.324760ATP synthase F0F1 subunit alpha
SDY_40152252.084522ATP synthase F0F1 subunit gamma
SDY_40163252.019211ATP synthase F0F1 subunit beta
SDY_40171201.384039ATP synthase F0F1 subunit epsilon
SDY_40180211.431807bifunctional N-acetylglucosamine-1-phosphate
SDY_40190251.963316glucosamine--fructose-6-phosphate
SDY_40200262.014345insertion sequence element IS1294 transposase
SDY_4021-1260.880405hypothetical protein
SDY_4022017-5.377824phosphate ABC transporter substrate-binding
SDY_4023217-7.891021phosphate transporter permease PstC
SDY_4024121-9.590984phosphate transporter permease PtsA
SDY_4025123-11.317784phosphate transporter subunit
SDY_4026331-12.951229transcriptional regulator PhoU
SDY_4027641-12.154466hypothetical protein
SDY_4028424-2.797279hypothetical protein
SDY_4029622-0.550240insertion element IS1 protein InsA
SDY_40305170.077794insertion element iso-IS1d protein InsB
SDY_47753170.171234insertion element iso-IS1N protein InsA
SDY_40333140.658054hypothetical protein
SDY_40342141.261401hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4012IGASERPTASE270.028 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.3 bits (60), Expect = 0.028
Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 18/101 (17%)

Query: 31 AAIEKRQKEIADGLASAERAHKDLDLAKASATDQLKKAKAEAQVIIEQ--ANKRRSQILD 88
+EK +++ + A K+ + T + A++ ++ Q K + +
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 89 EAKAEAEQERTKIVA----------------QAQAEIEAER 113
E KA+ E E+T+ V Q QAE E
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4018RTXTOXINA290.047 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.047
Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%)

Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426
LGD + D V + AG+ N G DV T G AT A T
Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665

Query: 427 VTRNVGENALAISRVPQTQK 446
VTR +G + + V + Q+
Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4033OMADHESIN643e-13 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 63.8 bits (154), Expect = 3e-13
Identities = 58/166 (34%), Positives = 93/166 (56%), Gaps = 3/166 (1%)

Query: 158 NYASALGVESEADGEKSLALGFKSKSGGIYSIALGAAANASATDAFAVGRESAASGTDSL 217
N ALG+E A G + + GI+SIA+GA A A+ A AVG S A+G +S+
Sbjct: 42 NADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSV 101

Query: 218 ALGRKSVASAANSIAIGAETEAAENATAVGNNAKAKGTNSMAMGLGSLADKVNTIALGNG 277
A+G S A +++ GA + A ++ A+G A T +A+G S AD N++A+G+
Sbjct: 102 AIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHS 160

Query: 278 SQALADN--AIAIGQGNKADGVDAIALGNGSQSRGLNTIALGTASN 321
S A++ +IAIG +K D +++++G+ S +R L +A GT
Sbjct: 161 SHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDT 206



Score = 41.8 bits (97), Expect = 4e-06
Identities = 62/207 (29%), Positives = 101/207 (48%), Gaps = 24/207 (11%)

Query: 34 KLLISALVAGGMFSS-FAYADNADGTPVVPAGHNSGNGWVAIGEGSTASQHTGPDGASTA 92
K+ +SA + +FSS +A+AD+ DG P + A S N A+G G
Sbjct: 6 KISVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAG---- 61

Query: 93 IGNLASALGKYSTSIGARSSAGGDASTALGVKASASGDRGIALGASSISEGNYSMALGVV 152
G +SA G S A+G A A+ +A+GA SI+ G S+A+G +
Sbjct: 62 ---------------GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPL 106

Query: 153 AVAHGNYASALGVESEADGEKSLALGFKSKSGGIYSIALGAAANASATDAFAVGRES--A 210
+ A G+ A G S A + +A+G ++ + +A+G + A A ++ A+G S A
Sbjct: 107 SKALGDSAVTYGAASTAQ-KDGVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 211 ASGTDSLALGRKSVASAANSIAIGAET 237
A+ S+A+G +S NS++IG E+
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHES 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4034OMADHESIN625e-12 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 61.8 bits (149), Expect = 5e-12
Identities = 57/182 (31%), Positives = 96/182 (52%), Gaps = 31/182 (17%)

Query: 5 DARATGTIATAVGYNAYASGEQSLAVGPNSIADDDFSTAIGAQAKAFGHHSLALGAGSNT 64
+A A G + A+G A A+ ++AVG SIA S AIG +KA G ++ GA S T
Sbjct: 64 NASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS-T 122

Query: 65 ASDASIALGANSFATGAQSMSLGVASKTSAEAAIALGYNSFANGLNSMSLGQSSYAGKDN 124
A +A+GA + ++++ +A+G+NS A+ NS+++G SS+ ++
Sbjct: 123 AQKDGVAIGARA---------------STSDTGVAVGFNSKADAKNSVAIGHSSHVAANH 167

Query: 125 SVALGSDASADGLNSVALGAGSIAEYDNTVSVGSSTLQRKVVNMAAGIVSQTSTDAINGS 184
S+A+G S + +N+VS+G +L R++ ++AAG TDA+N +
Sbjct: 168 GY------------SIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG---TKDTDAVNVA 212

Query: 185 QL 186
QL
Sbjct: 213 QL 214



Score = 38.0 bits (87), Expect = 1e-04
Identities = 27/80 (33%), Positives = 49/80 (61%)

Query: 85 SLGVASKTSAEAAIALGYNSFANGLNSMSLGQSSYAGKDNSVALGSDASADGLNSVALGA 144
+LG+ A G N+ A G++S+++G ++ A K +VA+G+ + A G+NSVA+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 145 GSIAEYDNTVSVGSSTLQRK 164
S A D+ V+ G+++ +K
Sbjct: 106 LSKALGDSAVTYGAASTAQK 125



Score = 34.5 bits (78), Expect = 0.002
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 25/149 (16%)

Query: 621 NGLAFNDASASGVGATAVGYNAVASGASSVAIGQNSSSTVDTGIALGSSSVSSRVIAKGS 680
+G+A +++ AVG+N+ A +SVAIG +S + G + IA G
Sbjct: 126 DGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYS----------IAIGD 175

Query: 681 RDTSVTENGVAIGYGTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGA 740
R + EN V+IG+ + + RQ+ ++A G++ DAV V QL+ I
Sbjct: 176 RSKTDRENSVSIGHESLN---------------RQLTHLAAGTKDTDAVNVAQLKKEIEK 220

Query: 741 VATTPTKYYHANSTAENSLAVGEDSLAMG 769
K N+ A + S +G
Sbjct: 221 TQENTNKRSAELLANANAYADNKSSSVLG 249



Score = 33.7 bits (76), Expect = 0.003
Identities = 39/156 (25%), Positives = 68/156 (43%), Gaps = 3/156 (1%)

Query: 129 GSDASADGLNSVALGAGSIAEYDNTVSVGSSTLQRKVVNMAAGIVSQTSTDAINGSQLYS 188
G +ASA G++S+A+GA + A V+VG+ ++ V ++A G +S+ D+ S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 189 LSSNIANYFGGDASVSDDGVFTCPTYNINGTDYTNVGDALAAIDTSFEDALLWDENANGG 248
+ G AS SD GV + + +G + + D +
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 249 TGAFSASHGKNDSKITNVLAGAVTETSTDAINSGQL 284
+ S H + ++T++ AG TDA+N QL
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGT---KDTDAVNVAQL 214



Score = 31.0 bits (69), Expect = 0.020
Identities = 35/96 (36%), Positives = 54/96 (56%), Gaps = 15/96 (15%)

Query: 727 DAVTVRQLQNAIGAVATTPTKYYHANSTAENSLAVGEDSLAMGAKTVVNGNAGIGIGLNT 786
++V + L A+G A T Y A STA+ +D +A+GA+ + + G+ +G N+
Sbjct: 99 NSVAIGPLSKALGDSAVT----YGAASTAQ------KDGVAIGARASTS-DTGVAVGFNS 147

Query: 787 LVLADAINGIAIG--SNARANHANSIAMGNGSQTTR 820
DA N +AIG S+ ANH SIA+G+ S+T R
Sbjct: 148 KA--DAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181


49SDY_4049SDY_4059Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4049115-3.800935L-threonine 3-dehydrogenase
SDY_4050221-6.7471232-amino-3-ketobutyrate CoA ligase
SDY_4051328-9.854865ADP-L-glycero-D-manno-heptose-6-epimerase
SDY_4052536-12.112795ADP-heptose--LPS heptosyltransferase
SDY_4053545-15.485984ADP-heptose--LPS heptosyltransferase
SDY_4054547-17.655304lipid A-core:surface polymer ligase WaaL
SDY_4055339-15.113401beta 1,4-galactosyltransferase WaaX
SDY_4056433-11.950063UDP-galactose:(galactosyl) LPS
SDY_4057331-10.161803lipopolysaccharide core biosynthesis protein
SDY_4058325-7.023571lipopolysaccharide 1,2-glucosyltransferase
SDY_4059322-4.700762lipopolysaccharide 1,3-galactosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4051NUCEPIMERASE1018e-27 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 101 bits (253), Expect = 8e-27
Identities = 76/348 (21%), Positives = 127/348 (36%), Gaps = 67/348 (19%)

Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47
+VTG AGFIG ++ K L + G ++ +DNL D +++
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 ADYMDKEDFLIQIMAGEDFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100
AD + + + A F E +F + +Y ++N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158
L C +I LYASS++ YG F + P+++Y +K +
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218
G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224

Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258
+ + W +E G ++N+G A + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305
+P G T AD L G+ P TV +GV ++ W
Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327


50SDY_4107SDY_4128Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_41070323.775095insertion element iso-IS1d protein InsB
SDY_41081344.973882insertion element IS1 protein InsA
SDY_41091386.020568hypothetical protein
SDY_41112387.465005phosphonate/organophosphate ester ABC
SDY_41121397.364415phosphonate/organophosphate ester ABC
SDY_41130378.509211phosphonate/organophosphate ester ABC
SDY_4114-1368.365560phosphonate metabolism transcriptional regulator
SDY_41160388.454339carbon-phosphorus lyase complex subunit
SDY_41170378.200807phosphonate metabolism protein
SDY_41180377.774436phosphonate metabolism protein
SDY_41190367.970349phosphonate C-P lyase system protein PhnK
SDY_41201347.467593phosphonate ABC transporter ATP-binding protein
SDY_41212285.003695phosphonate metabolism protein
SDY_41222214.293498ribose 1,5-bisphosphokinase
SDY_41231214.013829aminoalkylphosphonic acid N-acetyltransferase
SDY_41241213.833210carbon-phosphorus lyase complex accessory
SDY_41251193.123921hypothetical protein
SDY_41261183.029343hypothetical protein
SDY_41272194.061427histidine kinase
SDY_41280183.277933ribose ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4111PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.020
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 32 MVALLGPSGSGKSTLLRHLSGL 53
V L G G GKSTL+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4120PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 89
VVL G G GKSTL+ +L + I G + + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 90 TTVGWVSQFL 99
V F
Sbjct: 656 ADAEAVKAFF 665


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4123SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 5/84 (5%)

Query: 50 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAG 109
L L+ +G I + + N I+++ V R VG+ LL A E A++
Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 110 AEMTELSTNVKRHDAHRFYLREGY 133
L T A FY + +
Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4125RTXTOXIND260.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 25.9 bits (57), Expect = 0.032
Identities = 18/107 (16%), Positives = 40/107 (37%), Gaps = 8/107 (7%)

Query: 11 TLLTLTTVPAQADIIDDTIGNIQ--------QAINDAYNPDHGRDYEDSRDDGWQREVSD 62
LL LT + A+AD + +Q Q ++ + + + + + +Q +
Sbjct: 123 VLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEE 182

Query: 63 DRRRQYDDRRRQFEDRRRQLDDRQHQLDQERRQLEDEERRMEDEYGQ 109
+ R + QF + Q ++ LD++R + R+
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4127HTHFIS586e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.3 bits (141), Expect = 6e-11
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 640 VLVLEDEAAVRQTICEQLHLLGYLTLEASSGEQALDLLAASAEIDIFISDLMLPGGMSGA 699
+LV +D+AA+R + + L GY S+ +AA + D+ ++D+++P +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP-DENAF 63

Query: 700 EVVNAARKLYPHLTLLLISGQ 720
+++ +K P L +L++S Q
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQ 84


51SDY_4204SDY_4221Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_42042224.220079low affinity tryptophan permease
SDY_42053245.586285insertion element IS2 transposase InsD
SDY_42071195.412337insertion sequence element IS2 repressor TnpA
SDY_47810175.333328insertion element iso-IS1N protein InsA
SDY_42090175.333328insertion element iso-IS1d protein InsB
SDY_42100175.410861rhs element protein RhsA
SDY_42110152.406943glutathione S-transferase
SDY_4212-1142.263500selenocysteine synthase
SDY_4213-1122.575515selenocysteinyl-tRNA-specific translation
SDY_47821171.411041insertion element iso-IS1N protein InsA
SDY_42152152.299849insertion element IS1 protein InsA
SDY_42161172.173177insertion element iso-IS1d protein InsB
SDY_42180151.889633phosphate-starvation-inducible protein PsiE
SDY_42202213.649619hypothetical protein
SDY_42212181.672081hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4213TCRTETOQM585e-11 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 58.3 bits (141), Expect = 5e-11
Identities = 44/147 (29%), Positives = 69/147 (46%), Gaps = 18/147 (12%)

Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47
I HVD GKTTL +++ +G D E++RG+TI G +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 48 PDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTREHLAILQLTGNPML 107
+ +V ID PGH FL+ + + +D A+L+++ DGV AQTR L+ G P +
Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI 124

Query: 108 TVALTKADRVDEARVDEVERQVKEVLR 134
+ K D+ + V + +KE L
Sbjct: 125 -FFINKIDQNG-IDLSTVYQDIKEKLS 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4220CHANLCOLICIN300.006 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.006
Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 3/95 (3%)

Query: 20 AAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANSWWPGAVISEELATAAALRQQQALL 79
A + + + LT + L D+V + N+ + A AA++ + L
Sbjct: 73 AKAAAEAQAKAKANRDALT--QRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERL 130

Query: 80 TRLAEQGADSSADDAAAINALRQQIQVLKVTGRQK 114
RLA+ + + AA A ++ Q K R+K
Sbjct: 131 -RLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREK 164


52SDY_4287SDY_4297Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_42871173.124535hypothetical protein
SDY_42880204.870310oxidoreductase
SDY_42920215.143530*insertion sequence element IS911 transposase
SDY_42930215.507353iron-dicitrate ABC transporter ATP-binding
SDY_42941245.585329iron-dicitrate transporter subunit FecD
SDY_42951254.846063iron-dicitrate ABC transporter permease
SDY_4297-1233.254177citrate-dependent iron transport, outer membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4297ECOLNEIPORIN330.004 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 32.9 bits (75), Expect = 0.004
Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 9/89 (10%)

Query: 546 GSFGTVQYSQIGKAVQSGNVEPEKARTWELGTRYDDGALTAEMGLFLINFNNQYDSNQTN 605
G F + NV EK + L + YD+ AL A + Q D+
Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASV------AVQQQDAKLVE 240

Query: 606 DTVTARGKTRHTGLETQARYDLGTLTPTL 634
+ T + Y G +TP +
Sbjct: 241 E---NYSHNSQTEVAATLAYRFGNVTPRV 266


53SDY_4330SDY_4342Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4330219-0.430747homoserine O-succinyltransferase
SDY_43312230.058344insertion sequence element IS1 transposase
SDY_43323220.049403insertion element IS1 protein InsA
SDY_43332220.049403hypothetical protein
SDY_43342220.035759hypothetical protein
SDY_4335118-0.826725insertion sequence element IS4 transposase InsG
SDY_4792017-2.611682insertion element iso-IS1N protein InsA
SDY_4337018-2.758175insertion element iso-IS1n protein InsB
SDY_4338019-3.509435insertion element IS1 protein InsA
SDY_4339016-2.104280insertion element iso-IS1d protein InsB
SDY_4340-117-0.355758hypothetical protein
SDY_43411200.321661insertion sequence element IS600 transposase
SDY_43422230.576472insertion sequence element IS600 integrase core
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4333SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 0.001
Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%)

Query: 78 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126
+ D GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4334SHAPEPROTEIN317e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.3 bits (71), Expect = 7e-04
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96
IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E
Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131

Query: 97 GA 98
GA
Sbjct: 132 GA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4792ACRIFLAVINRP270.006 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.006
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 50 GIKELLTEM-AFNGAGV-----RDTARTLKIGINTVIRTL 83
IK L E+ F G+ DT +++ I+ V++TL
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL 344


54SDY_4798SDY_4413Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_47982161.888041insertion element iso-IS1N protein InsA
SDY_44061141.564654insertion element iso-IS1n protein InsB
SDY_44073191.746306insertion element IS1 protein InsA
SDY_44083191.676793insertion element iso-IS1d protein InsB
SDY_44095221.45007323S rRNA (guanosine-2'-O-)-methyltransferase
SDY_44104221.720331exoribonuclease R
SDY_44114211.251812transcriptional repressor NsrR
SDY_44124231.377470adenylosuccinate synthetase
SDY_44132171.177719hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4410RTXTOXIND310.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.028
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218
+VP+D L L+ I +G ++++ P R + GK+ + D +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413


55SDY_4469SDY_4495Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_4469225-1.017379insertion sequence element IS600 integrase core
SDY_4470122-1.077400insertion sequence element IS911 integrase core
SDY_4805-121-4.236328*insertion element iso-IS1N protein InsA
SDY_4473028-8.783845insertion element IS1 protein InsB
SDY_4474-123-6.035096transposase
SDY_4806023-5.707566insertion element iso-IS1N protein InsA
SDY_4476024-5.332503insertion element iso-IS1n protein InsB
SDY_4477024-5.055637DNA-binding transcriptional activator EvgA
SDY_4478023-4.142969hybrid sensory histidine kinase in two-component
SDY_44792284.696510insertion sequence element ISSfl4 transposase
SDY_44802312.370553insertion sequence element ISSfl4 ORF2
SDY_44812300.982507insertion sequence element ISSfl4 ORF2
SDY_44822290.034166hypothetical protein
SDY_4483125-0.730714insertion element IS1 protein InsA
SDY_4484222-2.273629insertion element iso-IS1d protein InsB
SDY_4485222-2.615336insertion sequence element IS600 integrase core
SDY_4486121-3.162664hypothetical protein
SDY_4807122-2.473718insertion element iso-IS1N protein InsA
SDY_4487122-2.828252insertion element iso-IS1n protein InsB
SDY_4488222-2.951167hypothetical protein
SDY_4489125-2.672342insertion element iso-IS1n protein InsB
SDY_4808127-2.755229insertion element iso-IS1N protein InsA
SDY_4490125-2.505467insertion sequence element IS600 integrase core
SDY_4491122-3.420772insertion sequence element IS600 transposase
SDY_4492019-3.503242hypothetical protein
SDY_4495-119-3.183893insertion sequence element IS911 transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4477HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4478HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


56SDY_4581SDY_4591Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_45814251.289838insertion element iso-IS1d protein InsA
SDY_45824261.948333insertion element iso-IS1d protein InsB
SDY_45845232.033072insertion sequence element IS2 repressor TnpA
SDY_45856252.324645insertion element IS2 transposase InsD
SDY_45868261.588388insertion sequence element IS600 integrase core
SDY_45899282.083549hypothetical protein
SDY_45907251.681678DNA repair protein
SDY_45913201.828154hypothetical protein
57SDY_0287SDY_0303N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0287117-0.991292hypothetical protein
SDY_0288a118-0.698934insertion element iso-IS1N protein InsA
SDY_0290118-0.747489hypothetical protein
SDY_0291324-0.958783peptidyl-prolyl cis-trans isomerase
SDY_0292424-1.315777transcriptional regulator HU subunit beta
SDY_0293323-1.288132DNA-binding ATP-dependent protease La
SDY_0294020-0.699067ATP-dependent protease ATP-binding subunit ClpX
SDY_0295-117-0.569170ATP-dependent Clp protease proteolytic subunit
SDY_0296018-0.458967trigger factor
SDY_0298090.076534transcriptional regulator BolA
SDY_02991100.027730hypothetical protein
SDY_0300011-0.001727muropeptide transporter
SDY_0301011-0.119906insertion element iso-IS1n protein InsB
SDY_0303a-114-0.239310insertion element iso-IS1N protein InsA
SDY_0303-113-0.049024transport protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0287PF08280280.018 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 27.5 bits (61), Expect = 0.018
Identities = 24/138 (17%), Positives = 41/138 (29%), Gaps = 20/138 (14%)

Query: 1 MQTQIKVRGYHLDVYQHVNNARYL-------EFLEEARWDGLENSDSFHWMTAH------ 47
+Q I + Y N Y E++ + N FH +
Sbjct: 361 LQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEWMAKLPGKRYLNHKHFHLFCHYVEQILR 420

Query: 48 ------NIAFVVVN-ININYRRPAVLSDLLTITSQLQQLNGKSGILSQVITLEPEGQVVA 100
+ FV N IN + + + + Q+ L+P+ +
Sbjct: 421 NIQPPLVVVFVASNFINAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDLKPDLVITH 480

Query: 101 DALITFVCIDLKTQKALA 118
LI FV +L A+A
Sbjct: 481 SQLIPFVHHELTKGIAVA 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0292DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 3e-38
Identities = 49/88 (55%), Positives = 67/88 (76%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89
NPQTG+EI I A+KVP+F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0293GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.002
Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%)

Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249
LE A +E + +L R +++ ++ S+ +Q++A ++L E + +
Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344

Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308
++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ +
Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397

Query: 309 VKKDLRQAQEILD 321
V+K L +A L
Sbjct: 398 VEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0294HTHFIS290.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.043
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%)

Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119
E P+ E + ++G+ A + +Y RL D +++ G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167

Query: 120 PTGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 168 ESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0299PF06291270.024 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.9 bits (59), Expect = 0.024
Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36
KK+LF ++ GCA+ T+ PT P++
Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0300TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%)

Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121
+F +P++ + F RR LL + V A M W+ + ++A +
Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109

Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181
+ V A+ D+ +ER + GM+ L + ++ A
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167

Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238
AL + + L PE + P+ + + + A L+ + ++ +G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298
A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++
Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285

Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358
A GY LL+ + + V GG+G A A+L ++ L+
Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403
AL+++ + VGP+ + A +T+ + + AA+ L L + R
Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0303aACRIFLAVINRP270.007 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.007
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 50 GIKELLTEM-AFNGAGV-----RDTARTLKIGINTVIRTL 83
IK L E+ F G+ DT +++ I+ V++TL
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0303TCRTETA833e-19 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 82.6 bits (204), Expect = 3e-19
Identities = 77/362 (21%), Positives = 139/362 (38%), Gaps = 29/362 (8%)

Query: 18 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGIAIGIYGLTQAVFQIPFGLLSD 75
L TV L +G+ +++PVL + A GI + +Y L Q G LSD
Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 76 RIGRKPLIVGGLAVFAAGSVIAALSDSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 134
R GR+P+++ LA A I A + +W + +GR + G +GA A A ++D+T
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128

Query: 135 RTKAMAFIGVSFGITFAIAMVLGPIITHKLG---LHALFWMIAILATTGIALTIWVVPNS 191
R + F+ FG MV GP++ +G HA F+ A L +++P S
Sbjct: 129 RARHFGFMSACFG----FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 192 STHVLNRESGMVKGSFSKVLAEPRLLKLNFGIMCLHILLMSTFVA-LPGQLADAGFPTAE 250
+ + G+ + L+ F+ L GQ+ A +
Sbjct: 185 HKGERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237

Query: 251 HWKVYLATMLIAF--------GSVVPFIIYAEVKRKMKQVFVFCVGLIV-VAEIVLWNAQ 301
+ + I S+ +I V ++ + +G+I +L
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 302 TQFWQLVVGVQLFFVAFNLMEALLPSLISKESPAGYKGTAMGVYSTSQFLGVAIGGSLGG 361
T+ W + + + + + L +++S++ +G G + L +G L
Sbjct: 298 TRGW-MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 362 WI 363
I
Sbjct: 357 AI 358


58SDY_0449SDY_0457N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_04493141.572321DNA polymerase III subunits gamma and tau
SDY_0450113-0.655511adenine phosphoribosyltransferase
SDY_0451111-0.479273hypothetical protein
SDY_04520140.138339primosomal replication protein N''
SDY_0453113-0.689974hypothetical protein
SDY_0454014-0.901188potassium efflux protein KefA
SDY_0455114-1.365642DNA-binding transcriptional repressor AcrR
SDY_0456215-1.092784acridine efflux pump
SDY_0457115-1.630472acridine efflux pump
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0449IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 4e-05
Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATCARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ERDNAVCLRLRS 558
K A E+ +V+ PK + + E R+N + ++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQSIYEEKLAQARES 617
Q N ++ A+ S ++ E T V N V P + + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0454RTXTOXIND320.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.015
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0455HTHTETR2225e-76 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 222 bits (567), Expect = 5e-76
Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180
GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0456RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 7e-07
Identities = 34/209 (16%), Positives = 73/209 (34%), Gaps = 17/209 (8%)

Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELANGML-----KQENGK--AKVSLITSDGIKFPQDGTLEFSDVTVDQTT 271
+ D + + G KV I D I+ + G + +++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENC 432

Query: 272 GSITLRAIFPNPDHTLLPGMFVRARLEEG 300
S + I L GM V A ++ G
Sbjct: 433 LSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.4 bits (79), Expect = 8e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0457ACRIFLAVINRP13650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1365 bits (3534), Expect = 0.0
Identities = 798/1033 (77%), Positives = 913/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVATNMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVIYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLV+YLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHSDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+H L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGNWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALHESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAAL+ESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRHRF 1033
FVPVFFVV+R F
Sbjct: 1020 FVPVFFVVIRRCF 1032


59SDY_0495aSDY_0501N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0495a1031-1.441748toxic polypeptide
SDY_0495930-1.096614insertion element IS2 transposase InsD
SDY_0496829-1.374296insertion sequence element IS2 repressor TnpA
SDY_0497524-0.963603hypothetical protein
SDY_04980181.714071hypothetical protein
SDY_0499-1182.677045insertion element IS1 protein InsA
SDY_05000193.154412insertion element iso-IS1d protein InsB
SDY_05010183.037657delta-aminolevulinic acid dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0495aHOKGEFTOXIC564e-15 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 55.6 bits (134), Expect = 4e-15
Identities = 17/50 (34%), Positives = 26/50 (52%)

Query: 1 MLAKYALVAVIVLCLTVPGFTLLVGDSLCEFTVKERDIEFRAVLAYEPKK 50
+ + V+++CLT+ FT L SLCE ++ E A +AYE K
Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0497PRTACTNFAMLY300.016 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.0 bits (67), Expect = 0.016
Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 9/93 (9%)

Query: 214 TINGNGDNDNTASIEAGQNEVDNNGDHVAAATGNYKVRIDNATGAGSIADYNGNELIYVN 273
T+ G+G + G ++ A+G +++ + N+ GS L+
Sbjct: 477 TLAGSGLFRMNVFADLGLSDKLVVMQD---ASGQHRLWVRNS---GSEPASANTLLLVQT 530

Query: 274 DKNSNATFSAVN---KADLGAYTYQAEQRGNTV 303
S ATF+ N K D+G Y Y+ GN
Sbjct: 531 PLGSAATFTLANKDGKVDIGTYRYRLAANGNGQ 563


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0498PRTACTNFAMLY802e-19 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 79.7 bits (196), Expect = 2e-19
Identities = 53/228 (23%), Positives = 94/228 (41%), Gaps = 8/228 (3%)

Query: 2 VGVDTKIDGNNAKWIVGAAAGFAKGDMN---DRSGQVDQDSQTAYIYSSAHFANNVF-VD 57
+G D + +W +G AG+ +GD D G D Y + + A++ F +D
Sbjct: 677 LGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGY---ATYIADSGFYLD 733

Query: 58 GSLSYSHFNNDLSASMSNGTYVDGSTNSDAWGFGLKAGYDFKLGDAGYVTPYGSISGLFQ 117
+L S ND + S+G V G + G L+AG F D ++ P ++
Sbjct: 734 ATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRA 793

Query: 118 SGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQALTPYFKQAYVYD-DSNNDND 176
G Y+ +N ++V + S+ LG++ G + + + PY K + + + D
Sbjct: 794 GGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVH 853

Query: 177 VNGDSIDNGTEGSAVRVGLGTQFSFTKNFSAYTDANYLGGGDVDQDWS 224
NG + G+ +GLG + + S Y Y G + W+
Sbjct: 854 TNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWT 901


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0501BINARYTOXINB300.019 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.7 bits (66), Expect = 0.019
Identities = 19/69 (27%), Positives = 30/69 (43%)

Query: 265 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 324
+ EL + +L + QV G A F +D E L I+ A +IF+
Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525

Query: 325 ALDLAEKKI 333
L+L E++I
Sbjct: 526 DLNLVERRI 534


60SDY_0522SDY_0527N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0522-1163.882038enterobactin exporter EntS
SDY_0523-2163.686980iron-enterobactin ABC transporter
SDY_0524-2183.695705isochorismate hydroxymutase
SDY_0525-1193.548623enterobactin synthase subunit E
SDY_05260182.0062402,3-dihydro-2,3-dihydroxybenzoate synthetase
SDY_05271191.2793632,3-dihydroxybenzoate-2,3-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0522TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 81/393 (20%), Positives = 144/393 (36%), Gaps = 38/393 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVNSPMIGGLLLAIGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEVMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELR 402
+ A + + +G+ + L LL L LR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0523FERRIBNDNGPP631e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 63.4 bits (154), Expect = 1e-13
Identities = 45/217 (20%), Positives = 84/217 (38%), Gaps = 17/217 (7%)

Query: 105 EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKSWQAL-----LTQL 159
EP+ E + P ++ SA G S + L+ IAP N+ D LT++
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141

Query: 160 GEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLWTPESAQGQML 219
++ + A +AQ++ + + K + + ++ P S ++L
Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201

Query: 220 EQLGFTPAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQKDADAIYANP 279
++ G NA Q + + + LAA + + L + KD DA+ A P
Sbjct: 202 DEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253

Query: 280 LLAHLPAVQNKQVYALGTETFRLDYYSAMQVLDRLNS 316
L +P V+ + + F SAM + L++
Sbjct: 254 LWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDN 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0526ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1133), Expect = e-159
Identities = 144/299 (48%), Positives = 193/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDITQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+ QNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLGKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV L + PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0527DHBDHDRGNASE363e-130 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 363 bits (934), Expect = e-130
Identities = 110/258 (42%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVKAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAAQVAQVCQRLLAETERLDALVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+A + ++ R+ E +D LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


61SDY_0806SDY_0811N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_0806-1182.831573ATP-dependent RNA helicase RhlE
SDY_0807-1192.528014DNA-binding transcriptional regulator
SDY_0808-1182.595942hypothetical protein
SDY_0809-1192.086556ATPase
SDY_0810-2191.352501hypothetical protein
SDY_0811-1180.751224hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0806SECA300.024 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.024
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0807HTHTETR736e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.1 bits (179), Expect = 6e-18
Identities = 33/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQL 131
IGE E + P + +RE+++ + + + + + F E +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120

Query: 132 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDANDTRMILHTHALIGEILAFRLGKETIL 191
A + + + + + +A L T + + G
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173

Query: 192 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
L W + + + ++ ++L+
Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0808RTXTOXIND626e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 62.2 bits (151), Expect = 6e-13
Identities = 43/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 83 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 142
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 143 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 197
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 198 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFT-VSLTRPVWVRAYVDERNLDQA 255
E Q S + AP + V G V+ T+ V + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 256 QPGRKVLLYTDGRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 309
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 310 ----DADDALRQGMPVTVQ 324
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0809PF05272330.005 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.005
Identities = 17/86 (19%), Positives = 25/86 (29%), Gaps = 13/86 (15%)

Query: 298 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 357
PR E + +LG P + + + K +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYILMGHVARVMEPGC 593

Query: 358 KRGEILGLLGPNGAGKSTTFKMMCGL 383
K + L G G GKST + GL
Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.3 bits (65), Expect = 0.047
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 39 YVTGLVGPDGAGKTTLMRMLAGL 61
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_0811ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


62SDY_1137SDY_1149N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_1137-112-0.355430phosphogluconate dehydratase
SDY_1138-112-1.048427glucose-6-phosphate 1-dehydrogenase
SDY_1141-115-1.079876pyruvate kinase
SDY_1142-118-1.243727lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA
SDY_1143-117-1.095846hypothetical protein
SDY_1144-120-2.497719high-affinity zinc ABC transporter
SDY_1145-120-1.590220high-affinity zinc ABC transporter ATPase
SDY_1146-118-0.991030high-affinity zinc ABC transporter permease
SDY_1147-214-0.786645Holliday junction DNA helicase RuvB
SDY_11480190.021501Holliday junction DNA helicase RuvA
SDY_1149020-0.116525hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1137PHPHTRNFRASE310.022 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.5 bits (69), Expect = 0.022
Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 532 GGLLAKVRDGDIIRVNGQTGELTLLVDEAELAAREP 567
+ K++ GD++ V+G G + + E E+ A E
Sbjct: 207 KEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1142BCTERIALGSPF300.013 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.8 bits (67), Expect = 0.013
Identities = 14/61 (22%), Positives = 20/61 (32%), Gaps = 8/61 (13%)

Query: 22 RHWGAWLGVAAMAGI-----ALTPPKFR---DPILARLGRFAGRLGKSSRRRALINLSLC 73
R +G W+ +A +AG L K R L L + R LS+
Sbjct: 224 RTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSIL 283

Query: 74 F 74

Sbjct: 284 N 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1144ADHESNFAMILY2733e-93 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 273 bits (699), Expect = 3e-93
Identities = 64/304 (21%), Positives = 115/304 (37%), Gaps = 25/304 (8%)

Query: 4 KKTLLFAALSAALWGGATQA---------ADAAVVASLKPVGFIASAIADGVTETEVLLP 54
KK L + A VVA+ + I IA + ++P
Sbjct: 2 KKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVP 61

Query: 55 DGASEHDYSLRPSDVKRLQNADLVVWVGPEMEAFMQKPVSKLPEAKQVTIAQLEDVKPLL 114
G H+Y P DVK+ ADL+ + G +E +KL E + T E+
Sbjct: 62 IGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKT----ENKDYFA 117

Query: 115 MKSIHGDDDDHDHAEKSDEDHHHGDFNMHLWLSPEIARATAVAIHGKLVELMPQSRAKLD 174
+ EK ED H WL+ E A I +L P ++ +
Sbjct: 118 VSDGVDVIYLEGQNEKGKEDPH-------AWLNLENGIIFAKNIAKQLSAKDPNNKEFYE 170

Query: 175 ANLKDFEAQLASTETQVGNELA--PFKGKGYFVFHDAYGYFEKQFGLTPLGHFTVNPEIQ 232
NLK++ +L + + ++ P + K A+ YF K +G+ + +N E +
Sbjct: 171 KNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEE 230

Query: 233 PGAQRLHEIRTQLVEQKATCVFAEPQFRPAVVESVARGTSVRMGT---LDPLGTNIKLGK 289
+++ + +L + K +F E +++V++ T++ + D + K G
Sbjct: 231 GTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGD 290

Query: 290 TSYS 293
+ YS
Sbjct: 291 SYYS 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1145PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 4/31 (12%)

Query: 27 LKPG----KILTLLGPNGAGKSTLVRVVLGL 53
++PG + L G G GKSTL+ ++GL
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_1149PilS_PF08805290.007 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.1 bits (65), Expect = 0.007
Identities = 12/46 (26%), Positives = 18/46 (39%)

Query: 29 AASNCWSNHVGIIIGHNGEDFLVAESRVPLSTITTLSRFIKRSSNQ 74
+A N W V I + F V E+ VP + ++ SS
Sbjct: 110 SAKNPWGGSVTITTSSDKYSFNVVEANVPQKNCMAMVNALRSSSAI 155


63SDY_2022SDY_2029N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2022-112-1.584978DNA-binding transcriptional regulator PhoP
SDY_2023-112-1.333459sensor protein PhoQ
SDY_2024-212-1.345663hypothetical protein
SDY_2025-213-1.778850peptidase T
SDY_2026-112-1.231496putrescine/spermidine ABC transporter ATPase
SDY_2027-113-1.093740spermidine/putrescine ABC transporter permease
SDY_2028-114-0.630318spermidine/putrescine ABC transporter permease
SDY_2029-1100.103437spermidine/putrescine ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2022HTHFIS875e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 5e-22
Identities = 31/124 (25%), Positives = 62/124 (50%)

Query: 2 RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHLPDIAIVDLGLPDEDGLS 61
+LV +D+A +R L + AG+ V +A ++ D+ + D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSG 121
L+ R + LP+LV++A+ ++ ++ GA DY+ KPF + E++ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 LASQ 125
S+
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2023PF06580290.046 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.046
Identities = 11/69 (15%), Positives = 22/69 (31%), Gaps = 20/69 (28%)

Query: 389 NACKYCLE------FVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQ 442
N K+ + + + + + + + VE+ G + +E
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------------ST 311

Query: 443 GVGLAVARE 451
G GL RE
Sbjct: 312 GTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2026PF05272300.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.016
Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80
+ L G G GK+T++ + GL+ D+ + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2029CHLAMIDIAOMP280.044 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 28.4 bits (63), Expect = 0.044
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)

Query: 137 GVNGDAVDPKSVTSWADL------WKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPK 190
G GD DP T+W D + ++ +L D + FQM + +GN T P
Sbjct: 42 GFGGDPCDP--CTTWCDAISMRMGYYGDFVFDRVLKTDVNKEFQMGDKPTSTTGNATAPT 99

Query: 191 EIEAAYN 197
+ A N
Sbjct: 100 TLTAREN 106


64SDY_2067SDY_2075N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_20671131.637995ribonuclease E
SDY_20702120.641143flagellar rod assembly protein/muramidase FlgJ
SDY_20713130.315173flagellar basal body P-ring biosynthesis protein
SDY_2072313-0.183187flagellar basal body L-ring protein
SDY_2075214-0.386542flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2067IGASERPTASE666e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 66.2 bits (161), Expect = 6e-13
Identities = 49/288 (17%), Positives = 89/288 (30%), Gaps = 36/288 (12%)

Query: 513 PSEEEFTERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPAAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSSGEETKPSEQAAPKVEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
+ E +K + K E QN + + + ++
Sbjct: 1039 -----------TVAENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
A +P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 38.9 bits (90), Expect = 1e-04
Identities = 48/302 (15%), Positives = 83/302 (27%), Gaps = 35/302 (11%)

Query: 721 KQRQLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQ 780
+ + NQ V A P V + + P+P A P +
Sbjct: 984 EVEKRNQTVDTTN--------ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE 1035

Query: 781 QEENNADNRDNGGMPRRSRRSPRHLRVSGQRRRRYRDERYPTQSPMPLTVACASPELASG 840
E A+N + R + + VA + E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 841 KVWIRYPIVRPQDVQVEEQREQEEVQVQPMVTEIPVAAAVEPVVSAPVVEEVAEVVEPPV 900
+ V+ EE+ + E + Q E+P + +E +E V+P
Sbjct: 1096 Q---TTETKETATVEKEEKAKVETEKTQ----EVPKVTS-----QVSPKQEQSETVQPQ- 1142

Query: 901 QVAEPQPEVVETTHPEVIAAAVTEQPQVITESDVAVAQEVAEHAEPVVEPQEETADIEEV 960
AEP E T + ++PQ T + Q E + V +P E+ +
Sbjct: 1143 --AEPARENDPTVN--------IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192

Query: 961 AETAEVVVAEPEVVAQPAAPVVAEVATEVETVTAVKPEITVEHNHVTAPMTRAPAPEYVP 1020
E QP + + +V+ +V T + V
Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH----NVEPATTSSNDRSTVA 1248

Query: 1021 EA 1022

Sbjct: 1249 LC 1250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2070FLGFLGJ5110.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 511 bits (1318), Expect = 0.0
Identities = 312/313 (99%), Positives = 312/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120
LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2071FLGPRINGFLGI424e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 424 bits (1092), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATVLDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRSLNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2072FLGLRINGFLGH346e-124 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 346 bits (889), Expect = e-124
Identities = 231/232 (99%), Positives = 231/232 (99%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVLGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPV GPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2075FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 37.2 bits (86), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


65SDY_2394SDY_2398N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_23940152.232624dTDP-glucose enzyme
SDY_23951181.834097nucleotide di-P-sugar epimerase or dehydratase
SDY_23960171.595351regulator
SDY_2397024-6.211944hypothetical protein
SDY_2398027-6.862818insertion element iso-IS1d protein InsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2394NUCEPIMERASE561e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.9 bits (135), Expect = 1e-10
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%)

Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54
+ LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGESGDFIAQERQVALNVRDALREVPVKQL 106
+ + + L + V+ H S+ + LN+ + R ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 107 IFLSS 111
++ SS
Sbjct: 122 LYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2395NUCEPIMERASE784e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 77.5 bits (191), Expect = 4e-18
Identities = 71/363 (19%), Positives = 124/363 (34%), Gaps = 65/363 (17%)

Query: 13 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMSKLLEKMGAEFVPAD 63
MK LVTGA +G + + L + G V +A +LL + G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 64 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAIAW 116
L + + ++ S +P A+ +N+ + E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116

Query: 117 GVRNLIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEKVINMLSQANPQTRFT 176
+++L++ SS S+Y + D + +A +K A+E + + S T
Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174

Query: 177 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 234
LR +++GP + + + + M SI + + G D TY ++ A+
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 235 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 280
RVYNI N L +Q L D L I+ + +P D
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294

Query: 281 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAW 340
+ T D E +G+ P T+ +G++ W
Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327

Query: 341 LRD 343
RD
Sbjct: 328 YRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2396ECOLIPORIN290.025 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 28.7 bits (64), Expect = 0.025
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 9/54 (16%)

Query: 2 RRVFWLIAVALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55
R+V L+ ALL AG A I K+G +LD Y ++ L HY +DD
Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2398HTHTETR290.002 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 29.2 bits (65), Expect = 0.002
Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 8/43 (18%)

Query: 52 THQKIIDMAM--------NGVGCRATARIMGVSLNTILHHLKN 86
T Q I+D+A+ + A+ GV+ I H K+
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


66SDY_2876SDY_2884N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_2876-1150.691288permease
SDY_2877-1130.554456hypothetical protein
SDY_2878-1110.426064hypothetical protein
SDY_2879-2100.037004transcriptional repressor MprA
SDY_2880-2100.416829multidrug resistance secretion protein
SDY_2882-2100.174407hypothetical protein
SDY_28830120.570867hypothetical protein
SDY_28840140.242093S-ribosylhomocysteinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2876TCRTETB453e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 3e-07
Identities = 31/166 (18%), Positives = 70/166 (42%), Gaps = 4/166 (2%)

Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93
L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 ASSQSLAMMILSTAL---TGLFSVVAQILVPLAATLASPDKRGKVVGTIMSGLLLGILLA 150
S +++ G + A ++V + A + RGK G I S + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMV-VVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 151 RTVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNY 196
+ G++A+ W + + + + + + +++ + H +
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2879PF05272280.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.017
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%)

Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82
P QE+ L + + L R A+G + + T + ++L ALG
Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808

Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111
SS ++ D L + GW RE+ RR
Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2880RTXTOXIND762e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 76.4 bits (188), Expect = 2e-17
Identities = 66/412 (16%), Positives = 122/412 (29%), Gaps = 97/412 (23%)

Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80
L FI+ + I VL E A +G +I + V ++
Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DFVKEGDVLVTLDPTDARQAFDKA------------------------------------ 104
+ V++GDVL+ L A K
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIE--VQKIALAKAQSDYNRR 146
K ++ Q +Q +N + +A + +I + S +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 147 VL-----LGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201
L L + I + + + A +L V Q ++ IL K E Q Q
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 202 AATEVRNSW------------------LALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242
E+ + + + I +P++ V + V G ++
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298
LM +VP + V A + I + +GQ I + + +Y GKV + +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409

Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350
++ G V+ + + PL G++ + T R
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_2884LUXSPROTEIN292e-105 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 292 bits (750), Expect = e-105
Identities = 131/170 (77%), Positives = 148/170 (87%)

Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61
PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121
GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI 171
ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALP+ L+EL I
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170


67SDY_3089SDY_3098N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3089-2121.230421hypothetical protein
SDY_3090-3122.480248acyl-CoA synthetase
SDY_3091-2173.776536insertion element iso-IS1n protein InsB
SDY_4737-3163.732503insertion element iso-IS1N protein InsA
SDY_3092-3173.894487type II secretion protein GspC
SDY_3093-2204.543249type II secretion protein
SDY_3094-1185.005896type II secretion protein
SDY_30950184.056424type II secretion protein
SDY_30960153.346494type II secretion protein
SDY_30971153.549133type II secretion protein
SDY_30982163.448409type II secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3089NUCEPIMERASE649e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 64.4 bits (157), Expect = 9e-14
Identities = 67/337 (19%), Positives = 115/337 (34%), Gaps = 58/337 (17%)

Query: 4 TVAVTGATGFIGKYIIDNLLVRGFHVRALT----------RTARAHV--NDNLTWVRGSL 51
VTGA GFIG ++ LL G V + + AR + + + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 52 EDTHSLSELVA--GASAVVHCAGQ--VRGHKEE--IFTHCNVDGSLHLMQAAKESGFCQR 105
D +++L A V + VR E + N+ G L++++ + + Q
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI-QH 120

Query: 106 FLFISSLA---------------ARHPELSWYANSKHVAEQRLTAMADEITLGV----FR 146
L+ SS + HP +S YA +K E L A G+ R
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHP-VSLYAATKKANE--LMAHTYSHLYGLPATGLR 177

Query: 147 PTAVYGP-GDKELKPLF--DWMLRGLLPRL-GAPDTQLSFLHVTDFAQAVGQWLSAETVQ 202
VYGP G ++ ML G + + F ++ D A+A+ + + V
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAI---IRLQDVI 234

Query: 203 TQTYELCDGVAGSYDWQRVQQLAADARCGSVRMVGIPLPVLTCLADISTALSRLAGKEPM 262
G+ + S P+ ++ + + AL A K +
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSS------PVELMDYIQALEDALGIEAKKNML 288

Query: 263 LTRSKIRELTHADWSASNNRISEDINWFPGISLEHAL 299
+ T AD A E I + P +++ +
Sbjct: 289 PLQPGDVLETSADTKALY----EVIGFTPETTVKDGV 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3092BCTERIALGSPC1092e-30 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 109 bits (273), Expect = 2e-30
Identities = 65/281 (23%), Positives = 114/281 (40%), Gaps = 36/281 (12%)

Query: 1 MFWLMLLIISAKVAHSLWRYFSFSAEYTVVSPSVNKPPRTDAKTFDKNDVQLISQQNWFG 60
+F+L++L+ ++A WR V S + P + + ND L FG
Sbjct: 18 LFYLLMLLFCQQLAMIFWR-IGLPDNAPVSSVQIT-PAQARQQPVTLNDFTL------FG 69

Query: 61 KY-QPVAAPV-KQPEPAPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQGERPG 115
+ A + + + + LN+ L G+ G +R A+I + +Q E
Sbjct: 70 VSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVP 129

Query: 116 SHNAVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTRQKAISDEAKQAVAEPAASAPV 175
+NA I I D V+L+YQG+ E L L +E S ++++ +Q
Sbjct: 130 GYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQ----------- 178

Query: 176 ELPAAVRQALAKDPQKIFNYIQLTPVHKEG-IVGYAVKPGADRALFDASGFKEGDIAIAL 234
+ + +Y+ +P+ + + GY + PG F G ++ D+A+AL
Sbjct: 179 -----------RASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVAL 227

Query: 235 NQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIAL 275
N D D M ++ + + LTV R G R DI +
Sbjct: 228 NGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3093BCTERIALGSPD516e-179 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 516 bits (1331), Expect = e-179
Identities = 268/619 (43%), Positives = 395/619 (63%), Gaps = 37/619 (5%)

Query: 3 PGVQGKVSIRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVG 62
P V+G +++R+ LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+
Sbjct: 58 PSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVAS 117

Query: 63 EGSDNYAGDEMVTKVVPV-----RELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVV 117
+ + GDE+VT+VVP+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V+
Sbjct: 118 DAAPG-IGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVI 176

Query: 118 ERLTEVIQRVDHAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADER 176
+RL +++RVD+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADER
Sbjct: 177 KRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADER 236

Query: 177 TNSVIVSGDPATRNKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAA 236
TN+V+VSG+P +R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ +
Sbjct: 237 TNAVLVSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSE 296

Query: 237 KEEAEGTVGSGREVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVE 296
K+ A+ + + + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I E
Sbjct: 297 KQAAKPV-AALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAE 355

Query: 297 VAEGSNINFGVQWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINP 356
V + +N G+QWA+K+AG+ QF N + +PI T A
Sbjct: 356 VQDADGLNLGIQWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQY 395

Query: 357 DTNGDLST-LAQLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFF 415
+ +G +S+ LA LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F
Sbjct: 396 NKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATF 455

Query: 416 MVGQDVPVLTGSTVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQ 475
VGQ+VPVLTGS S N FNTVERK VGI LKV PQINEG++V + IEQEVS V
Sbjct: 456 NVGQEVPVLTGSQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADA 514

Query: 476 TS-----LDVVFGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFK 530
S L F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+
Sbjct: 515 ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFR 574

Query: 531 STADKKEKRNLMVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQP 588
ST+ K KRNLM+FIRPT++RD S +Y Q + E +++
Sbjct: 575 STSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLE 634

Query: 589 VLPAQNQALPPEVRAFLNA 607
+ P Q+ A +V A ++A
Sbjct: 635 IYPRQDTAAFRQVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3095BCTERIALGSPF433e-153 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 433 bits (1116), Expect = e-153
Identities = 218/400 (54%), Positives = 294/400 (73%), Gaps = 1/400 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSERHARQLLRGKELIPVHI-EARMNASSGGMLQRRRH 59
MA ++YQAL+ G+K +G EADS R ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTCQLATLVQAAIPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL T QLATLV A++PLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATSVVTILLAAVVPKIIEQFDHLGHALPATTRALIAMSDALQASGVYWLAGLLALLVL 239
VVA +VV+ILL+ VVPK++EQF H+ ALP +TR L+ MSDA++ G + L LLA +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMLLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ + + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPML 399
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+L
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPIL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3096BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRTGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3097BCTERIALGSPH583e-13 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 58.0 bits (140), Expect = 3e-13
Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 41/185 (22%)

Query: 1 MLVIFLIGLASAGVVQTFATASESPAKKAAQDFLTRFAQFKDWAVIDGQTLGVLIDPPGY 60
ML++ L+G+++ V+ F + + A + F + + + GQ GV + P +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 QFMQRRHGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQRRRL----TLHD 116
QF+ + P D W L L+ R+ ++
Sbjct: 72 QFLVLEARDGADPA-------------------PADDGWSGYRWLPLRAGRVATSGSIAG 112

Query: 117 IELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKLAHDGALSLSQC 171
+L L + P + P TPF L L ++ +
Sbjct: 113 GKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------LGEAPGIAFNAR 159

Query: 172 DERMP 176
E +P
Sbjct: 160 GESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3098BCTERIALGSPH331e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.0 bits (75), Expect = 1e-04
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAATTVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


68SDY_3410SDY_3417N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3410-217-1.184457serine endoprotease
SDY_3411-212-1.125255serine endoprotease
SDY_3412-112-0.915557malate dehydrogenase
SDY_3413-214-1.253203arginine repressor ArgR
SDY_3414-214-0.675287hypothetical protein
SDY_3415-1120.264810hypothetical protein
SDY_3416-3110.894158p-hydroxybenzoic acid efflux subunit AaeB
SDY_3417-290.953391p-hydroxybenzoic acid efflux subunit AaeA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3410V8PROTEASE726e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.0 bits (176), Expect = 6e-16
Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 38/184 (20%)

Query: 90 GLGSGVIINASKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137
+ SGV++ K +LTN HV++ L +G ++ +
Sbjct: 102 FIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALG 190
D+A+++ + ++++ + +V G P V+ +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212

Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249
S + L+ +Q D S GNSG + N E+IGI+ G+
Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 250 MART 253
+
Sbjct: 264 VFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3411V8PROTEASE538e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 52.7 bits (126), Expect = 8e-10
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%)

Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124
+ SGV++ + ++TNKHV++ AL+ +G +
Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177
DLA++K + + ++ + + G P + T + G
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216

Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217
+I + +Q D S GNSG + N E++GI+
Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3412DHBDHDRGNASE280.049 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 27.7 bits (61), Expect = 0.049
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%)

Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62
+ GAA GIG+A+A L G+ ++ D P V S A + F +
Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66

Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104
A + D+++ AGV R + F+VN+ V N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146
V+K + + + +NP ++AA KA K G+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3413ARGREPRESSOR1694e-57 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 169 bits (430), Expect = 4e-57
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%)

Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74
+ ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+
Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69

Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131
S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI
Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128

Query: 132 FTTPANGFTVKDLYEAILELF 152
K + + ILEL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3417RTXTOXIND542e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 2e-10
Identities = 29/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%)

Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61
SR V I+ F+ I + + S I P + ++ ++
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 62 NVHDNQLVKKGQVLFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114
V + + V+KG VL + Q +L +A+ + YQ+L++ E + L
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168

Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154
+ + VL + Q + Q + +L+L++
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211



Score = 51.4 bits (123), Expect = 2e-09
Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%)

Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150
E R + ++ + ++EE + + +L +L + +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208
+ +VIRAP V L V+T G +T T + +V ++ V A ++ + + G
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231
A I P L G V ++
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410


69SDY_3507SDY_3514N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3507113-0.641238hypothetical protein
SDY_35080150.979018FKBP-type peptidylprolyl isomerase
SDY_35090142.597661hypothetical protein
SDY_3510-1132.445431FKBP-type peptidylprolyl isomerase
SDY_35110152.157683hypothetical protein
SDY_35120162.153886NEM-activable K+/H+ antiporter
SDY_35130181.399235glutathione-regulated potassium-efflux system
SDY_3514-1140.782952ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3507ACRIFLAVINRP290.024 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.024
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 164 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 222
A +V+D VTQ +E + ++ + S + + + L + D A QV ++L +
Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113

Query: 223 SK 224
+
Sbjct: 114 AT 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3508INFPOTNTIATR1324e-40 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 132 bits (334), Expect = 4e-40
Identities = 79/226 (34%), Positives = 124/226 (54%), Gaps = 9/226 (3%)

Query: 28 AAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87
A A A + D K +Y++GA LG K + GI ++ D L G+QD
Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66

Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146
+ + L++++++ L F+ + + A+ K A +N+AKG + + G+ +
Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126

Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYIRGEPLSFRLDGVIPGWTEGL 206
GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L
Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186

Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251
+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3513ISCHRISMTASE280.024 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 27.7 bits (61), Expect = 0.024
Identities = 32/140 (22%), Positives = 52/140 (37%), Gaps = 26/140 (18%)

Query: 12 YAHPESQDSVANRVLLKPATQLSNVTAHDLYAHYPDFFID-----------IPREQALLR 60
Y P + D N+V P + + HD+ ++ D F I + +
Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68

Query: 61 EHEVIVFQRPLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA-- 118
+ + P+ + P DR L F GPG N +G Y +IT PE
Sbjct: 69 QLGI-----PVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDL 119

Query: 119 ----YRYDALNRYPMSDVLR 134
+RY A R + +++R
Sbjct: 120 VLTKWRYSAFKRTNLLEMMR 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3514GPOSANCHOR350.001 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.001
Identities = 30/152 (19%), Positives = 55/152 (36%), Gaps = 22/152 (14%)

Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDDAPKENANSAQARKDQKRREAELRAQTQPLRKE 563
+ D + ++ E + D ++ R+ +R R + L E
Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602
+LE++ + +L A+ + EE+ SE QS + +L A
Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391

Query: 603 LQQQASAKSGLEECEMAWLEALEQLEQMLLEG 634
+ + + LEE L ALE+L + L E
Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422



Score = 30.4 bits (68), Expect = 0.025
Identities = 14/119 (11%), Positives = 36/119 (30%), Gaps = 8/119 (6%)

Query: 513 EDYQQWLSDVQKQENQTDDAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572
+ + ++ + E A A + D ++ + +++
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179

Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEALEQLEQML 631
L A+ A E + + E + TA + + ++ A LE+ +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA-LAARKADLEKALEGA 237


70SDY_3834SDY_3838N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_3834-1130.450788DNA-binding transcriptional regulator CpxR
SDY_3835-113-0.041923two-component sensor protein
SDY_3836016-0.671190hypothetical protein
SDY_3837015-0.0936442-keto-3-deoxygluconate permease
SDY_38380140.306109superoxide dismutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3834HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 35/117 (29%), Positives = 62/117 (52%), Gaps = 2/117 (1%)

Query: 3 KILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLL-DDSIDLLLLDVMMPKKNGID 61
IL+ DDD + ++L + L G++V + + + DL++ DV+MP +N D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 TLKALRQTH-QTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRR 117
L +++ PV++++A+ + + + E GA DYLPKPF+ EL+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3835PF06580290.031 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.031
Identities = 19/108 (17%), Positives = 38/108 (35%), Gaps = 28/108 (25%)

Query: 354 LENIVRNALRY------SHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTD 407
++ +V N +++ KI + D +T+ V++ G +E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 408 EARDRESGGTGLGLAIVETAIQQHRGW---VKAEDSPLGGLRLVIWLP 452
TG GL V +Q G +K + G + ++ +P
Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQ-GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3837ACRIFLAVINRP290.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.021
Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 17/163 (10%)

Query: 170 HVFVGAVLPFLVGFA-LGNLDPELREFFSKAVQTLIPF-FAFALGNTID-LTVIAQTGLL 226
+F +L FLV + L N+ L + V L F A G +I+ LT+ +
Sbjct: 343 TLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAI 402

Query: 227 GILLGVAVIIVTGIPLIIADKLIGGGDGTAGIAASSSAGAAV--ATPVLIAEMVPA---- 280
G+L+ A+++V + ++ + A + S A+ VL A +P
Sbjct: 403 GLLVDDAIVVVENVERVMMED--KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 281 ------FKPMAPAATSLVATAVIVTSILVPILTSIWSRKIKAR 317
++ + S +A +V+V IL P L + + + A
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3838UREASE270.049 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 27.4 bits (61), Expect = 0.049
Identities = 11/27 (40%), Positives = 13/27 (48%), Gaps = 1/27 (3%)

Query: 2 SYTLPSLPYAYDALEPHFDKQTMEIHH 28
S T P+ PY + L H D M HH
Sbjct: 298 SSTNPTRPYTVNTLAEHLD-MLMVCHH 323


71SDY_3872SDY_3877N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_38722211.726358GTP-binding protein
SDY_38730151.666238glutamine synthetase
SDY_38740130.984190nitrogen regulation protein NR(II)
SDY_3875014-0.148567nitrogen regulation protein NR(I)
SDY_3876013-1.150149coproporphyrinogen III oxidase
SDY_3877014-1.620995hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3872TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3874PF06580280.041 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.041
Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%)

Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223
I+E + R ++ L + L + + S+ V + L S++ D ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279
+P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+
Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293

Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336
++VE+ G + ++ TG GL R + G I+ +
Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 337 GHTEFSVYLP 346
G V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3875HTHFIS6020.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 602 bits (1553), Expect = 0.0
Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL + S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_3877SECA280.030 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.5 bits (61), Expect = 0.030
Identities = 11/71 (15%), Positives = 28/71 (39%)

Query: 13 AKARRKTREELNQEARDRKRQKKRRGHAPGSRAAGGNTTSGCKGQNAPKDPRIGSKTPIP 72
+K + + EE+ + + R+ + +R ++ + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 73 LGVTEKVTKQH 83
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


72SDY_4120SDY_4150N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_41201347.467593phosphonate ABC transporter ATP-binding protein
SDY_41212285.003695phosphonate metabolism protein
SDY_41222214.293498ribose 1,5-bisphosphokinase
SDY_41231214.013829aminoalkylphosphonic acid N-acetyltransferase
SDY_41241213.833210carbon-phosphorus lyase complex accessory
SDY_41251193.123921hypothetical protein
SDY_41261183.029343hypothetical protein
SDY_41272194.061427histidine kinase
SDY_41280183.277933ribose ABC transporter ATP-binding protein
SDY_4130-1162.552716ribose ABC transporter substrate-binding
SDY_41310152.029184hypothetical protein
SDY_41321172.242079hypothetical protein
SDY_4133-2171.007874kinase
SDY_4134-216-0.111272transcriptional regulator
SDY_4135-216-0.418906insertion element iso-IS1n protein InsB
SDY_4776-116-0.188488insertion element iso-IS1N protein InsA
SDY_4136-117-0.818089hypothetical protein
SDY_4137-118-1.765097hypothetical protein
SDY_4139220-1.676800insertion element IS1 protein InsA
SDY_4140116-0.284484insertion element iso-IS1d protein InsB
SDY_4141111-0.248825transport protein
SDY_41420120.373953hypothetical protein
SDY_4143-1121.829340DNA-binding protein
SDY_4144-1132.472392hypothetical protein
SDY_4146-1132.516321cryptic adenine deaminase
SDY_4147-1142.422847sugar phosphate antiporter
SDY_41480162.869089regulatory protein UhpC
SDY_41490162.230760sensory histidine kinase UhpB
SDY_41501161.903475DNA-binding transcriptional activator UhpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4120PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 89
VVL G G GKSTL+ +L + I G + + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 90 TTVGWVSQFL 99
V F
Sbjct: 656 ADAEAVKAFF 665


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4123SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 5/84 (5%)

Query: 50 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAG 109
L L+ +G I + + N I+++ V R VG+ LL A E A++
Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 110 AEMTELSTNVKRHDAHRFYLREGY 133
L T A FY + +
Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4125RTXTOXIND260.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 25.9 bits (57), Expect = 0.032
Identities = 18/107 (16%), Positives = 40/107 (37%), Gaps = 8/107 (7%)

Query: 11 TLLTLTTVPAQADIIDDTIGNIQ--------QAINDAYNPDHGRDYEDSRDDGWQREVSD 62
LL LT + A+AD + +Q Q ++ + + + + + +Q +
Sbjct: 123 VLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEE 182

Query: 63 DRRRQYDDRRRQFEDRRRQLDDRQHQLDQERRQLEDEERRMEDEYGQ 109
+ R + QF + Q ++ LD++R + R+
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4127HTHFIS586e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.3 bits (141), Expect = 6e-11
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 640 VLVLEDEAAVRQTICEQLHLLGYLTLEASSGEQALDLLAASAEIDIFISDLMLPGGMSGA 699
+LV +D+AA+R + + L GY S+ +AA + D+ ++D+++P +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP-DENAF 63

Query: 700 EVVNAARKLYPHLTLLLISGQ 720
+++ +K P L +L++S Q
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4130SUBTILISIN290.024 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 29.0 bits (65), Expect = 0.024
Identities = 15/65 (23%), Positives = 24/65 (36%), Gaps = 5/65 (7%)

Query: 55 KLAGDNVKVTLVSSGYDLGQQVSQIDNFIAANVDMIIL---NAADSKGIGPAVKRAKDAG 111
L +KV + I I VD+I + D + AVK+A +
Sbjct: 111 DLLI--IKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQ 168

Query: 112 IVVVA 116
I+V+
Sbjct: 169 ILVMC 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4131BCTERIALGSPD300.013 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.5 bits (66), Expect = 0.013
Identities = 19/118 (16%), Positives = 43/118 (36%), Gaps = 7/118 (5%)

Query: 116 GGGNLIVELWNADSNEQTADSDVTVVIDGCRQKHTAGTQLRLSPGESICLPPGLYHSFWA 175
G ++++E+ S+ A S + + T + + GE++ + GL +
Sbjct: 496 EGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVG-GLLDKSVS 554

Query: 176 ET-----GFGDV-LVGEVSSVNDDDHDNHFLQPLDRYNLINEDEPAQLVLCNKYRQFR 227
+T GD+ ++G + L R +I + + + +Y F
Sbjct: 555 DTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4134HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 1e-23
Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 1/120 (0%)

Query: 8 KPVVLVVDDDTAICALLQDVLSEHVFTVSVCHTGQEAILRIEGDPDIALVVLDMMLPDTN 67
+LV DDD AI +L LS + V + I LVV D+++PD N
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDEN 61

Query: 68 GLRVLQQIQKLRPTLPVVMLTGMGSKSDVVVGLEMGADDYICKPFTPRVVVARLKAVLRR 127
+L +I+K RP LPV++++ + + E GA DY+ KPF ++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4136INTIMIN482e-08 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 47.8 bits (113), Expect = 2e-08
Identities = 45/230 (19%), Positives = 80/230 (34%), Gaps = 21/230 (9%)

Query: 9 GVVEVSGTDKNETGNWSEESDGVYTTTRTAKIAGDRHYATLKLSTWSSAQQSDAYAIRES 68
VSGT + + G T T + G + + K + +SA ++A +
Sbjct: 597 SFNIVSGTAVLSANSANTNGSGKATVTLKSDKPG-QVVVSAKTAEMTSALNANAVIFVDQ 655

Query: 69 GAVLAYSSIVTDKTTYTAGGAIKVTVTLKDSY-ENLVGGQRDAINLAIQLPNTKTESIAW 127
+ + I DKTT A G +T T+K + V Q + + TE
Sbjct: 656 TKA-SITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEK--- 711

Query: 128 NEDQKGIYTATYTALLLGTGLKAQLQMSGWANALTSNDYSISGDAASAQIVAMQVTTGNP 187
D G T T+ G L + ++S A + + + + + GN
Sbjct: 712 -TDTNGYAKVTLTSTTPGKSLVSA-RVSDVAVDVKAPEVEFFTT--------LTIDDGNI 761

Query: 188 DVLANGSDRHTVNVRVE-DQFGNVLPEQTVTFTVT----KGAAVFANAGQ 232
+++ G V ++ Q +T A+V A++GQ
Sbjct: 762 EIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ 811



Score = 37.0 bits (85), Expect = 7e-05
Identities = 56/247 (22%), Positives = 88/247 (35%), Gaps = 35/247 (14%)

Query: 24 WSEESDGVYTTTRTAKIAGDRH-----YATLKLSTWSSAQQSDAYAIRESGAVLAYSSIV 78
+ + VY T A DR+ L ++ S+ Q D + +
Sbjct: 517 YVQGGSNVYKVTARAY---DRNGNSSNNVLLTITVLSNGQVVDQVGV---------TDFT 564

Query: 79 TDKTTYTAGG--AIKVTVTLKDSYENLVGGQRD-AINLAIQLPNTKTESIAWNEDQKGIY 135
DKT+ A G AI T T+K + I + + + N + G
Sbjct: 565 ADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANS----ANTNGSGKA 620

Query: 136 TATYTALLLGTGLKAQLQMSGWANALTSNDYSISGDAASAQIVAMQVTTGNPDVLANGSD 195
T T + G + + + + + AS + TT +ANG D
Sbjct: 621 TVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTT----AVANGQD 676

Query: 196 RHTVNVRVEDQFGNVLPEQTVTFTVTKGAAVFANAGQSADIRTDAHGMAEVDLSSTVADA 255
T V+V + + Q VTFT T S + +TD +G A+V L+ST
Sbjct: 677 AITYTVKV-MKGDKPVSNQEVTFTTT-----LGKLSNSTE-KTDTNGYAKVTLTSTTPGK 729

Query: 256 STVEAKV 262
S V A+V
Sbjct: 730 SLVSARV 736


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4137INTIMIN2968e-93 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 296 bits (759), Expect = 8e-93
Identities = 104/492 (21%), Positives = 193/492 (39%), Gaps = 41/492 (8%)

Query: 1 MALFGKDERQNDPHAITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQK 60
+ALF D+ Q++P A T G++YTP+PL++ + R G END M+ Q +
Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417

Query: 61 QLDPAEVAARRSLVGSRYDLVDRNNNIVLEYLKKELVRLTLTDPLKGKPGEVKSLVSSLQ 120
Q++P V R+L GSRYDLV RNNNI+LEY K++++ L + + G + + ++
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVK 477

Query: 121 TKYALKGYDIEAASLQSAGGKVAVSG----KDIQVTIPPYRFTAMPETDNIYPIAVTAED 176
+KY L + ++L+S GG++ SG +D Q +P Y + N+Y + A D
Sbjct: 478 SKYGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAY----VQGGSNVYKVTARAYD 533

Query: 177 SKGNFSRREE-SMVVVEKPTLSLADSTLSVDLQILLADGKSTSMLTYTA------RDSSG 229
GN S ++ V+ + A T +TYTA +
Sbjct: 534 RNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQAN 593

Query: 230 KPIPGMTLKTQAKGLQDFALSEWKDNGNGTYTQIVTAGKTSGALSLMPQFNGDDIAKTPA 289
P+ + G + + NG+G T + + K + A
Sbjct: 594 VPVSFNIV----SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANA 649

Query: 290 LIAIVANTASRADSTIETDQDNYVAGKPIVVKVTLRDD-NGNGVTGRKELLKQTVKVDNT 348
+I + AS + I+ D+ VA + T++ V+ ++ T+ +
Sbjct: 650 VIFVDQTKASI--TEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSN 707

Query: 349 KADAVSAWTEESEGIYKASYTAHLIGDKLTA------QLTMPGWQTKHSDAFSIAGDKDT 402
+ ++ G K + T+ G L + + + + + +I
Sbjct: 708 STE-----KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE 762

Query: 403 AKIAAMQITANNTVARRDHNTVAVTVRDVHQNLLQGQNVTFTVVNGAAVFADPNGGIVTT 462
++ + + + + T+ N A D + G VT
Sbjct: 763 IVGTGVKGKLPTVWLQYGQVNLKASGGNG--------KYTWRSANPAIASVDASSGQVTL 814

Query: 463 DKDGIASVNLAS 474
+ G ++++ S
Sbjct: 815 KEKGTTTISVIS 826



Score = 33.9 bits (77), Expect = 0.002
Identities = 30/151 (19%), Positives = 55/151 (36%), Gaps = 21/151 (13%)

Query: 355 AWTEESEGIYKASYTAHLIGDKLT--AQLTM----PGWQTKHSDAFSIAGDKDTAKIAAM 408
A+ + +YK + A+ + LT+ G DK +AK
Sbjct: 516 AYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAK---- 571

Query: 409 QITANNTVARRDHNTVAVTVRDVHQNLLQGQNVTFTVVNGAAVFADPNGGIVTTDKDGIA 468
A+ T A T TV+ V+F +V+G A + T+ G A
Sbjct: 572 ---ADGTEAI----TYTATVKKNGVAQA-NVPVSFNIVSGTA---VLSANSANTNGSGKA 620

Query: 469 SVNLASDQAVNSLIKAEINGSSQSVEVSFTL 499
+V L SD+ ++ A+ + ++ + +
Sbjct: 621 TVTLKSDKPGQVVVSAKTAEMTSALNANAVI 651


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4141TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.9 bits (88), Expect = 5e-05
Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 13/208 (6%)

Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 86 YVVILFAVLLTISCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 145
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGK 201
+ +V LG +G F AAA + + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4146UREASE371e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.4 bits (87), Expect = 1e-04
Identities = 28/105 (26%), Positives = 41/105 (39%), Gaps = 17/105 (16%)

Query: 22 AVSRGDVVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG----------AEYTDAPA 71
V+R D +I N ILD + G + I +K IA +G P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4147TCRTETB356e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 6e-04
Identities = 61/372 (16%), Positives = 133/372 (35%), Gaps = 32/372 (8%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGA-----GAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRYGSDSPES 219
+G G +YL +I + P ++ L+ + ++ D
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI 204

Query: 220 YGLGKAEELFGEEISEEDKETESTDMTKWQIFVEYVLK--NKVIWLLCFANI-FLYVVRI 276
+ F + + + IFV+++ K + + NI F+ V
Sbjct: 205 ILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLC 264

Query: 277 GIDQWSTVYAFQELKLSKAVAIQGFTLFEAG------AMVGTLLWGWLSDLANGRRG--L 328
G + TV F + + + E G + +++G++ + RRG
Sbjct: 265 GGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLY 324

Query: 329 VACIALALIIA---TLGVYQHASNEYIYLASLFALGFLVFGPQLLIGVAAVGFVPKKAIG 385
V I + + T ++ ++ + +F LG L F ++ + + ++A G
Sbjct: 325 VLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEA-G 383

Query: 386 AADGIKGTFAYL 397
A + ++L
Sbjct: 384 AGMSLLNFTSFL 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4148TCRTETB394e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.7 bits (90), Expect = 4e-05
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVLKFVSG 87
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 88 IVSDRSNARYFMGTGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 203
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 264 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 307
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4149PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 366 LRPRQLDDLTLEQAIRSLMREMELEGRGIVRHLEWRIDESALSENQRVTLFRVCQEGLNN 425
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 426 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 479
+KH + L+G + + + L +E+ GS + + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 480 G---TLHISCLHG-TRVSVSLP 497
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4150HTHFIS604e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 4e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIVAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


73SDY_4328SDY_4792N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SDY_43280150.048405isocitrate lyase
SDY_4330219-0.430747homoserine O-succinyltransferase
SDY_43312230.058344insertion sequence element IS1 transposase
SDY_43323220.049403insertion element IS1 protein InsA
SDY_43332220.049403hypothetical protein
SDY_43342220.035759hypothetical protein
SDY_4335118-0.826725insertion sequence element IS4 transposase InsG
SDY_4792017-2.611682insertion element iso-IS1N protein InsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4328BINARYTOXINB320.004 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 32.3 bits (73), Expect = 0.004
Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346
ET+ PD+ L A P L Y + N D +T + + QL+++
Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4333SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 0.001
Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%)

Query: 78 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126
+ D GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4334SHAPEPROTEIN317e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.3 bits (71), Expect = 7e-04
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96
IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E
Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131

Query: 97 GA 98
GA
Sbjct: 132 GA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SDY_4792ACRIFLAVINRP270.006 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.5 bits (61), Expect = 0.006
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 50 GIKELLTEM-AFNGAGV-----RDTARTLKIGINTVIRTL 83
IK L E+ F G+ DT +++ I+ V++TL
Sbjct: 305 AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL 344



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.