PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2238.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008322 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Shewmr7_0083Shewmr7_0102Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0083219-0.624526hypothetical protein
Shewmr7_00842170.456398hypothetical protein
Shewmr7_00852170.546762transcriptional regulator
Shewmr7_00862190.724470hypothetical protein
Shewmr7_00872171.229657transport protein
Shewmr7_00881172.750065ABC transporter-like protein
Shewmr7_0089-1153.714778TAP domain-containing protein
Shewmr7_0090-1174.310819hypothetical protein
Shewmr7_0091-2174.149179GntR family transcriptional regulator
Shewmr7_0092-1183.588823ABC transporter-like protein
Shewmr7_00930203.741340hypothetical protein
Shewmr7_00940223.324796AMP-dependent synthetase and ligase
Shewmr7_00951203.185507MORN repeat-containing protein
Shewmr7_00961183.675002hypothetical protein
Shewmr7_00971194.582696thioesterase superfamily protein
Shewmr7_00981215.451021HPP family protein
Shewmr7_0099-2215.306510hypothetical protein
Shewmr7_0100-2235.462259thioesterase superfamily protein
Shewmr7_0101-2194.449181MerR family transcriptional regulator
Shewmr7_0102-1183.489856NAD(P)H dehydrogenase (quinone)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0089PF06291290.017 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 28.9 bits (64), Expect = 0.017
Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 7/72 (9%)

Query: 3 NPMKRIINALMLAMLATGCSDPNPTQTNQTAEQAKAEKAVQVLNLKQQLAPITQRYFALR 62
N MK+++ + LAML TGC+ QT + A + + ++ I Q+
Sbjct: 4 NKMKKMLFSAALAMLITGCA----QQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDA 59

Query: 63 PEIATYYGVAEN 74
+I G AEN
Sbjct: 60 AKIC---GGAEN 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0091TOXICSSTOXIN290.018 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 28.8 bits (64), Expect = 0.018
Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 205 LRKLLKQTYDLPKSHFYTSSYWKIGCNEGE 234
+R L Q + L +S T YWKI N+G
Sbjct: 173 IRHQLTQIHGLYRSSDKTGGYWKITMNDGS 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0092UREASE441e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 43.6 bits (103), Expect = 1e-06
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 8/56 (14%)

Query: 348 LAGLTLNAAKALGIEENVGSLVVGKQADFCLWDIATPAQLAYSYGVNPCKDVVKNG 403
+A T+N A A G+ +GSL VGK+AD LW+ PA +GV P V+ G
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN---PAF----FGVKP-DMVLLGG 453



Score = 31.6 bits (72), Expect = 0.007
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 23 YGAITNAAIAVKDGKIAWLGPRSE---LPAFDVL---SIPVYRGKGGWITPGLIDAHTHL 76
+ I A I +KDG+IA +G P ++ V G+G +T G +D+H H
Sbjct: 80 HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHF 139

Query: 77 V 77
+
Sbjct: 140 I 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0102TCRTETOQM502e-08 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 50.2 bits (120), Expect = 2e-08
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 8 HVDHGKSTLIRALT---------------GMNTDRLPEEKRRGMTIDLGYAFMPLQDGTR 52
HVD GK+TL +L TD E++RG+TI G + T+
Sbjct: 11 HVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW-ENTK 69

Query: 53 LAFIDVPGHEKFINNMLVGVSHVRHALLVLACDDGVMPQTR 93
+ ID PGH F+ + +S + A+L+++ DGV QTR
Sbjct: 70 VNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR 110


2Shewmr7_0175Shewmr7_0223Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0175323-2.260013peptidase M14, carboxypeptidase A
Shewmr7_0176222-2.100741hypothetical protein
Shewmr7_0177327-2.508072hypothetical protein
Shewmr7_0178638-3.182685sodium:dicarboxylate symporter
Shewmr7_0179757-3.332612hypothetical protein
Shewmr7_0180859-2.806129hypothetical protein
Shewmr7_0181956-3.417569hypothetical protein
Shewmr7_01821052-2.158569OmpA/MotB domain-containing protein
Shewmr7_01831254-1.243720hypothetical protein
Shewmr7_01841254-1.030533transporter AbgT
Shewmr7_01851353-0.893400phosphoenolpyruvate carboxykinase
Shewmr7_01861250-0.987428Hsp33-like chaperonin
Shewmr7_01871251-0.929959RNA-binding S4 domain-containing protein
Shewmr7_01881253-0.745327general secretion pathway protein C
Shewmr7_0189953-1.375106general secretion pathway protein D
Shewmr7_0190952-1.429401hypothetical protein
Shewmr7_0191951-1.441095type II secretion system protein E (GspE)
Shewmr7_01921060-0.815459general secretion pathway protein F
Shewmr7_0193753-0.679285general secretion pathway protein G
Shewmr7_0194853-0.416082general secretion pathway protein H
Shewmr7_0195850-0.455688hypothetical protein
Shewmr7_0196650-0.822696general secretion pathway protein I
Shewmr7_0197751-0.608362hypothetical protein
Shewmr7_0198748-2.026511general secretion pathway protein J
Shewmr7_0199846-1.769515hypothetical protein
Shewmr7_0200845-2.734463general secretion pathway protein K
Shewmr7_0201843-3.423294hypothetical protein
Shewmr7_0202846-2.977011general secretion pathway protein L
Shewmr7_0203545-2.810928general secretion pathway M protein
Shewmr7_0204544-1.772178type II secretion system protein N
Shewmr7_0205544-1.465050hypothetical protein
Shewmr7_0206543-0.922216HAD family hydrolase
Shewmr7_0207645-0.594780hypothetical protein
Shewmr7_0208743-0.693525ADP-ribose diphosphatase NudE
Shewmr7_0209540-2.1693313'(2'),5'-bisphosphate nucleotidase
Shewmr7_0210539-2.666203GCN5-related N-acetyltransferase
Shewmr7_0211439-2.965338TetR family transcriptional regulator
Shewmr7_0212442-2.964071phospholipid/glycerol acyltransferase
Shewmr7_0213542-2.914699tRNA 2-selenouridine synthase
Shewmr7_0214741-2.455055selenophosphate synthetase
Shewmr7_02151147-0.981143Delta-9 acyl-phospholipid desaturase
Shewmr7_0216943-0.223204DNA-binding transcriptional repressor FabR
Shewmr7_0217840-0.417448tRNA (uracil-5-)-methyltransferase
Shewmr7_0218733-0.817980glutamate racemase
Shewmr7_0219426-0.126873RNP-1 like RNA-binding protein
Shewmr7_0220023-1.595040hypothetical protein
Shewmr7_0222023-2.231285**UDP-N-acetylenolpyruvoylglucosamine reductase
Shewmr7_0223124-3.090630biotin--protein ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0180TCRTETOQM832e-19 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 83.0 bits (205), Expect = 2e-19
Identities = 55/199 (27%), Positives = 92/199 (46%), Gaps = 5/199 (2%)

Query: 13 VNVGTIGHVDHGKTTLTAAI--SHVLAKTYGGEAKDFSQIDNAPEERERGITINTSHIEY 70
+N+G + HVD GKTTLT ++ + G K ++ DN ER+RGITI T +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 DTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQVGVPF 130
+D PGH D++ + + +DGAIL++++ DG QTR R++G+P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP- 122

Query: 131 IIVFMNKCDMVDDAELLELVEMEVRELLSEYDFPGDDLPVIQGSALKALEGEPEWEAKII 190
I F+NK D L V +++E LS + + + +W+ I
Sbjct: 123 TIFFINKIDQN--GIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 191 ELAEALDSYIPEPERDIDK 209
+ L+ Y+ + +
Sbjct: 181 GNDDLLEKYMSGKSLEALE 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0181SECETRNLCASE1188e-38 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 118 bits (297), Expect = 8e-38
Identities = 63/125 (50%), Positives = 90/125 (72%), Gaps = 2/125 (1%)

Query: 1 MTTNTENQ--TNSLDIVKWGLAILLLAAAVIGNQMYSETSAVIRALAVIVAFAIAGFIAL 58
M+ NTE Q L+ +KW + + LL A++GN +Y + +RALAV++ A AG +AL
Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60

Query: 59 QTEKGKKALAFARESQIEVRKVVWPTRQEALNTTFIVLAATGILALVLWGLDAVLMHIVN 118
T KGK +AFARE++ EVRKV+WPTRQE L+TT IV A T +++L+LWGLD +L+ +V+
Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120

Query: 119 FITGV 123
FITG+
Sbjct: 121 FITGL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0191TCRTETOQM6020.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 602 bits (1554), Expect = 0.0
Identities = 172/680 (25%), Positives = 295/680 (43%), Gaps = 71/680 (10%)

Query: 9 RYRNIGICAHVDAGKTTTTERVLFYTGLSHKIGEVHDGAATTDWMVQEQERGITITSAAV 68
+ NIG+ AHVDAGKTT TE +L+ +G ++G V G TD + E++RGITI +
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TTFWRGMDAQFTEHRINIIDTPGHVDFTIEVERSLRVLDGAVVVFCGSSGVEPQSETVWR 128
+ W ++NIIDTPGH+DF EV RSL VLDGA+++ GV+ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QADKYRVPRLVFVNKMDRAGADFERVVKQIRTRLGATCVPIQLNIGAEENFTGVIDLIKM 188
K +P + F+NK+D+ G D V + I+ +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNEADQGMTFTYEEIPASLAAKAAEMHEYLVEAAAEASDELMDKYLEEGTLSEDEI 248
N+ E++Q + E +D+L++KY+ +L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KKALRQRTINNEIVLATCGSAFKNKGVQAVLDAVVEFLPAPVDVPPIKGIDDDEQEVERP 308
++ R N + GSA N G+ +++ + +
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 SDDNAPFAALAFKIATDPFVGTLTFIRVYSGVLESGSGVYNSVKQKRERIGRIVQMHAND 368
+ FKI L +IR+YSGVL V S K+K +I + +
Sbjct: 244 -RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGE 301

Query: 369 RTELKEVRAGDIAAAIG--LK-EVTTGDTLCDPDHKVILERMEFPEPVITIAVEPKSKAD 425
++ + +G+I LK GDT P ER+E P P++ VEP
Sbjct: 302 LCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQ 357

Query: 426 QDKMGIALQKLAAEDPSFRVETDEESSQTLISGMGELHLDIIVDRMRREFGVECNVGKPQ 485
++ + AL +++ DP R D + + ++S +G++ +++ ++ ++ VE + +P
Sbjct: 358 REMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPT 417

Query: 486 VAYRETIRASVEAEGKFVRQSGGRGQFGHVWLKLEPNEEGAGYEFINAIVGGVVPREFIP 545
V Y E R +AE + + + L + P G+G ++ +++ G + + F
Sbjct: 418 VIYME--RPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQN 475

Query: 546 AVDKGIQEQMKNGVLAGFPVLDVKVTLFDGSYHDVDSNEMAFKIAGSMGFKKGALEANPV 605
AV +GI+ + G L G+ V D K+ G Y+ S F++ + ++ +A
Sbjct: 476 AVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTE 534

Query: 606 LLEPCMKVEVTTPENYMGDVVGDLNRRRGLIEGMDDGFGGIKIVHAVVPLSEMFGYATDL 665
LLEP + ++ P+ Y+ D + I I+ +P + Y +DL
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLK-NNEVILSGEIPARCIQEYRSDL 593

Query: 666 RSATQGRASYSMEFLKYSDA 685
T GR+ E Y
Sbjct: 594 TFFTNGRSVCLTELKGYHVT 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0192TCRTETOQM833e-19 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 83.0 bits (205), Expect = 3e-19
Identities = 55/199 (27%), Positives = 92/199 (46%), Gaps = 5/199 (2%)

Query: 13 VNVGTIGHVDHGKTTLTAAI--SHVLAKTYGGEAKDFSQIDNAPEERERGITINTSHIEY 70
+N+G + HVD GKTTLT ++ + G K ++ DN ER+RGITI T +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 DTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQVGVPF 130
+D PGH D++ + + +DGAIL++++ DG QTR R++G+P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP- 122

Query: 131 IIVFMNKCDMVDDAELLELVEMEVRELLSEYDFPGDDLPVIQGSALKALEGEPEWEAKII 190
I F+NK D L V +++E LS + + + +W+ I
Sbjct: 123 TIFFINKIDQN--GIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 191 ELAEALDSYIPEPERDIDK 209
+ L+ Y+ + +
Sbjct: 181 GNDDLLEKYMSGKSLEALE 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0205PF06872270.018 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 26.6 bits (58), Expect = 0.018
Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 75 PVTGKADRVGFRFEDGKKVRFFKSNSELVK 104
P + RV +F DG +R +NSEL++
Sbjct: 87 PAHNELGRVYAKFSDGSSLRISVTNSELIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0214SECYTRNLCASE464e-164 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 464 bits (1196), Expect = e-164
Identities = 180/428 (42%), Positives = 258/428 (60%), Gaps = 14/428 (3%)

Query: 16 SELKARLLFVIGAIIVFRAGSFVPIPGIDAAVLAELFAQQKGT--ILGMFNMFSGGALSR 73
+L+ +LLF + I+V+R G+ +PIPG+D + + + G + G+ NMFSGGAL +
Sbjct: 12 PDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMFSGGALLQ 71

Query: 74 ASIFALGIMPYISASIIMQLLTVVHPALAELKKEGESGRKKISQYTRWGTLVLGTFQSIG 133
+IFALGIMPYI+ASII+QLLTVV P L LKKEG++G KI+QYTR+ T+ L Q G
Sbjct: 72 ITIFALGIMPYITASIILQLLTVVIPRLEALKKEGQAGTAKITQYTRYLTVALAILQGTG 131

Query: 134 IATGLPN--------LVPGLVVNIGFGFYFVAVVSLVTGTMFLMWLGEQITERGIGNGIS 185
+ + + +V + V+ + GT +MWLGE IT+RGIGNG+S
Sbjct: 132 LVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWLGELITDRGIGNGMS 191

Query: 186 ILIFAGIVAGLPSAIGQTAEQARQGDLNVLVLLLLAVIIFAVTYFVVFVERGQRRIVVNY 245
IL+F I A PSA+ +Q + ++AV + + VVFVE+ QRRI V Y
Sbjct: 192 ILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLI-MVALVVFVEQAQRRIPVQY 250

Query: 246 AKRQQGRKVFAAQSTHLPLKINMAGVIPPIFASSIILFPGTLAQWFGQNESMSWLSDFSL 305
AKR GR+ + ST++PLK+N AGVIP IFASS++ P +AQ+ G N + +L
Sbjct: 251 AKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAGGNSGWKSWVEQNL 310

Query: 306 AVSPGQPLYSLLYAAAIIFFCFFYTALVFNPRETADNLKKSGAFIPGIRPGEQTSRYIDK 365
P+Y + Y I+FF FFY A+ FNP E ADN+KK G FIPGIR G T+ Y+
Sbjct: 311 T-KGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGIRAGRPTAEYLSY 369

Query: 366 VMTRLTLAGALYITFICLIPEFMLIAWKV--QFYFGGTSLLIMVVVIMDFMAQVQTHMMS 423
V+ R+T G+LY+ I L+P L+ + F FGGTS+LI+V V ++ + Q+++ +
Sbjct: 370 VLNRITWPGSLYLGLIALVPTMALVGFGASQNFPFGGTSILIIVGVGLETVKQIESQLQQ 429

Query: 424 HQYESVMK 431
YE ++
Sbjct: 430 RNYEGFLR 437


3Shewmr7_0279Shewmr7_0284Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_02792170.058692hypothetical protein
Shewmr7_02802170.157075redoxin domain-containing protein
Shewmr7_0281220-0.080627hypothetical protein
Shewmr7_0282321-1.050637competence/damage-inducible protein CinA
Shewmr7_0283221-2.350932hypothetical protein
Shewmr7_0284220-2.602857hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0282ECOLNEIPORIN521e-09 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 52.1 bits (125), Expect = 1e-09
Identities = 56/323 (17%), Positives = 117/323 (36%), Gaps = 40/323 (12%)

Query: 48 LSLYGSVRPTLVYSDSSLTDD---------WDVGDALSRIGIKGKTEFAPGWALLAQGEW 98
++LYG+++ + S S + + D S+IG KG+ + G + Q E
Sbjct: 21 VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQ 80

Query: 99 KINLNGDGSFGDARLAFAGVSSPWGQFTLGRQRPVQYSLVAEYTDIFNNANSPFAYNQES 158
K ++ G S R +F G+ +G+ +GR S++ + DI N +S Y +
Sbjct: 81 KASIAGTDSGWGNRQSFIGLKGGFGKLRVGR----LNSVLKDTGDI-NPWDSKSDYLGVN 135

Query: 159 PFFTDNLALYQAKI---GYFTLMGAAQF--KSEEANSGADMLNAGIGFDWQQLHLGLSYL 213
L + + L G+ Q+ ++ +AG + +
Sbjct: 136 KIAEPEARLISVRYDSPEFAGLSGSVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGA 195

Query: 214 EQDTFESNLNTGKENTTGFAMAYSF-TNGVYLAVAYQYKDYQL--------NQGEDREGY 264
+ + N E + + + +Y +VA Q +D +L +Q E
Sbjct: 196 YKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDAKLVEENYSHNSQTEVAATL 255

Query: 265 SLDSALALPLGPDYKLKLGYFQ-FD---DGVSDPSSLNYQGVNTTIEWSPTPNVRLHAEY 320
+ P ++ Y F D + + + V ++S + + A +
Sbjct: 256 AYRFGNVTP-------RVSYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGW 308

Query: 321 LYQDND-NRDTDNSIAIGVRYDF 342
L + ++ + +G+R+ F
Sbjct: 309 LQEGKGESKFVSTAGGVGLRHKF 331


4Shewmr7_0335Shewmr7_0347Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0335122-3.644842hypothetical protein
Shewmr7_0336022-3.188983hypothetical protein
Shewmr7_0337222-3.995650hypothetical protein
Shewmr7_0338223-4.831749hypothetical protein
Shewmr7_0339121-4.534707porin
Shewmr7_0340121-3.510999hypothetical protein
Shewmr7_0341123-3.688506putrescine transporter
Shewmr7_0342225-4.097474ornithine decarboxylase
Shewmr7_0343427-3.314968hypothetical protein
Shewmr7_0345426-1.294917PBP family phospholipid-binding protein
Shewmr7_0346320-0.637115hypothetical protein
Shewmr7_0347320-0.118405AraC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0340PHPHTRNFRASE280.047 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.047
Identities = 22/96 (22%), Positives = 43/96 (44%), Gaps = 22/96 (22%)

Query: 70 RPAVQLTQEGLDAISELQQTPKGNLRISVPMVFGRLYIAPLIAEFLKRYPDIQLQMQMDD 129
+ + TQ L A+ L+ + GNL++ PM+ + E Q + M +
Sbjct: 367 KQDIFRTQ--LRAL--LRASTYGNLKVMFPMI-------ATLEEL------RQAKAIMQE 409

Query: 130 KTTDLIAGGFDLA--IRIG---ELPDSSLIARKIAP 160
+ L++ G D++ I +G E+P +++ A A
Sbjct: 410 EKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAK 445


5Shewmr7_0421Shewmr7_0447Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_04210193.278697hypothetical protein
Shewmr7_04220182.955209hypothetical protein
Shewmr7_04231182.634333hypothetical protein
Shewmr7_04242181.879409superfamily II helicase
Shewmr7_0425115-3.825042phage transcriptional regulator, AlpA
Shewmr7_0426120-5.055821hypothetical protein
Shewmr7_0427021-5.707208hypothetical protein
Shewmr7_0428-116-4.414626phage integrase family protein
Shewmr7_0429018-4.401185hypothetical protein
Shewmr7_0430022-5.158379ribonuclease PH
Shewmr7_0431-1190.636099orotate phosphoribosyltransferase
Shewmr7_0432-3151.094354GTP cyclohydrolase I
Shewmr7_0433-1141.700677peptidase S9 prolyl oligopeptidase
Shewmr7_04341141.318169hypothetical protein
Shewmr7_04351151.520956nucleoid occlusion protein
Shewmr7_04362161.553352deoxyuridine 5'-triphosphate
Shewmr7_04370171.645891phosphopantothenate-cysteine ligase /
Shewmr7_04380211.264003DNA repair protein RadC
Shewmr7_0439-1210.64416950S ribosomal protein L28
Shewmr7_0440-1180.07994550S ribosomal protein L33
Shewmr7_0441-117-1.315110N-acetylglutamate synthase
Shewmr7_0442-113-0.861990hypothetical protein
Shewmr7_0443218-1.575104transporter DMT superfamily protein
Shewmr7_0444220-1.461697hypothetical protein
Shewmr7_0445221-1.496914ATP-dependent DNA helicase RecQ
Shewmr7_0446118-0.899245tetratricopeptide domain-containing protein
Shewmr7_0447217-0.493913hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0421RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 45/221 (20%), Positives = 81/221 (36%), Gaps = 36/221 (16%)

Query: 109 AQIHELEKQLSQLELNNLSLNAEILTQLQQRIDVAAEGVTRQNGLLDSFERYQRKGVVPT 168
Q + Q Q ELN AE LT L + ++ LD F K +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR-LDDFSSLLHKQAIAK 251

Query: 169 ----------ADMAAVLQAHTASKMALE----QAKVDLMQARQAQKTELLAGPIAQSKYN 214
+ L+ + + +E AK + Q K E+L + Q+ N
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL-DKLRQTTDN 310

Query: 215 ---VELQLARLKAQESQLDIKALTPTRVVDV-------LVQAGEHIVEDRPLVLLSGREA 264
+ L+LA+ + ++ I+A +V + +V E ++ P +
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE-----DDT 365

Query: 265 AVIFAYLEPKYLEYTAIGQEATIKLP--NGTR---LRGEIS 300
+ A ++ K + + +GQ A IK+ TR L G++
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0424NUCEPIMERASE709e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 69.8 bits (171), Expect = 9e-16
Identities = 53/234 (22%), Positives = 88/234 (37%), Gaps = 31/234 (13%)

Query: 3 NIMVTGATGLLGRAVVKQLTAAGHRVIA---------TGFSRAEAGI--------HRLDL 45
+VTGA G +G V K+L AGH+V+ +A + H++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TQAAEVEAFIAREQPEVIVHCAAERRPDVSERSPEHALALNLSASQTLAEVAKTHQ-AWL 104
+ A E + S +P NL+ + E + ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 LYISTDYVF-DGTTPPYAEDAEPN-PVNFYGASKLQGETCVLSTDNGFAV----LRLPIL 158
LY S+ V+ P++ D + PV+ Y A+K E + + + + LR +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 159 YGEVTQLNESAVLVLINQLLDGRPQRV----DHWAIRAPTSTADIANAIAKLIQ 208
YG + A+ +L+G+ V R T DIA AI +L
Sbjct: 182 YGP-WGRPDMALFKFTKAMLEGKSIDVYNYGKMK--RDFTYIDDIAEAIIRLQD 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0426HTHFIS872e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-22
Identities = 26/110 (23%), Positives = 57/110 (51%)

Query: 3 RLLIIEDDQALAGVLARRLTRHGFECRLSHDASNALLVAREFCPTHILLDMKLAEANGLG 62
+L+ +DD A+ VL + L+R G++ R++ +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVIMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAA 112
L+ ++ P + +++++ + TA++A GA +YL KP D L+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114



Score = 48.3 bits (115), Expect = 3e-09
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 116 NSQASALPEDEIDDSPLTPKRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
S ALP + D L +E+ I L A +GN A LG++R TL++K+ +
Sbjct: 417 ASFGDALPPSGLYDRVL--AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0428DHBDHDRGNASE433e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.1 bits (101), Expect = 3e-07
Identities = 37/195 (18%), Positives = 74/195 (37%), Gaps = 29/195 (14%)

Query: 55 LEEEIKQLSQNIPQLDWLINCIGMLHTEDKGPEKSLQALDGDFLQHNIQLNTLPSMMLAK 114
++E ++ + + +D L+N G+L G SL + + +N+ ++
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRP---GLIHSLSDEE---WEATFSVNSTGVFNASR 125

Query: 115 HFETALKRSVSVRFAVVSAKVGSISDNRLGGWYSYRASKAALNMFLKTLSIEWQRSMKHC 174
+ S V + + + +Y +SKAA MF K L +E C
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 175 VVLALHPGTTDTRLSKP------------------FQQNVPKEKLFTPEYVAQCLVSIIA 216
+++ PG+T+T + F+ +P +KL P +A ++ +++
Sbjct: 183 NIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 217 NATPAQTGSFLAYDG 231
T L DG
Sbjct: 241 GQAGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0433HTHFIS310.015 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.015
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0435BCTERIALGSPG482e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.0 bits (114), Expect = 2e-09
Identities = 18/59 (30%), Positives = 33/59 (55%)

Query: 1 MSRLHTSKGFTLIELVVVIIILGILAVVAAPRFINLSQDAHDARAKAAFAAFTSGVKLY 59
M +GFTL+E++VVI+I+G+LA + P + + A +A + A + + +Y
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMY 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0436HTHFIS843e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.7 bits (207), Expect = 3e-21
Identities = 28/125 (22%), Positives = 51/125 (40%)

Query: 7 LYLVDDDEAILDSLGFMLGQFGYQVQTFNSGRNFLAQCPLTQAGCVILDSRMPEITGQEV 66
+ + DDD AI L L + GY V+ ++ V+ D MP+ ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 QQKLLETQSPLGVIFLTGHGDLPMALSAFRKGACDFFQKPVSGKALVQAIEKAHRESQAN 126
++ + + L V+ ++ A+ A KGA D+ KP L+ I +A E +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 127 FEQQS 131
+
Sbjct: 126 PSKLE 130


6Shewmr7_0497Shewmr7_0512Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0497119-3.347670hypothetical protein
Shewmr7_0498127-6.626740hypothetical protein
Shewmr7_0499123-5.512745hypothetical protein
Shewmr7_0500227-7.768272DedA family protein
Shewmr7_0501224-7.903306putative manganese-dependent inorganic
Shewmr7_0502217-4.237831hypothetical protein
Shewmr7_0503116-0.724152TonB-dependent receptor
Shewmr7_05040182.082116hypothetical protein
Shewmr7_05050182.440810hypothetical protein
Shewmr7_05060192.613001ATP-dependent protease La
Shewmr7_0507-1173.139808pseudouridine synthase
Shewmr7_0508-2183.114389hypothetical protein
Shewmr7_0509-1182.863704glycosyl transferase family protein
Shewmr7_0510-1182.916199hypothetical protein
Shewmr7_05110172.776125hypothetical protein
Shewmr7_05120183.057128hypothetical protein
7Shewmr7_0554Shewmr7_0568Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0554027-3.789611primosomal protein N'
Shewmr7_0555126-4.030895hypothetical protein
Shewmr7_0556026-5.113222hypothetical protein
Shewmr7_0557123-5.01644950S ribosomal protein L31
Shewmr7_0558119-4.368490malate dehydrogenase
Shewmr7_0559015-2.906974regulatory protein CsrD
Shewmr7_0560012-1.851874regulatory protein CsrD
Shewmr7_0561-1110.305142biogenesis protein MshI
Shewmr7_0562-1142.024826hypothetical protein
Shewmr7_0563-1203.428710hypothetical protein
Shewmr7_05640234.335900MSHA biogenesis protein MshJ
Shewmr7_05650223.699118MSHA biogenesis protein MshK
Shewmr7_0566-1233.997651hypothetical protein
Shewmr7_0567-2223.669887pilus (MSHA type) biogenesis protein MshL
Shewmr7_0568-2213.045964MSHA biogenesis protein MshM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0560INTIMIN320.027 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 31.6 bits (71), Expect = 0.027
Identities = 41/175 (23%), Positives = 67/175 (38%), Gaps = 18/175 (10%)

Query: 1038 YDPNGQFNDLVKGETANEVFSYTITDEIGAT--STTEVTISVVGINAAP-VAVADTAVTT 1094
YD + N+++ ++ S I +I T ST ++ + V + D+A+ +
Sbjct: 435 YDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKYGLDRIVWDDSALRS 494

Query: 1095 KSGSIQIDLLANDTD------------ADGDTLTITAIDVGSLKGKVTNNNDGTVTYSPN 1142
+ G IQ + D ++ +T A D G +NN T+T N
Sbjct: 495 QGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDR---NGNSSNNVLLTITVLSN 551

Query: 1143 GQFGHLYQGQSATETFTYTISDGDAEMTASVTVTINGEGQAPVEPEKEGSSGGSL 1197
GQ T T +DG +T + TV NG QA V SG ++
Sbjct: 552 GQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAV 606


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0564HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 17/145 (11%)

Query: 30 RTVIRSLILGLLCSGHVLLEGLPGTAKTRSVKAL------ANALAISFGRIQFTPDLLPS 83
+ + R L + +++ G GT K +AL N ++ DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 84 DVTGTE--VLHEAEGKSTLRFQP---GPVFNQIVLADEINRAPAKVQAALLEAMAEGTIT 138
++ G E A+ +ST RF+ G +F DEI P Q LL + +G T
Sbjct: 207 ELFGHEKGAFTGAQTRSTGRFEQAEGGTLF-----LDEIGDMPMDAQTRLLRVLQQGEYT 261

Query: 139 -VAGQTHVLPELFMVLATQNPIEQE 162
V G+T + ++ +V AT ++Q
Sbjct: 262 TVGGRTPIRSDVRIVAATNKDLKQS 286


8Shewmr7_0616Shewmr7_0626Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_06162184.013805phosphoribosylaminoimidazole-succinocarboxamide
Shewmr7_0617-1133.689084hypothetical protein
Shewmr7_0618-2143.446334pentapeptide repeat-containing protein
Shewmr7_06190243.489565twin-arginine translocation pathway signal
Shewmr7_06200212.288387hypothetical protein
Shewmr7_0621027-4.1744294Fe-4S ferredoxin iron-sulfur binding
Shewmr7_0622-131-5.267703polysulfide reductase, NrfD
Shewmr7_0623-133-6.170332hypothetical protein
Shewmr7_0624-230-6.070307hypothetical protein
Shewmr7_0625027-7.904257transcriptional repressor protein MetJ
Shewmr7_0626027-7.542251cystathionine gamma-synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0616DHBDHDRGNASE494e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 48.9 bits (116), Expect = 4e-09
Identities = 53/257 (20%), Positives = 98/257 (38%), Gaps = 31/257 (12%)

Query: 5 IIITGVGKRIGYALAKHLLAQGHKVIG-----TYRSHYPSIDELQSLGATLIQCDFYDNA 59
ITG + IG A+A+ L +QG + S + ++ A D D+A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 RLQTLIEQL-SQYPKIRAIIHNASDWLPDNSPSLAAHEVMQRMMQVHVSVPYQMNLALAS 118
+ + ++ + I +++ A P SL+ E + V+ + + + +++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNASRSVSK 129

Query: 119 QLRAGAEGEIG--ASDIIHFTDYVAEKGSAKHMAYAASKAALDNLTLSFAAQLAP-GVKV 175
+ G I S+ V A AYA+SKAA T +LA ++
Sbjct: 130 YMMDRRSGSIVTVGSNPAG----VPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 176 NAIAPAMI-------LFNPSDDEAYRQKTLAKAI-----LPKEAGNQEIIALVDYLLASR 223
N ++P L+ + K + L K A +I V +L++ +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 224 --YVTGRSHNVDGGRHL 238
++T + VDGG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_06192FE2SRDCTASE280.019 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 28.1 bits (62), Expect = 0.019
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 73 FQDSVARLSDFEFGFMPLLPEEEEPLSQRVEALSLWTQSFLTGIAIIQPKLNKASAEVRE 132
+A SD + P++ E +PL +SLW Q + I ++ P L A +
Sbjct: 66 LSSLLAVYSDHIYRNQPMMIRENKPL------ISLWAQWY---IGLMVPPLMLALLTQEK 116

Query: 133 VIKDLAEIAQVEF 145
+ E EF
Sbjct: 117 ALDVSPEHFHAEF 129


9Shewmr7_0654Shewmr7_0667Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_06540123.121947hypothetical protein
Shewmr7_06552123.808511peptidase M16 domain-containing protein
Shewmr7_06562113.855508hypothetical protein
Shewmr7_06572113.576369hypothetical protein
Shewmr7_06580133.495615hypothetical protein
Shewmr7_0659-2143.143214hypothetical protein
Shewmr7_0660-2122.415144MltD domain-containing protein
Shewmr7_0661-2141.490530hypothetical protein
Shewmr7_0662-1141.654987serine/threonine protein kinase
Shewmr7_0663-1132.032860Sel1 domain-containing protein
Shewmr7_06640161.527472hypothetical protein
Shewmr7_06651152.698222alcohol dehydrogenase
Shewmr7_06660193.584909hypothetical protein
Shewmr7_06671193.592286aldose 1-epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0657DHBDHDRGNASE1171e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (294), Expect = 1e-33
Identities = 76/258 (29%), Positives = 120/258 (46%), Gaps = 6/258 (2%)

Query: 34 LKGKVGLITGSTSGIGLATAHVLAEQGCHLILHGLMPEAEGQCLAADFAEQYHINTFFSN 93
++GK+ ITG+ GIG A A LA QG H+ PE + +++ AE H F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--P 63

Query: 94 ADLRDPESIHAFMDAGVNALGSIDILVNNAGIQHTENVAHFPIDKWNDIIAINLSSAFHT 153
AD+RD +I +G IDILVN AG+ + ++W ++N + F+
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 154 IQQVVPAMAEKRWGRIINIASVHGLVASVNKAAYCAAKHGIVGLTKVVAIECAEQGITVN 213
+ V M ++R G I+ + S V + AAY ++K V TK + +E AE I N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 214 AICPGWVDTPLINK-QIEAIASNKGLSYDEAKYQLVTAKQPLPEMLDPRQIGEFVLFLCS 272
+ PG +T + + + + + ++ PL ++ P I + VLFL S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGI---PLKKLAKPSDIADAVLFLVS 240

Query: 273 SAARGITGASLAMDGAWT 290
A IT +L +DG T
Sbjct: 241 GQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0662TCRTETB300.022 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.022
Identities = 26/107 (24%), Positives = 40/107 (37%), Gaps = 4/107 (3%)

Query: 252 GIVGTIAGILYSRKQPLRLPIIRLSGLLIFLTVLGLSFGSAPWLQTLCAI-VLGFCIFLP 310
I G I GIL R+ PL + I ++ L + + W T+ + VLG F
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 311 VTALVSIPHELPKMTSQKITVIFSLFWSISYLISTLVLWLFGKLVDI 357
+ L Q+ SL S+L + + G L+ I
Sbjct: 367 TVISTIVSSSL---KQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0663HTHFIS399e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.0 bits (91), Expect = 9e-05
Identities = 17/89 (19%), Positives = 34/89 (38%), Gaps = 6/89 (6%)

Query: 1052 ISVLVIDNDELMLKAISSLLLGWGCHVLTARDKACAELQLAQQVLPKLIIADYHLDDDQN 1111
++LV D+D + ++ L G V + A + L++ D + D+ N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDE-N 61

Query: 1112 GVDLVQSLLTHPVFSRQRPTCIICSADPS 1140
DL+ + R ++ SA +
Sbjct: 62 AFDLLPRIKKA----RPDLPVLVMSAQNT 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0664HTHFIS661e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 1e-14
Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 8/167 (4%)

Query: 1 MSQIKVAIADDHPLFRTALTQAVLKNVNTADVLEAENFQELISIVENNPDIELIFLDLHM 60
M+ + +ADD RT L QA L DV N L + +L+ D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQA-LSRAG-YDVRITSNAATLWRWIAAGD-GDLVVTDVVM 57

Query: 61 PGNEGFTGLTLLQNHFPDIAVIMVSSDDQPEIIRKAINFGASAFIPKSASLTQISTAIAT 120
P F L ++ PD+ V+++S+ + KA GA ++PK LT++ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 121 VLEGEVWLPEHTDINVDQQ-----TAAEHQRLAKQLAQLTPQQYTVL 162
L P + + +A Q + + LA+L T++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


10Shewmr7_0686Shewmr7_0748Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0686228-2.054790XRE family transcriptional regulator
Shewmr7_0687224-0.230216HipA domain-containing protein
Shewmr7_0688-1150.517129hypothetical protein
Shewmr7_0689-2140.070028hypothetical protein
Shewmr7_0690014-0.129879putative outer membrane adhesin like protein
Shewmr7_06911120.227195hypothetical protein
Shewmr7_06921130.125288LysR family transcriptional regulator
Shewmr7_0693118-1.971434sulfatase
Shewmr7_0696218-3.200379hypothetical protein
Shewmr7_0697116-1.832414radical SAM domain-containing protein
Shewmr7_0698120-2.223843ATPase
Shewmr7_0699524-1.233870hypothetical protein
Shewmr7_07006260.158739hypothetical protein
Shewmr7_07018323.389327hypothetical protein
Shewmr7_07029302.464926von Willebrand factor, type A
Shewmr7_07038321.527911von Willebrand factor, type A
Shewmr7_07049340.142497hypothetical protein
Shewmr7_0705631-3.512182putative nonspecific acid phosphatase precursor
Shewmr7_0706528-3.230489hypothetical protein
Shewmr7_07074241.2403834-carboxymuconolactone decarboxylase
Shewmr7_07085251.737926hypothetical protein
Shewmr7_07095251.701042hypothetical protein
Shewmr7_07106262.301620hypothetical protein
Shewmr7_07116251.789613catalase/peroxidase HPI
Shewmr7_07126221.580584hypothetical protein
Shewmr7_0713622-4.130005hypothetical protein
Shewmr7_0714321-2.380775ABC transporter-like protein
Shewmr7_0715322-0.310857hypothetical protein
Shewmr7_0716220-0.159363hypothetical protein
Shewmr7_07172191.382907thioredoxin domain-containing protein
Shewmr7_07184192.158185peptidase S8/S53 subtilisin kexin sedolisin
Shewmr7_07194202.594806hypothetical protein
Shewmr7_07203233.076105cold-shock DNA-binding protein family protein
Shewmr7_07213362.459053diguanylate cyclase with PAS/PAC sensor
Shewmr7_07225311.939300hypothetical protein
Shewmr7_07235322.600169hypothetical protein
Shewmr7_07244311.696976hypothetical protein
Shewmr7_07255321.539355alkaline phosphatase
Shewmr7_07265310.785861hypothetical protein
Shewmr7_07276311.598103alkaline phosphatase
Shewmr7_07284353.858548hypothetical protein
Shewmr7_07296342.814374hypothetical protein
Shewmr7_07304342.994459hypothetical protein
Shewmr7_07314362.942221AraC family transcriptional regulator
Shewmr7_07325393.133152lysine exporter protein LysE/YggA
Shewmr7_07336383.862655hypothetical protein
Shewmr7_07346393.130001hypothetical protein
Shewmr7_0735430-1.477948hypothetical protein
Shewmr7_0736430-1.083263isochorismatase hydrolase
Shewmr7_0737327-1.963659nitrogen regulatory protein P-II
Shewmr7_0738226-1.398557Rh family protein/ammonium transporter
Shewmr7_0739226-2.756920hypothetical protein
Shewmr7_0740123-4.255077hypothetical protein
Shewmr7_07412150.046734TPR repeat-containing protein
Shewmr7_0742014-0.842896hypothetical protein
Shewmr7_0743015-1.467908phospho-2-dehydro-3-deoxyheptonate aldolase
Shewmr7_0744018-2.713366hypothetical protein
Shewmr7_0745-121-3.205261ABC transporter-like protein
Shewmr7_0746015-1.451616hypothetical protein
Shewmr7_0747115-1.916018hypothetical protein
Shewmr7_0748215-2.185846ferredoxin-dependent glutamate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0691V8PROTEASE489e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 47.7 bits (113), Expect = 9e-08
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 650 SVPVNFLS-SVDTTGGNSGSPVFNGKGELVGLNFDSTYEAITKDWFFNPTI 699
+ + + TTGGNSGSPVFN K E++G+++ F N +
Sbjct: 220 YLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENV 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0693ALARACEMASE431e-154 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 431 bits (1110), Expect = e-154
Identities = 155/350 (44%), Positives = 214/350 (61%), Gaps = 6/350 (1%)

Query: 6 RAEISSSALQNNLAVLRQQASRSQVMAVVKANGYGHGLLNVANCLHTADGFGLARLEEAL 65
+A + AL+ NL+++RQ A+ ++V +VVKAN YGHG+ + + + DGF L LEEA+
Sbjct: 6 QASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAI 65

Query: 66 ELRAGGVKARLLLLEGFFRSTDLPLLVAHDIDTVVHHESQIEMLEQATLSKPVTVWLKVD 125
LR G K +L+LEGFF + DL + H + T VH Q++ L+ A L P+ ++LKV+
Sbjct: 66 TLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVN 125

Query: 126 SGMHRLGVTPEQFAQVYARLTACDNVAKPIHLMTHFACADEPENNYTQVQMQTFNQLTAD 185
SGM+RLG P++ V+ +L A NV + LM+HFA A+ P+ M Q
Sbjct: 126 SGMNRLGFQPDRVLTVWQQLRAMANV-GEMTLMSHFAEAEHPD--GISGAMARIEQAAEG 182

Query: 186 LPGFRTLANSAGALYWPKSQGDWIRPGIALYGVSPVT--GDCGANHGLIPAMNLVSRLIA 243
L R+L+NSA L+ P++ DW+RPGI LYG SP D AN GL P M L S +I
Sbjct: 183 LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDI-ANTGLRPVMTLSSEIIG 241

Query: 244 VRDHKAGQPVGYGCYWTAKQDTRLGVVAIGYGDGYPRNAPEGTPVWVNGRRVPIVGRVSM 303
V+ KAG+ VGYG +TA+ + R+G+VA GY DGYPR+AP GTPV V+G R VG VSM
Sbjct: 242 VQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVSM 301

Query: 304 DMLTVDLGADAADQVGDEALLWGAALPVEEVAEHIGTIAYELVTKLTPRV 353
DML VDL +G LWG + +++VA GT+ YEL+ L RV
Sbjct: 302 DMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRV 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0715FIMREGULATRY445e-09 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 44.2 bits (104), Expect = 5e-09
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 1/84 (1%)

Query: 3 TLIQGCETAQQFEILLKLTSISSQAKKDALRAYLVDGLPAKRAYARYGVTQQHFSEALAT 62
L+ G + F +L+ ++SI S A++ YLV G K +Y + +FS L
Sbjct: 22 VLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTLGR 81

Query: 63 LNQKADLAMQYAALQKTKAENNFD 86
L + LA + A ++ + FD
Sbjct: 82 LIRLNALAARLAPYYTDES-SAFD 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0734BCTERIALGSPF260.031 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 25.6 bits (56), Expect = 0.031
Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 8 LLVIIGIVGMLLTVVVPLIA 27
+V I +V +LL+VVVP +
Sbjct: 180 TVVAIAVVSILLSVVVPKVV 199


11Shewmr7_0842Shewmr7_0855Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0842026-5.959639hypothetical protein
Shewmr7_0843-126-5.956557hypothetical protein
Shewmr7_0844-126-6.350932DegS serine peptidase
Shewmr7_0845020-4.120207peptidase Do
Shewmr7_0846118-3.138709hypothetical protein
Shewmr7_0847122-1.552221AFG1 family ATPase
Shewmr7_0848125-0.89280350S ribosomal protein L13
Shewmr7_0849120-0.28710730S ribosomal protein S9
Shewmr7_08500190.410673hypothetical protein
Shewmr7_0851-115-0.668993adenylosuccinate synthetase
Shewmr7_0852014-0.757584Sel1 domain-containing protein
Shewmr7_0853411-0.499365hypothetical protein
Shewmr7_0854412-0.310757RNAse R
Shewmr7_0855215-0.18088223S rRNA (guanosine-2'-O-)-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0845OMPADOMAIN824e-19 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 81.9 bits (202), Expect = 4e-19
Identities = 45/164 (27%), Positives = 73/164 (44%), Gaps = 18/164 (10%)

Query: 104 TNADNQQNSLLISAN---NQAMSKEDQEKYSDYQVVGMAATISSTDEMMFGSGSAEATAQ 160
T DN SL +S +A +V + +++F A +
Sbjct: 176 TRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQ--TKHFTLKSDVLFNFNKATLKPE 233

Query: 161 AKQKLQKLAAIYKDSK---QNILITGHTDASGSESLNQTLSEARARYVATLFNQAGIPTE 217
+ L +L + + ++++ G+TD GS++ NQ LSE RA+ V GIP +
Sbjct: 234 GQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPAD 293

Query: 218 RLFFQGAGESQPVAANTNETGK---------AKNRRVEIVEIEG 252
++ +G GES PV NT + K A +RRVEI E++G
Sbjct: 294 KISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI-EVKG 336


12Shewmr7_0909Shewmr7_0922Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0909016-3.675124hypothetical protein
Shewmr7_0910119-4.783153hypothetical protein
Shewmr7_0911122-5.474096TP901 family phage tail tape measure protein
Shewmr7_0912120-4.979561hypothetical protein
Shewmr7_0913121-5.153202putative bacteriophage protein
Shewmr7_0914019-4.565864hypothetical protein
Shewmr7_0915-115-3.766385hypothetical protein
Shewmr7_0916-215-1.324477hypothetical protein
Shewmr7_0917-215-0.748789hypothetical protein
Shewmr7_0918-213-0.198755hypothetical protein
Shewmr7_0919-2140.446421hypothetical protein
Shewmr7_09201160.343518phage integrase family protein
Shewmr7_09210200.790563tRNA-dihydrouridine synthase A
Shewmr7_09222200.557099hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0911ANTHRAXTOXNA290.017 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 28.9 bits (64), Expect = 0.017
Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 14/102 (13%)

Query: 48 HFVLASTTEGKALIAESTENYAFNTQKILNASHVV-------VLCTRTQLDEAHLLQVL- 99
FV E LI + ++YA N+++ + + ++ LD L ++
Sbjct: 140 RFVFEKKRETPKLII-NIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDP-EFLNLIK 197

Query: 100 ----EQEAKDGRFANDEAKQAQHNGRSFFANMHKNELKDAQH 137
+ ++ D F+ ++ + N +S N K L + QH
Sbjct: 198 SLSDDSDSSDLLFSQKFKEKLELNNKSIDINFIKENLTEFQH 239


13Shewmr7_0967Shewmr7_0978Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_09672161.448461membrane-flanked domain-containing protein
Shewmr7_0968016-0.005151TonB-dependent receptor
Shewmr7_0969-114-0.461710hypothetical protein
Shewmr7_0970-115-0.275022glutathione S-transferase domain-containing
Shewmr7_0971-116-0.671603hypothetical protein
Shewmr7_0972121-1.604976hypothetical protein
Shewmr7_0973224-1.784262hypothetical protein
Shewmr7_0974441-2.043302hypothetical protein
Shewmr7_0975334-2.159274endothelin-converting protein 1
Shewmr7_0976330-2.889191hypothetical protein
Shewmr7_0977324-2.795017ribosomal small subunit pseudouridine synthase
Shewmr7_0978220-2.695329hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0972LUXSPROTEIN2731e-97 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 273 bits (699), Expect = 1e-97
Identities = 131/168 (77%), Positives = 150/168 (89%)

Query: 2 PLLDSFTVDHTRMNAPAVRVAKHMSTPKGDAITVFDLRFCAPNKDILSERGIHTLEHLFA 61
PLLDSFTVDHTRMNAPAVRVAK M TPKGD ITVFDLRF APNKDILSE+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRDHLNGSDVEIIDISPMGCRTGFYMSLIGEPSERQVADAWLASMEDVLKVVEQSEIP 121
GFMR+HLNG VEIIDISPMGCRTGFYMSLIG PSE+QVADAW+A+MEDVLKV Q++IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNEYQCGTYQMHSLEQAQDIARNIIAAGVSVNRNDDLKLSDEILGQL 169
ELNEYQCGT MHSL++A+ IA+NI+ GV+VN+ND+L L + +L +L
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLREL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0973ECOLIPORIN300.033 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 29.9 bits (67), Expect = 0.033
Identities = 55/244 (22%), Positives = 87/244 (35%), Gaps = 48/244 (19%)

Query: 417 EHVSINREVKQAFAGLDEADFSDSDWMPQLGVLYDAGDWRFSTDIRRAWTAASAGNT--- 473
E N + AFAGL D+ D+ GVLYD W TD+ + S
Sbjct: 87 EGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLYDVEGW---TDMLPEFGGDSYTYADNY 143

Query: 474 -TQEAQVSLHYQVSAQYAREGIKADLRAYVQ-----EFDNLHVDCDSYSMCADERLLTQE 527
T A Y+ + + G+ L +Q E + + + + +
Sbjct: 144 MTGRANGVATYRNTDFF---GLVDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYD 200

Query: 528 NIPDVLTYGVELGLGYRWDLGGVELPLGLNYQYLSAEYQTSTCTDVQ----GCVLEGDRL 583
N G G+ +D+G +G + A Y TS T+ Q G + GD+
Sbjct: 201 N-------GDGFGISTTYDIG-----MGFS---AGAAYTTSDRTNEQVNAGGTIAGGDKA 245

Query: 584 -AWLPEHQLQLSAGIKYAQYRLNLEAAYQSERDFSQFGSEQERISGQW-----RVDLAAN 637
AW +AG+KY + L Y R+ + +G + G ++ A
Sbjct: 246 DAW--------TAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297

Query: 638 YDFD 641
Y FD
Sbjct: 298 YQFD 301


14Shewmr7_0992Shewmr7_1001Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0992224-1.141859invasion gene expression up-regulator, SirB
Shewmr7_09932170.059049hypothetical protein
Shewmr7_0994216-0.088322hypothetical protein
Shewmr7_09952190.5582432-dehydro-3-deoxyphosphooctonate aldolase
Shewmr7_09961141.483304hypothetical protein
Shewmr7_09970171.002115IS4 family transposase
Shewmr7_0998-1161.523590ammonium transporter
Shewmr7_09992251.534248hypothetical protein
Shewmr7_10002251.536529nitrogen regulatory protein P-II
Shewmr7_10012240.8610102-dehydropantoate 2-reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0995SHAPEPROTEIN1429e-40 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 142 bits (361), Expect = 9e-40
Identities = 78/385 (20%), Positives = 144/385 (37%), Gaps = 79/385 (20%)

Query: 5 IGIDLGTTNSCVAVLDGGK-----ARVLENAEGDRTTPSIIAYTDDETIVGQPAKRQAVT 59
+ IDLGT N+ + V G + V + + S+ A VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA-------VGHDAKQMLGR 65

Query: 60 NPNNTFFAIKRLIGRRFKDDEVQRDVNIMPFKIIQADNGDAWVESRGNKMAPPQVSAEIL 119
P N AI+ + +D I F + + + +
Sbjct: 66 TPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQHFI 95

Query: 120 KKMKKTAEDFLGEEVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAYG 179
K++ + ++ VP +R+A +++ + AG +I EP AAA+ G
Sbjct: 96 KQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 180 IDKKQGDNIVAVYDLGGGTFDISIIEIDSNDGDQTFEVLATNGDTHLGGEDFDNRLINYL 239
+ + + V D+GGGT ++++I ++ + + +GG+ FD +INY+
Sbjct: 153 LPVSEATGSM-VVDIGGGTTEVAVISLNG---------VVYSSSVRIGGDRFDEAIINYV 202

Query: 240 ADEFKKEQGLDLRKDPLAMQRLKEAAEKAKIELSST----NQTEVNLPYITADATGPKHL 295
+ G + AE+ K E+ S E+ + P+
Sbjct: 203 RRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249

Query: 296 VVKITRAKLESLVEDLIIRTLEPLKVALADA--DLSVSDINE--VILVGGQTRMPKVQEA 351
+ + LE+L E + + + VAL +L+ SDI+E ++L GG + +
Sbjct: 250 TLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELA-SDISERGMVLTGGGALLRNLDRL 306

Query: 352 VSNFFGKEPRKDVNPDEAVAVGAAI 376
+ G +P VA G
Sbjct: 307 LMEETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1001INFPOTNTIATR1552e-49 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 155 bits (392), Expect = 2e-49
Identities = 77/205 (37%), Positives = 121/205 (59%), Gaps = 9/205 (4%)

Query: 6 STVEQQASYGVGRQMGEQLAANSFEGVDI--AAVQAGLADAFAGVESAVSMQDMQVAFTE 63
+T + + SY +G +G+ +G+DI + G+ D +G + ++ + M+ ++
Sbjct: 28 TTDKDKLSYSIGADLGKNFKN---QGIDINPDVLAKGMQDGMSGAQLILTEEQMKDVLSK 84

Query: 64 ISRRIQAAQ----EQAAAAASAEGEAFLAENAKRAGVIVTDSGLQYEVLVQGSGAKPSYE 119
+ + A + + A A+G+AFL+ N + G++V SGLQY+++ G+GAKP
Sbjct: 85 FQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKS 144

Query: 120 DTVRTHYHGSFINGDVFDSSVVRGQPAEFPVSGVIAGWTEALQLMPVGTKLKLFVPHHLA 179
DTV Y G+ I+G VFDS+ G+PA F VS VI GWTEALQLMP G+ ++FVP LA
Sbjct: 145 DTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLA 204

Query: 180 YGERGAGASIPPYSTLVFEVELLDI 204
YG R G I P TL+F++ L+ +
Sbjct: 205 YGPRSVGGPIGPNETLIFKIHLISV 229


15Shewmr7_1015Shewmr7_1039Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1015225-4.068229UDP-N-acetylmuramate
Shewmr7_1016330-3.563802hypothetical protein
Shewmr7_1017330-3.132473aromatic acid decarboxylase
Shewmr7_1018430-4.033721hypoxanthine phosphoribosyltransferase
Shewmr7_1019332-3.744309ABC transporter-like protein
Shewmr7_1020331-4.644674ABC-2 type transporter
Shewmr7_1021532-4.076407protease domain-containing protein
Shewmr7_1022530-3.388384hypothetical protein
Shewmr7_1023530-3.421731DNA-binding transcriptional regulator AsnC
Shewmr7_1024531-2.666503ribosomal large subunit pseudouridine synthase
Shewmr7_1025531-2.973862peptidase U32
Shewmr7_1026531-2.871338hypothetical protein
Shewmr7_1027630-2.804762abortive infection protein
Shewmr7_1028532-4.955474hypothetical protein
Shewmr7_1029535-6.641863cupin 2 domain-containing protein
Shewmr7_1030536-6.928362hypothetical protein
Shewmr7_1031637-6.937897twin-arginine translocation pathway signal
Shewmr7_1032539-8.326092hypothetical protein
Shewmr7_1033438-8.485037Fe-S metabolism associated SufE
Shewmr7_1034439-8.930802cysteine desulfurase
Shewmr7_1035438-8.158314methyl-accepting chemotaxis sensory transducer
Shewmr7_1036437-8.402984hypothetical protein
Shewmr7_1037536-8.760331DEAD/DEAH box helicase domain-containing
Shewmr7_1038537-7.071097hypothetical protein
Shewmr7_1039122-3.728471cysteine/glutathione ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1018PF00577395e-05 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 39.4 bits (92), Expect = 5e-05
Identities = 25/180 (13%), Positives = 54/180 (30%), Gaps = 29/180 (16%)

Query: 484 GRLTYRYTASNFGHGWVGQQQSLRADL-WNWKFATVSASFNHSSSSGNNITLSVSGSLG- 541
L+Y G G + A L + + + ++HS + VSG +
Sbjct: 645 NNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDI-KQLYYGVSGGVLA 703

Query: 542 -RTGLLRTPQRSQAQLELSTCKDLNLNGICEPTEPEVESIKATVAGQEVVTP-----AII 595
G+ + + + P + K V Q V A++
Sbjct: 704 HANGVTLGQPLNDTVVLVKA--------------PGAKDAK--VENQTGVRTDWRGYAVL 747

Query: 596 GSLTPYQRYTIQVGSSFNLSPSYKAVESTRLI---RGGINKLRLPLTEIREVEGQLDRDG 652
T Y+ + + ++ L+ + + + RG I + ++ L +
Sbjct: 748 PYATEYRENRVALDTN-TLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNN 806


16Shewmr7_1085Shewmr7_1101Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1085219-0.755119response regulator receiver/unknown
Shewmr7_1086429-0.499835peptidase M16 domain-containing protein
Shewmr7_1087533-0.912412hypothetical protein
Shewmr7_1088532-0.331100peptidase M16 domain-containing protein
Shewmr7_1089633-0.161872hypothetical protein
Shewmr7_1090426-0.542916hypothetical protein
Shewmr7_1091430-0.748196ErfK/YbiS/YcfS/YnhG family protein
Shewmr7_1092222-1.583715hypothetical protein
Shewmr7_1093222-1.200933anaerobic nitric oxide reductase transcription
Shewmr7_1094121-1.998104putative nitric oxide reductase (subunit B)
Shewmr7_1095126-2.087635hypothetical protein
Shewmr7_1096339-2.204395potassium/proton antiporter
Shewmr7_1097332-2.234395lipid A biosynthesis lauroyl (or palmitoleoyl)
Shewmr7_1098433-1.553415bifunctional heptose 7-phosphate kinase/heptose
Shewmr7_1099233-1.433017hypothetical protein
Shewmr7_1100339-1.284853TetR family transcriptional regulator
Shewmr7_1101225-0.684083hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1087adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1088SECGEXPORT1213e-39 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 121 bits (304), Expect = 3e-39
Identities = 63/110 (57%), Positives = 83/110 (75%)

Query: 1 MYEVLVVVYLLVALGLIGLILIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+VV+L+VA+GL+GLI++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDAWKNLGSDEQVTQPVDQATEKSETKIPD 110
FF +SL++GN+++N W+NL + + Q A K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1091TCRTETOQM694e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 69.1 bits (169), Expect = 4e-14
Identities = 38/133 (28%), Positives = 57/133 (42%), Gaps = 18/133 (13%)

Query: 392 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETDN 433
++ HVD GKT+L + + A E G GIT G + +N
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 434 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 493
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 494 NKMDKPEADIDRV 506
NK+D+ D+ V
Sbjct: 128 NKIDQNGIDLSTV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1097SYCDCHAPRONE290.013 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.013
Identities = 11/75 (14%), Positives = 20/75 (26%), Gaps = 3/75 (4%)

Query: 69 NEQRARFHYDRGVIYDSVGLRLMARIDFMQALKLQPDLADAYNFLGIYYTQEGEYDSAYE 128
+ +RF G ++G +A + + Q+GE A
Sbjct: 66 DHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAES 125

Query: 129 AFDGVLEL---SPNY 140
EL +
Sbjct: 126 GLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1098TCRTETOQM1983e-58 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 198 bits (504), Expect = 3e-58
Identities = 107/461 (23%), Positives = 210/461 (45%), Gaps = 47/461 (10%)

Query: 10 KRRTFAIISHPDAGKTTITEKVLLFGNALQKAGTV-KGKKSGQHAKSDWMEMEKDRGISI 68
K +++H DAGKTT+TE +L A+ + G+V KG ++D +E+ RGI+I
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT-----TRTDNTLLERQRGITI 56

Query: 69 TTSVMQFPYGGALVNLLDTPGHEDFSEDTYRTLTAVDSCLMVIDSAKGVEDRTIKLMEVT 128
T + F + VN++DTPGH DF + YR+L+ +D +++I + GV+ +T L
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 129 RLRDTPIVTFMNKLDRDIRDPIELMDEVEDVLNIACAPITWPIGSGKEFKGVYHILRDEV 188
R P + F+NK+D++ D + ++++ L+ +++ +V
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEI------------------VIKQKV 158

Query: 189 VLYQSGMGHTIQERRVIEGIDNPELDKAIGSYAADLR-DEMELVRGASNEFDHQAFLKGE 247
LY + E + + D + Y + + +EL + S F
Sbjct: 159 ELYPNMCVTNFTESEQWDTVIEGN-DDLLEKYMSGKSLEALELEQEES-----IRFHNCS 212

Query: 248 LTPVFFGTALGNFGVDHILDGIVEWAPKPLPRESDARMIMPDEEKFTGFVFKIQANMDPK 307
L PV+ G+A N G+D++++ I R + + G VFKI+ K
Sbjct: 213 LFPVYHGSAKNNIGIDNLIEVITNKFYSSTHR---------GQSELCGKVFKIE--YSEK 261

Query: 308 HRDRVAFMRVCSGRYEQGMKMHHVRIGKDVNVSDALTFMAGDRERAEVAYPGDIIGLHNH 367
R R+A++R+ SG + K + +++ T + G+ + + AY G+I+ L N
Sbjct: 262 -RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNE 319

Query: 368 GTIRIGDTFTQGEKFRFTGVPNFAPEMFR-RIRLRDPLKQKQLLKGLVQLSEEG-AVQVF 425
+++ + + + + P +++ LL L+++S+ ++ +
Sbjct: 320 F-LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYY 378

Query: 426 RPLDTNDLIVGAVGVLQFEVVVGRLKSEYNVEAIYEGISVS 466
T+++I+ +G +Q EV L+ +Y+VE + +V
Sbjct: 379 VDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1101CHANNELTSX823e-20 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 81.6 bits (201), Expect = 3e-20
Identities = 77/240 (32%), Positives = 109/240 (45%), Gaps = 23/240 (9%)

Query: 60 YLEMEFGGRSGIFDLYGYVDVFNLANESSKDGDKNPGSGTSKLFMKFAPRVSIDALTGKD 119
YLE E + FD YGY+D +S K + S LFM+ PR SID LT D
Sbjct: 58 YLEYEAFAKKDWFDFYGYIDAPVFFGGNSTA--KGIWNKGSPLFMEIEPRFSIDKLTNTD 115

Query: 120 LSFGPIQEVYFSTLFNWD-GLNGEGVNSTFW-GVGADVNVPWLGKTGMNLYGYYD----- 172
LSFGP +E YF+ + +D G N ST++ G+G D++ +N+Y Y
Sbjct: 116 LSFGPFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYG 175

Query: 173 -MNAKEWNGYQFSANWFKPFYFFDNKSFLSFQGYIDYQFGAD------EDKTAFVPKTSN 225
N EW+GY+F +F P S LS+ G+ ++ +G+D D +TSN
Sbjct: 176 ASNENEWDGYRFKVKYFVPLTDLWGGS-LSYIGFTNFDWGSDLGDDNFYDLNGKHARTSN 234

Query: 226 --GGNIFFGL---YWHSDRYALGYGLKG-FKDVYLLEDGAGALALESTGWSHYLSATYKF 279
+ L +WH A + G + D L G G ++ STGW Y Y F
Sbjct: 235 SIASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294


17Shewmr7_1117Shewmr7_1130Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1117-2223.135715sulfate adenylyltransferase subunit 1
Shewmr7_1118-1182.431792TrkA domain-containing protein
Shewmr7_11192110.797738adenylylsulfate kinase
Shewmr7_1120112-0.341265hypothetical protein
Shewmr7_1121012-0.770230major facilitator transporter
Shewmr7_1122216-1.619789hypothetical protein
Shewmr7_1123218-2.066353protoporphyrinogen oxidase
Shewmr7_1124214-1.698994hypothetical protein
Shewmr7_1125-217-4.368362putative lipoprotein
Shewmr7_1126-314-2.115730hypothetical protein
Shewmr7_1127-214-0.375098DSBA oxidoreductase
Shewmr7_1128-1131.067698hypothetical protein
Shewmr7_11290131.398794heat shock protein DnaJ domain-containing
Shewmr7_11302171.995640dihydropteridine reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1124SALSPVBPROT441e-05 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 43.6 bits (102), Expect = 1e-05
Identities = 46/186 (24%), Positives = 71/186 (38%), Gaps = 32/186 (17%)

Query: 295 GQASYHIPIDLPPGRNGVQPSISLSYNSQGGNGILGVGWSLNAGSSISRCGATFAQDGFT 354
G AS +P+ + R G P+++L Y+S GGNG GVGWS S Q
Sbjct: 34 GLASITLPLPISAER-GFAPALALHYSSGGGNGPFGVGWSCATMSIARSTSHGVPQ---- 88

Query: 355 RAVTFNASTDRLCLDGQRLIVASG--------------------SYGASNAEYRTEMDSF 394
+N S + L DG+ L+ SY + + RTE F
Sbjct: 89 ----YNDSDEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESS-F 143

Query: 395 VKVVQHGNINDSNSSFTVYKPDGSRATYGANANSRFV-PSGLSTALSWKVTQESYSNGAN 453
++ ++ + + ++ +G G A +R P S W V +ES +
Sbjct: 144 YRLEYWVGNSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHTAQWLV-EESVTPAGE 202

Query: 454 TIDYQY 459
I Y Y
Sbjct: 203 HIYYSY 208


18Shewmr7_1232Shewmr7_1246Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_12323180.357663hypothetical protein
Shewmr7_12332190.593774xanthine/uracil/vitamin C permease
Shewmr7_12341200.509755gamma-glutamyl kinase
Shewmr7_12352210.818137gamma-glutamyl phosphate reductase
Shewmr7_12360180.393125hypothetical protein
Shewmr7_1237018-0.926250ybaK/ebsC protein
Shewmr7_1238017-0.648506hypothetical protein
Shewmr7_1239018-1.379113hypothetical protein
Shewmr7_1240018-1.824967molecular chaperone DnaK
Shewmr7_1241016-2.236767chaperone protein DnaJ
Shewmr7_1242020-3.342534hypothetical protein
Shewmr7_1243023-3.784893DEAD/DEAH box helicase domain-containing
Shewmr7_1244-221-4.005253GCN5-related N-acetyltransferase
Shewmr7_1245-124-5.193525peptidase M48, Ste24p
Shewmr7_1246018-3.146992hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1238DHBDHDRGNASE526e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.6 bits (123), Expect = 6e-10
Identities = 40/189 (21%), Positives = 75/189 (39%), Gaps = 10/189 (5%)

Query: 12 VLITGASSGIGLQLAKDYLAAGWHVIACGRDKAKLDALAETVLIGA---TCISFDINERS 68
ITGA+ GIG +A+ + G H+ A + KL+ + ++ A D+ + +
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 QVQENALRIKDLLAQCACQLDLVILNAGGCEYIDDAKHFDDRLFERVVHTNLIAMGYCLG 128
+ E RI+ + +D+++ N G D +E N +
Sbjct: 71 AIDEITARIEREMGP----IDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 AFLPLMP--RGARLALMSSSATYLAFPRAEAYGASKAGVQYLAASLRLDLAQHGISVSVI 186
+ M R + + S+ + AY +SKA L L+LA++ I +++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 CPGFVATPL 195
PG T +
Sbjct: 186 SPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1245TYPE3IMSPROT240.049 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 24.3 bits (53), Expect = 0.049
Identities = 7/18 (38%), Positives = 14/18 (77%), Gaps = 1/18 (5%)

Query: 53 ERRVRDAREDGRLEPKSR 70
+++RDAR+ G++ KS+
Sbjct: 11 PKKIRDARKKGQV-AKSK 27


19Shewmr7_1295Shewmr7_1303Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_12952140.896731hypothetical protein
Shewmr7_12960130.651984hypothetical protein
Shewmr7_12971140.966650hypothetical protein
Shewmr7_12982191.799013TonB-dependent siderophore receptor
Shewmr7_12992171.476883hypothetical protein
Shewmr7_13002191.341491hypothetical protein
Shewmr7_13012211.434670Ferritin, Dps family protein
Shewmr7_13022191.782240hypothetical protein
Shewmr7_13032171.790938SSU ribosomal protein S18P alanine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1297SYCDCHAPRONE320.001 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 0.001
Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 11/156 (7%)

Query: 76 PELEDVHVAMAYYYQTVGDLVRTEQAYQDAINTKDASGDSMNNFGVFLCQQKQYDKAEKM 135
+ ++ +AM + + G + + + + + Q +Y+ A K+
Sbjct: 6 TDTQEYQLAMESFLKGGGTI-------AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKV 58

Query: 136 FLAAIEMPKYTRTASSYEN-LGICSRDAGQTEKARQYFQMALKYDPRRSVSLLELAELGL 194
F A + Y S + LG C + GQ + A + D + AE L
Sbjct: 59 FQALCVLDHYD---SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLL 115

Query: 195 DKGDYVDAQNQLARYHQVAAQTPESLTLGIKIEQAL 230
KG+ +A++ L ++ A E L ++ L
Sbjct: 116 QKGELAEAESGLFLAQELIADKTEFKELSTRVSSML 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1303TCRTETOQM330.003 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 32.9 bits (75), Expect = 0.003
Identities = 38/159 (23%), Positives = 67/159 (42%), Gaps = 35/159 (22%)

Query: 199 IKLAIIGKPNVGKSTLTNRIL----GEERVVVYDEPGTTRDSIYIPMER----------- 243
I + ++ + GK+TLT +L + D+ T D+ + +R
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 244 --DGREYVIIDTAGVRRRSKVHEVIEKFSVIKTLKAVEDANVVLLIIDAREGVAEQDLGL 301
+ + IIDT G + + E V ++L ++ A +L+I A++GV Q L
Sbjct: 64 QWENTKVNIIDTPG-----HMDFLAE---VYRSLSVLDGA---ILLISAKDGVQAQTRIL 112

Query: 302 LGFALNAGRALVIAVNKWD--GID-----QGIKDRVKSE 333
G + +NK D GID Q IK+++ +E
Sbjct: 113 FHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAE 151


20Shewmr7_1322Shewmr7_1341Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1322-122-4.123900hypothetical protein
Shewmr7_1323020-4.499081methylation site containing protein
Shewmr7_1324020-4.838842apolipoprotein N-acyltransferase
Shewmr7_1325022-4.701603hypothetical protein
Shewmr7_1326-221-4.220278CBS domain-containing protein
Shewmr7_1327020-2.895633putative metalloprotease
Shewmr7_1328118-1.519483PhoH family protein
Shewmr7_1329318-1.025467(dimethylallyl)adenosine tRNA
Shewmr7_1330217-0.603886hypothetical protein
Shewmr7_1331117-0.112308hypothetical protein
Shewmr7_13320170.3258582-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
Shewmr7_1333-2121.064311peptidyl-tRNA hydrolase
Shewmr7_1334-213-0.068269GTP-dependent nucleic acid-binding protein EngD
Shewmr7_1335-115-0.845990*******hypothetical protein
Shewmr7_1336120-1.476175hypothetical protein
Shewmr7_1337221-2.082658hypothetical protein
Shewmr7_1338325-3.160392hypothetical protein
Shewmr7_1339433-4.988723hypothetical protein
Shewmr7_1340431-4.068136hypothetical protein
Shewmr7_1341226-2.978958transcription elongation factor GreA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1323FRAGILYSIN270.041 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 26.6 bits (58), Expect = 0.041
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%)

Query: 1 MKRYLFIVAALLLTGCAAK-DKYVQWEDVPPSSFPKLTAIGYAPLATQ 47
+K L + A LL C+ + D D P ++ L ++ Y LATQ
Sbjct: 12 VKLLLMLGTAALLAACSNEADSLTTSIDAPVTASIDLQSVSYTDLATQ 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1327HTHFIS634e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 4e-13
Identities = 24/128 (18%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRALESLNLQIDTAKDGREALDKLKTIAAEMNNVAEEIPLIISDI 239
I+V DD A R + +AL + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1330FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSIDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1332FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.3 bits (94), Expect = 1e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 8e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1334FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 19/119 (15%), Positives = 41/119 (34%), Gaps = 4/119 (3%)

Query: 145 EDATSITVSAEGEVSVKTPGAAENQVVGQLSMSDFINPSGLDPMGQNLYTETG---ASGT 201
D I +++E + + + Q + + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLAYVNQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1335FLGLRINGFLGH1472e-46 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 147 bits (373), Expect = 2e-46
Identities = 78/228 (34%), Positives = 113/228 (49%), Gaps = 20/228 (8%)

Query: 4 YFILAVALL-LTACSSTSKKPIADDPFYAPVYPEAPPTKIAATGSIYQDSQAA-----SL 57
Y I ++ +L LT C+ P+ A P P A GSI+Q +Q L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPL 65

Query: 58 YSDIRAHKVGDIITIVLKEATQAKKSAGNQIKKGSDMSLDPIYAGGSNVS------IGGV 111
+ D R +GD +TIVL+E A KS+ + + G V G
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNF-----GFDTVPRYLQGLFGNA 120

Query: 112 PLDLRYKDSMNTKRESDADQSNSLDGSISANVMQVLNNGNLVVRGEKWISINNGDEFIRV 171
D+ + A+ SN+ G+++ V QVL NGNL V GEK I+IN G EFIR
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 172 TGIVRSQDIKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
+G+V + I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1336FLGPRINGFLGI379e-133 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 379 bits (974), Expect = e-133
Identities = 161/367 (43%), Positives = 224/367 (61%), Gaps = 14/367 (3%)

Query: 5 LILAVAMLAFSLPSQAE--RIKDIANVQGVRNNQLIGYGLVVGLPGTGEKTR---YTEQT 59
L+ + + P+QA+ RIKDIA++Q R+NQLIGYGLVVGL GTG+ R +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FTTMLKNFGINLPDNFRPKIKNVAVVAVHADMPAFIKPGQELDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINDL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENIEVEPADESAKVIVNSRTGTIVVGQNVKLLPAAVTHGGLTVTIAEATQVSQPNAL 295
A +EN+ VE D AKV++N RTGTIV+G +V++ AV++G LTV + E+ QV QP
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTINASESNRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1337FLGFLGJ1522e-45 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 152 bits (386), Expect = 2e-45
Identities = 70/168 (41%), Positives = 101/168 (60%), Gaps = 2/168 (1%)

Query: 206 QILPTAAFRETQKTLKFGSREEFLATLYPHAEKAAKALGTQPEVLLAQSALETGWGQKIV 265
Q++ A R +L S+ FLA L A+ A++ G ++LAQ+ALE+GWGQ+ +
Sbjct: 131 QLVQKAVPRNYDDSLPGDSKA-FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQI 189

Query: 266 RGNNGAPSHNLFNIKADRRWQGDKANVSTLEFEQGIAVRQKADFRVYADFEHSFNDFVSF 325
R NG PS+NLF +KA W+G ++T E+E G A + KA FRVY+ + + +D+V
Sbjct: 190 RRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGL 249

Query: 326 IAEGERYQAAKKVAASPTQFIRALQDAGYATDPKYAEKVIKVMQSISE 373
+ RY AA AAS Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 250 LTRNPRY-AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKS 296



Score = 86.3 bits (213), Expect = 4e-21
Identities = 39/93 (41%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGVFVQMLMKSMRDANAVFESDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPESS 104
M DQQ++ ++ LGLA+MMV+Q++PE
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQP 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1338FLGHOOKAP12153e-64 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 215 bits (549), Expect = 3e-64
Identities = 125/455 (27%), Positives = 195/455 (42%), Gaps = 19/455 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQTTLDSQRLGNSFYGTGTYVSD 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ +S + G G YVS
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQVFSQIGKIVPQSLNDLFSGLNSLAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F+ L +L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLNDAKQLANSLNQMQSTLNGQLTQTNDQITGMTKRINEISTELANLNLE 183
D R + + ++ L N L Q Q N I +IN + ++A+LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDAM-----LLDKQDALVQELSQYAQVNVIPLENGAKSIMLGGAIMLVSGEV-- 236
+ + A LLD++D LV EL+Q V V + G +I + LV G
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 237 PMSVSTATGDPFPNELQLMSSIGSQSVRVDPNKLGGQLGALFEYREQTLVPAGLELDQLA 296
++ ++ DP + + + G LG + +R Q L L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKLQAQGFDLNGQVGTDIFKDINDPLMSIGRVAGFSGNTGNATLGVNIDDTSA 356
L A+ FN GFD NG G D F + V + N G+ +G + D SA
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LSGGSYELSF--TAPATYELRDTQTGTITPLTLNGTKLEGGAGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP +G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGAANGIEVVMTDPKGIAAAAPKITPDAANS 449
++P + A ++V++TD IA A+ + D+ N
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446



Score = 82.7 bits (204), Expect = 1e-18
Identities = 55/217 (25%), Positives = 80/217 (36%), Gaps = 20/217 (9%)

Query: 430 TDPKGIAAAAPKITPDAANSGNTQVKVTQITNRSAANFPTTGSELTIQLDTTAVPPTFEA 489
T KG A +T +A +F ++T T
Sbjct: 338 TKNKGDVAIGATVTDASAVLATDY----------KISFDNNQWQVTRLASNT-TFTVTPD 386

Query: 490 FDVNGASLGAPVAYTPPSISAFGFTFEVDSSAAAAGDKFTFDLS---------FAEGDNT 540
+ A G + +T FT + S A D D + + DN
Sbjct: 387 ANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446

Query: 541 NALAMAKLSETKVMNGGKSTLADVFEQTKQDIGSQTKAAEVRVGAADAIYQQAYARVESE 600
N A+ L GG + D + DIG++T + + Q + +S
Sbjct: 447 NGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSI 506

Query: 601 SGVNLDEEAANLMRFQQAYQASARIMSTAQQIFDTLL 637
SGVNLDEE NL RFQQ Y A+A+++ TA IFD L+
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1339FLAGELLIN515e-09 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 50.8 bits (121), Expect = 5e-09
Identities = 33/233 (14%), Positives = 83/233 (35%), Gaps = 4/233 (1%)

Query: 20 QTATSKILDQLSSGKKVNTSGDDPVAALGIDNLNQRNALVDQFMKNIDYATNHLQQTESQ 79
Q++ S +++LSSG ++N++ DD + + Q +N + + Q TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGQADALISSMKDLMLQGSNGSQTSEERQTIADDLRKSLDQLLTIANTKDESGNYLFAGN 139
L + + + +++L +Q +NG+ + + ++I D++++ L+++ ++N +G + + +
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQD 140

Query: 140 KTETLPFQFDANGKIVYQGDSGVHSAIIASGIQLNTNVAGDTAFIKSPNAMGDYSVNYSS 199
+ + I ++ G NV G +V
Sbjct: 141 NQMKIQVGANDGETITIDLQKIDVKSLGLDG----FNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 200 SQQGEFSVTSAKLDGVTPSLSDYQINFLDDGAGGINVEVTDTATPANVISAAA 252
+ + ++ D T N +
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1340FLAGELLIN2072e-63 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 207 bits (528), Expect = 2e-63
Identities = 162/509 (31%), Positives = 228/509 (44%), Gaps = 46/509 (9%)

Query: 2 AITVNTNVTSLKSQKNLNGANSALQTSMERLSSGLRINSAKDDAAGLQISNRLTSQINGL 61
A +NTN SL +Q NLN + S+L +++ERLSSGLRINSAKDDAAG I+NR TS I GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAQRNANDGISIAQTAEGAMQTSTDILQRMRDLSLQSANGSNSTEDRAAMQKELAALQT 121
A RNANDGISIAQT EGA+ + LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGTYGTQKFQVGANANETISVTLMDVSSNKLGNNTISGAGSVL 181
E+ R+++ T F G K+L K QVGAN ETI++ L + LG + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GVAATDTLSNSVAFTAGDIKVNGKTVAVAAADTATSLADKINATGSGVKAEAKLSTTIEG 241
S V V A V A
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 -------LSSTDTGTLTVYDAEGNADSYDLSTYN---------------------GDAKT 273
+ T T AE A + + G+ K
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 274 LASDLGKAGYDVSYDADTGKIGFSATGVQGIEISGGVV---------------GGTVSLG 318
+ G+ D G A +Q + V L
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 319 GNVADDTNTNVSVASSLTLSSPDKFTVTDDGTADLGEILSGGTSELNKVSDIDINTAEGA 378
N A + ++V + ++ VT G + + G S L ++ +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLI--NEDAAAAKKST 417

Query: 379 QDAISVIDAAIAGIDSSRSDLGAVQNRMSFTINNLNNISTNVSDARSRIQDVDFAKETAT 438
+ ++ ID+A++ +D+ RS LGA+QNR I NL N TN++ ARSRI+D D+A E +
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 439 MTKQQILSQTSSAMLAQANQIPQVALSLL 467
M+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1341FLAGELLIN2111e-64 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 211 bits (537), Expect = 1e-64
Identities = 163/509 (32%), Positives = 230/509 (45%), Gaps = 46/509 (9%)

Query: 2 AITVNTNVTSLGSQKNLNKANSALQTSMERLSSGLRINSAKDDAAGLQISNRLTSQINGL 61
A +NTN SL +Q NLNK+ S+L +++ERLSSGLRINSAKDDAAG I+NR TS I GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAQRNANDGISIAQTAEGAMQTSTDILQRMRDLSLQSANGSNSADDRAAMQKEISSLQT 121
A RNANDGISIAQT EGA+ + LQR+R+LS+Q+ NG+NS D ++Q EI
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGTYGTQKFQVGSNANETISISLGDVSSNKLGNNTISGAGSVL 181
E+ R+++ T F G K+L K QVG+N ETI+I L + LG + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GVAATDTLSNSVAFTAGDIKVNGKTVAVAAADTATSLADKINATGSGVKAEAKLSTTIEG 241
S V V A V A
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 -------LSSTDTGTLTVYDAEGNADSYDLSTYN---------------------GDAKT 273
+ T T AE A + + G+ K
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 274 LASDLGKAGYDVSYDADTGKIGFSATGVQGIEISGGVV---------------GGTVSLG 318
+ G+ D G A +Q + V L
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 319 GNVADDTNTNVSVASSLTLSSPDKFTVTDDGTADLGEILSGGTSELNKVSDIDINTAKGA 378
N A + ++V + ++ VT G + + G S L ++ K
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLI--NEDAAAAKKST 417

Query: 379 QDAISVIDAAIAGIDSQRADLGAVQNRMNFTINNLSNISTNVSDARSRVQDVDFAKETAQ 438
+ ++ ID+A++ +D+ R+ LGA+QNR + I NL N TN++ ARSR++D D+A E +
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 439 MTKQQILSQTSSAMLAQANQLPQVALSLL 467
M+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLL 506


21Shewmr7_1373Shewmr7_1405Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1373024-3.050383phosphopentomutase
Shewmr7_1374-119-3.227563purine nucleoside phosphorylase
Shewmr7_1375-118-3.124567membrane protein
Shewmr7_1376-118-3.621179hypothetical protein
Shewmr7_1377-121-3.633073phosphoserine phosphatase
Shewmr7_1378023-4.271211alpha/beta hydrolase fold domain-containing
Shewmr7_1379-124-4.855638type IV pilus assembly PilZ
Shewmr7_1380-127-5.372933DNA repair protein RadA
Shewmr7_1381-130-6.763212type IV pilus assembly PilZ
Shewmr7_1382-132-7.220980DNA-binding transcriptional regulator TorR
Shewmr7_1383139-9.649189TMAO reductase system periplasmic protein TorT
Shewmr7_1384241-10.760499periplasmic sensory protein associated with the
Shewmr7_1385345-11.204269multi-sensor hybrid histidine kinase
Shewmr7_1386451-11.980723hypothetical protein
Shewmr7_1387454-14.092606chaperone protein TorD
Shewmr7_1388554-15.131407trimethylamine-N-oxide reductase TorA
Shewmr7_1389456-16.401096hypothetical protein
Shewmr7_1390455-16.370123trimethylamine-N-oxide reductase c-type
Shewmr7_1391352-15.453255hypothetical protein
Shewmr7_1392348-15.002715periplasmic nitrate reductase NapE
Shewmr7_1393343-11.821618hypothetical protein
Shewmr7_1394240-9.379662xanthine/uracil/vitamin C permease
Shewmr7_1395133-7.081080CBS domain-containing protein
Shewmr7_1396029-5.698375phospholipid/glycerol acyltransferase
Shewmr7_1397-127-5.237583metallophosphoesterase
Shewmr7_1398026-4.384827phage integrase family protein
Shewmr7_1399125-4.585316YD repeat-containing protein
Shewmr7_1400024-5.247497hypothetical protein
Shewmr7_1401026-5.442525hypothetical protein
Shewmr7_1402027-5.662392hypothetical protein
Shewmr7_1403230-5.637001hypothetical protein
Shewmr7_1404028-5.092980antibiotic biosynthesis monooxygenase
Shewmr7_1405-121-3.827714nucleoside recognition domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1376TYPE3IMSPROT547e-12 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 53.6 bits (129), Expect = 7e-12
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 3/87 (3%)

Query: 8 TQQAVALSYD-GKH-APKVVASGEGLVADEIIALAKASGVYIHQDPHLSNFL-RLLELGE 64
T A+ + Y G+ P V + +A+ GV I Q L+ L +
Sbjct: 265 THIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDH 324

Query: 65 EIPKELYLLIAELIAFVYMLDGKFPEQ 91
IP E AE++ ++ + +
Sbjct: 325 YIPAEQIEATAEVLRWLERQNIEKQHS 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1377IGASERPTASE382e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.7 bits (87), Expect = 2e-04
Identities = 32/181 (17%), Positives = 64/181 (35%), Gaps = 12/181 (6%)

Query: 230 LTPQAELLNTSKPSIQAQVAPDNKAAVEVTTSSATDNPTSKNNASALSIQTSLQANTEPK 289
+ P A + A+ + VE AT+ T++N A +++++ANT+
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATE-TTAQNREVAKEAKSNVKANTQTN 1083

Query: 290 LTTVNQKLIPEIQPQEMAKTQQ-----KAPSLEINLTEAKNVASNTLPTRD--ENIRNTA 342
+ E Q E +T KA E V S P ++ E ++ A
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 343 SAMLP----TNAMKSEAASSSLAKSALSSNELPLNLKPLAAEAQLTEKTNKASESTISVN 398
N + ++ +++ A + + E N++ E+ N E+ +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 399 E 399

Sbjct: 1204 P 1204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1378VACJLIPOPROT2305e-78 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 230 bits (588), Expect = 5e-78
Identities = 91/257 (35%), Positives = 134/257 (52%), Gaps = 16/257 (6%)

Query: 9 LLGFALLPKVYGAEATVPDTTPKETASAVKITYDDPRDPLEGFNRAMWDFNYLYLDRYIY 68
L AL + A+ + DPLEGFNR M++FN+ LD YI
Sbjct: 5 LSALALGTTLLVGCASSGTDQQGRS------------DPLEGFNRTMYNFNFNVLDPYIV 52

Query: 69 RPIAHGYNDYLPLPAKTGINNFVQNLEEPSSLVNNALQGKWGWAANAGGRFTVNTTIGLL 128
RP+A + DY+P PA+ G++NF NLEEP+ +VN LQG RF +NT +G+
Sbjct: 53 RPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMG 112

Query: 129 GVFDVADMMGMPRKQDE---FNEVLGYYGVPNGPYFMAPFAGPYVVRELASDWVDGLYFP 185
G DVA M ++ E F LG+YGV GPY PF G + +R+ D D LY
Sbjct: 113 GFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPV 172

Query: 186 LSELTVWQSIVKWGLKSLHARASAIDQERLVDNALDPYTFVKDAYLQHMDYKVYDGNV-P 244
LS LT S+ KW L+ + RA +D + L+ + DPY V++AY Q D+ G + P
Sbjct: 173 LSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKP 232

Query: 245 QKQEDDELLDQYMQELE 261
Q+ + + + +++++
Sbjct: 233 QENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1379HTHFIS952e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 2e-23
Identities = 29/101 (28%), Positives = 47/101 (46%)

Query: 7 SILLVEDDPVFRQIVATFLSGRGAEVVQACDGEQGLSIFKQQRFDIILADLSMPKLGGLD 66
+IL+ +DD R ++ LS G +V + D+++ D+ MP D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 MLKEMSKLEPLVPSIVISGNNVMADVVEALRVGACDYLVKP 107
+L + K P +P +V+S N ++A GA DYL KP
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1385NUCEPIMERASE688e-15 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 67.5 bits (165), Expect = 8e-15
Identities = 48/223 (21%), Positives = 82/223 (36%), Gaps = 36/223 (16%)

Query: 6 KVLITGGTGSFGKQFIKTILERYPNVKRIVIFSRDELKQS-ELRLKY---PQKDYPQLRF 61
K L+TG G G K +LE V I D L ++ LK P +F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI-----DNLNDYYDVSLKQARLELLAQPGFQF 56

Query: 62 FIGDVRDRNRMVQ--ACEGIDVIIHAAAIKQVDTAEYNPTECIRTNVDGAENVIHAALQC 119
D+ DR M A + + + V + NP +N+ G N++
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 120 GVKDVVALST---------------DKACAPINLYGATKLTSDKLFTAANNIKGSRDIRF 164
++ ++ S+ D P++LY ATK ++ + +++ G +
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG---LPA 173

Query: 165 SVVRYGNVMGSRGS---VIPFFLKKRDEGVLPIT---HEEMTR 201
+ +R+ V G G + F K EG I + +M R
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGK-SIDVYNYGKMKR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1399NUCEPIMERASE1824e-57 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 182 bits (463), Expect = 4e-57
Identities = 89/356 (25%), Positives = 146/356 (41%), Gaps = 53/356 (14%)

Query: 1 MRILVTGGSGFIGSALVRLLIQATNCHVLNIDKLTYASHP---DALIGISNHPRYQFVKA 57
M+ LVTG +GFIG + + L++A + V+ ID L A + + P +QF K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDGARLDILFEQFKPNIVMHLAAETHVDRSIEGPAAFIQNNILGTFTLLEAARRYWTQ 117
D+ D + LF V V S+E P A+ +N+ G +LE R
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LDSLQKLQFRFHHVSTDEVFGSLADTGLFSETSAYD-PSSPYSASKASTDHLVRAWHRTY 176
+Q L + S+ V+G L FS + D P S Y+A+K + + + + Y
Sbjct: 117 --KIQHLLY----ASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 177 GLPIVITNCSNNYGPFQYPEKLIPLMVNHALQGKSLPIYGNGQQVRDWLYVDDHVKALYL 236
GLP YGP+ P+ + L+GKS+ +Y G+ RD+ Y+DD +A+
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 237 V------------------ATRGQLGQTYNIGGCCERTNLAVVQQICLLLEELVPTHPQS 278
+ A + YNIG + +Q LE+ +
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQ----ALEDAL------ 279

Query: 279 LAMKGDGFAALIEHVVDRPGHDIR--YAIDSSKIQHELGWQPLESFESGLRRAVEW 332
G A + +PG D+ A D+ + +G+ P + + G++ V W
Sbjct: 280 ------GIEAKKNMLPLQPG-DVLETSA-DTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1401PF06291310.001 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 30.8 bits (69), Expect = 0.001
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 13 QTVKMNMKLSQISLALLALMITACSEPAKTVANEPVAAPHQDTQTN 58
Q KM L +LA+L IT C++ TV N+P A ++T T+
Sbjct: 2 QDNKMKKMLFSAALAML---ITGCAQQTFTVGNKPTAVTPKETITH 44


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1402PF06580485e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 47.6 bits (113), Expect = 5e-08
Identities = 35/198 (17%), Positives = 74/198 (37%), Gaps = 38/198 (19%)

Query: 266 NTMQDGLGLIERNLSRAAELV--------HNFKRTAADQSVLERERFNLKTYIFQIFSSL 317
N + + LI + ++A E++ ++ + + A Q L E + +Y+ + S
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQ-LAS-- 233

Query: 318 RPLMR-KKNIALNVELDDDIFIESYPGAIAQIFTNLVANSFRHGFPESFTGDKIITIRVQ 376
++ + + +++ I P + Q LV N +HG + G I ++
Sbjct: 234 ---IQFEDRLQFENQINPAIMDVQVPPMLVQT---LVENGIKHGIAQLPQG-GKILLKGT 286

Query: 377 KQDSNICMQYQDNGVGMTDEVKLKAFEPFFTTARKDGGTGLGMSIIYNLVTQKLHG---T 433
K + + ++ ++ G K TG G+ + + Q L+G
Sbjct: 287 KDNGTVTLEVENTGSLALKNTK--------------ESTGTGLQNVRERL-QMLYGTEAQ 331

Query: 434 IMLTSSPYQGVKVEIQIP 451
I L+ V + IP
Sbjct: 332 IKLSEKQ-GKVNAMVLIP 348


22Shewmr7_1464Shewmr7_1476Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_14642162.615664(p)ppGpp synthetase I, SpoT/RelA
Shewmr7_14652163.545558hypothetical protein
Shewmr7_14661153.789505hypothetical protein
Shewmr7_14671123.385757YD repeat-containing protein
Shewmr7_14680111.787343hypothetical protein
Shewmr7_14690111.442203phage integrase family protein
Shewmr7_1470-1130.333220GCN5-related N-acetyltransferase
Shewmr7_1471024-5.823400hypothetical protein
Shewmr7_1472-121-5.818088nucleoside triphosphate pyrophosphohydrolase
Shewmr7_1473-123-6.664113CTP synthetase
Shewmr7_1474026-7.574477CTP synthetase
Shewmr7_1475-123-6.979605phosphopyruvate hydratase
Shewmr7_1476-120-5.932139cell division protein FtsB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1467IGASERPTASE504e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.7 bits (118), Expect = 4e-08
Identities = 32/191 (16%), Positives = 66/191 (34%), Gaps = 17/191 (8%)

Query: 482 QQQGQNQQQGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQGAQ 541
N Q D S ++ + + + A PS+ T +E + E
Sbjct: 996 NITTPNNIQADVPSVPSNNEE----IARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 542 DKQQASDENAKQDQQDAQQEQQQAEQQANQQNGADNNAEDKEDPASNEAKMQAKVE-DDK 600
+K + ++ +E + + Q N + + ++ + E K A VE ++K
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 601 SKAKQEQQQAVAQKADKEK-----------QAQADKKPDTAVESVEA-PPSNSEPLPAEM 648
+K + E+ Q V + + QA+ ++ D V E +N+ +
Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 649 QRALRGVSEDP 659
+ E P
Sbjct: 1172 AKETSSNVEQP 1182



Score = 47.0 bits (111), Expect = 2e-07
Identities = 38/224 (16%), Positives = 84/224 (37%), Gaps = 15/224 (6%)

Query: 430 AALDKQPEFPQAKANLELAEKLLNQQQSQQNADNQDKQSQGDQNQQGQDQNDQQQGQNQQ 489
A +D+ P P A A + S+Q + +K Q Q++ ++ ++
Sbjct: 1018 ARVDEAPVPPPAPA-TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNV 1076

Query: 490 QGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQGAQDKQQASDE 549
+ + Q+++ Q+ +++E Q+ + + +K+ + T Q P+ +Q +
Sbjct: 1077 KANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS 1136

Query: 550 NAKQDQQDAQQE--------QQQAEQ--QANQQNGADNNAEDKEDPASNEAKMQAKVEDD 599
Q Q + +E + Q++ A+ + A + + E P + V
Sbjct: 1137 ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE----STTVNTG 1192

Query: 600 KSKAKQEQQQAVAQKADKEKQAQADKKPDTAVESVEAPPSNSEP 643
S + + A ++K + SV + P N EP
Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236



Score = 40.4 bits (94), Expect = 2e-05
Identities = 25/198 (12%), Positives = 63/198 (31%), Gaps = 7/198 (3%)

Query: 488 QQQGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQGAQDKQQAS 547
+ + Q+ + Q S+ + E + +
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPS---ETTETV 1040

Query: 548 DENAKQDQQDAQQEQQQAEQQANQQNGADNNAEDKEDPASNEAKMQAKVEDDKSKAKQEQ 607
EN+KQ+ + ++ +Q A + Q A+ A+ + A+ + + + +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAK-SNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 608 QQAVAQKADKEKQ-AQADKKPDTAVESVEAPPSNSEPLPAEMQRALRGVSEDPQVLLRNK 666
+ A +EK + +K + + + P + +A DP V ++
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS-ETVQPQAEPARENDPTVNIKEP 1158

Query: 667 MQLEYQKRRQNGQISRDN 684
Q + Q +++
Sbjct: 1159 -QSQTNTTADTEQPAKET 1175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1471HTHFIS348e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 8e-04
Identities = 35/139 (25%), Positives = 53/139 (38%), Gaps = 19/139 (13%)

Query: 28 LIALIANG--HLLVEGPPGLAKT---RAVKALCDGVEGDFHRIQ---FTPDLLPADLTG- 78
++A + L++ G G K RA+ G F I DL+ ++L G
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 79 -----TDIYRSQTGTFEFEAGPIFHNLILADEINRAPAKVQSALLEAMAEGQVT-VGKNS 132
T TG FE G + DEI P Q+ LL + +G+ T VG +
Sbjct: 212 EKGAFTGAQTRSTGRFEQAEG----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRT 267

Query: 133 YKLPPLFLVMATQNPLENE 151
+ +V AT L+
Sbjct: 268 PIRSDVRIVAATNKDLKQS 286


23Shewmr7_1615Shewmr7_1626Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1615029-7.023464type IV pilus biogenesis/stability protein PilW
Shewmr7_1616556-12.819845hypothetical protein
Shewmr7_1619659-14.049434hypothetical protein
Shewmr7_1620448-11.6500454-hydroxy-3-methylbut-2-en-1-yl diphosphate
Shewmr7_1622343-10.611360histidyl-tRNA synthetase
Shewmr7_1623239-9.636594hypothetical protein
Shewmr7_1624229-7.344452outer membrane protein assembly complex subunit
Shewmr7_1625117-4.389660outer membrane protein assembly complex subunit
Shewmr7_1626115-3.567802GTP-binding protein EngA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1620DNABINDINGHU894e-27 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 89.0 bits (221), Expect = 4e-27
Identities = 36/88 (40%), Positives = 52/88 (59%)

Query: 2 NKAQLIQRIATSLEQSQASTKPVVEQILQQIHIALSEGEKVFLPQFGTFELRFHLPKSGR 61
NK LI ++A + E ++ + V+ + + L++GEKV L FG FE+R + GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGETIEIAGFNQPSFKAATALKKAI 89
NPQTGE I+I P+FKA ALK A+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1623SUBTILISIN1147e-30 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 114 bits (286), Expect = 7e-30
Identities = 69/289 (23%), Positives = 120/289 (41%), Gaps = 61/289 (21%)

Query: 258 SSYVVDDNGNPVPNEYKVCVHGHGTAVGGTIASIRGNGVGVSSVLGSNNAELVYVKVLDS 317
++ DD G+P + +GHGT V GTIA+ N GV V + A+L+ +KVL+
Sbjct: 67 RNFTDDDEGDPEIFKDY---NGHGTHVAGTIAA-TENENGVVGV--APEADLLIIKVLNK 120

Query: 318 CNDGAFLSDIIKGIHWSVGDHFDGVTDISSPVDVINLSLGGMGNGGLCDVGFNAMADAVA 377
G II+GI++++ VD+I++SLGG + +AV
Sbjct: 121 QGSGQ-YDWIIQGIYYAIEQK----------VDIISMSLGG-------PEDVPELHEAVK 162

Query: 378 YANSKGAVVVASTGNSALEA----TAATPVSCHGIITAAANTSNGELAPFSNYYNSRKNI 433
A + +V+ + GN P + +I+ A + + FSN N ++
Sbjct: 163 KAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNE-VDL 221

Query: 434 SAIGQDLLTPFVNTSVYVSRNGVGGVEGDCNKITNCYAYYTGTSLSAPVISSAVALIKME 493
A G+D+L+ YA ++GTS++ P ++ A+ALIK
Sbjct: 222 VAPGEDILSTV---------------------PGGKYATFSGTSMATPHVAGALALIKQL 260

Query: 494 NPS-----LKAEQIFDILYNTA------SEYNTNEVGNKTALYKLSKNT 531
+ L +++ L + N + TA+ +LS+
Sbjct: 261 ANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRIF 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1626YERSSTKINASE320.012 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.012
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 182 QVLDGIIHSHANQVLHRDIKPDNILVDD-EGRVHVIDFGISKLMGEQ 227
++LD H V+H DIKP N++ D G VID G+ GEQ
Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ 299


24Shewmr7_1653Shewmr7_1658Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1653230-5.172606hypothetical protein
Shewmr7_1654130-4.927904hypothetical protein
Shewmr7_1655028-4.296577FlgN family protein
Shewmr7_1656-128-4.413585anti-sigma-28 factor, FlgM
Shewmr7_1657-126-4.387598flagellar basal body P-ring biosynthesis protein
Shewmr7_1658-127-4.268354flagellar basal body P-ring biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1654PF06580372e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 2e-04
Identities = 20/102 (19%), Positives = 34/102 (33%), Gaps = 23/102 (22%)

Query: 356 LMENAFRLCISQ------VEVSARFNEQGDFELIVEDDGPGVEENLRQKIIQRGVRADTQ 409
L+EN + I+Q + + + G L VE+ G +N ++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTK-DNGTVTLEVENTGSLALKNTKE------------ 309

Query: 410 SPGQGIGLA-VCDEIVSSYGGYLSIE-ESHLEGARFRITIPA 449
G GL V + + YG I+ + IP
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1655HTHFIS794e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 4e-19
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 1 MSIMRILVVEDDLILSHHLKVQLSDLGNQVQVALTAKEGFFQATNYPIDVAIVDLGLPDQ 60
M+ ILV +DD + L LS G V++ A + D+ + D+ +PD+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGISLIQQLREEGVKAPILILTARVNWQDKVEGLNAGADDYLVKPFQKEELVARLD-ALV 119
+ L+ ++++ P+L+++A+ + ++ GA DYL KPF EL+ + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RRSA 123

Sbjct: 121 EPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1658INTIMIN421e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 42.4 bits (99), Expect = 1e-05
Identities = 45/184 (24%), Positives = 78/184 (42%), Gaps = 16/184 (8%)

Query: 54 ATPAEVQATVVDSKTGPKAGIVVTFKLDNDELGTFTPSTGTQLTDSSGVAKIKLDTGSLA 113
ATV + +A + V+F + + GT S + T+ SG A + L +
Sbjct: 575 TEAITYTATVKKNGV-AQANVPVSFNIVS---GTAVLSANSANTNGSGKATVTLKSDKP- 629

Query: 114 GAGSVTASIGTGESASMGFYSKGDGAINPGTGNKLKLSLVNAQEQAITSISSATPGIVKA 173
G V+A SA + I ++ K S+ + T++++ I
Sbjct: 630 GQVVVSAKTAEMTSA-----LNANAVIFV---DQTKASITEIKADKTTAVANGQDAITYT 681

Query: 174 LYTNSSDEPLVGKVITFTSTLGKFQPESGTALTDAQGLAKIAITAGTVAGAGKIIAKVDD 233
+ D+P+ + +TFT+TLGK + T TD G AK+ +T+ T G + A+V D
Sbjct: 682 VKVMKGDKPVSNQEVTFTTTLGK--LSNSTEKTDTNGYAKVTLTSTTP-GKSLVSARVSD 738

Query: 234 TESE 237
+
Sbjct: 739 VAVD 742



Score = 37.0 bits (85), Expect = 5e-04
Identities = 60/305 (19%), Positives = 99/305 (32%), Gaps = 48/305 (15%)

Query: 379 TGMPTTNISATQPGKVTV---ALVDKDSTPLVGKVVSFSSTLGNFLPTQGTALTDALGRA 435
T SA G + A V K+ VSF+ G + + +A T+ G+A
Sbjct: 561 TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKA 620

Query: 436 SITLTAGSIEGAGEITATYG-----TAKAIIGFVTAGDEIDPVEASPEISFDIYDCNGVA 490
++TL + T A A+I I ++A NG
Sbjct: 621 TVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADK----TTAVANGQD 676

Query: 491 AWDKALKNFEVCKTTDNITNDKPGIIGAKVTRSGSTQALQQVLISAATTIGAISPSSGTA 550
A +K + DKP + +VT + TT+G +S S+
Sbjct: 677 AITYTVKVMK---------GDKP-VSNQEVTFT--------------TTLGKLSNSTEK- 711

Query: 551 ITNAEGKAILDLYANGNVGAGEISLKVKDVTATKAFEIGRVNISLKLETSLGGNLLPAGG 610
T+ G A + L + G +S +V DV A ++ + ++ + G
Sbjct: 712 -TDTNGYAKVTLTST-TPGKSLVSARVSDV----AVDVKAPEVEFFTTLTIDDGNIEIVG 765

Query: 611 STI---LDVTVLNPDGS--LATGQPFTLVFTSECQASNKAIIDSPVITNGGKGYATYRST 665
+ + L L A+G + S A S +T KG T
Sbjct: 766 TGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVI 825

Query: 666 GCETQ 670
+ Q
Sbjct: 826 SSDNQ 830


25Shewmr7_1704Shewmr7_1711Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1704530-1.012305flagellar biosynthesis sigma factor
Shewmr7_1705737-0.472327response regulator receiver protein
Shewmr7_1706739-0.076748chemotaxis phosphatase, CheZ
Shewmr7_1707742-0.198677CheA signal transduction histidine kinases
Shewmr7_1708746-0.209348chemotaxis-specific methylesterase
Shewmr7_1709642-0.135495hypothetical protein
Shewmr7_1710439-0.070913hypothetical protein
Shewmr7_1711438-1.249536cobyrinic acid a,c-diamide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1704RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 72 GLIEAINVEEGDRVQKGQILAVIDAKRQQYDLDRSEAEVKIIEQELNRLKKMS 124
+++ I V+EG+ V+KG +L + A + D ++++ + E R + +S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157



Score = 39.8 bits (93), Expect = 1e-05
Identities = 35/202 (17%), Positives = 80/202 (39%), Gaps = 24/202 (11%)

Query: 91 LAVIDAKRQ----QYDLDRSEAEVKIIEQELNRLK---KMSNKEFIS--ADSMAKLEYNL 141
AV++ + + +L +++++ IE E+ K ++ + F + D + + N+
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 142 QAAIAKRDLAELQVKESHVVSPINGIIAKRYVKAGNMAKEFGD-LFYIV-NQDELHGIVH 199
+ E + + S + +P++ + + V + L IV D L
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 200 LPEQQLTSLRLGQEAQV-FS--NQQSKNAIHAKVLRISP--VVDPQSGT-FKVTLAVP-- 251
+ + + + +GQ A + + KV I+ + D + G F V +++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 252 -----NQDAHLKAGMFTRVELK 268
N++ L +GM E+K
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIK 453


26Shewmr7_1759Shewmr7_1798Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1759-117-3.992130hexapaptide repeat-containing transferase
Shewmr7_1760017-4.129829hypothetical protein
Shewmr7_1761229-6.991407putative lipoprotein
Shewmr7_1762232-8.611013hypothetical protein
Shewmr7_1763337-10.625965hypothetical protein
Shewmr7_1764441-11.484160hypothetical protein
Shewmr7_1765334-8.477839amidohydrolase
Shewmr7_1766228-7.042588hypothetical protein
Shewmr7_1767221-4.216677histone family protein nucleoid-structuring
Shewmr7_1768220-2.568090electron transfer flavoprotein beta-subunit
Shewmr7_17695231.127853electron transfer flavoprotein subunit alpha
Shewmr7_17704231.125919Na+/H+ antiporter NhaC
Shewmr7_17712241.056475peptidyl-dipeptidase Dcp
Shewmr7_1772220-0.770914hypothetical protein
Shewmr7_1773016-2.950617hypothetical protein
Shewmr7_1774221-3.262366hypothetical protein
Shewmr7_1775221-4.490297thymidine kinase
Shewmr7_1776220-4.251824hypothetical protein
Shewmr7_1777320-4.938443two component, sigma54 specific, Fis family
Shewmr7_1778219-5.693727periplasmic sensor signal transduction histidine
Shewmr7_1779217-5.057359TRAP dicarboxylate transporter, DctM subunit
Shewmr7_1780119-5.632336hypothetical protein
Shewmr7_1781219-6.176018tripartite ATP-independent periplasmic
Shewmr7_1782219-6.168906TRAP dicarboxylate transporter, DctP subunit
Shewmr7_1783223-6.555928hypothetical protein
Shewmr7_1784423-6.116948hypothetical protein
Shewmr7_1785426-6.484809******glutamyl-tRNA synthetase
Shewmr7_1786526-6.024591*********ABC transporter-like protein
Shewmr7_1787426-4.891422DNA-3-methyladenine glycosylase II /
Shewmr7_1788430-5.228040methylated-DNA--protein-cysteine
Shewmr7_1789430-5.381551DEAD/DEAH box helicase domain-containing
Shewmr7_1790229-5.190202dual specificity protein phosphatase
Shewmr7_1791329-6.864122phospholipase D/transphosphatidylase
Shewmr7_1792428-7.893150response regulator receiver modulated CheW
Shewmr7_1793529-8.258731serine/threonine transporter SstT
Shewmr7_1794730-8.602892hypothetical protein
Shewmr7_1795728-9.024497hypothetical protein
Shewmr7_1796727-8.381276twin-arginine translocation pathway signal
Shewmr7_1797629-7.623074hypothetical protein
Shewmr7_1798222-3.453655hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1773FLGPRINGFLGI300.002 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 29.9 bits (67), Expect = 0.002
Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 1/27 (3%)

Query: 20 VIEKELANVDPKDANAITLNFRDPDYS 46
+IE+EL + KD+ + L R+PD+S
Sbjct: 178 IIERELPS-KFKDSVNLVLQLRNPDFS 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1779ARGREPRESSOR310.005 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 30.6 bits (69), Expect = 0.005
Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 14 VNTVHRQWMILKYLSRTAERKTTEELRNYLRSEGVNQTQRTIQRDL 59
+N R I + ++ E +T +EL + L+ +G N TQ T+ RD+
Sbjct: 1 MNKGQRHIKIREIITA-NEIETQDELVDILKKDGYNVTQATVSRDI 45


27Shewmr7_1815Shewmr7_1822Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_18152200.070751peptidase S8/S53 subtilisin kexin sedolisin
Shewmr7_1816218-0.151254hypothetical protein
Shewmr7_18173170.038254rhodanese domain-containing protein
Shewmr7_18183160.668878hypothetical protein
Shewmr7_18193160.830523hypothetical protein
Shewmr7_18202171.633576hypothetical protein
Shewmr7_18213181.297671acriflavin resistance protein
Shewmr7_18222151.268041RND family efflux transporter MFP subunit
28Shewmr7_1851Shewmr7_1862Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_18512203.173337major facilitator transporter
Shewmr7_18521161.346330ATP-NAD/AcoX kinase
Shewmr7_18531160.293564hypothetical protein
Shewmr7_1854018-2.881905tRNA/rRNA methyltransferase (SpoU)
Shewmr7_1855021-4.330289FAD dependent oxidoreductase
Shewmr7_1856117-4.0067203-oxoacyl-[acyl-carrier-protein] synthase I
Shewmr7_1857115-4.102090D-isomer specific 2-hydroxyacid dehydrogenase,
Shewmr7_1860215-1.462990aspartate semialdehyde dehydrogenase
Shewmr7_1861417-1.110115hypothetical protein
Shewmr7_18623180.754312hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1856HTHFIS310.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.009
Identities = 9/43 (20%), Positives = 18/43 (41%)

Query: 104 DEINRASPKTQSALLEAMAEQQISVDGVTHRLPNPFFVIATQN 146
DEI Q+ LL + + + + G + + ++A N
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


29Shewmr7_1897Shewmr7_1902Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1897014-3.154726hypothetical protein
Shewmr7_1898016-4.991103hypothetical protein
Shewmr7_1899016-5.054377tryptophan synthase subunit alpha
Shewmr7_1900217-5.160658tryptophan synthase subunit beta
Shewmr7_1901115-4.384869bifunctional indole-3-glycerol phosphate
Shewmr7_1902-114-4.255883anthranilate phosphoribosyltransferase
30Shewmr7_2014Shewmr7_2028Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2014333-3.423163histone family protein DNA-binding protein
Shewmr7_2015336-2.894142hypothetical protein
Shewmr7_2016332-2.582805hypothetical protein
Shewmr7_2017231-2.178058peptidase S8/S53 subtilisin kexin sedolisin
Shewmr7_2018330-2.688522hypothetical protein
Shewmr7_2019121-2.345971hypothetical protein
Shewmr7_2020013-0.934631hypothetical protein
Shewmr7_2021014-1.191240ECF subfamily RNA polymerase sigma-24 factor
Shewmr7_2022518-1.356696serine/threonine protein kinase
Shewmr7_2023417-1.985681ABC transporter-like protein
Shewmr7_2024117-2.186219hypothetical protein
Shewmr7_2025016-2.221139hypothetical protein
Shewmr7_2026-121-3.911934hypothetical protein
Shewmr7_2028021-4.110283hypothetical protein
31Shewmr7_2049Shewmr7_2075Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2049328-1.617920uracil-xanthine permease
Shewmr7_2050433-2.373052hypothetical protein
Shewmr7_2051127-2.064449hypothetical protein
Shewmr7_2052129-1.546823DNA replication initiation factor
Shewmr7_2053026-1.285389metal dependent phosphohydrolase
Shewmr7_2054227-0.621607prolyl 4-hydroxylase subunit alpha
Shewmr7_20550160.165792periplasmic sensor signal transduction histidine
Shewmr7_20560150.921600hypothetical protein
Shewmr7_20572210.254701two component transcriptional regulator
Shewmr7_2058323-0.014177hypothetical protein
Shewmr7_20594240.261641sodium:dicarboxylate symporter
Shewmr7_2060119-0.349995Ig domain-containing protein
Shewmr7_2061218-0.768469hypothetical protein
Shewmr7_2062320-2.033311succinylarginine dihydrolase
Shewmr7_2063-112-0.931293DNA topoisomerase I
Shewmr7_2064-314-0.541238hypothetical protein
Shewmr7_2065-315-0.661345hypothetical protein
Shewmr7_2066-315-0.484406transcriptional regulator CysB
Shewmr7_2067-315-0.135217two component LuxR family transcriptional
Shewmr7_2068-215-0.109132thioesterase superfamily protein
Shewmr7_2069-116-1.164482phospho-2-dehydro-3-deoxyheptonate aldolase
Shewmr7_2070124-3.208255hypothetical protein
Shewmr7_2071124-3.005712FAD linked oxidase domain-containing protein
Shewmr7_2072122-2.773851hypothetical protein
Shewmr7_2073021-2.316723MarR family transcriptional regulator
Shewmr7_2074119-1.703662peptidase S8/S53 subtilisin kexin sedolisin
Shewmr7_2075223-1.196151hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2049DHBDHDRGNASE947e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 7e-25
Identities = 73/258 (28%), Positives = 119/258 (46%), Gaps = 14/258 (5%)

Query: 10 QGKNVVVVGGTSGINLAIANAFALAGANVAVASRSQDKIDAAV--LQLKQSNPDGIHLGV 67
+GK + G GI A+A A GA++A + +K++ V L+ + + +
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA-- 64

Query: 68 SFDVRDLAAVERGFDTIASEFGFIDVLVSGAAGNFPATAAKLSANGFKAVMDIDLLGSFQ 127
DVRD AA++ I E G ID+LV+ A P LS ++A ++ G F
Sbjct: 65 --DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 128 VLK-TAYPLLRRPQGNIIQISAPQASIAMPMQAHVCAAKAGVDMLTRTLAIEWGCEGIRI 186
+ + ++ R G+I+ + + A + A ++KA M T+ L +E IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 187 NSIIPGPIANTEGFNRLAPSAALQQQVAQS-------VPLKRNGEGQDIANAAMFLGSEY 239
N + PG ++ A +Q + S +PLK+ + DIA+A +FL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 240 ASYITGVVLPVDGGWSLG 257
A +IT L VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2053DNABINDINGHU1131e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 113 bits (284), Expect = 1e-36
Identities = 31/89 (34%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 2 TKSELIEKLATRQSQLSAKEVEGAIKEMLEQMATTLESGDRIEIRGFGSFSLHYRAPRTG 61
K +LI K+A ++L+ K+ A+ + +++ L G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGSSVELEGKYVPHFKPGKELRERV 90
RNP+TG ++++ VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2071SYCDCHAPRONE250.033 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 25.3 bits (55), Expect = 0.033
Identities = 7/34 (20%), Positives = 17/34 (50%)

Query: 1 MTDINQVIDQMPEEVYERLRSAAELGKWEDGTVL 34
+ +N++ E++Y + + GK+ED +
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKV 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2074HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 7/128 (5%)

Query: 2 SRILLVDDDPLFRVWLTDALKTQGHEVECAINGIEGLKRIRSFMPDIIMLDLIMPQMDGF 61
+ IL+ DDD R L AL G++V N + I + D+++ D++MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 SLLE---ARDCMTPIMMLSARDNEEDRIRCYELGADDFLTKPFSIKELLVRLHALERRLI 118
LL P++++SA++ I+ E GA D+L KPF + EL+ + R +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI----GIIGRAL 119

Query: 119 SRPPEQMA 126
+ P + +
Sbjct: 120 AEPKRRPS 127


32Shewmr7_2174Shewmr7_2189Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2174220-0.126192hypothetical protein
Shewmr7_21751200.721378glutamate dehydrogenase (NAD)
Shewmr7_21761201.331063dihydroorotate dehydrogenase 2
Shewmr7_21771202.686818hypothetical protein
Shewmr7_21781193.067523bifunctional acetaldehyde-CoA/alcohol
Shewmr7_21790212.832829hypothetical protein
Shewmr7_21800202.619586methyl-accepting chemotaxis sensory transducer
Shewmr7_21811152.986737hypothetical protein
Shewmr7_21821162.771944*phage integrase family protein
Shewmr7_21832162.444710hypothetical protein
Shewmr7_21842151.894114integrase catalytic subunit
Shewmr7_21851151.825471hypothetical protein
Shewmr7_2186-1131.508998phage transcriptional regulator, AlpA
Shewmr7_2187115-0.038763hypothetical protein
Shewmr7_21882151.065929hypothetical protein
Shewmr7_21892150.993785hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2189V8PROTEASE458e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 45.4 bits (107), Expect = 8e-08
Identities = 26/205 (12%), Positives = 65/205 (31%), Gaps = 15/205 (7%)

Query: 19 IIRHDVDDAKYQAAAQSDNSTVSFLGLYKGDEIVLGTGSLIDKQWIVTAAHVANELMVGN 78
I+ ++ D + + V+++ + + +G ++ K ++T HV + G+
Sbjct: 70 ILPNN-DRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVV-DATHGD 127

Query: 79 KVQFKTDFYPIKDVIKHPLWKERHFPYDIALVQLASPIEDATLARL--NHASTETGKIAT 136
F + +P + + S D + + N + G++
Sbjct: 128 PHA-LKAFPSAINQDNYPNGG-----FTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVK 181

Query: 137 FVGRGDYGHGLVGVAGADKQLRAAHNAVVGVQEQWLQFIFDRDANALPLEGISGPGDSGG 196
+ + V + V + I A+ + + G+SG
Sbjct: 182 PATMSN--NAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGS 239

Query: 197 PAYLKSADSVCLIGVSSWQNAESIN 221
P + + + +IG+ N
Sbjct: 240 PVFNEKNE---VIGIHWGGVPNEFN 261


33Shewmr7_2210Shewmr7_2229Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2210-112-3.993866hypothetical protein
Shewmr7_2211-114-4.814900hypothetical protein
Shewmr7_2212016-6.818360membrane-bound metal-dependent hydrolase
Shewmr7_2213421-9.198016hypothetical protein
Shewmr7_2214419-9.390561hypothetical protein
Shewmr7_2215319-8.989344hypothetical protein
Shewmr7_2216013-4.298104hypothetical protein
Shewmr7_2217010-2.497645hypothetical protein
Shewmr7_2218-117-1.075009phage integrase family protein
Shewmr7_2219-213-0.611617hypothetical protein
Shewmr7_2220-113-0.949058hypothetical protein
Shewmr7_2221-113-1.516670hypothetical protein
Shewmr7_2222-113-1.860348hypothetical protein
Shewmr7_2223-114-2.369319resolvase domain-containing protein
Shewmr7_2224-116-3.516215hypothetical protein
Shewmr7_2225-121-4.276516L-serine ammonia-lyase
Shewmr7_2226021-3.457755beta-hexosaminidase
Shewmr7_2227023-4.221417hypothetical protein
Shewmr7_2228-122-3.379221hypothetical protein
Shewmr7_2229-224-3.152421acylphosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2224GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.9 bits (77), Expect = 0.002
Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 7/68 (10%)

Query: 594 ALEEAKIQ-QAIAEQEAIAAQAKAA--EEAALAKAKAEVEAEAERQRL----EQEEQMKA 646
ALEEA + A+ + ++K +E A +AK E EA+A +++L E+ +++A
Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRA 460

Query: 647 SEQSQPET 654
+ S +T
Sbjct: 461 GKASDSQT 468



Score = 32.0 bits (72), Expect = 0.010
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 611 AAQAKAAEEAALAKAKAEVE-AEAERQRLEQEEQMKASEQSQPETGSQEAIATSD-ESLA 668
+ +AK EA K + + + +EA RQ L + + AS +++ + A S +L
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRR--DLDASREAKKQVEKALEEANSKLAALE 413

Query: 669 K 669
K
Sbjct: 414 K 414



Score = 31.6 bits (71), Expect = 0.010
Identities = 21/93 (22%), Positives = 34/93 (36%), Gaps = 23/93 (24%)

Query: 597 EAKIQQAIAEQEAIAAQAKAAEEAALAKAKAEVEA-EAERQRLEQ--------------- 640
A+ + + A+ KAA EA A + + + A RQ L +
Sbjct: 273 MNFSTADSAKIKTLEAE-KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 641 ----EEQMKASEQSQPETGSQEAIATSDESLAK 669
EEQ K SE S+ + + S E+ +
Sbjct: 332 HQKLEEQNKISEASR--QSLRRDLDASREAKKQ 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2225HTHTETR397e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.8 bits (90), Expect = 7e-06
Identities = 13/75 (17%), Positives = 29/75 (38%)

Query: 21 WEQRRDYLTQVALRSLRGHKTFDLCRSHLVQVSQISKGTIYNHFTTEADLIVAVASAQYD 80
++ R ++ VALR + + + +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 81 EWLCAAKQDALRYPD 95
+ ++P
Sbjct: 69 NIGELELEYQAKFPG 83


34Shewmr7_2295Shewmr7_2314Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2295-120-4.904902hypothetical protein
Shewmr7_2296031-7.195666sulfatase
Shewmr7_2297029-6.539536diguanylate cyclase
Shewmr7_2298029-6.469768MarR family transcriptional regulator
Shewmr7_2299029-6.836129secretion protein HlyD family protein
Shewmr7_2300029-6.457684EmrB/QacA family drug resistance transporter
Shewmr7_2301026-5.869145CoA-binding domain-containing protein
Shewmr7_2302124-5.320955methyl-accepting chemotaxis sensory transducer
Shewmr7_2303226-7.700587anti-sigma-factor antagonist
Shewmr7_2304328-8.758961response regulator receiver protein
Shewmr7_2305329-9.027382response regulator receiver protein
Shewmr7_2307427-9.168706CheA signal transduction histidine kinases
Shewmr7_2308527-7.711298CheW protein
Shewmr7_2309428-9.526710methyl-accepting chemotaxis sensory transducer
Shewmr7_2310221-7.887517MCP methyltransferase, CheR-type
Shewmr7_2311018-6.779390chemoreceptor glutamine deamidase CheD
Shewmr7_2312-116-6.196723response regulator receiver modulated CheB
Shewmr7_2313-116-4.161120response regulator receiver modulated
Shewmr7_2314-112-3.443104hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2299HTHFIS667e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 7e-15
Identities = 27/119 (22%), Positives = 50/119 (42%), Gaps = 2/119 (1%)

Query: 10 DKATILLIDDHPMLRNGVKQLIGMADNLCIVAEASCGKDGIILATQLDPDLILLDLNMPE 69
ATIL+ DD +R + Q + A + D DL++ D+ MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSN--AATLWRWIAAGDGDLVVTDVVMPD 59

Query: 70 FNGLETLTKLRECELSSRIIVFTVSNYEGDIVNAFKYGVDGYLLKDMEPEDLLQSIQQA 128
N + L ++++ ++V + N + A + G YL K + +L+ I +A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2301TCRTETB290.033 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.033
Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 126 NTTPFSIFVIIALLCGFGGANF-ASSMANISFFYPKDKQGSALGLNGGLGNLGVSV 180
+ FS+ ++ + G G A F A M ++ + PK+ +G A GL G + +G V
Sbjct: 99 GHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2309OMPADOMAIN280.039 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 28.0 bits (62), Expect = 0.039
Identities = 23/90 (25%), Positives = 33/90 (36%), Gaps = 12/90 (13%)

Query: 89 LKLNVPFTDKTN-FAVSGGFLADLD----IGNEARDKEVNPYVKFNVDYNINTNWAV--- 140
KL P TD + + GG + D + + D V+P V+Y I A
Sbjct: 102 AKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLE 161

Query: 141 ---VIGFNQSFSSDY-LDQYAVSLGVKYRF 166
+ + D +SLGV YRF
Sbjct: 162 YQWTNNIGDAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2312PF00577330.005 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 33.3 bits (76), Expect = 0.005
Identities = 43/293 (14%), Positives = 84/293 (28%), Gaps = 54/293 (18%)

Query: 392 LGLGVTGSDL---GTITTAGLAFDLPWDLSLQST--------YNYASSGEIYFNTTLDLG 440
L + +T ++ G + ++ SL + Y Y++SG F T
Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSR 496

Query: 441 LLTTSFQEYKGGSDRNLASTLYGYGHFK---RAYIGSSFNFFDLGQARLSGA----YNFN 493
+ + + G T Y + + + + LSG+ + +
Sbjct: 497 MNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTS 556

Query: 494 QSDVTSINYENYNLSFGLGRQINNYARIDFNLGYTTNSLEDGFELDKLDGTLSVTINLD- 552
D + + I++ L Y+ D++ L+V I
Sbjct: 557 NVDE------QFQAGLN-----TAFEDINWTLSYSLTKNAWQKGRDQMLA-LNVNIPFSH 604

Query: 553 ---PQDSISYRSTVHGYNRQ-----VSSIRNTLSASSLIDGE-NYSDYGEVSHVYNNVSE 603
+R Y+ + + + L D +YS + + S
Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664

Query: 604 SSTRVDASYSGNYRNDYLHANGLVIADTQGKRGASLGLDSSQIFANGKGYVTA 656
S+ +Y G Y + G S D Q++ G V A
Sbjct: 665 STGYATLNYRGGY------------GNANI--GYSHSDDIKQLYYGVSGGVLA 703


35Shewmr7_2342Shewmr7_2348Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2342217-1.886254electron transport complex protein RnfC
Shewmr7_2343218-1.695040electron transport complex protein RnfB
Shewmr7_2344218-2.462347electron transport complex protein RnfB
Shewmr7_2345321-3.135343Na(+)-translocating NADH-quinone reductase
Shewmr7_2346120-3.921103diguanylate cyclase/phosphodiesterase
Shewmr7_2347321-3.716608*****hypothetical protein
Shewmr7_2348221-3.061872excinuclease ABC subunit B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2344HTHFIS443e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.4 bits (105), Expect = 3e-07
Identities = 19/130 (14%), Positives = 49/130 (37%), Gaps = 18/130 (13%)

Query: 180 MPGKKVLIVDDSSTARRQVRETLGQLGIEIIEASDGLQALHLLQKWRDEGKNVAQELLMM 239
M G +L+ DD + R + + L + G ++ S+ + ++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI---AAGDGDLV------ 51

Query: 240 ITDAEMPEMDGYKLTYEVRNDKAMAD-LFITLNTSLSGSFNNAMVE--KVGCDRFISK-F 295
+TD MP+ + + L ++ + L ++ + ++ + G ++ K F
Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTF-----MTAIKASEKGAYDYLPKPF 106

Query: 296 QPDLLVEVVQ 305
L+ ++
Sbjct: 107 DLTELIGIIG 116


36Shewmr7_2393Shewmr7_2402Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_23931213.857954hypothetical protein
Shewmr7_23941213.756966glyceraldehyde-3-phosphate dehydrogenase
Shewmr7_23951213.835475***aromatic amino acid aminotransferase
Shewmr7_23961152.664359hypothetical protein
Shewmr7_2397-1131.658068bax protein
Shewmr7_2398-114-2.018596hypothetical protein
Shewmr7_2399-116-2.920798hypothetical protein
Shewmr7_2400-319-3.472369hypothetical protein
Shewmr7_2401-220-3.574568C32 tRNA thiolase
Shewmr7_2402020-3.509227universal stress protein UspE
37Shewmr7_2514Shewmr7_2535Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2514214-0.954777acyl-CoA dehydrogenase domain-containing
Shewmr7_2515214-0.530485hypothetical protein
Shewmr7_2516114-0.245608short chain dehydrogenase
Shewmr7_25171130.211205orotidine 5'-phosphate decarboxylase
Shewmr7_25180130.568003hypothetical protein
Shewmr7_25192151.449699hypothetical protein
Shewmr7_2520-1160.250023hypothetical protein
Shewmr7_2521-1160.046374integration host factor subunit beta
Shewmr7_2522-216-1.27378330S ribosomal protein S1
Shewmr7_2523-115-2.838597cytidylate kinase
Shewmr7_2524016-3.7871063-phosphoshikimate 1-carboxyvinyltransferase
Shewmr7_2525120-3.498020aromatic amino acid aminotransferase
Shewmr7_2526122-3.886335phosphoserine aminotransferase
Shewmr7_2527222-4.065002DNA gyrase subunit A
Shewmr7_2528534-5.0342983-demethylubiquinone-9 3-methyltransferase
Shewmr7_2529638-4.950807HAD family hydrolase
Shewmr7_2530643-6.227139ribonucleotide-diphosphate reductase subunit
Shewmr7_2531541-6.601559ribonucleotide-diphosphate reductase subunit
Shewmr7_2532130-5.241028ferredoxin
Shewmr7_2533024-4.303823hypothetical protein
Shewmr7_2534-220-3.843751hypothetical protein
Shewmr7_2535-218-3.670635integrase catalytic subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2519RTXTOXIND452e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 2e-06
Identities = 19/194 (9%), Positives = 50/194 (25%), Gaps = 30/194 (15%)

Query: 525 EDLPTDLQLQAAQDAEALALDNLNKARAEYRGLQKQLETQQHQANELATVLGDKVELSLE 584
L + Q + A + + R ++ + +E + E+
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187

Query: 585 SHSQTLERLMQQAKQADAAAQALQQLQQQIKTLQQQESTLAQQLELERE----------- 633
+ E+ Q L + + + T+ + + +E+
Sbjct: 188 TSLIK-EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 634 ------RYREQEGKVERLSGQFAEKALRIPEEYRTLDVLNQAIAHNQQQLEQI-----KR 682
EQE K + ++ + + I +++ + +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQ-------IESEILSAKEEYQLVTQLFKNE 299

Query: 683 QIDVLRTAQQQATE 696
+D LR
Sbjct: 300 ILDKLRQTTDNIGL 313



Score = 40.6 bits (95), Expect = 3e-05
Identities = 36/214 (16%), Positives = 72/214 (33%), Gaps = 25/214 (11%)

Query: 192 DTLKAKAADIRNLVKEQRARRDGILQTAALTSDDELTAELSRIEPEFAAATAAKEQSVAA 251
DTLK +++ ++ +++ R + L+ EL P+ E+ V
Sbjct: 135 DTLKTQSSLLQARLEQTRYQ--------ILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 252 HLAALKQRDSAQQLFAEFTRLQELQAEALSLNEQQAQIATQTTRLEVAKQALRV------ 305
+ +K+ Q + + + L++++A+ T R+ + RV
Sbjct: 187 LTSLIKE-----QFSTWQNQKYQKELN---LDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 306 --KPLLDNALSREQEASVAAAQRDSAQLTLDAAKLALSHAETAAQEIIPLEHKLREVEQQ 363
LL + + A L K L E+ E++L +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK-EEYQLVTQLFK 297

Query: 364 NSHLSALVPQLAEFASLEQALAQAKEILQHTKLQ 397
N L L L LA+ +E Q + ++
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331



Score = 30.2 bits (68), Expect = 0.044
Identities = 28/200 (14%), Positives = 68/200 (34%), Gaps = 16/200 (8%)

Query: 617 LQQQESTLAQQLELERERYREQEGKVERLSGQFAEKALRIPEEYRTLDVLNQAIAHNQQQ 676
+ +S+L Q LE+ RY+ +E + + + + + + + ++Q
Sbjct: 136 TLKTQSSLLQAR-LEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 677 LEQIKRQIDVLRTAQQQATEQSVAAQTALSAAIERCHATADLQAQAQQTLLTALDNAGFI 736
+ Q + + + A I R + ++ + ++L + I
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVL----ARINRYENLSRVEKS-RLDDFSSLLHKQAI 249

Query: 737 DRDALREALLTDEQMQTLAEGIETYHRQCALNQSQLSQLTTKLSESTLPDLDALEALLTE 796
+ A+ EQ E + + +SQL Q+ +++ + + E
Sbjct: 250 AKHAVL------EQENKYVE----AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNE 299

Query: 797 RLAQLKTAEEVWSQLNTRLT 816
L +L+ + L L
Sbjct: 300 ILDKLRQTTDNIGLLTLELA 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2529SYCDCHAPRONE290.021 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.021
Identities = 11/51 (21%), Positives = 21/51 (41%)

Query: 198 YFNQKKYKKAVGVLEVMVPLFPDDGRLWVQLAQFYLMVEDYDKSLATYDLA 248
+ KY+ A V + + L D R ++ L + YD ++ +Y
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2530PF035441082e-31 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 108 bits (272), Expect = 2e-31
Identities = 37/169 (21%), Positives = 64/169 (37%), Gaps = 10/169 (5%)

Query: 39 TPVIEITMDRQDSKAQNKPRVVPKPPPPPEQPQKPDTTPPDTSSNID----TSMSFNMGG 94
P E + K PKP P P+ P S N
Sbjct: 75 EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP 134

Query: 95 VEAGGHGTGGFKLGNMMTRDGDATPIVRIEPQYPIAAARDGKEGWVQLRFTINELGGVDD 154
+ + + R +PQYP A EG V+++F + G VD+
Sbjct: 135 ARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDN 194

Query: 155 VEIINAEPKRLFDKEAIRALKKWKYKPKIVDGKPLKQPGMTVQLDFTLD 203
V+I++A+P +F++E A+++W+Y+P G+ V + F ++
Sbjct: 195 VQILSAKPANMFEREVKNAMRRWRYEP------GKPGSGIVVNILFKIN 237


38Shewmr7_2555Shewmr7_2562Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2555216-0.429784hypothetical protein
Shewmr7_2556218-0.548242hypothetical protein
Shewmr7_2557321-0.694985gonadoliberin III-like protein
Shewmr7_2558423-1.110192hypothetical protein
Shewmr7_2559529-1.283619alpha-L-glutamate ligase-like protein
Shewmr7_2560429-0.986632hypothetical protein
Shewmr7_2561325-0.726948hypothetical protein
Shewmr7_2562232-1.440151hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2555HTHFIS310.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.010
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2556HTHFIS290.021 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.021
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2560DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2561PF05272330.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.006
Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 286 AEATVVRSYVDWMTSVPWSQRSKIKRDLAKAQEVLDTDHYGLEKVKDRILEYLAVQSRVR 345
A+ V + DW+ + W + ++++ L D+ +++ + V
Sbjct: 527 ADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVA 586

Query: 346 QLKGP------ILCLVGPPGVGKTSL 365
++ P + L G G+GK++L
Sbjct: 587 RVMEPGCKFDYSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2562HTHFIS300.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.017
Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLRNSSPKDGVELGKSNILLIGPTG 123
+ P+ E + ++G+ A +Y RL + +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAA----MQEIYRVLARLMQT---------DLTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


39Shewmr7_2572Shewmr7_2578Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2572214-0.183787thiamine biosynthesis protein ThiC
Shewmr7_2573419-0.420049hypothetical protein
Shewmr7_2574519-0.232388hemolysin III family channel protein
Shewmr7_2575521-0.166279NUDIX hydrolase
Shewmr7_25766210.114045hypothetical protein
Shewmr7_25776240.034875alpha-glucosidase
Shewmr7_25785180.789707hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2572TCRTETOQM396e-05 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 39.1 bits (91), Expect = 6e-05
Identities = 37/148 (25%), Positives = 58/148 (39%), Gaps = 47/148 (31%)

Query: 14 NAGKSTLFNAL---TGANQQVG---------NW------SGVTVEKKTGHFTLNGADVYL 55
+AGK+TL +L +GA ++G + G+T++ F V +
Sbjct: 13 DAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNI 72

Query: 56 TDLPGIYDLLPAGNSCDCSLDEQIAQQYLAEQRVDGIINLVDA-------TNIERHLYLT 108
D PG D +A+ Y + +DG I L+ A T I L
Sbjct: 73 IDTPGHMDF--------------LAEVYRSLSVLDGAILLISAKDGVQAQTRI-----LF 113

Query: 109 AQLRELSIPMVVVLNKIDAAIKRGIRVD 136
LR++ IP + +NKID GI +
Sbjct: 114 HALRKMGIPTIFFINKIDQN---GIDLS 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2576INTIMIN340.003 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 33.5 bits (76), Expect = 0.003
Identities = 23/114 (20%), Positives = 40/114 (35%), Gaps = 13/114 (11%)

Query: 325 PSISSASMDANGTVTVAVTLSNPSTGTV--------YSDSADKLKFISDLRVYAN----W 372
S S+ D NG V +T + P V A +++F + L +
Sbjct: 705 LSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764

Query: 373 GTSFDYSTRSARSIRLPESTPVSGSNGTYTYTISGLTVPAGTEADHGGLAIQGR 426
GT + + SG NG YT+ + A +A G + ++ +
Sbjct: 765 GTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSAN-PAIASVDASSGQVTLKEK 817



Score = 31.2 bits (70), Expect = 0.014
Identities = 22/81 (27%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 325 PSISSASMDANGTVTVAVTLSNPSTGTVYSDSADKLKFISDLRVYANWGTSFDYSTRSAR 384
S +SA+ + +G TV + P V SA + S L AN D + S
Sbjct: 607 LSANSANTNGSGKATVTLKSDKPGQVVV---SAKTAEMTSALN--ANAVIFVDQTKASIT 661

Query: 385 SIRLPESTPVSGSNGTYTYTI 405
I+ ++T V+ TYT+
Sbjct: 662 EIKADKTTAVANGQDAITYTV 682


40Shewmr7_2589Shewmr7_2598Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2589-119-3.180360hypothetical protein
Shewmr7_25901190.773989hypothetical protein
Shewmr7_25911190.878855hypothetical protein
Shewmr7_25923181.161591hypothetical protein
Shewmr7_25932172.668740PqiA family integral membrane protein
Shewmr7_25941173.308218RND efflux system outer membrane lipoprotein
Shewmr7_25951182.823479acriflavin resistance protein
Shewmr7_25961182.776179RND family efflux transporter MFP subunit
Shewmr7_25971153.649068hypothetical protein
Shewmr7_25980153.728799LysR family transcriptional regulator
41Shewmr7_2636Shewmr7_2642Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2636-1143.369755glucan biosynthesis protein G
Shewmr7_2637-2164.840415periplasmic sensor signal transduction histidine
Shewmr7_2638-1265.709940hypothetical protein
Shewmr7_26391265.745376two component transcriptional regulator
Shewmr7_26400255.648412ApbE family lipoprotein
Shewmr7_26411236.120723hypothetical protein
Shewmr7_26422183.149001hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2637SUBTILISIN1438e-41 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 143 bits (362), Expect = 8e-41
Identities = 71/210 (33%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 126 AGMKVCIIDSGLDSSNPDFNWNNITG----DNDPGTGNWFQNGGPHGTHVAGTIGAADNN 181
G+KV ++D+G D+ +PD I G D+D G F++ HGTHVAGTI A +N
Sbjct: 41 RGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE 100

Query: 182 IGVVGMAPGVPMHIVKVFNASGWGYSSDLAYAANKCSNAGAKIISMSLGGGAANNTEKNA 241
GVVG+AP + I+KV N G G + IISMSLGG A
Sbjct: 101 NGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEA 160

Query: 242 FDAFTAAGGLVVAAAGNDGNSVRS-----YPAGYPSVMMIGANDANNNIADFSQYPSCVS 296
A+ LV+ AAGN+G+ YP Y V+ +GA + + + ++FS
Sbjct: 161 VKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNS----- 215

Query: 297 GRGKKAVNDDGICVEVTAGGVDTLSTYPAG 326
V++ A G D LST P G
Sbjct: 216 ----------NNEVDLVAPGEDILSTVPGG 235



Score = 53.3 bits (128), Expect = 8e-10
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 7/70 (10%)

Query: 447 YGFMSGTSMATPAVSGMAALVWSN-----HSQCTGTQIRKALKATAMDAGTVGKDNYFGY 501
Y SGTSMATP V+G AL+ T ++ L + G G
Sbjct: 237 YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGN 294

Query: 502 GIVNAKAADA 511
G++ A +
Sbjct: 295 GLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2640ABC2TRNSPORT401e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.5 bits (92), Expect = 1e-05
Identities = 44/166 (26%), Positives = 78/166 (46%), Gaps = 23/166 (13%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI----TPYVLVGFV 237
G++ T M T AA R Q E ++ T +R +++LG++ T L G
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 238 QLAIILTAGH-----LLFAVPIRGGLDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+ G+ LL+A+P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPVAAQWIAEALPATHFMRMSRAIVL 338
++ P + LSG +FP + +P+ Q A LP +H + + R I+L
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2642RTXTOXIND596e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.5 bits (144), Expect = 6e-12
Identities = 36/180 (20%), Positives = 61/180 (33%), Gaps = 11/180 (6%)

Query: 17 PSPRGWGKLLASLLGAALLLQLTACGDESPRVLGTV--ERDRLTLTAPVGELIKRVNVVE 74
PR + L A +L + + G + + ++K + V E
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 75 GQQVQAGEVLLELDSTAAQARLGQRQAELKQA-------QAKLDEAVTGARSEDIDKARA 127
G+ V+ G+VLL+L + A+A + Q+ L QA Q E
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 128 ALDGANASVKEARQNFERTQ--QLFKTKVLSQADLDAARAARDTSLAKQAEAEQSLRLLQ 185
+ + + Q K + +LD RA R T LA+ E R+ +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234



Score = 49.8 bits (119), Expect = 6e-09
Identities = 30/242 (12%), Positives = 77/242 (31%), Gaps = 35/242 (14%)

Query: 73 VEGQQVQAGEVLLELDSTAAQARLGQRQAELKQAQAKLDEAVTGARSEDIDKARAALDGA 132
V ++V L++ + Q + Q++ L + +A + A ++
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA------------ERLTVLARINRY 226

Query: 133 NASVKEARQNFERTQQLFKTKVLS--------------QADLDAARAARDTSLAKQAEAE 178
+ + + L + ++ +L ++ + ++ A+
Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286

Query: 179 QSLRLLQNGTRSEQLEQARAAVEAAMAGVAQEQKALKDLSLVAAK-PA---VVDTLPWRV 234
+ +L+ ++E L++ R + + K + + P V
Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346

Query: 235 GDRVAAGSQLIGLLAIEHPY-VRVYLPATWLDRVKAGSQVKILVDG----RTQPIAGTVR 289
G V L+ ++ + V + + + G I V+ R + G V+
Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406

Query: 290 NI 291
NI
Sbjct: 407 NI 408


42Shewmr7_2654Shewmr7_2661Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_26540183.039521hydrogenase assembly chaperone hypC/hupF
Shewmr7_2655-1141.868371hydrogenase expression/formation protein HypD
Shewmr7_26560132.446074hydrogenase expression/formation protein HypE
Shewmr7_2657-2143.030819hydrogenase nickel insertion protein HypA
Shewmr7_2658-2153.393971lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA
Shewmr7_2659-2153.677020lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA
Shewmr7_2660-2153.163070integration host factor subunit alpha
Shewmr7_2661-2153.089808phenylalanyl-tRNA synthetase subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2655FbpA_PF05833270.029 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 26.8 bits (59), Expect = 0.029
Identities = 5/42 (11%), Positives = 16/42 (38%)

Query: 75 NAEDVKQLTRNHLAYVEEQISKLQNLRSQLQQMVSECQGGEQ 116
++ +K + + V I++ L + +C+ +
Sbjct: 293 KSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDI 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2658DHBDHDRGNASE1075e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (267), Expect = 5e-30
Identities = 74/263 (28%), Positives = 126/263 (47%), Gaps = 22/263 (8%)

Query: 3 LKDKVVVITGGAGGLGLAMAHNFAQAGAKLALIDVDQEKLERACADLGS-ATEVQGYALD 61
++ K+ ITG A G+G A+A A GA +A +D + EKLE+ + L + A + + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 ITDEEDVVAGFAYILEDFGKINVLVNNAGILRDGMLIKAKDGKVTDRMSFDQFQSVINVN 121
+ D + A I + G I++LVN AG+LR G+ +S +++++ +VN
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL---------IHSLSDEEWEATFSVN 116

Query: 122 LTGTFLCGREAAAAMIESGQAGVIVNISSLAKAGNVGQSNYAASKAGVAAMSVGWAKELA 181
TG F R + M++ ++ S+ A + YA+SKA + ELA
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 182 RYNIRSAAVAPGVIATEMTAAMKPE----------ALERLEKLVPVGRLGQAEEIASTVR 231
YNIR V+PG T+M ++ + +LE + +P+ +L + +IA V
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 232 FIIEND--YVNGRVFEVDGGIRL 252
F++ ++ VDGG L
Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259


43Shewmr7_2729Shewmr7_2751Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_27292142.864970glutamine amidotransferase, class-II
Shewmr7_27302133.189765acyl-CoA dehydrogenase
Shewmr7_27312133.313817acyl-CoA dehydrogenase
Shewmr7_27321133.168483sodium/hydrogen exchanger
Shewmr7_27331132.983913TetR family transcriptional regulator
Shewmr7_27341132.151501LysR family transcriptional regulator
Shewmr7_27351132.332337hypothetical protein
Shewmr7_27360150.530395hypothetical protein
Shewmr7_2737215-0.411328hypothetical protein
Shewmr7_27381170.326881hypothetical protein
Shewmr7_2739-1170.856741tetraheme cytochrome c
Shewmr7_2740-2180.887912hypothetical protein
Shewmr7_2741-2180.778697flavocytochrome c
Shewmr7_2742-2170.182960hypothetical protein
Shewmr7_2743-118-1.957709alcohol dehydrogenase
Shewmr7_2744119-3.364787diguanylate cyclase/phosphodiesterase with
Shewmr7_2745119-3.413552**hypothetical protein
Shewmr7_2746218-4.487421DNA polymerase III subunit epsilon
Shewmr7_2747115-4.770645ribonuclease H
Shewmr7_2748114-4.486469LysR family transcriptional regulator
Shewmr7_2749114-4.086270type 11 methyltransferase
Shewmr7_2750013-3.416436hydroxyacylglutathione hydrolase
Shewmr7_2751014-3.219430MltD domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2733RTXTOXIND350.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 35.2 bits (81), Expect = 0.004
Identities = 25/142 (17%), Positives = 46/142 (32%), Gaps = 7/142 (4%)

Query: 1070 HQQFLVIPQQYGDTVSALMAEQAKMASLGIAIPESLQRSMELFHQHQAQTLKSHAEFMQL 1129
L +Y + V+ L ++++ + I + + + + + L +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ-TTD 309

Query: 1130 QTSSSQAALALLGQMPAPQVQASTQAAPVAVTAATPVAPAQAPVVQALAAEPKAAVVPMS 1189
LA + + QAS APV+V + VV AE +VP
Sbjct: 310 NIGLLTLELAKNEE----RQQASVIRAPVSVKVQQLKVHTEGGVVTT--AETLMVIVPED 363

Query: 1190 EPQVQQPQVQQPQVAQPQVAQP 1211
+ VQ + V Q
Sbjct: 364 DTLEVTALVQNKDIGFINVGQN 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2742HTHFIS371e-127 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 371 bits (953), Expect = e-127
Identities = 138/374 (36%), Positives = 198/374 (52%), Gaps = 35/374 (9%)

Query: 103 KPIDMSLLCETLADFAQHLVANTSTRIRPFASELDQYGLLVGSSLPMHRLYRTIRRVSAA 162
KP D+ L +A R + LVG S M +YR + R+
Sbjct: 104 KPFDL----TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT 159

Query: 163 ESNVLIIGESGAGKELVANTIHLASPRVNKPYIAINCGALSPELVDSELFGHVKGSFTGA 222
+ ++I GESG GKELVA +H R N P++AIN A+ +L++SELFGH KG+FTGA
Sbjct: 160 DLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA 219

Query: 223 NRDHQGVFEQAEGGTLFLDEVTEMPLEHQVKLLRVLENNEYRPVGSPKVLKANVRIVAAT 282
G FEQAEGGTLFLDE+ +MP++ Q +LLRVL+ EY VG ++++VRIVAAT
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAAT 279

Query: 283 NRDPLVAIEQGQLREDLYFRLAHFPIQVPPLRERGEDIVGLAKHFLAYRNAAEKQSKVFS 342
N+D +I QG REDLY+RL P+++PPLR+R EDI L +HF+ K F
Sbjct: 280 NKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFD 339

Query: 343 PSSLEAIAAHTWPGNVRELKHAIERAYILADHE-ITPEHLQ-----------LTPSLDKE 390
+LE + AH WPGNVREL++ + R L + IT E ++ + + +
Sbjct: 340 QEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARS 399

Query: 391 ATAEENVVIPQGMR-------------------LEELEKIAIYQALETSQGNKTDTAEQL 431
+ + + + MR L E+E I AL ++GN+ A+ L
Sbjct: 400 GSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLL 459

Query: 432 GISVKTLYNKLSKY 445
G++ TL K+ +
Sbjct: 460 GLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2749NEISSPPORIN582e-11 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 57.7 bits (139), Expect = 2e-11
Identities = 44/192 (22%), Positives = 86/192 (44%), Gaps = 16/192 (8%)

Query: 26 MKKEILTASVFAISALPVLADESPSVYGRLDLSVTHSELSSTVYSGTSGVKVGESGTYLE 85
MKK ++ ++ALPV A ++YG + V S V+ +G+ +
Sbjct: 1 MKKSLIA---LTLAALPVAAMADVTLYGAIKAGVQTYRSVEHTDGKVSKVE---TGSEIA 54

Query: 86 NNSSNIGVKGKSAISDGINVVYKMEFGVNNTSNRANDSSKVFSARNTYLGVETAYGTLLV 145
+ S IG KG+ + +G+ V+++E S ++ + + +++G++ +GT+
Sbjct: 55 DFGSKIGFKGQEDLGNGLKAVWQLE---QGASVAGTNTG--WGNKQSFVGLKGGFGTIRA 109

Query: 146 GRNDTVFKTAEGKVDIFGTTNADINQL-VSGQTRSAD---GVWYYSPKLFGLMDINATYL 201
G ++ K V+ + + N L +SG + V Y SP+ G + Y
Sbjct: 110 GSLNSPLKNTGANVNAWESGKFTGNVLEISGMAQREHRYLSVRYDSPEFAGFSG-SVQYA 168

Query: 202 LQDNYGADNELY 213
+DN G++ E Y
Sbjct: 169 PKDNSGSNGESY 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2750TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 5e-04
Identities = 23/115 (20%), Positives = 48/115 (41%), Gaps = 7/115 (6%)

Query: 75 AFFFTYAIGKFSNGFLADYANIGRFMSVSLMLSSITCMAMGMGVAGLFFVILWGMNGWFQ 134
AF T++IG G L+D I R + ++++ + +G + +I M + Q
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLI---MARFIQ 113

Query: 135 SVGSAP----SCVSIFQWYSPKQRGSVYSVWGGSRNIGEAISWILTASLVSFFGW 185
G+A V + ++ + RG + + G +GE + + + + W
Sbjct: 114 GAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


44Shewmr7_2848Shewmr7_2855Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2848-220-3.134993*diguanylate cyclase
Shewmr7_2849021-2.126644hypothetical protein
Shewmr7_2850-114-2.318721hypothetical protein
Shewmr7_2851-116-3.671470hypothetical protein
Shewmr7_2852018-3.893880NAD synthetase
Shewmr7_2853-122-5.271796inosine-guanosine kinase
Shewmr7_2854-120-3.897847ferrochelatase
Shewmr7_2855019-3.591766adenylate kinase
45Shewmr7_2916Shewmr7_2929Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_29162180.055785hypothetical protein
Shewmr7_2917218-0.105741peptidyl-dipeptidase Dcp
Shewmr7_2918013-0.354246hypothetical protein
Shewmr7_2919-215-4.776858GCN5-related N-acetyltransferase
Shewmr7_2920-215-4.990408nitroreductase
Shewmr7_2921-112-4.580015DoxX family protein
Shewmr7_2922-115-5.415260hypothetical protein
Shewmr7_2923014-4.961768hypothetical protein
Shewmr7_2924015-4.506038aminoglycoside phosphotransferase
Shewmr7_29252140.352250nicotinamide mononucleotide transporter PnuC
Shewmr7_29261130.775134hypothetical protein
Shewmr7_2927116-0.020448TonB-dependent receptor
Shewmr7_2928116-0.268265hypothetical protein
Shewmr7_2929216-0.894703hypothetical protein
46Shewmr7_3164Shewmr7_3188Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3164326-1.108027cytochrome C family protein
Shewmr7_3165526-1.800231hypothetical protein
Shewmr7_3166528-1.627670outer membrane protein
Shewmr7_3167528-1.731946hypothetical protein
Shewmr7_3168225-1.220392decaheme cytochrome c MtrF
Shewmr7_3169-124-0.825245hypothetical protein
Shewmr7_31700210.412429decaheme cytochrome c
Shewmr7_31712211.713793hypothetical protein
Shewmr7_31721191.989311decaheme cytochrome c
Shewmr7_31731192.797477hypothetical protein
Shewmr7_31740192.932495cytochrome C family protein
Shewmr7_31751183.328015hypothetical protein
Shewmr7_31762203.669180outer membrane protein precursor MtrB
Shewmr7_31772223.675292hypothetical protein
Shewmr7_31781244.719167hypothetical protein
Shewmr7_3179-1214.639363hypothetical protein
Shewmr7_31800162.619524phage SPO1 DNA polymerase domain-containing
Shewmr7_31812140.895385vibriolysin
Shewmr7_3182212-0.053728hypothetical protein
Shewmr7_3183212-1.419633CdaR family transcriptional regulator
Shewmr7_3184316-5.309655catalase domain-containing protein
Shewmr7_3185338-10.388160hypothetical protein
Shewmr7_3186321-5.430461gluconate transporter
Shewmr7_3187217-3.787215hypothetical protein
Shewmr7_3188215-2.757040glycerate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3167BCTERIALGSPG334e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.6 bits (74), Expect = 4e-04
Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 12 QTGFTLIELMISLT-LGLVVMLGASQIFVSVNKAYVETQRFSQLQG 56
Q GFTL+E+M+ + +G++ L + + KA + S +
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAV-SDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3168BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.002
Identities = 13/23 (56%), Positives = 17/23 (73%), Gaps = 2/23 (8%)

Query: 4 RKQKGFSLIEIMVTSFIVAFGIL 26
KQ+GF+L+EIMV IV G+L
Sbjct: 5 DKQRGFTLLEIMVV--IVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3169BCTERIALGSPG382e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.3 bits (89), Expect = 2e-06
Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 3 TKKILGFTLTELMVVVAIVAIIAGIAAPSFASMIRENTARTQVNELLALTN 53
T K GFTL E+MVV+ I+ ++A + P+ + + V++++AL N
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3170BCTERIALGSPG429e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 9e-08
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 3 LSKIKVNTGFTLIELMIAIAIVGILASIALPSYQEHVRNTRRTDARD---ALSNA 54
+ GFTL+E+M+ I I+G+LAS+ +P+ + + A AL NA
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3174FERRIBNDNGPP382e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.4 bits (89), Expect = 2e-05
Identities = 46/196 (23%), Positives = 74/196 (37%), Gaps = 19/196 (9%)

Query: 4 RRFI-ALGLSLALLPI---AAMAEPAKRIIALSPHAVEMLYAIGAGESIVAATDYADY-- 57
RR + A+ LS L + A A RI+AL VE+L A+G D +Y
Sbjct: 10 RRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGI--VPYGVADTINYRL 67

Query: 58 ----PEAAKKIPSIGGYYGIQIERVLELNPDLIVVWDTGNKA--EDINQL-KSLGFKLYS 110
P + +G +E + E+ P + VW G E + ++ GF +S
Sbjct: 68 WVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FS 125

Query: 111 SSPKMLEDVAKEIEELGALTGRTEQASQVAADYRNQLLQLRSENAAKSE-PKVFYQLWST 169
+ L K + E+ L A A Y + + ++ + P + L
Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185

Query: 170 PLMTV-AKNSWIQQII 184
M V NS Q+I+
Sbjct: 186 RHMLVFGPNSLFQEIL 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3182PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.017
Identities = 10/58 (17%), Positives = 22/58 (37%), Gaps = 4/58 (6%)

Query: 14 ATSANMALQVSQLSWAIEGKTILSEISFALPKG----EMLGLIGPNGAGKSSLLRCLY 67
T + + + + ++ ++ + G + L G G GKS+L+ L
Sbjct: 560 KTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3184BCTERIALGSPD320.022 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.022
Identities = 14/71 (19%), Positives = 31/71 (43%), Gaps = 5/71 (7%)

Query: 354 SGLEPLTIDAQTLFVNVGERTN---VTGSAKFLKLIKEGKFEQALDVAREQVESGAQIID 410
+P+ + + + +TN VT + + ++ + LD+ R QV A I +
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLE--RVIAQLDIRRPQVLVEAIIAE 355

Query: 411 INMDEGMLDGV 421
+ +G+ G+
Sbjct: 356 VQDADGLNLGI 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3187CHANLCOLICIN310.001 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.8 bits (69), Expect = 0.001
Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 5/74 (6%)

Query: 33 LTQGLNEFAMQQKQTELARQQAAKERKIIEYQQQQIAMQQAAEQQRIAQQNEAARIRKAE 92
LTQ L + + + +R +A E AMQ E+ R+A+ E AR ++AE
Sbjct: 90 LTQRLKDIVNEALRHNASRTPSATELAHANNA----AMQAEDERLRLAKAEEKAR-KEAE 144

Query: 93 AWRKYYIVPEDCKN 106
A K + E +
Sbjct: 145 AAEKAFQEAEQRRK 158


47Shewmr7_3292Shewmr7_3310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_32920173.001268fumarylacetoacetate (FAA) hydrolase
Shewmr7_32930162.351173transcriptional regulator, TyrR
Shewmr7_3294-1160.919676pterin-4-alpha-carbinolamine dehydratase
Shewmr7_3295016-1.649885phenylalanine 4-monooxygenase
Shewmr7_3296115-2.858077UDP-glucose pyrophosphorylase
Shewmr7_3297116-3.316406UDP-galactose 4-epimerase
Shewmr7_3298216-3.507883ferredoxin-type protein NapF
Shewmr7_3299214-2.227221hypothetical protein
Shewmr7_3300110-0.940097LysR family transcriptional regulator
Shewmr7_3302-2100.417612decaheme cytochrome c
Shewmr7_3303-3101.390572hypothetical protein
Shewmr7_3304-2101.526159hypothetical protein
Shewmr7_3305-2142.129723hypothetical protein
Shewmr7_3306-2141.965418hypothetical protein
Shewmr7_3307-2141.959934fructokinase
Shewmr7_33081193.1074153'(2'),5'-bisphosphate nucleotidase
Shewmr7_33091212.843114dTDP-4-dehydrorhamnose reductase
Shewmr7_33102222.857322putative hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3292INVEPROTEIN270.031 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 27.0 bits (59), Expect = 0.031
Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 13/81 (16%)

Query: 24 GEEYMNAKQLGHFKTILEAWRNQLREEVDRTLSHMQDEAANFPDPVDRAAQEEEFSLELR 83
E AKQ+ ++ L + + + S FPDP D E LR
Sbjct: 88 DEALPKAKQILKLISVH---GGALEDFLRQARSL-------FPDPSDLVLVLREL---LR 134

Query: 84 ARDRERKLIKKIEKTLQKIEE 104
+D E + KK+E L+ +EE
Sbjct: 135 RKDLEEIVRKKLESLLKHVEE 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3293PF04605320.001 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 31.8 bits (72), Expect = 0.001
Identities = 10/52 (19%), Positives = 17/52 (32%), Gaps = 2/52 (3%)

Query: 65 AADDILRTLEAYGFEWDDEVLYQSDRT--EAYQAKLDELLAKDNAYFCQCSR 114
I + + GFE Y S E ++ L K + +C +
Sbjct: 27 PYSLIKKFMLENGFEHRQYSGYTSKEPINERRVIRIVNKLTKKFTWLGECVK 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3297LPSBIOSNTHSS300.008 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.8 bits (67), Expect = 0.008
Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 34 HQGHITLVKEAAKKCDHVVVSIFVNP 59
GH+ +++ + D V V++ NP
Sbjct: 13 TFGHLDIIERGCRLFDQVYVAVLRNP 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3305OUTRMMBRANEA280.041 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 28.4 bits (63), Expect = 0.041
Identities = 19/107 (17%), Positives = 38/107 (35%), Gaps = 18/107 (16%)

Query: 213 ISFNKGCYMGQETVARMKYRGGNKRALYILHGTT-SLNINLETGIEIELEDGYRKGGQII 271
++ G MG + + RM Y+G + Y G + + ++ D Y + G
Sbjct: 66 VNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDL---DIYTRLG--- 119

Query: 272 EFVQRGNQVLLTAVLANDTQNDAKLRFADDEQSSLRIQALPYSLEDE 318
V DT+++ + D S + + Y++ E
Sbjct: 120 -----------GMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPE 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3306HTHFIS904e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 4e-22
Identities = 39/159 (24%), Positives = 65/159 (40%), Gaps = 6/159 (3%)

Query: 1 MDKATILVVDDTPENIDILVGILG-EDYKVKVAIDGPRALALVAKTLPDLILLDVMMPGM 59
M ATILV DD +L L Y V++ + +A DL++ DV+MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NGYEVCKLLKQEPLTCHIPVIFVTALSEVADETQGFELGAVDYITKPVSAPVVKARVRTH 119
N +++ +K+ +PV+ ++A + + E GA DY+ KP + +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LALYDQKRLLEQQVKERTQEL--EETRF-EIIRRLGRAA 155
LA ++ + + L EI R L R
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3307HTHFIS789e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 9e-17
Identities = 34/139 (24%), Positives = 61/139 (43%), Gaps = 5/139 (3%)

Query: 1283 SILVADDNATARDIMRTTLESMGFRVDTVRSGEEAVTRCSQQEYAVALIDWKMPNLDGIE 1342
+ILVADD+A R ++ L G+ V + + + + + D MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1343 TAKQIKQLAKNAPRILMVSAHATQEFLSQIEAL--GLAGYISKPISASRLLDGIMNSLGR 1400
+IK+ + P ++M SA T + I+A G Y+ KP + L+ I +L
Sbjct: 65 LLPRIKKARPDLPVLVM-SAQNTFM--TAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1401 AGVLPVRRNSESIDPKLLL 1419
P + +S D L+
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140



Score = 69.5 bits (170), Expect = 6e-14
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 1425 RILLVEDNEMNLEVATEFLEQVGIILSIATNGQIALDKLAQQSFDLVLMDCQMPVMDGYQ 1484
IL+ +D+ V + L + G + I +N +A DLV+ D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1485 ATQAIRKRPELAELPVIAMTANAMAGDKEMCLKAGMNDHIAKP 1527
I+K +LPV+ M+A + G D++ KP
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


48Shewmr7_3362Shewmr7_3406Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3362215-0.786036hypothetical protein
Shewmr7_33631110.1226354'-phosphopantetheinyl transferase
Shewmr7_33640121.216424transcriptional regulator
Shewmr7_3365-1121.997170beta-ketoacyl synthase
Shewmr7_3366-2121.850126omega-3 polyunsaturated fatty acid synthase
Shewmr7_3367-3131.941302Beta-hydroxyacyl-(acyl-carrier-protein)
Shewmr7_3368-3131.8018802-nitropropane dioxygenase, NPD
Shewmr7_3369-114-0.012374hypothetical protein
Shewmr7_3370-315-3.245359XRE family transcriptional regulator
Shewmr7_3371-315-3.644380hypothetical protein
Shewmr7_3372-213-3.605021hypothetical protein
Shewmr7_3373-216-4.460231hypothetical protein
Shewmr7_3374-214-4.749212hypothetical protein
Shewmr7_3375-212-2.276858hypothetical protein
Shewmr7_3376-2170.746908hypothetical protein
Shewmr7_33772192.326316sigma-54 dependent trancsriptional regulator
Shewmr7_33780182.827094hypothetical protein
Shewmr7_3379-2233.698195transport-associated
Shewmr7_3380-2274.822165hypothetical protein
Shewmr7_3381-2295.329756hypothetical protein
Shewmr7_3382-1315.738693hypothetical protein
Shewmr7_33832356.779948phosphoglycerate mutase
Shewmr7_33841315.289365hypothetical protein
Shewmr7_33851283.474673porin
Shewmr7_3386021-0.735014hypothetical protein
Shewmr7_3387124-3.094651major facilitator transporter
Shewmr7_3388327-4.947666putative phosphoglycerate transport regulatory
Shewmr7_3389532-7.962682hypothetical protein
Shewmr7_3390531-7.415677periplasmic sensor signal transduction histidine
Shewmr7_3391224-5.164509two component, sigma54 specific, Fis family
Shewmr7_3392120-4.768132hypothetical protein
Shewmr7_3393-113-2.157277hypothetical protein
Shewmr7_3394-113-1.714020cyclic nucleotide-binding protein
Shewmr7_3395-1150.085134fumarylacetoacetate (FAA) hydrolase
Shewmr7_3397-114-0.548602succinylglutamate desuccinylase/aspartoacylase
Shewmr7_3398-115-1.217843MarR family transcriptional regulator
Shewmr7_3399-116-2.386229PhnA protein
Shewmr7_3400224-5.411891hypothetical protein
Shewmr7_3401116-0.330421TetR family transcriptional regulator
Shewmr7_34020152.020004glutathione S-transferase domain-containing
Shewmr7_3403-1162.739260glutathione S-transferase domain-containing
Shewmr7_3404-1162.477105Fmu (Sun) domain-containing protein
Shewmr7_3405-1163.183595hypothetical protein
Shewmr7_3406-1163.394639hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3364PF06580424e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 4e-06
Identities = 27/148 (18%), Positives = 56/148 (37%), Gaps = 19/148 (12%)

Query: 405 NQLTEINEGVSTAYVQLRELL----STFRLTIKEPNLKN-AMEAMLEQLRANTDI----- 454
N L I + + RE+L R +++ N + ++ L + + +
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF 236

Query: 455 --KIHLDYKLSPQWLEAKQHIHILQITREATLNAIKHANASR----VIIRCYKDDNGMVN 508
++ + +++P ++ + ++Q E N IKH A I+ DNG V
Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 509 ISVSDNGIGIGYLKERDQHFGIGIMHER 536
+ V + G + G+ + ER
Sbjct: 294 LEVENTGSLALKNTKESTGTGLQNVRER 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3365HTHFIS643e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 3e-14
Identities = 27/159 (16%), Positives = 62/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEVGGGLDALSAVATDEPDIVLLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+V+ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLDKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKNAADEQEWISSLTPRELQILQQLA 164
E + +L++ + + + + +I + LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3369CARBMTKINASE310.009 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.9 bits (70), Expect = 0.009
Identities = 18/81 (22%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 202 DYSAALLAEALKASAVEIWTDVAGIYTTDPRLAPNAHPIAEISFNEAAEMATFGAKVLHP 261
D + LAE + A I TDV G + E+ E + G
Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAALY--YGTEKEQWLREVKVEELRKYYEEGH--FKA 271

Query: 262 ATILPAVRQQIQVFVGSSKEP 282
++ P V I+ F+ E
Sbjct: 272 GSMGPKVLAAIR-FIEWGGER 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3371HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGNEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3374CARBMTKINASE290.027 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.0 bits (65), Expect = 0.027
Identities = 7/40 (17%), Positives = 18/40 (45%), Gaps = 2/40 (5%)

Query: 218 SQSMSEITPAQTTQIKEPVTSFVVNKVVEAKNPEIAQPTK 257
Q++ + +++ V + + +V+ +P PTK
Sbjct: 94 QQALKNELRKR--GMEKKVVTIITQTIVDKNDPAFQNPTK 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3382GPOSANCHOR535e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.1 bits (127), Expect = 5e-09
Identities = 53/316 (16%), Positives = 101/316 (31%), Gaps = 10/316 (3%)

Query: 598 EYAASEQELRIRLSKAEEALQSAQELQVEAESQLVAINGELDKLSRELTFARTAYKNSRD 657
EL LS A+E L+ + E S++ + L + L A
Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSA 141

Query: 658 DLRRLFDEKRSEQDKINKALAERKAFAQQRLTQLDGELKQLKHQHQLWLEEQKEQALEAR 717
++ L EK + KA K + + E ++ LE
Sbjct: 142 KIKTLEAEK---AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 718 MEKQAYWQEVIGALDNQLGQIKATIDARRESAKAEQKACETWYKNELKSRGVDEDNILKL 777
+E + A L KA + AR+ + + + + E L
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 778 KQQIRELETKISRAEQRRSEVLRFDDWY-----QHTWLLRKPKLQTQLSDVKR-AASEID 831
+ + ELE + A + + Q+Q+ + R +
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 832 QQLKAKTQEVKTRRQQLETERKASDAAQIEASENLTKLRAVMRKLAELKLPANNEEAQGS 891
+ ++++ Q+LE + K S+A++ +L R ++L E + E+ + S
Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL-EAEHQKLEEQNKIS 377

Query: 892 LGERLRQGEDLLLKRD 907
R DL R+
Sbjct: 378 EASRQSLRRDLDASRE 393



Score = 30.8 bits (69), Expect = 0.036
Identities = 49/346 (14%), Positives = 112/346 (32%), Gaps = 27/346 (7%)

Query: 360 WRADMENLSERHKLQTEKHQDIEAAYNARRSKIGEQLNRELEGLHADQDKQRGARDKQRE 419
+ + + K+ D+ A + N EL ++ ++ DK
Sbjct: 55 VQERADKFEIENNTLKLKNSDLSFNNKAL-----KDHNDELTEELSNAKEKLRKNDKSLS 109

Query: 420 VARADLDALEAQWRSQMDAGKASFSEQEYQFKLNAAELKLRVDGVTYTEEEKLSLAIFDE 479
+ + LEA + D KA + +A L + + ++
Sbjct: 110 EKASKIQELEA---RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL----EK 162

Query: 480 RIHRADEEQESCNAKVERLTGEERKLRAKRDQANEALRIASLRVNERQTALDELHHMLFP 539
+ A + +AK++ L E+ L A++ + +AL A + L
Sbjct: 163 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 222

Query: 540 QSHTL--LEFLRKEAQGWEQSLGKVIAPELLHRTDLHPSVTGTSDTLFGVHLDLKAIDVP 597
+ LE + A + + I + L +
Sbjct: 223 LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA------------LE 270

Query: 598 EYAASEQELRIRLSKAEEALQSAQELQVEAESQLVAINGELDKLSRELTFARTAYKNSRD 657
++ E + + + + E Q +N L R+L +R A K
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 658 DLRRLFDEKRSEQDKINKALAERKAFAQQRLTQLDGELKQLKHQHQ 703
+ ++L ++ + + ++L +++ QL+ E ++L+ Q++
Sbjct: 331 EHQKLEEQNKI-SEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3386TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.9 bits (75), Expect = 0.002
Identities = 38/175 (21%), Positives = 72/175 (41%), Gaps = 17/175 (9%)

Query: 79 GVLIARIGYLRGIIFGLCTMATGCLLFYPASSMEQYALFLLALFVLASGITILQVSANPF 138
G L ++G R ++FG+ G ++ + S ++L ++A F+ +G
Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSF--FSLLIMARFIQGAGAAAFPALVMVV 127

Query: 139 VARLGPERTAASRLNLAQALNSLGHTLGPLFGSLLIFGAAAGTHEAVQLPYLLLAAVIGI 198
VAR P+ L ++ ++G +GP G ++ + YLLL +I I
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA--------HYIHWSYLLLIPMITI 179

Query: 199 IAVGFIILGGKVKQTDMGVDHRHSGSLLSHKRLLLGALAIFLYVGAEVSIGSFLV 253
I V F++ K+ + + R G +L+ +F + SFL+
Sbjct: 180 ITVPFLM---KL----LKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227



Score = 29.1 bits (65), Expect = 0.033
Identities = 15/102 (14%), Positives = 40/102 (39%), Gaps = 5/102 (4%)

Query: 75 SPLAGVLIARIGYLRGIIFGLCTMATGCLL--FYPASSMEQYALFLLALFVLASGITILQ 132
+ G+L+ R G L + G+ ++ L F ++ + ++ + G++ +
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL---GGLSFTK 366

Query: 133 VSANPFVARLGPERTAASRLNLAQALNSLGHTLGPLFGSLLI 174
+ V+ ++ A + ++L + L G L+
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3391PF005777810.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 781 bits (2019), Expect = 0.0
Identities = 308/854 (36%), Positives = 480/854 (56%), Gaps = 38/854 (4%)

Query: 22 ADDYFNPEFIEGGDGKSKSVDLSHFENIDGQIPGRYFVQIFVNGERKESTEIVFFHSQTS 81
A+ YFNP F+ DLS FEN PG Y V I++N + ++ F T
Sbjct: 45 AELYFNPRFLADDPQAV--ADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTF---NTG 99

Query: 82 SAKGDLTPCLSLSQLKSWGVNVPKFFEESVYKAE-CVDLT-RIDQALSVFKMNDMKLVLS 139
++ + PCL+ +QL S G+N ++ + CV LT I A + + +L L+
Sbjct: 100 DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLT 159

Query: 140 FPQSAMTNVARGTVSPNRWDDGINSFLLNYDLTANSRWQNQTGQRTDNYYVNLRPGINFD 199
PQ+ M+N ARG + P WD GIN+ LLNY+ + NS QN+ G + Y+NL+ G+N
Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV-QNRIGGNSHYAYLNLQSGLNIG 218

Query: 200 AWRIRSNITWSKSVTNWSSGTDDQNDAEFDIIYSYASRSFAGLKSRLTVGDAYTSADIFN 259
AWR+R N TWS + ++ SSG+ ++ + I ++ R L+SRLT+GD YT DIF+
Sbjct: 219 AWRLRDNTTWSYNSSDSSSGSKNK----WQHINTWLERDIIPLRSRLTLGDGYTQGDIFD 274

Query: 260 SVSFRGIQLESDEDMLPYSLRGYAPVVRGIARTNAEVQIHQNGRKIYSTFVYPGNFEISD 319
++FRG QL SD++MLP S RG+APV+ GIAR A+V I QNG IY++ V PG F I+D
Sbjct: 275 GINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTIND 334

Query: 320 LFATAGGGDLTVTVVESDGNTQRYEVPFASLPVLRREGSLKYSATSGIFRNSDASVNDVA 379
++A GDL VT+ E+DG+TQ + VP++S+P+L+REG +YS T+G +R+ +A
Sbjct: 335 IYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPR 394

Query: 380 FSQGTVSYGLPKGLTLYGGAQFSSKYRAVAAGLGTNLGKLGAISFDVTRSWSDFIERVSE 439
F Q T+ +GLP G T+YGG Q + +YRA G+G N+G LGA+S D+T++ S +
Sbjct: 395 FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQH 454

Query: 440 SGNSYKLRYSKYFSESGTNFSMAGYQYSTKGYWVLSDVLNQTSYY------DELAQIRFD 493
G S + Y+K +ESGTN + GY+YST GY+ +D D + Q++
Sbjct: 455 DGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPK 514

Query: 494 IPTTNNLYTKPKQRFELVLTQGAGSWGSLSVAAAFEEHRESNRNIQSLNTSYNNSFGSLS 553
NL + + +L +TQ G +L ++ + + + ++ + N +F ++
Sbjct: 515 FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDIN 574

Query: 554 YGLGFSYSSRTALSFNGITSVQDKDDMIFSFNISAPIEELFG---VSQPVYSSMASSIK- 609
+ L +S + + Q D + + N++ P SQ ++S + S+
Sbjct: 575 WTLSYSLTK---------NAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSH 625

Query: 610 -SGGSVRSNAIINGSMLEGRSLNWGLYTSY---DDQSHDYDAGVNLDLKTRYGEFSSGFV 665
G + + A + G++LE +L++ + T Y D + L+ + YG + G+
Sbjct: 626 DLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYS 685

Query: 666 YDSGGRRFNYGARGSLLAHSNGATFGQQMGETVALVAVPDVEDIAIENQIGIKTDANGYA 725
+ ++ YG G +LAH+NG T GQ + +TV LV P +D +ENQ G++TD GYA
Sbjct: 686 HSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYA 745

Query: 726 ILPYVTPYRNNFVSLDPRTFGHNVEVNGSAAKAVPTRGAVVLVEFNSRIGERALVTLQKS 785
+LPY T YR N V+LD T NV+++ + A VPTRGA+V EF +R+G + L+TL
Sbjct: 746 VLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH- 804

Query: 786 DGSYVPFGATVIQSDNNDKINIVSDFGQVYLAGLPKQGTLFVKWGRGDNEQCQFDYQIPD 845
+ +PFGA V S+++ IV+D GQVYL+G+P G + VKWG +N C +YQ+P
Sbjct: 805 NNKPLPFGAMV-TSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPP 863

Query: 846 KKIEK-LNFLNAIC 858
+ ++ L L+A C
Sbjct: 864 ESQQQLLTQLSAEC 877


49Shewmr7_3423Shewmr7_3429Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3423331-2.233631recombination associated protein
Shewmr7_3424535-2.961882peptidase M61 domain-containing protein
Shewmr7_3425531-2.605361hypothetical protein
Shewmr7_3426528-2.181181TPR repeat-containing protein
Shewmr7_3427429-2.011518hypothetical protein
Shewmr7_3428322-1.383266diguanylate cyclase
Shewmr7_3429218-0.269723hypothetical protein
50Shewmr7_3501Shewmr7_3521Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3501214-1.703988thioesterase superfamily protein
Shewmr7_3503013-0.834112isocitrate lyase
Shewmr7_3504-2150.232299malate synthase
Shewmr7_3505-3180.987319TonB-dependent receptor
Shewmr7_3506-1181.739705hypothetical protein
Shewmr7_3507-1171.833355potassium efflux system protein
Shewmr7_3508-2204.003871diguanylate cyclase
Shewmr7_3509-1194.593918hypothetical protein
Shewmr7_3510-1204.677939uroporphyrin-III C/tetrapyrrole
Shewmr7_35110205.035518SmpA/OmlA domain-containing protein
Shewmr7_35120215.323433hypothetical protein
Shewmr7_3513-1195.311948hypothetical protein
Shewmr7_35140163.887124cyclase/dehydrase
Shewmr7_35150172.723142SsrA-binding protein
Shewmr7_35162172.233197phage integrase family protein
Shewmr7_35171182.635422hypothetical protein
Shewmr7_35182183.589361phage transcriptional regulator, AlpA
Shewmr7_35191193.771407hypothetical protein
Shewmr7_35200163.850768hypothetical protein
Shewmr7_35211163.937831hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3505HTHFIS355e-121 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 355 bits (913), Expect = e-121
Identities = 136/479 (28%), Positives = 229/479 (47%), Gaps = 48/479 (10%)

Query: 18 LLVLDPEQSLPE-CSEELKQAAWNCLKAVSAAEALVLLQKYDLRVAIAFIN--DTNQVLL 74
+LV D + ++ ++ L +A ++ +AA + D + + + D N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 75 ANEIAIIQTEYPSLHWIAVTD-STLEQHCSWLSAANFIDYYHRPFDWGRFADTLGHAWGM 133
+ I+ P L + ++ +T + DY +PFD +G A
Sbjct: 66 ---LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAE 121

Query: 134 AQLTVAKKGKSAPTEVLTTIKGDNPLLQQLRQRLHKFSLSDDTVLLSGETGSGKGLCAKT 193
+ +K + + G + +Q++ + L + +D T++++GE+G+GK L A+
Sbjct: 122 PKRRPSKLEDDSQD--GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 194 LHSLSKRRDGPFITVNCGALPIGLIHSALFGHEKGAFTDADKRYIGHLEQANGGTLFLDE 253
LH KRR+GPF+ +N A+P LI S LFGHEKGAFT A R G EQA GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 254 IADLPLDLQVNLLHVLDDKQIMRIGGNVPIKVDCRLLFASHQDLEVAIDEGRFREDLYHR 313
I D+P+D Q LL VL + +GG PI+ D R++ A+++DL+ +I++G FREDLY+R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 314 INVLRLHVPSLRQYSDEVMLLAEDFLRE-NTDSNVQFHFSDDARCAMKHYNWPGNVRELR 372
+NV+ L +P LR ++++ L F+++ + F +A MK + WPGNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 373 NRIRRAMVLSDDSKITAQLLGLDQLPSRAGQDLARCRV---------------------- 410
N +RR L IT +++ + + +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 411 ---------------EHEAEVLLKAISDHKHNISAAARSLNISRATFYRLLKKCQIKMP 454
E E ++L A++ + N AA L ++R T + +++ + +
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3507SACTRNSFRASE409e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 9e-07
Identities = 20/72 (27%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 81 ASIGRVVVSPAGRGKGLAMPLMQRAIDAALSTWPAAGIQIGAQDYLKS---FYQKLGFNA 137
A I + V+ R KG+ L+ +AI+ A G+ + QD S FY K F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 138 CS-EMYLEDGIP 148
+ + L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3508TCRTETB1201e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (303), Expect = 1e-31
Identities = 89/421 (21%), Positives = 177/421 (42%), Gaps = 19/421 (4%)

Query: 18 SEYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 77
+ Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 78 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASILCSISWN-LEAMIAFRALQGFFGGALIP 136
I + G LS L ++R LL+ F S++ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 137 LAFRLILEFLPENKRAVGMALFGVTATFAPSIGPTLGGWLTEHFSWHYLFYINVQPGLLV 196
L ++ ++P+ R L G +GP +GG + + W YL I + ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TII 180

Query: 197 MAMLAYGLEKRPVVWDKLKNADLAGIVTMALGMGCLEVVLEEGNRKDWFGSDLIRNLAII 256
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 257 AAVNLVLFVWIQLKRKDPLVNLRLLGKRDFVLSTIAYFLLGMALFGAIYLIPLYLSQVHD 316
+ ++ ++FV K DP V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 317 YTPLEIGEVIMWMGFPQLLVL-PLVPRLMQRFDGRYLAAFGFFMFALSYYMNSQMTADYA 375
+ EIG VI++ G +++ + L+ R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 376 GPQMIASQVVRALG-QPFILVPIGMLATAHLKPHENPSASTVLNVMRNLGGAFGIALVAT 434
+ +V LG F I + ++ LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 435 L 435
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3509RTXTOXIND951e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 94.9 bits (236), Expect = 1e-23
Identities = 38/290 (13%), Positives = 92/290 (31%), Gaps = 28/290 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLASAKADLQTLAAKVELQHALISQASAGVVAAQADKLRAEQ 130
+ + + S + ++ + ++ + A A + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLTRAKKLKVSNYSSQDDVDQLQAGFDSAAAGLDEAKA--------LLVAKERELAVFN- 181
+L L ++ V + + + A L K+ +L AKE V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLNQAGSVVEQSNAALELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTF---TGVIDSLSPASGAK 290
L +VP+ +TA + I + GQ+ + ++AFP + G + +++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412

Query: 291 FSLLPAENATGNFTKIVQRIPVRIRLDLSEEEARVVPGLSAVVKVDTASH 340
+ G ++ I + + G++ ++ T
Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMR 457



Score = 58.3 bits (141), Expect = 1e-11
Identities = 24/128 (18%), Positives = 48/128 (37%), Gaps = 2/128 (1%)

Query: 59 VTDNQHVRKGELLAQLEDNQFSAKVSQAEASLASAKADLQTLAAKVELQHALISQASAGV 118
V + + VRKG++L +L A + ++SL A+ + L +
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR-SIELNKLPELKL 170

Query: 119 VAAQADKLRAEQQLTRAKKLKVSNYSS-QDDVDQLQAGFDSAAAGLDEAKALLVAKEREL 177
+ +E+++ R L +S+ Q+ Q + D A A + E
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 178 AVFNAQLN 185
V ++L+
Sbjct: 231 RVEKSRLD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3511MECHCHANNEL1741e-59 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 174 bits (443), Expect = 1e-59
Identities = 89/136 (65%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPAVVIAYGKFIQTVIDFTIIAFAIFMGLKAINSLKRKQEEAPKAPPAPTKDQ 120
L+ AQGD PAVV+ YG FIQ V DF I+AFAIFM +K IN L RK+EE P A PAPTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEE-PAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3513ACRIFLAVINRP6530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 653 bits (1687), Expect = 0.0
Identities = 224/1077 (20%), Positives = 440/1077 (40%), Gaps = 72/1077 (6%)

Query: 9 AIKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + ++ A + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVTDVLSFGGEVR 187
I S + +D + G ++ VK + + GV DV FG +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVSEALESNNRNAGGWFMDQGQEQLVVRGYGMLPAGEAGLA 247
++ +D + L Y L+ V L+ N + G L + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLG-GTPALPGQQLNASIIAQTRFK 241

Query: 248 AIAQIPLTEVR----GTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQALGEVVAGVV 303
+ +R G+ VR+ D+A+V+ G E + G+ +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-----GK-----PAAGLGI 291

Query: 304 LKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVRDALLMAFVFI 363
GAN T I A++ ++ P+G+ YD V ++ V L A + +
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 364 VVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDGSVV 423
+++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD ++V
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 424 MVENIFKHLTQPDRRHLAQARSRVDGEVDPYHGDEDGSVRAVEADNSMAVRIMLAAKEVC 483
+VEN+ + + + + EA + ++
Sbjct: 412 VVENVERVM-------------------------MEDKLPPKEA-------TEKSMSQIQ 439

Query: 484 SPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK- 542
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499

Query: 543 ---------RGVVLKESVVLAPLDSAYRKLLSATLARPKLVMISALLMFAMSMVLLPRLG 593
G + + Y + L ++ L+ A +VL RL
Sbjct: 500 VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLP 559

Query: 594 TEFVPELEEGTINLRVTLAPTASLGTSLDVAPKLEAMLLEFPEVEYALSRIGAPELGGDP 653
+ F+PE ++G + L A+ + V ++ L+ + S
Sbjct: 560 SSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSG 618

Query: 654 EPVSNIEVYIGLKPIEEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLS 711
+ + ++ LKP EE + E + R E + G ++ F+ P + EL +
Sbjct: 619 QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGT 675

Query: 712 GVKAQLA-IKLFGPDLDVLSEKGQVLTDLVAKIPGAV-DVSLEQVSGEAQLVVRPDRSQL 769
I G D L++ L + A+ P ++ V + AQ + D+ +
Sbjct: 676 ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKA 735

Query: 770 ARYGISVDQVMSLVSQGIGGASAGQVIDGNARYDINLRLAAEYRSSPDVIKDLLLSGSNG 829
G+S+ + +S +GG ID + ++ A++R P+ + L + +NG
Sbjct: 736 QALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG 795

Query: 830 ATVRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAG 888
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 796 EMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAG 853

Query: 889 YTVIVGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIV 948
G ++ + + +V IS ++ L L + + + +M VPL ++G ++
Sbjct: 854 IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLL 913

Query: 949 ALFVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGESLYDSVYEGTVGRLRP 1007
A + V +G +T G++ N +++V+ + G+ + ++ RLRP
Sbjct: 914 AATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRP 973

Query: 1008 VLMTALTSALGLIPILVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1064
+LMT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 974 ILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 108 bits (270), Expect = 8e-26
Identities = 85/550 (15%), Positives = 186/550 (33%), Gaps = 68/550 (12%)

Query: 10 IKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L +VA V + +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 DVLSFGGEVR-QYQVQVDPNKLRAYGLSMAQVSEALES--NNRNAGGWFMDQGQEQLVVR 234
V G E Q++++VD K +A G+S++ +++ + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEA-GLAAIAQIPLTEVRGTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQ 293
A + ++ + G V + G+ + R + +
Sbjct: 774 ----ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSME 825

Query: 294 ALGEVVAGVVLKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVR 353
GE G D A + + LP G+ ++ + + +
Sbjct: 826 IQGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAP 873

Query: 354 DALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVA 413
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 414 IGMLVDGSVVMVENIFKHLTQPDRRHLAQARSRVDGEVDPYHGDEDGSVRAVEADNSMAV 473
IG+ ++++VE A+ +G+ VEA
Sbjct: 934 IGLSAKNAILIVE-------------FAKDLMEKEGK------------GVVEA------ 962

Query: 474 RIMLAAKEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAV 533
++A + PI + I+ PL G + + ++ M+SA L+A+ V
Sbjct: 963 -TLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 534 PALAVYLFKR 543
P V + +
Sbjct: 1022 PVFFVVIRRC 1031



Score = 98.4 bits (245), Expect = 8e-23
Identities = 67/347 (19%), Positives = 141/347 (40%), Gaps = 16/347 (4%)

Query: 735 VLTDLVAKIPGAVDVSLEQVSGEAQLVVRPDRSQLARYGISVDQVMSLVSQGIGGASAGQ 794
+ D ++++ G DV L + + + D L +Y ++ V++ + +AGQ
Sbjct: 161 NVKDTLSRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218

Query: 795 VIDGNARYDINLRLA----AEYRSSPDVIKDLLLSGSNGATVRLGEVASVEVEMAPPNIR 850
+ A L + +++ + K L S+G+ VRL +VA VE+ N+
Sbjct: 219 LGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI 278

Query: 851 -RDDVQRRVVVQANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIVGGQYENQQRAQQK 906
R + + + +A G + K I A + Q P G V+ Y+ Q
Sbjct: 279 ARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLS 336

Query: 907 LMLVVP---ISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIVALFVSGTYLSVPSSI 963
+ VV +I L+ L++Y ++ L+ VP+ L+G L G ++ +
Sbjct: 337 IHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMF 396

Query: 964 GFITLFGVAVLNGVVLVDSINQRRQS-GESLYDSVYEGTVGRLRPVLMTALTSALGLIPI 1022
G + G+ V + +V+V+++ + ++ + ++ A+ + IP+
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 1023 LVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRHDKSP 1069
G I + ++ I+ + S + L++ P L L + +
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3514RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.4 bits (136), Expect = 9e-11
Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 9/138 (6%)

Query: 157 EVAKAQAEYINAAAEWNRVRR---MSESAVSVSRRMQAQVDAELKRAILEAIKMTDEQIR 213
V + + +Y+ A E + ES + ++ V K IL+ ++ T + I
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 214 TLE----STPEAIGSYQLLAPIDGRVQQ-DIAMLGQVFTAGTPLMQLT-DESHLWVEAQL 267
L E + + AP+ +VQQ + G V T LM + ++ L V A +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 268 TPAQAANVNVGGPALVQV 285
+NVG A+++V
Sbjct: 373 QNKDIGFINVGQNAIIKV 390



Score = 42.1 bits (99), Expect = 3e-06
Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%)

Query: 101 SLSNLNLDTMATATLVVDRDRTATLAPQLDVRVQARHVVPGQEVKKGEPLLTLGG----A 156
L + + A L R+ + P + V+ V G+ V+KG+ LL L A
Sbjct: 76 VLGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 157 EVAKAQAEYINAAAEWNRVRRMSESAVSVSRRMQAQVDAELKRAILEAIKMTDEQIRTLE 216
+ K Q+ + A E R + +S S D + + E + + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 217 STPEAIGSYQLLAPIDGRVQQDIAMLGQV 245
+ YQ +D + + + +L ++
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3515RTXTOXIND310.010 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.010
Identities = 21/167 (12%), Positives = 52/167 (31%), Gaps = 17/167 (10%)

Query: 80 EVQAQIARQQQAELAIAAADRAVYNPEL-GLNYQNADTDTYSLGLSQTLDWGDKRGVATR 138
+ + QA L + EL L + Y +S+ + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 139 LAQLEAQILLADISLERSQMLAERLLALAEQAQSNKALTFAEQQLRFTQAQLNIAEQRFA 198
+ + Q +++L++ + AE+ + E R +++L+
Sbjct: 195 FSTWQNQKYQKELNLDKKR---------AERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 199 AGDLS-----DVELQLLKL--ELASNTADYAMAEQAALVADGKVIEL 238
++ + E + ++ EL + E L A + +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3521RTXTOXIND290.006 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.006
Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 126 GIVGAIWVKDGDDVAFDQPLFTL 148
IV I VK+G+ V L L
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127


51Shewmr7_3562Shewmr7_3583Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_35621243.082181DEAD/DEAH box helicase domain-containing
Shewmr7_35631212.824886hypothetical protein
Shewmr7_35641193.082990redoxin domain-containing protein
Shewmr7_35651193.712100hypothetical protein
Shewmr7_35660173.470727hypothetical protein
Shewmr7_35670162.789148hypothetical protein
Shewmr7_35680163.217690hypothetical protein
Shewmr7_3569-1152.784936hypothetical protein
Shewmr7_35700140.754400PfpI family intracellular peptidase
Shewmr7_3571011-1.057785hypothetical protein
Shewmr7_3572012-0.627463hypothetical protein
Shewmr7_35730120.397246hypothetical protein
Shewmr7_3574112-0.205627hypothetical protein
Shewmr7_3575216-0.205661carboxypeptidase Taq
Shewmr7_35761150.876038GCN5-related N-acetyltransferase
Shewmr7_35771133.018798hypothetical protein
Shewmr7_35780143.062832hypothetical protein
Shewmr7_3579-1153.081211hypothetical protein
Shewmr7_3580-1183.821796RDD domain-containing protein
Shewmr7_3581-1153.559813permease YjgP/YjgQ family protein
Shewmr7_35820153.488907permease YjgP/YjgQ family protein
Shewmr7_3583-1173.148827leucyl aminopeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3565FRAGILYSIN250.025 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 25.4 bits (55), Expect = 0.025
Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 1 MKTPLIMLVLSASLLTSACAELACSARTDIDP 32
MK ++L+L + L +AC+ A S T ID
Sbjct: 9 MKNVKLLLMLGTAALLAACSNEADSLTTSIDA 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3572INFPOTNTIATR704e-18 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 70.4 bits (172), Expect = 4e-18
Identities = 39/96 (40%), Positives = 51/96 (53%), Gaps = 2/96 (2%)

Query: 12 GEGKEAVKGALITTQYRGFLQDGTQFDSSYDRGQAFQCVIGTGRVIKGWDQGLMGMKVGG 71
G G + K +T +Y G L DGT FDS+ G+ +VI GW + L M G
Sbjct: 136 GTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEALQLMPAGS 193

Query: 72 KRKLFVPAHLAYGERQIGAHIKPNSDLTFEIELLEV 107
++FVPA LAYG R +G I PN L F+I L+ V
Sbjct: 194 TWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3577TCRTETA320.003 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.5 bits (74), Expect = 0.003
Identities = 53/286 (18%), Positives = 101/286 (35%), Gaps = 30/286 (10%)

Query: 15 LFVPVTGLSLFALASGYLMSLIPLSLSYFELSPDLAP---WLASIFYLGLLLGAPCIAPI 71
L V ++ ++L A+ G +M ++P L S D+ L +++ L AP + +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 72 VARIGHSKAFILFINILLCSVVVMVLLPQGGIWL--ASRLVAGVAVAGIFVVVESWLLMA 129
R G + +L +++ +V ++ +W+ R+VAG+ A V +++
Sbjct: 67 SDRFG--RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADI 123

Query: 130 DTQKQRAKRLGLYMTALYG-GTAIGQLAVDYLGTRGNLPYLVVIGLLAAASLPALLVKRG 188
+RA+ G +M+A +G G G + +G L +
Sbjct: 124 TDGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 189 QPQSSEQHSIALSDLKNLSKPAVVGCLVSGLLLGPIYGLLPVYVSQDMGFAQQTGQFMAL 248
+ E+ + L L+ + + L+ V+ Q GQ A
Sbjct: 183 ESHKGERRPLRREALNPLAS------FRWARGMTVVAALMAVFFI-----MQLVGQVPAA 231

Query: 249 IIMGGMIVQPLVSYLSPRFQKSALMAAFCLIGAAALFLLTQKSLVG 294
+ V + RF A L L L Q + G
Sbjct: 232 L---------WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG 268


52Shewmr7_3637Shewmr7_3642Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_36375252.933395methylthioadenosine nucleosidase
Shewmr7_36386262.743778cobalamin biosynthesis protein CbiB
Shewmr7_36395252.360823hypothetical protein
Shewmr7_36405242.104173hypothetical protein
Shewmr7_36415242.048030hypothetical protein
Shewmr7_36426242.033729hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3642CABNDNGRPT861e-18 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 86.2 bits (213), Expect = 1e-18
Identities = 49/232 (21%), Positives = 75/232 (32%), Gaps = 20/232 (8%)

Query: 4489 GGGQSGIITNSSGHEVVASGANNKSYTNSSAQVVNGGDGNDHIETGKGDDVIYAGKTGSA 4548
G +G + + +A+ + GD + D A + A
Sbjct: 235 GADYNGHYGGAPMIDDIAAIQ----RLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKA 290

Query: 4549 NYGTDDQLELSVNTLLTHHIMTGNITGNDRMVDNDGLLLANDVSSQRADVVNGGSGNDQI 4608
+ + + N N + + + G +
Sbjct: 291 LIFSV--WDAGGTDTFDFSGYSNNQRINLNEGSFSDV-----GGLKGNVSIAHGVTIENA 343

Query: 4609 YGQSGSDILYGHSGNDYIDGGNHNDALRGGEGNDTLIGGLGDDVLRGDSGNDTFLWRYAD 4668
G SG+DIL G+S ++ + GG ND L GG G DTL GG G D SG D+ +
Sbjct: 344 IGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTV----- 398

Query: 4669 ADQGTDHIMDFNVRDDKLDLSDLLQGETANTLESYLNFSLDNGSTVIDIDAN 4720
D I DF DK+DLS F+ ++ DA
Sbjct: 399 --AAYDWIADFQKGIDKIDLSAF--RNEGQLSFVQDQFTGKGQEVMLQWDAA 446


53Shewmr7_3657Shewmr7_3662Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3657-1123.306158hypothetical protein
Shewmr7_36580143.601203hypothetical protein
Shewmr7_36590134.538086iron-sulfur cluster insertion protein ErpA
Shewmr7_36600123.896257hypothetical protein
Shewmr7_3661-1154.164254hypothetical protein
Shewmr7_36620153.164990Cl- channel, voltage-gated family protein
54Shewmr7_3676Shewmr7_3695Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3676-1233.023617hypothetical protein
Shewmr7_3677-1223.928296putative DNA-binding/iron metalloprotein/AP
Shewmr7_3678-1224.00437630S ribosomal protein S21
Shewmr7_36790234.192718GatB/Yqey domain-containing protein
Shewmr7_36801213.997651DNA primase
Shewmr7_36811194.128047RNA polymerase, sigma 70 subunit, RpoD
Shewmr7_36821173.512829*methyl-accepting chemotaxis sensory transducer
Shewmr7_36832203.546896hypothetical protein
Shewmr7_36842203.727057amino acid/peptide transporter
Shewmr7_36850193.811823luciferase family protein
Shewmr7_36860204.621122hypothetical protein
Shewmr7_36870193.2461864-aminobutyrate aminotransferase
Shewmr7_36881183.180762succinate semialdehyde dehydrogenase
Shewmr7_36890201.437352FAD dependent oxidoreductase
Shewmr7_3690017-3.819945binding-protein-dependent transport systems
Shewmr7_3691018-4.212120binding-protein-dependent transport systems
Shewmr7_3692224-5.864692hypothetical protein
Shewmr7_3693322-4.981673spermidine/putrescine ABC transporter ATPase
Shewmr7_3694121-4.022742extracellular solute-binding protein
Shewmr7_3695120-3.378238hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3685ACRIFLAVINRP412e-05 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 41.4 bits (97), Expect = 2e-05
Identities = 48/228 (21%), Positives = 85/228 (37%), Gaps = 30/228 (13%)

Query: 622 ELAPPEQSTESETSSDNTNLGNDSHGAIVLLGG---IEDIAALKARFAEV----PQVQLI 674
++A E E+ N + I L G ++ A+KA+ AE+ PQ +
Sbjct: 264 DVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKV 323

Query: 675 DKVADISTVMGHYRLLTLKLLGLALVIALLLFSLSFGVKRAALVV--AVPALSALLTLAI 732
D + + +K L A+++ L+ L RA L+ AVP + L T AI
Sbjct: 324 LYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVP-VVLLGTFAI 382

Query: 733 LGLVGSPLSLFHALALILVFGIGIDYS---------------LFFASAAQHG-KAVMMAV 776
L G ++ ++L G+ +D + L A + + A+
Sbjct: 383 LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGAL 442

Query: 777 FMSACSTLLAFGLLAFSQTQA---IHYFGLTLSLGIGFTFVLSPLILT 821
A F +AF F +T+ + + +++ LILT
Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVA-LILT 489



Score = 31.3 bits (71), Expect = 0.018
Identities = 22/117 (18%), Positives = 43/117 (36%), Gaps = 17/117 (14%)

Query: 694 LLGLA-LVIALLLFSLSFGVKRAALVVAVPALSALLTLAILGLVGSPLSLFHALALILVF 752
L+ ++ +V+ L L +L V+ V L + L L ++ + L+
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 753 GIGIDYSLFFASAA-----QHGKAVMMAV-----------FMSACSTLLAFGLLAFS 793
G+ ++ A + GK V+ A M++ + +L LA S
Sbjct: 935 GLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS 991


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3687BONTOXILYSIN310.006 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 30.6 bits (69), Expect = 0.006
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 11/61 (18%)

Query: 137 VYDGNTLSSEQSLLLGDEFKAEYLMAMMQLIYWPEQSIKSHLEGGELVTGLCDAIPCRQF 196
+YD N LS D + +L A++ L+ + I + + G +L++ + AIP
Sbjct: 63 IYDSNFLSQ-------DSERENFLQAIIILL----KRINNTISGKQLLSLISTAIPFPYG 111

Query: 197 Y 197
Y
Sbjct: 112 Y 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3690DHBDHDRGNASE1074e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (267), Expect = 4e-30
Identities = 71/248 (28%), Positives = 114/248 (45%), Gaps = 15/248 (6%)

Query: 5 VLVTGSSRGIGKAIALKLAAAGYDIALHYHSNQAAADASAAELSALGVNVSLLKFDVADR 64
+TG+++GIG+A+A LA+ G IA N + + L A + DV D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 AAVKAAIEADIEANGAYYGVILNAGINRDNAFPAMSETEWDSVIHTNLDGFYNVIHPCVM 124
AA+ G ++ AG+ R ++S+ EW++ N G +N V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS-VS 128

Query: 125 PMVQGRKGGRIITLASVSGIAGNRGQVNYSASKAGLIGATKALSLELAKRKITVNCIAPG 184
+ R+ G I+T+ S Y++SKA + TK L LELA+ I N ++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 LIETDM----------VADIPKDMVEQL---VPMRRMGKPNEIAALAAFLMSDDAAYITR 231
ETDM + K +E +P++++ KP++IA FL+S A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 232 QVISVNGG 239
+ V+GG
Sbjct: 249 HNLCVDGG 256


55Shewmr7_3777Shewmr7_3792Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_37772231.980232hypothetical protein
Shewmr7_37782201.016571hypothetical protein
Shewmr7_37792170.630943glutamate--cysteine ligase
Shewmr7_37801150.278478peptidase M16 domain-containing protein
Shewmr7_37810140.331837hypothetical protein
Shewmr7_37820120.183709hypothetical protein
Shewmr7_3783-1100.232946sodium:dicarboxylate symporter
Shewmr7_37840120.724091hypothetical protein
Shewmr7_37850140.913837hypothetical protein
Shewmr7_37861221.233045hypothetical protein
Shewmr7_37872221.244586sodium ion-translocating decarboxylase subunit
Shewmr7_37883241.063601oxaloacetate decarboxylase
Shewmr7_37893281.092472sodium pump decarboxylases subunit gamma
Shewmr7_37904351.334576CDP-diacylglycerol--serine
Shewmr7_37914351.309468hypothetical protein
Shewmr7_37924331.120639hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3781HTHFIS771e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 1e-18
Identities = 31/129 (24%), Positives = 61/129 (47%)

Query: 2 RLLLIEDDTDLVARLIPALNKAGYTVEHANNGIDGAFLGEEEAFEAVILDLGLPGKPGLS 61
+L+ +DD + L AL++AGY V +N + V+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLSQWRQKGLLMPVLILTARDAWHERVDGLKAGADDYLGKPFHVEELLARLEALIRRHFG 121
+L + ++ +PVL+++A++ + + + GA DYL KPF + EL+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RADNLLQHA 130
R L +
Sbjct: 125 RPSKLEDDS 133


56Shewmr7_3842Shewmr7_3849Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_38423102.327226hypothetical protein
Shewmr7_38433102.360144Dna-J like membrane chaperone protein
Shewmr7_38443121.507964nucleotidyl transferase
Shewmr7_38454131.555355aminoglycoside phosphotransferase
Shewmr7_38464141.079688organic solvent tolerance protein
Shewmr7_38476160.170212hypothetical protein
Shewmr7_3848114-2.226625PpiC-type peptidyl-prolyl cis-trans isomerase
Shewmr7_3849219-3.448030hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3847IGASERPTASE753e-16 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 75.5 bits (185), Expect = 3e-16
Identities = 46/270 (17%), Positives = 94/270 (34%), Gaps = 35/270 (12%)

Query: 17 DEVVEQTPVSTPSQTEQAEALAKQQAEEARLAAEKAAAEQALADKLAAEKAEAERIAVEQ 76
++ V+ T ++TP+ + EE +A A A +E E V +
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSNNEE--IARVDEAPVPPPAPATPSETTET----VAE 1042

Query: 77 AAKAQAEAEALRIAEEQAARLAEQQAAEA---ARLAAEQAQAEQLAA-EQAEAERVAAEQ 132
+K +++ EQ A E R A++A++ A + E + +E
Sbjct: 1043 NSKQESKTVEKN----------EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 133 AAKAQAEA-EAQRVAEEQAARLAEQQAAEAARLAAE------QAQAEQLAAEQAEAERVA 185
E E V +E+ A++ ++ E ++ ++ Q++ Q AE E
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE-PARENDP 1151

Query: 186 AEQAAKAQAEAEAEAEAQRVAEEQAA----LLAEQQAAEAARLAAEQAQAEQLAAEQAEA 241
+ Q++ A+ ++ A+E ++ + E E + A Q
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 242 ERVAAEAQAERAAEQVQAEQPLEQQPEPQA 271
++ R V++ EP
Sbjct: 1212 NSESSNKPKNRHRRSVRSVP---HNVEPAT 1238



Score = 73.6 bits (180), Expect = 1e-15
Identities = 45/263 (17%), Positives = 94/263 (35%), Gaps = 14/263 (5%)

Query: 17 DEVVEQTPV-STPS-QTEQAEALAKQQAEEARLAAEKAAAEQALADKLAAEKAEAERIAV 74
DE P +TPS TE +KQ+++ E+ A E ++ A++A++ A
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVE-KNEQDATETTAQNREVAKEAKSNVKAN 1079

Query: 75 EQAAK-AQAEAEALRIAEEQAARLAEQQAAEAARLAAEQAQAE-QLAAE------QAEAE 126
Q + AQ+ +E + A + E A++ E+ Q ++ ++ Q+E
Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139

Query: 127 RVAAEQAAKAQAEAEAQRVAEEQAARLAEQQAAEAARLAAEQAQAEQLAAEQAEAERVAA 186
+ AE A + + + +Q A+ EQ E +
Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 187 EQAAKAQAEAEAEAEAQRVAEEQA--ALLAEQQAAEAARLAAEQAQAEQLA-AEQAEAER 243
E A + +E+ + + ++ + E A ++ L
Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNA 1259

Query: 244 VAAEAQAERAAEQVQAEQPLEQQ 266
V ++A+A+ + + + Q
Sbjct: 1260 VLSDARAKAQFVALNVGKAVSQH 1282



Score = 65.9 bits (160), Expect = 3e-13
Identities = 43/222 (19%), Positives = 84/222 (37%), Gaps = 20/222 (9%)

Query: 74 VEQAAKAQAEAEALRIAEEQAARLAEQQAAEAARLAAEQAQAEQLAAEQAEAERVAAEQA 133
+ QA+ ++ E+ AR+ +A A ++ + AE ++ E E+
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARV--DEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054

Query: 134 AKAQAEAEAQ--RVAEEQAARL-AEQQAAEAARLAAEQAQAEQLAAEQAEAERVAAEQAA 190
+ E AQ VA+E + + A Q E A+ +E + + ++
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE------------ 1102

Query: 191 KAQAEAEAEAEAQRVAEEQAALLAEQQAAEAARLAAEQAQAEQLAAEQAEAERVAAEAQA 250
A E E +A+ + ++ + Q + + + Q QAE A + + E Q+
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE--PARENDPTVNIKEPQS 1160

Query: 251 ERAAEQVQAEQPLEQQPEPQAKPVKESFFARLKRGLMRTSEN 292
+ EQP ++ +PV ES ++ EN
Sbjct: 1161 QTNTTADT-EQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 57.4 bits (138), Expect = 1e-10
Identities = 31/167 (18%), Positives = 63/167 (37%), Gaps = 11/167 (6%)

Query: 120 AEQAEAERVAAEQAAKAQAEAEAQRVAEEQAARLAEQQAAEAARLAAEQAQAEQLAAEQA 179
E+ +A+ V +A A ++ + AE
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE-- 1042

Query: 180 EAERVAAEQAAKAQAEAEAEAEAQRVAEE-QAALLAEQQAAEAARLAAE----QAQAEQL 234
+++ + Q E A+ + VA+E ++ + A Q E A+ +E Q +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 235 AAEQAEAERVAAEAQAERAAEQVQAE-QPLEQQPE---PQAKPVKES 277
A + E+ E + + +V ++ P ++Q E PQA+P +E+
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149


57Shewmr7_3864Shewmr7_3894Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_38642161.923425transposase
Shewmr7_38651151.228296integrase catalytic subunit
Shewmr7_38662171.347596uracil-DNA glycosylase
Shewmr7_38672181.202773hypothetical protein
Shewmr7_38681200.689785hypothetical protein
Shewmr7_38692170.871181hypothetical protein
Shewmr7_38702161.396611lysine exporter protein LysE/YggA
Shewmr7_38711161.223142hypothetical protein
Shewmr7_38721160.298414peptidase S10, serine carboxypeptidase
Shewmr7_3873217-0.677629hypothetical protein
Shewmr7_3874117-1.232620thiol:disulfide interchange protein
Shewmr7_3875-119-2.481537hypothetical protein
Shewmr7_3876022-3.425318Fis family transcriptional regulator
Shewmr7_3877-218-1.6630634Fe-4S ferredoxin iron-sulfur binding
Shewmr7_3878-1140.143233hypothetical protein
Shewmr7_3879-2121.654866methionine aminopeptidase, type I
Shewmr7_3880-3122.696867AMP-dependent synthetase and ligase
Shewmr7_3881-2123.172877lipid ABC transporter ATPase/inner membrane
Shewmr7_3882-1111.035924hypothetical protein
Shewmr7_3883114-0.862220hypothetical protein
Shewmr7_3884214-1.611823FAD dependent oxidoreductase
Shewmr7_3885219-2.542604hypothetical protein
Shewmr7_3886329-5.185526hypothetical protein
Shewmr7_3887331-5.702436extracellular solute-binding protein
Shewmr7_3888330-5.127044UspA domain-containing protein
Shewmr7_3889437-8.627319UspA domain-containing protein
Shewmr7_3890535-8.509776hypothetical protein
Shewmr7_3891635-8.580930aldehyde dehydrogenase
Shewmr7_3892634-9.127433TetR family transcriptional regulator
Shewmr7_3893319-2.126252curli production assembly/transport component
Shewmr7_3894319-2.055700hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3864PF05616310.012 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.9 bits (69), Expect = 0.012
Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 7/73 (9%)

Query: 344 IAAVITYERNAWGNNTGDA--VQAKDVDAHKSGGTNSEPVATTPPPATTDAPKPATEPAA 401
+ V T+ R++ GN T D + D+ + N++P+ P + A PA PA
Sbjct: 289 VQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPL-----PEVSPAENPANNPAP 343

Query: 402 SVDPASLPTLSHD 414
+ +P + P D
Sbjct: 344 NENPGTRPNPEPD 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_386960KDINNERMP290.029 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.7 bits (64), Expect = 0.029
Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 8/82 (9%)

Query: 18 FIRYPYVLLTKLI--GLPRYIWLRRAMLLLLILTVVVFVILVKLGLWQMDRAAEKTELLA 75
FI P L K I + + + ++I+T +V I+ L Q A+ L
Sbjct: 335 FISQPLFKLLKWIHSFVGNWGFS------IIIITFIVRGIMYPLTKAQYTSMAKMRMLQP 388

Query: 76 QMEARQSAAALNPEQLIAELAK 97
+++A + + +++ E+
Sbjct: 389 KIQAMRERLGDDKQRISQEMMA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3883IGASERPTASE343e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 3e-04
Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 8/167 (4%)

Query: 8 PQVPIATTNVATDSARVDNQL------KPVVIPPQAATKGHEERAFNPQNERTADQTQQQ 61
P P TT ++++ +++ Q E ++ N +T + Q
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 62 ARLLEQNQQQVQDKQQQQQSSQQQSQQQQEKQAPIVAADRALPKTLKVPVRGPAALQRKD 121
+ E + ++ ++ + + + ++ ++ P V + + PK + P A ++
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS-PKQEQSETVQPQAEPARE 1148

Query: 122 IRLKVGQNTPRPANTTAKTTTAAARQ-PMQGESPLFYQQVGQRIGQY 167
V P+ T T A++ E P+
Sbjct: 1149 NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV 1195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3894HTHFIS280.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.011
Identities = 8/54 (14%), Positives = 20/54 (37%)

Query: 34 LLYEIEKGFDVEIIQLLANELNWPINKFVQLIGFSRSTFNRRMKRHRLTSQESE 87
L + + +I K L+G +R+T ++++ ++ S
Sbjct: 428 LYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSS 481


58Shewmr7_4008Shewmr7_4020Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_4008212-0.884476hypothetical protein
Shewmr7_4009114-1.908463acriflavin resistance protein
Shewmr7_4010114-0.963517radical SAM domain-containing protein
Shewmr7_4011117-1.224696putative PAS/PAC sensor protein
Shewmr7_4012222-1.071856hypothetical protein
Shewmr7_4013224-1.548591hypothetical protein
Shewmr7_4014432-1.644652hypothetical protein
Shewmr7_4015639-0.486886lysine decarboxylase transcriptional regulator,
Shewmr7_4016947-1.190609cytochrome c
Shewmr7_4017849-1.160469hypothetical protein
Shewmr7_4018542-1.872818hypothetical protein
Shewmr7_4019540-2.362328hypothetical protein
Shewmr7_4020327-3.453329hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_4008TCRTETA290.037 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.0 bits (65), Expect = 0.037
Identities = 21/100 (21%), Positives = 40/100 (40%), Gaps = 8/100 (8%)

Query: 21 PDADKADRFNPRNRIYVRAIDGL-WSSVRRRMGWVAMLFFLILPWIPWGDRQAVWFNLGE 79
P++ K +R P R + + W+ + + +FF++ + A+W GE
Sbjct: 182 PESHKGER-RPLRREALNPLASFRWARGMTVVAALMAVFFIM--QLVGQVPAALWVIFGE 238

Query: 80 QKFHVFGLTIWPQDLTLLAALFMIAAFALFFVTTYLGRVW 119
+FH TI LAA ++ + A +T +
Sbjct: 239 DRFHWDATTIG----ISLAAFGILHSLAQAMITGPVAARL 274


59Shewmr7_0051Shewmr7_0060N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0051-1161.832774phosphoglyceromutase
Shewmr7_0052-2140.199515rhodanese domain-containing protein
Shewmr7_0053-113-0.156872preprotein translocase subunit SecB
Shewmr7_0054-214-1.202177NAD(P)H-dependent glycerol-3-phosphate
Shewmr7_0055-1150.344224NAD(P)H-dependent glycerol-3-phosphate
Shewmr7_00560130.247414hypothetical protein
Shewmr7_0057114-0.184476hypothetical protein
Shewmr7_00582160.525391tetratricopeptide domain-containing protein
Shewmr7_00590100.045434TrkH family potassium uptake protein
Shewmr7_0060-1120.020297TrkA domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0051HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 1e-23
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KVLVVDDEPQIHTFMRISLEAEGFEYLSATSIATALKQYQSHQPHLIVLDLGLPDGDGIE 63
+LV DD+ I T + +L G++ ++ AT + + L+V D+ +PD + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLHGLRQQDK-TPVLVLTARDQEEEKIRLLEAGANDYLSKPFGIRELIVRIK 114
LL +++ PVLV++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0054PF065802024e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 202 bits (514), Expect = 4e-62
Identities = 59/205 (28%), Positives = 109/205 (53%), Gaps = 13/205 (6%)

Query: 351 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLDKG 410
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 411 D-ELIDIQEEVKQVRDYVAIEQARYGDKLEVVFDVDD--VHFCVPCLLLQPLVENAILHG 467
+ + + +E+ V Y+ + ++ D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 468 IQPRSAPGRVTIEVKKLDAGIRVAVRDTGYGISQEVIDGVAAGRIESSSIGLTNVHQRVK 527
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 528 LLYGE--GLQLKRLNPGTEVSFYLP 550
+LYG ++L +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0055HTHFIS683e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 3e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAHKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIEQV 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0056TCRTETA347e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 7e-04
Identities = 65/350 (18%), Positives = 127/350 (36%), Gaps = 42/350 (12%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGVLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINMLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQQAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMRKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLCGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFIPLNANLFFVAVACVAFSFGGTI 320
+L ++ G ++ ++ R + L I G +LL F + F + +A G +
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA-TRGWMAFPIMVLLAS-GGIGM 317

Query: 321 TVYPSLVSDFFGLNNLTKNYGVIYLGFGIGSIIGS-IVASLFGGFIATFN 369
+++S + G + + SI+G + +++ I T+N
Sbjct: 318 PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWN 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0060TCRTETB1191e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 119 bits (301), Expect = 1e-31
Identities = 77/400 (19%), Positives = 160/400 (40%), Gaps = 20/400 (5%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNFGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IALVVFALGSIVSA-SASTMDHMIAGRILQGLGGGGLMSLSQSLVGELVPIRQRARFQGY 147
+++ GS++ S +I R +QG G +L +V +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGFVVHAYSWHWLFWANIPLV-MLAVWRLNRLHKQSVKPVRQGRF 206
++ + GP IGG + H W +L IP++ ++ V L +L K+ V+ +G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLLGVLLFPTIITALLYWLSVAGQDFAWLSTTSLGFMGFICVGALVLLWWERRRESPFLP 266
D+ G++L I + + + F +S S + + R+ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL----------IFVKHIRKVTDPFVD 249

Query: 267 LDLLANKAIYMPLFTAALFAACLFAMIFFLPIYLQVGLHTNPAKTGLLLM-PMTFGIVTG 325
L N + + + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSRDVAPKWLPTFGMGLAFIGLLLIGLVPPNANLIGALGV-LVGIGLGTVMPS 384
I G L+ R P ++ G+ + L + + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLSRSMGAAIGTALFSLLL 424
+ +V S + ++++ + + G A+ LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


60Shewmr7_0150Shewmr7_0157N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_01500170.908721chorismate lyase
Shewmr7_01510191.120887flagellar basal body-associated protein
Shewmr7_0152-1191.477506flagellar basal body-associated protein
Shewmr7_01530170.545085putative SAM-dependent methyltransferase
Shewmr7_01540160.142270hypothetical protein
Shewmr7_01550140.754711hypothetical protein
Shewmr7_0156-1120.365231hypothetical protein
Shewmr7_0157-1130.881253molybdopterin biosynthesis protein MoeB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0150BCTERIALGSPC1822e-58 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 182 bits (464), Expect = 2e-58
Identities = 72/295 (24%), Positives = 137/295 (46%), Gaps = 33/295 (11%)

Query: 8 IAKAAGIPHKPLSQIVFWFGFILSLLLAAQITWKLVPTTSSPTAWSPTAVTTTGKGAGQI 67
I+K + + +I+F+ +L A I W++ ++P + +V T A Q
Sbjct: 3 ISKLPPLSPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS----SVQITPAQARQ- 57

Query: 68 DMAGLQQLALFGKADAKSDKPKVEVVETVTDAPKTSLSIQLTGVVASTADQKGLAVIESS 127
L LFG + K+ ++ +++ P ++L++ LTGV+A D + +A+I
Sbjct: 58 QPVTLNDFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKD 116

Query: 128 GSQETYSLGDKIKGTSASLKEVYADRIIITNAGRYETLMLDGLVYTSQSPANQQLQKAKG 187
Q + + +++ G +A + + DR+++ GRYE L L +
Sbjct: 117 NEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPG------- 169

Query: 188 EKAEVVSRVDQRKNTEISQELAESRSELLADPSKITDYIAISPVRQGENVVGYRLNPGKD 247
A+V ++ QR + ++DY++ SP+ + GYRLNPG
Sbjct: 170 --AQVNEQLQQR------------------ASTTMSDYVSFSPIMNDNKLQGYRLNPGPK 209

Query: 248 VNLFKQAGFKPNDLAKSINGYDLTVMSQALEMMSQLPELTEVSIMVEREGQLVEI 302
+ F + G + ND+A ++NG DL QA + M ++ ++ ++ VER+GQ +I
Sbjct: 210 SDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDI 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0151BCTERIALGSPD5980.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 598 bits (1542), Expect = 0.0
Identities = 326/678 (48%), Positives = 444/678 (65%), Gaps = 31/678 (4%)

Query: 6 IRRKLIAGVVAGATMLTSQFVWSEQYAANFKGTDIQEFINIVGKNLNKTIIVDPTIRGKI 65
IR + ++ A + +E+++A+FKGTDIQEFIN V KNLNKT+I+DP++RG I
Sbjct: 7 IRSFSLTLLIFAALLFRP--AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTI 64

Query: 66 NVRSYDLLNDEQYYQFFLNVLQVYGYAIVEMENNVIKVIKDKDAKTAAIRVANDNDPGLG 125
VRSYD+LN+EQYYQFFL+VL VYG+A++ M N V+KV++ KDAKTAA+ VA+D PG+G
Sbjct: 65 TVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIG 124

Query: 126 DEMVTRIVALYNTEAKQLAPLLRQLNDNAGGGNVVNYDPSNVLMLSGRAAVVNKLVEIVR 185
DE+VTR+V L N A+ LAPLLRQLNDNAG G+VV+Y+PSNVL+++GRAAV+ +L+ IV
Sbjct: 125 DEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE 184

Query: 186 RVDKQGDTSVQVVPLEYASAGEMVRIIDTLYRATANQAQLPGQAPKVVADERINAVVVSG 245
RVD GD SV VPL +ASA ++V+++ L + T+ A VVADER NAV+VSG
Sbjct: 185 RVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSG 244

Query: 246 DEKSRQRVVELIHRLDAEQASTGNTKVRYLRYAKAEDLVEVLTGFAQKLEGEKDPSAQAG 305
+ SRQR++ +I +LD +QA+ GNTKV YL+YAKA DLVEVLTG + ++ EK +
Sbjct: 245 EPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVA 304

Query: 306 GKRRNEINIMAHTDTNALVISAEPDQMRTIESVINQLDIRRAQVLVEAIIVEVAEGDNVG 365
+N I I AH TNAL+++A PD M +E VI QLDIRR QVLVEAII EV + D +
Sbjct: 305 ALDKN-IIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLN 363

Query: 366 FGVQWAAKAGGGTQFNNLGPTIGEIGAGIWQAQGEDGTTVCTENGTCTENPDSRGDVTLL 425
G+QWA K G TQF N G I AG Q + + + L
Sbjct: 364 LGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVS------------------SSL 405

Query: 426 AQALGKVNGMAWGVAMGDFGALIQAVSADTNSNVLATPSITTLDNQEASFIVGDEVPILT 485
A AL NG+A G G++ L+ A+S+ T +++LATPSI TLDN EA+F VG EVP+LT
Sbjct: 406 ASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT 465

Query: 486 GSTASSNNSNPFQTVERKEVGVKLKVVPQINEGNAVKLAIEQEVSGVNG-----NTGVDI 540
GS +++ N F TVERK VG+KLKV PQINEG++V L IEQEVS V ++ +
Sbjct: 466 GSQ-TTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGA 524

Query: 541 SFATRRLTTTVMADSGQIVVLGGLINEEVQESVQKVPFLGDIPVLGHLFKSSSSKKTKKN 600
+F TR + V+ SG+ VV+GGL+++ V ++ KVP LGDIPV+G LF+S+S K +K+N
Sbjct: 525 TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRN 584

Query: 601 LMIFIKPTIIRDGVTMEGIAGRKYNYFRALQLEQ--QERGVNLMPNTQVPILEEWNQSEY 658
LM+FI+PT+IRD + +Y F Q +Q +E ++ + I Q
Sbjct: 585 LMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYP--RQDTA 642

Query: 659 LPPEVNDILDRYKEGKGL 676
+V+ +D + G L
Sbjct: 643 AFRQVSAAIDAFNLGGNL 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0153BCTERIALGSPF5040.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 504 bits (1299), Expect = 0.0
Identities = 228/407 (56%), Positives = 305/407 (74%), Gaps = 1/407 (0%)

Query: 1 MPAFEYKALDAKGKQLKGVIEADTARHARSQLRDQRMMPLEILPVTEKEAKAKGTGFS-P 59
M + Y+ALDA+GK+ +G EAD+AR AR LR++ ++PL + + K+ TG S
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 FKRGISVAELALITRQIATLVAAGLPIEECLKAVGQQCEKARLASMIMAVRSRVVEGYSL 119
K +S ++LAL+TRQ+ATLVAA +P+EE L AV +Q EK L+ ++ AVRS+V+EG+SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 ADSLAEFPHIFDDLYRAMVASGEKSGHLEVVLNRLADYTERRQQLKSKLQQAMIYPIMLT 179
AD++ FP F+ LY AMVA+GE SGHL+ VLNRLADYTE+RQQ++S++QQAMIYP +LT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 IVAIGVVSVLLAAVVPKVVGQFEHMGAELPATTRFLIAASDFVQSYGLLVVLIIGILLVV 239
+VAI VVS+LL+ VVPKVV QF HM LP +TR L+ SD V+++G ++L + +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 FQRLLKSPIFKMKYHTFLLKMPVVGRVSKGLNTARFARTLSILSASSVPLLDGMRIASEV 299
F+ +L+ ++ +H LL +P++GR+++GLNTAR+ARTLSIL+AS+VPLL MRI+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 LQNVRVRAAVDDATARVREGTSLSTALTNTKLFPAMMLYMIASGEKSGQLEDMLERAADN 359
+ N R + AT VREG SL AL T LFP MM +MIASGE+SG+L+ MLERAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QDREFESNVTLALGVFEPALVVSMAGVVLFIVMAILQPILALNNLIS 406
QDREF S +TLALG+FEP LVVSMA VVLFIV+AILQPIL LN L+S
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0154BCTERIALGSPG2302e-81 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 230 bits (588), Expect = 2e-81
Identities = 98/144 (68%), Positives = 118/144 (81%)

Query: 1 MQMNKKHKGFTLLEVMVVIVILGILASMVVPNLMGNKDKADQQKAVSDIVALENALDMYK 60
M+ K +GFTLLE+MVVIVI+G+LAS+VVPNLMGNK+KAD+QKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNGIYPTTEQGLEALVQKPTISPEPRNYRDEGYVKRLPQDPWRNNYLLLSPGENSKLDI 120
LDN YPTT QGLE+LV+ PT+ P NY EGY+KRLP DPW N+Y+L++PGE+ D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 FSAGPDGQPGTEDDIGNWNLQNFQ 144
SAGPDG+ GTEDDI NW L +
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0155BCTERIALGSPH612e-14 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 61.5 bits (149), Expect = 2e-14
Identities = 35/154 (22%), Positives = 56/154 (36%), Gaps = 39/154 (25%)

Query: 1 MGLTAAAVTMSIGNSGPQQALEKTAQQFIAATELVLDETVLSGQFIGIVVEKTSYQFVYY 60
MG++A V ++ S A + T +F A V + +GQF G+ V +QF+
Sbjct: 18 MGVSAGMVLLAFPASRDDSAAQ-TLARFEAQLRFVQQRGLQTGQFFGVSVHPDRWQFLVL 76

Query: 61 KDG---------------KWNPLEKDRILSEKQMEPGVVINLVLDGLPLVQEDEQDESWF 105
+ +W PL R+ + + G L Q E+W
Sbjct: 77 EARDGADPAPADDGWSGYRWLPLRAGRVATSGS----------IAGGKLNLAFAQGEAW- 125

Query: 106 DEPLIEPSAEDKKKHPEPQILLFPSGEMSAFELS 139
P +L+FP GEM+ F L+
Sbjct: 126 ------------TPGDNPDVLIFPGGEMTPFRLT 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0156PilS_PF08805310.001 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.7 bits (69), Expect = 0.001
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5 KGMTLLEVIVALAVFAIAAVSITKSLGEQMAN 36
KG TL+EV++ + V + A S K +N
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0157BCTERIALGSPG366e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.6 bits (82), Expect = 6e-05
Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 3/41 (7%)

Query: 3 LKQTNAQKGFTLLEMLIAIAIFAMLGLAANAVLSTVLTNDE 43
++ T+ Q+GFTLLE+++ I I +G+ A+ V+ ++ N E
Sbjct: 1 MRATDKQRGFTLLEIMVVIVI---IGVLASLVVPNLMGNKE 38


61Shewmr7_0242Shewmr7_0250N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0242018-4.71597350S ribosomal protein L23
Shewmr7_0243019-4.25120350S ribosomal protein L2
Shewmr7_0244-119-3.15260630S ribosomal protein S19
Shewmr7_0245-117-2.86417550S ribosomal protein L22
Shewmr7_0246-116-2.55024430S ribosomal protein S3
Shewmr7_0247-116-1.46462950S ribosomal protein L16
Shewmr7_0248-215-1.11074250S ribosomal protein L29
Shewmr7_0249-113-0.52221930S ribosomal protein S17
Shewmr7_0250-212-0.50227850S ribosomal protein L14
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0242SHAPEPROTEIN445e-07 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 44.0 bits (104), Expect = 5e-07
Identities = 32/156 (20%), Positives = 58/156 (37%), Gaps = 34/156 (21%)

Query: 199 VDIGANMTTFSVVESGETTFIREQAFGGELFTQSILSFYGMSY------EQAEKAKIE-- 250
VDIG T +V+ + GG+ F ++I+++ +Y AE+ K E
Sbjct: 164 VDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIG 223

Query: 251 -------------------GDLPRNY------MFEVLSPFQTQLLQQIKRTLQIYCTSSG 285
+PR + + E L T ++ + L+
Sbjct: 224 SAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELA 283

Query: 286 KDKVDY-LVLCGGTSKLEGMANLLTNELGVHTIIAD 320
D + +VL GG + L + LL E G+ ++A+
Sbjct: 284 SDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAE 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0246BCTERIALGSPD2491e-75 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 249 bits (638), Expect = 1e-75
Identities = 99/411 (24%), Positives = 189/411 (45%), Gaps = 38/411 (9%)

Query: 306 GDITLRLDDVPWDQALDLILQTKGLDKRIEGNILMVAPSEELAIRESQNLKNKQEVKELA 365
GD ++ + W A D++ L+K + L + + E N
Sbjct: 190 GDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR 249

Query: 366 PLYSEYLQ----------------INYAKATDIAELLKGADSSLLSPRG----------- 398
++ + YAKA+D+ E+L G S++ S +
Sbjct: 250 QRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKN 309

Query: 399 -SVAVDERTNTVLVKDTAEIIENIHRLVEVLDIPIRQVLIESRMVTVKDDVSEDLGIRWG 457
+ +TN ++V +++ ++ R++ LDI QVL+E+ + V+D +LGI+W
Sbjct: 310 IIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369

Query: 458 VTDQQGSKGTSGTLEGAGSIATGTVPSLDNRLNVNLPAAVTNPTSIAFHVAKLADGTILD 517
+ ++ T+ L + +IA + D ++ +L +A+++ IA +
Sbjct: 370 NKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN----WA 425

Query: 518 LELSALEQENKGEIIASPRITTSNQKAAYIEQGVEIPYV-----QSTSSGATSVTFKKAV 572
+ L+AL K +I+A+P I T + A G E+P + S + +V K
Sbjct: 426 MLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVG 485

Query: 573 LSLRVTPQITPDNRVILDLEITQDSQGKT-VDTPTGPAVAIDTQRIGTQVLVDNGETIVL 631
+ L+V PQI + V+L++E S T + +T+ + VLV +GET+V+
Sbjct: 486 IKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVV 545

Query: 632 GGIYQQNLISRVSKVPILGDIPLVGFLFRNTTDKNERQELLIFVTPKIVNE 682
GG+ +++ KVP+LGDIP++G LFR+T+ K ++ L++F+ P ++ +
Sbjct: 546 GGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRD 596



Score = 46.8 bits (111), Expect = 3e-07
Identities = 33/175 (18%), Positives = 75/175 (42%), Gaps = 14/175 (8%)

Query: 275 SLNFQNISVRTVLQIIADYNNFNLVTSDTVEGDITLR-LDDVPWDQALDLILQTKGLDKR 333
S +F+ ++ + ++ N ++ +V G IT+R D + +Q L
Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVL----D 86

Query: 334 IEGNILMVAPSEELAIRESQNLKNKQEVK--ELAP-----LYSEYLQINYAKATDIAELL 386
+ G ++ + L + S++ K + AP + + + + A D+A LL
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLL 146

Query: 387 KGADSSLLSPRGSVAVDERTNTVLVKDTAEIIENIHRLVEVLDIPIRQVLIESRM 441
+ + + + GSV E +N +L+ A +I+ + +VE +D + ++ +
Sbjct: 147 RQLNDN--AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0247PF05272310.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.002
Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 8/64 (12%)

Query: 9 LVGPMGAGKSTIGRHLAQML-----HLEFHDSDQEIEQRTGADIAWVFDVEGEEGFRRRE 63
L G G GKST+ L + H + EQ G +++ FRR +
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAG---IVAYELSEMTAFRRAD 657

Query: 64 AQVI 67
A+ +
Sbjct: 658 AEAV 661


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0249PF05272300.027 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.027
Identities = 16/65 (24%), Positives = 24/65 (36%)

Query: 7 ALVERLHHVASYSDQLLVLVGAHGSGKTTLLTALATDFDESNAALVICPMHADNAEIRRK 66
V R+ D +VL G G GK+TL+ L S+ I +I
Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGI 642

Query: 67 ILVQL 71
+ +L
Sbjct: 643 VAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0250TYPE3IMSPROT346e-04 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 34.0 bits (78), Expect = 6e-04
Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 17/90 (18%)

Query: 164 IGYEKAFEQIRTGDVIYCDPP-------YAPLSTTASFTTYVGAGFSLDDQALLARYSRH 216
I E ++ V+ +P Y T T+ D Q R
Sbjct: 245 IQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYT----DAQVQ---TVRK 297

Query: 217 MALEQRIPVVISNHDIPLTRELYRGAHLAK 246
+A E+ +P++ IPL R LY A +
Sbjct: 298 IAEEEGVPIL---QRIPLARALYWDALVDH 324


62Shewmr7_0421Shewmr7_0428N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_04210193.278697hypothetical protein
Shewmr7_04220182.955209hypothetical protein
Shewmr7_04231182.634333hypothetical protein
Shewmr7_04242181.879409superfamily II helicase
Shewmr7_0425115-3.825042phage transcriptional regulator, AlpA
Shewmr7_0426120-5.055821hypothetical protein
Shewmr7_0427021-5.707208hypothetical protein
Shewmr7_0428-116-4.414626phage integrase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0421RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 45/221 (20%), Positives = 81/221 (36%), Gaps = 36/221 (16%)

Query: 109 AQIHELEKQLSQLELNNLSLNAEILTQLQQRIDVAAEGVTRQNGLLDSFERYQRKGVVPT 168
Q + Q Q ELN AE LT L + ++ LD F K +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR-LDDFSSLLHKQAIAK 251

Query: 169 ----------ADMAAVLQAHTASKMALE----QAKVDLMQARQAQKTELLAGPIAQSKYN 214
+ L+ + + +E AK + Q K E+L + Q+ N
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL-DKLRQTTDN 310

Query: 215 ---VELQLARLKAQESQLDIKALTPTRVVDV-------LVQAGEHIVEDRPLVLLSGREA 264
+ L+LA+ + ++ I+A +V + +V E ++ P +
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE-----DDT 365

Query: 265 AVIFAYLEPKYLEYTAIGQEATIKLP--NGTR---LRGEIS 300
+ A ++ K + + +GQ A IK+ TR L G++
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0424NUCEPIMERASE709e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 69.8 bits (171), Expect = 9e-16
Identities = 53/234 (22%), Positives = 88/234 (37%), Gaps = 31/234 (13%)

Query: 3 NIMVTGATGLLGRAVVKQLTAAGHRVIA---------TGFSRAEAGI--------HRLDL 45
+VTGA G +G V K+L AGH+V+ +A + H++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TQAAEVEAFIAREQPEVIVHCAAERRPDVSERSPEHALALNLSASQTLAEVAKTHQ-AWL 104
+ A E + S +P NL+ + E + ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 LYISTDYVF-DGTTPPYAEDAEPN-PVNFYGASKLQGETCVLSTDNGFAV----LRLPIL 158
LY S+ V+ P++ D + PV+ Y A+K E + + + + LR +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 159 YGEVTQLNESAVLVLINQLLDGRPQRV----DHWAIRAPTSTADIANAIAKLIQ 208
YG + A+ +L+G+ V R T DIA AI +L
Sbjct: 182 YGP-WGRPDMALFKFTKAMLEGKSIDVYNYGKMK--RDFTYIDDIAEAIIRLQD 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0426HTHFIS872e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-22
Identities = 26/110 (23%), Positives = 57/110 (51%)

Query: 3 RLLIIEDDQALAGVLARRLTRHGFECRLSHDASNALLVAREFCPTHILLDMKLAEANGLG 62
+L+ +DD A+ VL + L+R G++ R++ +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVIMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAA 112
L+ ++ P + +++++ + TA++A GA +YL KP D L+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114



Score = 48.3 bits (115), Expect = 3e-09
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 116 NSQASALPEDEIDDSPLTPKRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
S ALP + D L +E+ I L A +GN A LG++R TL++K+ +
Sbjct: 417 ASFGDALPPSGLYDRVL--AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0428DHBDHDRGNASE433e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.1 bits (101), Expect = 3e-07
Identities = 37/195 (18%), Positives = 74/195 (37%), Gaps = 29/195 (14%)

Query: 55 LEEEIKQLSQNIPQLDWLINCIGMLHTEDKGPEKSLQALDGDFLQHNIQLNTLPSMMLAK 114
++E ++ + + +D L+N G+L G SL + + +N+ ++
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRP---GLIHSLSDEE---WEATFSVNSTGVFNASR 125

Query: 115 HFETALKRSVSVRFAVVSAKVGSISDNRLGGWYSYRASKAALNMFLKTLSIEWQRSMKHC 174
+ S V + + + +Y +SKAA MF K L +E C
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 175 VVLALHPGTTDTRLSKP------------------FQQNVPKEKLFTPEYVAQCLVSIIA 216
+++ PG+T+T + F+ +P +KL P +A ++ +++
Sbjct: 183 NIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 217 NATPAQTGSFLAYDG 231
T L DG
Sbjct: 241 GQAGHITMHNLCVDG 255


63Shewmr7_0464Shewmr7_0479N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0464-2151.880816UDP-N-acetylmuramoylalanyl-D-glutamate--2,
Shewmr7_0465-3142.133526UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-
Shewmr7_0466-3141.885428phospho-N-acetylmuramoyl-pentapeptide-
Shewmr7_0467-2152.009081UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
Shewmr7_0468-3160.938749cell division protein FtsW
Shewmr7_0469-1170.068023undecaprenyldiphospho-muramoylpentapeptide
Shewmr7_0470-119-1.757378hypothetical protein
Shewmr7_0471026-2.472937UDP-N-acetylmuramate--L-alanine ligase
Shewmr7_0472126-1.903027polypeptide-transport-associated
Shewmr7_0473020-1.929605cell division protein FtsA
Shewmr7_0474017-1.247161cell division protein FtsZ
Shewmr7_0475018-0.441147UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
Shewmr7_0476017-0.185559hypothetical protein
Shewmr7_04770150.368216peptidase M23B
Shewmr7_0478-1110.884286preprotein translocase subunit SecA
Shewmr7_0479-3142.308299***hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0464RTXTOXIND270.041 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.041
Identities = 7/61 (11%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 39 IESLWKQQYSTAQQLKALEQENQISMQQIDLYQQRLAMDPNQDYRQRLNLLQQQNQQIDA 98
++ ++ + ++ E +++ ++D + L ++ + +L+Q+N+ ++A
Sbjct: 209 LDKKRAERLTVLARINRYENLSRVEKSRLDDFSS-LL---HKQAIAKHAVLEQENKYVEA 264

Query: 99 Q 99

Sbjct: 265 V 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0466BCTERIALGSPD1882e-54 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 188 bits (480), Expect = 2e-54
Identities = 78/318 (24%), Positives = 145/318 (45%), Gaps = 28/318 (8%)

Query: 237 ELKETLSAIIGDTGGGRQVVVT--PQAGLVTIRAYPNELRQVRAFLNSAESHLQRQVILE 294
++ A + +++ Q + + A P+ + + + + + QV++E
Sbjct: 292 TMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVE 350

Query: 295 AKILEVTLSDGYQQGIQWDNVLGHV---GNTNINFGTSAGAGLS----DKITASLGGVTS 347
A I EV +DG GIQW N + N+ + T+ +++SL S
Sbjct: 351 AIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALS 410

Query: 348 ------LSIKGSDFNTMISLLDTQGDVDVLSSPRVTASNNQKAVIKVGTDEYFVTDVSST 401
++ +++ L + D+L++P + +N +A VG + +T S
Sbjct: 411 SFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQ 468

Query: 402 TVAGTTPVTTPQVELTPFFSGIALDVTPQIDKDGNVLLHVHPSVIDVKEQTKDIKVSSES 461
T +G T + + GI L V PQI++ +VLL + V V + SS S
Sbjct: 469 TTSGDNIFNTVERKTV----GIKLKVKPQINEGDSVLLEIEQEVSSVADAA-----SSTS 519

Query: 462 LELPLAQSEIRESDTVIRAASGDVVVIGGLMKSENTEVVSQVPLLGDIPLVGELFKNRSK 521
+L + R + + SG+ VV+GGL+ ++ +VPLLGDIP++G LF++ SK
Sbjct: 520 SDLGATFN-TRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSK 578

Query: 522 QKKKTELIIMLKPTVVGN 539
+ K L++ ++PTV+ +
Sbjct: 579 KVSKRNLMLFIRPTVIRD 596


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0468IGASERPTASE395e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.9 bits (90), Expect = 5e-05
Identities = 26/169 (15%), Positives = 57/169 (33%), Gaps = 15/169 (8%)

Query: 89 IDTSPLETETTTSTEPTAEM-----------AQVSPSSTEVPAKDMSASAESAPRVAQSP 137
+DT+ + T + + A V P + P++ AE++ + +++
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 138 RVNAAQV-EPTSEPITESVAASPSTDTPNKAVLTQEQAQTQAEPQQVAVKVNQADVNANQ 196
N E T++ + A + + + E Q K +
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 197 SEVKITQTEPKASEPFVPAATQASSQTTPQASSQASPQAQSTGQMAIRE 245
++V +TE P V + + + QA P ++ + I+E
Sbjct: 1112 AKV---ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0470BCTERIALGSPF301e-101 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 301 bits (773), Expect = e-101
Identities = 117/407 (28%), Positives = 206/407 (50%), Gaps = 6/407 (1%)

Query: 1 MPVYQYRGRSGQGQAVTGQLDAASEGAAADMLLARGIIPLEVKVAKEAK----SFTLAQL 56
M Y Y+ QG+ G +A S A +L RG++PL V + + S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 57 FSSKVGLDELQIFTRQMYSLTRSGIPILRAIAGLSETAHSQRMKDALNDISEQLTAGRPL 116
++ +L + TRQ+ +L + +P+ A+ +++ + + + + ++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 SSAMNQHPDVFDSLFVSMVHVGENTGKLEDAFIQLSGYIEREQETRRRIKAAMRYPIFVL 176
+ AM P F+ L+ +MV GE +G L+ +L+ Y E+ Q+ R RI+ AM YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPC-VL 179

Query: 177 IAIALAMV-ILNIMVIPKFAEMFSRFGADLPWATKVLIGTSNLFVNYWPLMLIILLGTII 235
+A+A+V IL +V+PK E F LP +T+VL+G S+ + P ML+ LL +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 GIRYWHHTEKGEKQWDQWKLHIPAVGSIIERSTLSRYCRSFSMMLSAGVPMTQALSLVAD 295
R EK + + LH+P +G I +RY R+ S++ ++ VP+ QA+ + D
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 296 AVDNAYMHDKIVGMRRGIESGESMLRVSNQSKLFTPLVLQMVAVGEETGQIDQLLNDAAD 355
+ N Y ++ + G S+ + Q+ LF P++ M+A GE +G++D +L AAD
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 356 FYEGEVDYDLKNLTAKLEPILIGIVAVIVLVLALGIYLPMWDMLNVV 402
+ E + EP+L+ +A +VL + L I P+ + ++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0472BCTERIALGSPG431e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 42.6 bits (100), Expect = 1e-07
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQDGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0473BCTERIALGSPG503e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 49.5 bits (118), Expect = 3e-10
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 1 MKRQQGFTLIELVVVIIILGILAVTAAPKFINLQSDARA----SAIQGMKGAI 49
+Q+GFTL+E++VVI+I+G+LA P + + A S I ++ A+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0474BCTERIALGSPH413e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.5 bits (97), Expect = 3e-07
Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 8 RQFGFTLVELVTTIILIGILSVTVLPRLFSQSSYSAFSLRNEFMAELRQVQQKALNNTDR 67
RQ GFTL+E++ ++L+G+ + VL + SA F A+LR VQQ+ L +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQT-GQ 60

Query: 68 CYRVVVSATGYQVSQFASRD 87
+ V V +Q +RD
Sbjct: 61 FFGVSVHPDRWQFLVLEARD 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0475BCTERIALGSPH362e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.5 bits (84), Expect = 2e-05
Identities = 15/57 (26%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 23 QQGFTLIELVVGMLVIAIAIVM-LSSMLFPQADRAAKTLHRVKSA-ELA--HSVMNE 75
Q+GFTL+E+++ +L++ ++ M L + + D AA+TL R ++ +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0476BCTERIALGSPG351e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.9 bits (80), Expect = 1e-04
Identities = 12/24 (50%), Positives = 19/24 (79%)

Query: 8 RMARSKRGFTLVEMVTVILILGIL 31
R +RGFTL+E++ VI+I+G+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0479SHAPEPROTEIN5570.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 557 bits (1437), Expect = 0.0
Identities = 315/348 (90%), Positives = 333/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRDEGIVLNEPSVVAIRGERSSSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+ +GIVLNEPSVVAIR +R+ S KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


64Shewmr7_0544Shewmr7_0551N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_0544-3141.837331hypothetical protein
Shewmr7_0545-2121.088125hypothetical protein
Shewmr7_0546-1121.097325hypothetical protein
Shewmr7_0547-2121.017984SPFH domain-containing protein/band 7 family
Shewmr7_0548-2121.322079SPFH domain-containing protein/band 7 family
Shewmr7_0549-1131.042586alkylhydroperoxidase
Shewmr7_0550-2140.772586AraC family transcriptional regulator
Shewmr7_0551-115-0.012815hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0544RTXTOXINA290.031 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.031
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 124 LAAGLSSSGALVVAFGTAISDSSQLHLSPMAVAQLAQRGEY 164
A GLS+S A +A++ L +SP++ +A + +
Sbjct: 296 AAQGLSTSAAAAGLIASAVT----LAISPLSFLSIADKFKR 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0545FLGHOOKAP1356e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 6e-04
Identities = 20/110 (18%), Positives = 39/110 (35%), Gaps = 15/110 (13%)

Query: 412 GEILGIE---HKQELVDLHRANGRNVVQGDAADTDFWEKLDKAPNLELVLLAMPHHAGNL 468
+I+G+E ++ ANG ++VQG A + + + +A
Sbjct: 209 NQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQ--LAAVPSSADPSRTTVAYVDGTAG- 265

Query: 469 FAVEQLKKLNYQGKLSAIV--------QYGDDAASLRASGVHSVYNLYEA 510
+E +KL G L I+ Q + L + + ++A
Sbjct: 266 -NIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALAFAEAFNTQHKA 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0550ACRIFLAVINRP501e-163 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 501 bits (1292), Expect = e-163
Identities = 207/1046 (19%), Positives = 443/1046 (42%), Gaps = 53/1046 (5%)

Query: 3 IAEYSIRHKVISWMFVLLLLVGGGVSFTGLGQLEFPEFTIKEALVITAYPGASPEQVEEE 62
+A + IR + +W+ ++L++ G ++ L ++P V YPGA + V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTLPLEDALQQLDAVKHVTSI-NSAGLSQIQIEIKENYDKTSLPQVWDEVRRKVNDTAGQ 121
VT +E + +D + +++S +SAG I + + T +V+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQ---SGTDPDIAQVQVQNKLQLATPL 117

Query: 122 LPPGTSTPKVFDDFGD---VYGILFNLSGPDYSNRELSNYAD-YLRREIVLVPGVKKVSV 177
LP + + + F P + ++S+Y ++ + + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 178 AGSVTEQVVIEISQQRLSALGLDQSYIYGLINNQNVVSNAGSLVVGDN------RIRIHP 231
G+ + I + L+ L + + QN AG L I
Sbjct: 178 FGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 232 TGEFSSVQDLARLIVSPPGSTELIYLGDIANIEKDYDETPDVLYHNRGETALSLGISFSS 291
F + ++ ++ + ++ L D+A +E + + N G+ A LGI ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-GKPAAGLGIKLAT 295

Query: 292 GVNVVEVGKSVSQRLAELESQRPIGMNLDTVYNQSLAVDDTVNGFLINLLESIAIVIAVL 351
G N ++ K++ +LAEL+ P GM + Y+ + V +++ + L E+I +V V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 352 LLFMG-LRSGLLMGLILLLTILGTFIVMKVLGIELQLISLGALIIALGMLVDNAIVVTEG 410
LF+ +R+ L+ + + + +LGTF ++ G + +++ +++A+G+LVD+AIVV E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 411 ILIGLRRGKTR-LEAAKQIVSQTQWPLLGATVIAIIAFAPIGLSQNAAGEFCRSLFQVLM 469
+ + K EA ++ +SQ Q L+G ++ F P+ + G R ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 470 ISLFISWITAITLTPFFCHLLFKDAPADDDEEQDPYKGWF-------FSLYRVSLTFALR 522
++ +S + A+ LTP C L K A+ E + + GWF + Y S+ L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 523 FRLASIVLVGVMLVSAVIGFGHIKNVFFPASNTPIFFVDIWMPEGTDIKGTERFTADIEK 582
+++ +++ V+ F + + F P + +F I +P G + T++ +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 583 LLLKQAEEQHSGLKHLTSVIG-------QGAQRFILPYQPEKGYPAYAQLIIEMEDLASL 635
LK + ++ + +V G Q A + +P + +
Sbjct: 596 YYLKNEKA---NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA--EAVIHRA 650

Query: 636 KVYMPELETLLNQRFPQAQYRFKNMENGPSPAAKIEARFYGDDPEVLRALGAQAEAIFNA 695
K+ + ++ F + ++ + + +A
Sbjct: 651 KMELGKIRDGFVIPFNMP--AIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 696 EPSMDGIRHDWRNQVPLIRPQLENAQARETGISKQDLDNALLINFSGKQIGLYRETSHLL 755
S+ +R + + +++ +A+ G+S D++ + G + + + +
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768

Query: 756 PIVARAPAEERLQADSLWKLQIWSTEHNTFVPATQVVSQFETQWENPLVKRRDRMRMLAV 815
+ +A A+ R+ + + KL + + + VP + + + +P ++R + + + +
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYV-RSANGEMVPFSAFTT-SHWVYGSPRLERYNGLPSMEI 826

Query: 816 LADPKLGSD-ETADSVLRKVKDKVEAISLPTGYHLEWGGEYETAGEAQTAVFSSIPMGYL 874
+ G+ A +++ + K LP G +W G + + + + ++
Sbjct: 827 QGEAAPGTSSGDAMALMENLASK-----LPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 875 VMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVIKNG 934
V+FL L+ S P+ + VPL ++GV LF+ ++GLL+ G+ KN
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 935 IVLVDQIN-LELGEGKPAYAALVDSSVSRVRPVLMAAITTMLGMIPLIPDAFFGS----- 988
I++V+ L EGK A + + R+RP+LM ++ +LG++PL GS
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 989 MAITIIFGLGFASLLTLIVLPVMYSL 1014
+ I ++ G+ A+LL + +PV + +
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 74.1 bits (182), Expect = 2e-15
Identities = 43/219 (19%), Positives = 94/219 (42%), Gaps = 13/219 (5%)

Query: 814 AVLADPKLGSDETADSVLRKVKDKVEAI--SLPTGYHLEWGGEYETAGEAQTA---VFSS 868
A KL + A + +K K+ + P G ++ Y+T Q + V +
Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQG--MKVLYPYDTTPFVQLSIHEVVKT 343

Query: 869 IPMGYLVMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSG 928
+ +++FL+ ++R L+ VP+ L+G A L F + + + G++ G
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 929 MVIKNGIVLVDQINLELGEGKPAYAALVDSSVSRVR-PVLMAAITTMLGMIPL-----IP 982
+++ + IV+V+ + + E K + S+S+++ ++ A+ IP+
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 983 DAFFGSMAITIIFGLGFASLLTLIVLPVMYSLAFNIKPN 1021
A + +ITI+ + + L+ LI+ P + +
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0551RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 44/269 (16%), Positives = 91/269 (33%), Gaps = 49/269 (18%)

Query: 25 SSVQATAIRPVKLFEVVQLEGGDFRTFPAR--VSANSRAELSFRISGELTDLALVEGQ-- 80
+V A R L V + DF + + ++ ++ E + + +L + + Q
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 81 QIRQGSLLAKLDDRDAHNNLMTREAEHELLAADFQRKTELLKRKLISQAEFDSTQAQLKS 140
QI L AK E++L+ F+ + + Q +
Sbjct: 277 QIESEILSAKE--------------EYQLVTQLFKNEI----LDKLRQT-----TDNIGL 313

Query: 141 AKAALAAARDQLSYTKLIAPFSGTVAKRLVDNH-QIVQANQGILTL-QNNNLLDVSIQVP 198
LA ++ + + AP S V + V +V + ++ + ++ L+V+ V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQ 373

Query: 199 EAMAASLNTYVQQQNFTAKVRFSALAGMEF---DAKFKEYSTQVTPGTQ---AYEVVFSL 252
+ A ++ A + K K + + + V+ S+
Sbjct: 374 NKDIGFI-----NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISI 428

Query: 253 PQP------KDIQLLPGMSAELTLALVKT 275
+ K+I L GM+ A +KT
Sbjct: 429 EENCLSTGNKNIPLSSGMAVT---AEIKT 454



Score = 29.0 bits (65), Expect = 0.035
Identities = 11/84 (13%), Positives = 30/84 (35%), Gaps = 5/84 (5%)

Query: 68 SGELTDLALVEGQQIRQGSLLAKLDDRDAHNNLMTREAEHELLAADFQRKTELLKRKLIS 127
+ + ++ + EG+ +R+G +L KL A + + ++ + R + L
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTR-----YQILSR 158

Query: 128 QAEFDSTQAQLKSAKAALAAARDQ 151
E + + ++
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEE 182


65Shewmr7_0612Shewmr7_0619N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_06120181.616809hypothetical protein
Shewmr7_06130191.042218type 11 methyltransferase
Shewmr7_06140191.554814RNA-directed DNA polymerase (Reverse
Shewmr7_06151182.979552hypothetical protein
Shewmr7_06162184.013805phosphoribosylaminoimidazole-succinocarboxamide
Shewmr7_0617-1133.689084hypothetical protein
Shewmr7_0618-2143.446334pentapeptide repeat-containing protein
Shewmr7_06190243.489565twin-arginine translocation pathway signal
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0612NUCEPIMERASE372e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 2e-05
Identities = 29/123 (23%), Positives = 44/123 (35%), Gaps = 23/123 (18%)

Query: 1 MKIAVLGASGWIGGTILNEALSRGHEVVAL-----VRDPS-------KLGETAAEVRSVD 48
MK V GA+G+IG + L GH+VV + D S L + + +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 LT-QPLKADTFA--GVDVVI---AAVGARAEQNHGIVAKTVN-----NLLAVLPQAKVPR 97
L + D FA + V + R + N N+L K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LLW 100
LL+
Sbjct: 121 LLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0613ARGREPRESSOR1472e-48 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 147 bits (372), Expect = 2e-48
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 5/150 (3%)

Query: 6 NQDDLVRIFKAILKEERFGSQSEIVAALQAEGFSNINQSKVSRMLSKFGAVRTRNAKQEM 65
N+ + I+ +Q E+V L+ +G+ N+ Q+ VSR + + V+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 66 VYCLPAELGVPTAGSPLKNLV---LDVDHNQAMIVVRTSPGAAQLIARLLDSIGKPEGIL 122
Y LPA+ ++L+ + +D +IV++T PG AQ I L+D++ E I+
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IM 119

Query: 123 GTIAGDDTIFICPSSIQDIADTLETIKSLF 152
GTI GDDTI I + D + I L
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0616DHBDHDRGNASE494e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 48.9 bits (116), Expect = 4e-09
Identities = 53/257 (20%), Positives = 98/257 (38%), Gaps = 31/257 (12%)

Query: 5 IIITGVGKRIGYALAKHLLAQGHKVIG-----TYRSHYPSIDELQSLGATLIQCDFYDNA 59
ITG + IG A+A+ L +QG + S + ++ A D D+A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 RLQTLIEQL-SQYPKIRAIIHNASDWLPDNSPSLAAHEVMQRMMQVHVSVPYQMNLALAS 118
+ + ++ + I +++ A P SL+ E + V+ + + + +++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNASRSVSK 129

Query: 119 QLRAGAEGEIG--ASDIIHFTDYVAEKGSAKHMAYAASKAALDNLTLSFAAQLAP-GVKV 175
+ G I S+ V A AYA+SKAA T +LA ++
Sbjct: 130 YMMDRRSGSIVTVGSNPAG----VPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 176 NAIAPAMI-------LFNPSDDEAYRQKTLAKAI-----LPKEAGNQEIIALVDYLLASR 223
N ++P L+ + K + L K A +I V +L++ +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 224 --YVTGRSHNVDGGRHL 238
++T + VDGG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_06192FE2SRDCTASE280.019 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 28.1 bits (62), Expect = 0.019
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 73 FQDSVARLSDFEFGFMPLLPEEEEPLSQRVEALSLWTQSFLTGIAIIQPKLNKASAEVRE 132
+A SD + P++ E +PL +SLW Q + I ++ P L A +
Sbjct: 66 LSSLLAVYSDHIYRNQPMMIRENKPL------ISLWAQWY---IGLMVPPLMLALLTQEK 116

Query: 133 VIKDLAEIAQVEF 145
+ E EF
Sbjct: 117 ALDVSPEHFHAEF 129


66Shewmr7_0657Shewmr7_0676N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_06572113.576369hypothetical protein
Shewmr7_06580133.495615hypothetical protein
Shewmr7_0659-2143.143214hypothetical protein
Shewmr7_0660-2122.415144MltD domain-containing protein
Shewmr7_0661-2141.490530hypothetical protein
Shewmr7_0662-1141.654987serine/threonine protein kinase
Shewmr7_0663-1132.032860Sel1 domain-containing protein
Shewmr7_06640161.527472hypothetical protein
Shewmr7_06651152.698222alcohol dehydrogenase
Shewmr7_06660193.584909hypothetical protein
Shewmr7_06671193.592286aldose 1-epimerase
Shewmr7_06680172.323532galactokinase
Shewmr7_0669-1171.822406sodium/hydrogen exchanger
Shewmr7_0670-1141.906313thiol:disulfide interchange protein precursor
Shewmr7_0672-1140.888273thiol:disulfide interchange protein precursor
Shewmr7_0673-2130.580753CutA1 divalent ion tolerance protein
Shewmr7_0674-1140.524647FxsA cytoplasmic membrane protein
Shewmr7_06750160.231822hypothetical protein
Shewmr7_0676123-0.115481hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0657DHBDHDRGNASE1171e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (294), Expect = 1e-33
Identities = 76/258 (29%), Positives = 120/258 (46%), Gaps = 6/258 (2%)

Query: 34 LKGKVGLITGSTSGIGLATAHVLAEQGCHLILHGLMPEAEGQCLAADFAEQYHINTFFSN 93
++GK+ ITG+ GIG A A LA QG H+ PE + +++ AE H F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--P 63

Query: 94 ADLRDPESIHAFMDAGVNALGSIDILVNNAGIQHTENVAHFPIDKWNDIIAINLSSAFHT 153
AD+RD +I +G IDILVN AG+ + ++W ++N + F+
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 154 IQQVVPAMAEKRWGRIINIASVHGLVASVNKAAYCAAKHGIVGLTKVVAIECAEQGITVN 213
+ V M ++R G I+ + S V + AAY ++K V TK + +E AE I N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 214 AICPGWVDTPLINK-QIEAIASNKGLSYDEAKYQLVTAKQPLPEMLDPRQIGEFVLFLCS 272
+ PG +T + + + + + ++ PL ++ P I + VLFL S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGI---PLKKLAKPSDIADAVLFLVS 240

Query: 273 SAARGITGASLAMDGAWT 290
A IT +L +DG T
Sbjct: 241 GQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0662TCRTETB300.022 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.022
Identities = 26/107 (24%), Positives = 40/107 (37%), Gaps = 4/107 (3%)

Query: 252 GIVGTIAGILYSRKQPLRLPIIRLSGLLIFLTVLGLSFGSAPWLQTLCAI-VLGFCIFLP 310
I G I GIL R+ PL + I ++ L + + W T+ + VLG F
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 311 VTALVSIPHELPKMTSQKITVIFSLFWSISYLISTLVLWLFGKLVDI 357
+ L Q+ SL S+L + + G L+ I
Sbjct: 367 TVISTIVSSSL---KQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0663HTHFIS399e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.0 bits (91), Expect = 9e-05
Identities = 17/89 (19%), Positives = 34/89 (38%), Gaps = 6/89 (6%)

Query: 1052 ISVLVIDNDELMLKAISSLLLGWGCHVLTARDKACAELQLAQQVLPKLIIADYHLDDDQN 1111
++LV D+D + ++ L G V + A + L++ D + D+ N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDE-N 61

Query: 1112 GVDLVQSLLTHPVFSRQRPTCIICSADPS 1140
DL+ + R ++ SA +
Sbjct: 62 AFDLLPRIKKA----RPDLPVLVMSAQNT 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0664HTHFIS661e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 1e-14
Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 8/167 (4%)

Query: 1 MSQIKVAIADDHPLFRTALTQAVLKNVNTADVLEAENFQELISIVENNPDIELIFLDLHM 60
M+ + +ADD RT L QA L DV N L + +L+ D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQA-LSRAG-YDVRITSNAATLWRWIAAGD-GDLVVTDVVM 57

Query: 61 PGNEGFTGLTLLQNHFPDIAVIMVSSDDQPEIIRKAINFGASAFIPKSASLTQISTAIAT 120
P F L ++ PD+ V+++S+ + KA GA ++PK LT++ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 121 VLEGEVWLPEHTDINVDQQ-----TAAEHQRLAKQLAQLTPQQYTVL 162
L P + + +A Q + + LA+L T++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0668OUTRMMBRANEA280.048 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.048
Identities = 6/16 (37%), Positives = 6/16 (37%)

Query: 27 PPPEPPAPPPVVMKSF 42
P P P V K F
Sbjct: 200 VAPAPAPAPEVQTKHF 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0669HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 28/134 (20%), Positives = 60/134 (44%)

Query: 23 ILLVEDEQDLAQMIMVNLTALNFRVFHAASLHQANALLQAKRIDLVLLDRMLPDGDGLLL 82
IL+ +D+ + ++ L+ + V ++ + A DLV+ D ++PD + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 83 CQQLRNDGQQMPVMLLTARDGEADTVLGLESGADDYMTKPFSVLELRARTKALLRRHLSA 142
+++ +PV++++A++ + E GA DY+ KPF + EL L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 143 APTRQLIEFEGLRI 156
+ +G+ +
Sbjct: 126 PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0675V8PROTEASE746e-17 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 74.3 bits (182), Expect = 6e-17
Identities = 35/177 (19%), Positives = 66/177 (37%), Gaps = 33/177 (18%)

Query: 81 GSLQGLGSGVIMSKEGYILTNYHVIKKADEIVVALQ------------DGRKFTSEVVGF 128
+ + SGV++ K +LTN HV+ AL+ +G ++ +
Sbjct: 98 PTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKY 156

Query: 129 DPETDLSVLKIE--------GDNLPTVPVNLDSPPQVGDVVLAIGNPYNLGQTITQGIIS 180
E DL+++K G+ + ++ ++ QV + G P + T
Sbjct: 157 SGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--E 213

Query: 181 ATGRNGLSSGYLDFLQTDAAINAGNSGGALIDTNGSLIGINTAAFQVGGEGGGHGIN 237
+ G+ G +Q D + GNSG + + +IGI+ G + N
Sbjct: 214 SKGKITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHWG-------GVPNEFN 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_0676V8PROTEASE771e-17 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 77.0 bits (189), Expect = 1e-17
Identities = 42/192 (21%), Positives = 70/192 (36%), Gaps = 34/192 (17%)

Query: 89 RGLGSGVIIDADKGYIVTNNHVIDGADDIQVGLH------------DGREVKAKLIGTDS 136
+ SGV++ K ++TN HV+D L +G ++
Sbjct: 101 TFIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 137 ESDIALLQIEA--------KNLVAIKTSDSDELRVGDFAVAIGNPFGLGQTVTSGIVSAL 188
E D+A+++ + + S++ E +V G P V+ +
Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATM 211

Query: 189 GRSGLGIEMLEN-FIQTDAAINSGNSGGALVNLKGELIGINTAIVAPGGGNVGIGFAIPA 247
S I L+ +Q D + GNSG + N K E+IGI+ G N G
Sbjct: 212 WESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG----GVPNEFNGAVFIN 267

Query: 248 NMVKNLVAQIAE 259
V+N + Q E
Sbjct: 268 ENVRNFLKQNIE 279


67Shewmr7_1080Shewmr7_1091N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1080-1120.733299putative tricarboxylic transport membrane
Shewmr7_10810140.326732hypothetical protein
Shewmr7_10821140.006808hypothetical protein
Shewmr7_10831150.156950signal transduction histidine kinase regulating
Shewmr7_1084116-0.097775hypothetical protein
Shewmr7_1085219-0.755119response regulator receiver/unknown
Shewmr7_1086429-0.499835peptidase M16 domain-containing protein
Shewmr7_1087533-0.912412hypothetical protein
Shewmr7_1088532-0.331100peptidase M16 domain-containing protein
Shewmr7_1089633-0.161872hypothetical protein
Shewmr7_1090426-0.542916hypothetical protein
Shewmr7_1091430-0.748196ErfK/YbiS/YcfS/YnhG family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1080SECFTRNLCASE781e-17 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.0 bits (192), Expect = 1e-17
Identities = 31/172 (18%), Positives = 82/172 (47%), Gaps = 4/172 (2%)

Query: 442 VTIVEERTIGPTLGAENIQNGFAALGLGMGITLLFMALWYR-RLGWVANVALIANMVILF 500
+ I ++GP + E + +L + + ++ + + + A VAL+ ++++
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 501 GLLALIPGAVLTLPGIAGLVLTVGMAVDTNVLIFERIKDKLKEGRSFALA--IDTGFDSA 558
GL A++ L +A L+ G +++ V++F+R+++ L + ++ L ++ +
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 559 FSTIFDANFTTMITAVVLYSIGNGPIQGFALTLGLGLLTSMFTGIFASRALI 610
S TT++ V + G I+GF + G+ T ++ ++ ++ ++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1081SECFTRNLCASE2382e-79 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 238 bits (608), Expect = 2e-79
Identities = 91/299 (30%), Positives = 153/299 (51%), Gaps = 20/299 (6%)

Query: 2 KNLNLTKWRYVSSAISIFLMLASLTIIGMKGFNWGLDFTGGVVTEVQLDRRITSSELQPL 61
N + +W++ + +I +M+AS+ + + G N+G+DF GG + I +
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LNAAYQQEVTVISASEP--------------------GRWVLRYADTAQSNVDIAQTLAP 101
L +V + +P G N A
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 102 LGEVQVLNTSIVGPQVGKELAEQGGLALLVAMLAILGYLSYRFEWRLASGALFALVHDVI 161
+++ + VGP+V EL +LL A + I+ Y+ RFEW+ A GA+ ALVHDV+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 162 FVLAFFALTQMEFNLTVLAAVLAILGYSLNDSIIIADRIRELLIAKPKLAIQEINNQAIV 221
+ FA+ Q++F+LT +AA+L I GYS+ND++++ DR+RE LI + ++++ N ++
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 222 ATFSRTMVTSGTTLMTVGALWIMGGGPLEGFSIAMFIGILTGTFSSISVGTSLPEFLGL 280
T SRT++T TTL+ + + I GG + GF AM G+ TGT+SS+ V ++ F+GL
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGL 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1084HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 198 VLMVGPPGTGKTLLAKAIAGESK---VPFFT-----ISGSDFVEMFVGV------GASRV 243
+++ G GTGK L+A+A+ K PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 244 RD-MFEQAKKSAPCIIFIDEID 264
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1087adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1088SECGEXPORT1213e-39 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 121 bits (304), Expect = 3e-39
Identities = 63/110 (57%), Positives = 83/110 (75%)

Query: 1 MYEVLVVVYLLVALGLIGLILIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+VV+L+VA+GL+GLI++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDAWKNLGSDEQVTQPVDQATEKSETKIPD 110
FF +SL++GN+++N W+NL + + Q A K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1091TCRTETOQM694e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 69.1 bits (169), Expect = 4e-14
Identities = 38/133 (28%), Positives = 57/133 (42%), Gaps = 18/133 (13%)

Query: 392 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETDN 433
++ HVD GKT+L + + A E G GIT G + +N
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 434 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 493
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 494 NKMDKPEADIDRV 506
NK+D+ D+ V
Sbjct: 128 NKIDQNGIDLSTV 140


68Shewmr7_1144Shewmr7_1150N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_11440132.100980LysR family transcriptional regulator
Shewmr7_1145-1131.921028LrgA family protein
Shewmr7_1146-1121.427737LrgB family protein
Shewmr7_1147-2130.298615GCN5-related N-acetyltransferase
Shewmr7_11480100.372862carbon starvation protein CstA
Shewmr7_11490100.334383glucan biosynthesis protein D
Shewmr7_1150012-0.466892phosphate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1144HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 35/185 (18%), Positives = 65/185 (35%), Gaps = 9/185 (4%)

Query: 23 LAKALEVFWRKGFEGTSLTDLTQAMGINKPSLYAAFGNKEQLFLKAIELYEQLPCAFFYP 82
L AL +F ++G TSL ++ +A G+ + ++Y F +K LF + EL E
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 83 SLEK--ETAYQVAESMLYGAATNLVDKNHPQGCLIVQGALACSEAGQAIKETLITRRRDG 140
K V +L + V + + + + C G+ R
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK-CEFVGEMAVVQQAQRNLCL 135

Query: 141 EL--ALCERFQRAKDEGDLPADADPLLLAR----YLGTVLQGMAVQATNGICPNELRKVA 194
E + + + + LPAD A Y+ +++ + E R
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYV 195

Query: 195 ELVLA 199
++L
Sbjct: 196 AILLE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1145RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 3e-07
Identities = 23/123 (18%), Positives = 48/123 (39%), Gaps = 5/123 (4%)

Query: 64 SVTLVPRVSGYIASVNFKEGALVKKGDVLFHIDASVFEAEVARLKADLASALSAE---QL 120
S + P + + + KEG V+KGDVL + A EA+ + ++ L A + Q+
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 121 ATNDLERARKLFAQKAVSAELLDTRESNKRQTTAAVASVKAALLR--AELDLDYTQVRAP 178
+ +E + + + E + T+ + + + +L+ + RA
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215

Query: 179 IDG 181

Sbjct: 216 RLT 218



Score = 39.4 bits (92), Expect = 2e-05
Identities = 21/102 (20%), Positives = 41/102 (40%), Gaps = 10/102 (9%)

Query: 101 EAEVARLKADLASALSAEQLATNDLERARKLFAQKAVSAELLDTRESNKRQTTAAVASVK 160
E+ K+ L S A + + +LF + + +L RQTT + +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLF-KNEILDKL--------RQTTDNIGLLT 315

Query: 161 AALLRAELDLDYTQVRAPIDGRASYANV-TAGNYVSAGQSVL 201
L + E + +RAP+ + V T G V+ ++++
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1146ACRIFLAVINRP10360.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1036 bits (2681), Expect = 0.0
Identities = 419/1043 (40%), Positives = 640/1043 (61%), Gaps = 18/1043 (1%)

Query: 2 LSQFFIKRPIFAAVLSLLFFITGAIAVWQLPITEYPEVVPPTVVVTANYPGANPKVIAET 61
++ FFI+RPIFA VL+++ + GA+A+ QLP+ +YP + PP V V+ANYPGA+ + + +T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 62 VASPLEQEINGVEDMLYMSSQATSDGRMTLTITFAIGTDVDRAQTQVQSRVDRAMPRLPQ 121
V +EQ +NG+++++YMSS + S G +T+T+TF GTD D AQ QVQ+++ A P LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 EVQRLGIVTEKSSPDLTMVVHLLSPDNRYDMLYLSNYAALNVKDELARIKGVGAVRLFGA 181
EVQ+ GI EKSS MV +S + +S+Y A NVKD L+R+ GVG V+LFG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 182 GEYSLRIWLDPNKVSALGMSPAEIIAAVREQNQQAAAGSLGAQPSGNA-DFQLLINVKGR 240
+Y++RIWLD + ++ ++P ++I ++ QN Q AAG LG P+ I + R
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 241 LTELSEFEDIIIKVGQNGEVIRLKDVARVELGATSYALRSLLDNKDAVAIPVFQASGSNA 300
EF + ++V +G V+RLKDVARVELG +Y + + ++ K A + + A+G+NA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 301 IQISDDVRAEMARLAKSFPEGLQYEIVYDPTVFVRGSIHAVVKTLLEAVLLVVLVVVLFL 360
+ + ++A++A L FP+G++ YD T FV+ SIH VVKTL EA++LV LV+ LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 361 QTWRASIIPLVAVPVSLVGTFAFMHLMGFSLNALSLFGLVLAIGIVVDDAIVVVENVERN 420
Q RA++IP +AVPV L+GTFA + G+S+N L++FG+VLAIG++VDDAIVVVENVER
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 421 IAS-GLSPIAATQKAMKEVTGPIVATTLVLAAVFIPTAFMSGLTGQFYKQFALTITISTF 479
+ L P AT+K+M ++ G +V +VL+AVFIP AF G TG Y+QF++TI +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 480 ISAINSLTLSPALSALLLKGHDAPKDALTRLMDKLFGGWLFTPFNRLFNRASEGYGYLVR 539
+S + +L L+PAL A LLK A G F FN F+ + Y V
Sbjct: 480 LSVLVALILTPALCATLLKPVSAE--------HHENKGGFFGWFNTTFDHSVNHYTNSVG 531

Query: 540 KVIRFGGIIGLVYLGMVALTGVQFVNTPTGYVPGQDKQYLVAFAQLPDAASLERTDAVIK 599
K++ G L+Y +VA V F+ P+ ++P +D+ + QLP A+ ERT V+
Sbjct: 532 KILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLD 591

Query: 600 KMSDIALNH--PGVAHSIAFPGLSINGFTNSPNSGVVFVALDDFELRKRPELSANAIAGQ 657
+++D L + V G S +G + N+G+ FV+L +E R E SA A+ +
Sbjct: 592 QVTDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHR 649

Query: 658 LNQQFAGIQDAFIAIFPPPPVQGLGTIGGFRLQIQDRANLGYEALYQVTQQVMYKAWADP 717
+ I+D F+ F P + LGT GF ++ D+A LG++AL Q Q++ A P
Sbjct: 650 AKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHP 709

Query: 718 -QLAGIFSSYQVNVPQLELDIDRTKAKQQAVSLDQIFQTLQTYMGSTYVNDFNRFGRTYQ 776
L + + + Q +L++D+ KA+ VSL I QT+ T +G TYVNDF GR +
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK 769

Query: 777 VNMQADEAFRQSPQQISQLKVPNVNGDMIPLGSFINVSQSAGPDRVMHYNGFTTAEINGG 836
+ +QAD FR P+ + +L V + NG+M+P +F G R+ YNG + EI G
Sbjct: 770 LYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE 829

Query: 837 PAPGVSSGQAQAAIEKILAETLPIGMTYEWTELTYQQILAGNTGLLVFPLVILLVFMVLA 896
APG SSG A A +E + ++ LP G+ Y+WT ++YQ+ L+GN + + ++VF+ LA
Sbjct: 830 AAPGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLA 888

Query: 897 AQYESLSLPLAIILIIPMTLLSALSGVLIYGGDNNIFTQIGLIVLVGLATKNAILIVEFA 956
A YES S+P++++L++P+ ++ L ++ N+++ +GL+ +GL+ KNAILIVEFA
Sbjct: 889 ALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFA 948

Query: 957 KEKQDH-GMEVMESILEAARLRLRPILMTSIAFIMGVVPMVFSTGAGAEMRQAMGVAVFA 1015
K+ + G V+E+ L A R+RLRPILMTS+AFI+GV+P+ S GAG+ + A+G+ V
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMG 1008

Query: 1016 GMIGVTLFGLILTPLFYYALAKR 1038
GM+ TL + P+F+ + +
Sbjct: 1009 GMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1149ACRIFLAVINRP7770.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 777 bits (2008), Expect = 0.0
Identities = 308/1032 (29%), Positives = 516/1032 (50%), Gaps = 28/1032 (2%)

Query: 3 LSDVSVKRPVVAIVLSLLLCVFGLVSFTKLSVREMPDVESPVVTVSTSYSGASAAIMESQ 62
+++ ++RP+ A VL+++L + G ++ +L V + P + P V+VS +Y GA A ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITKTLEDELTGISGIDEITSTT-RNGSSRITVKFLLGWNLTEGVSDVRDAVARAQRRLPE 121
+T+ +E + GI + ++ST+ GS IT+ F G + V++ + A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 DANDPVVSKDNGSGEPSVYVNLSSSVMDRTQ--LTDYAQRVLEDRFSLISGVSSISISGG 179
+ +S + S + S TQ ++DY ++D S ++GV + + G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 LYKVMYVKLRPEQMAGRNVTVTDITNALRKENVETPGGQVRNDTTV------MSVRTKRL 233
Y + + L + + +T D+ N L+ +N + GQ+ + S+ +
Sbjct: 181 QYAMR-IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 YYTPKDFDYLVVRTASDGTPIYLKDVADVAVGAQNENSTFKSDGIVNLSLGVITQSDANP 293
+ P++F + +R SDG+ + LKDVA V +G +N N + +G LG+ + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 LIVAQEVHKEVDRIQDFLPEGTSLVVDFDSTVFIDRSINEVYNTLYVTGALVVLVLYIFI 353
L A+ + ++ +Q F P+G ++ +D+T F+ SI+EV TL+ LV LV+Y+F+
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 GQARATLIPAVTVPVSLISAFIAANMFGYSINLLTLMALILAIGLVVDDAIVVVENIFHH 413
RATLIP + VPV L+ F FGYSIN LT+ ++LAIGL+VDDAIVVVEN+
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-ERGEEPLLAAYKGTREVGFAVVATTAVLVMVFLPISFMEGMVGLLFTEFSVMLAVSVL 472
+ E P A K ++ A+V VL VF+P++F G G ++ +FS+ + ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 FSSLIALTLTPVLSSKLLKANVK-----PNRFNRFVDSGFARMEKVYRVGVTHAIRFRLL 527
S L+AL LTP L + LLK F + ++ F Y V +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 528 APLVILACVGGSVWLMQQVPSQLAPQEDRGVLFAFVKGAEGTSYNRMTANMDIVEDRLMP 587
L+ V G V L ++PS P+ED+GV ++ G + R +D V D +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 588 LLGQGVLRSFSVQAPAFGGRAGDQTGFVIMQLEDWEHRHVTAQQALGIIS---NALKDIP 644
V F+V +F G+ G + L+ WE R+ A +I L I
Sbjct: 600 NEKANVESVFTVNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 645 DVMVRPM-MPGFRGQ-SSEPVQFVL---GGSDYAELFKWAQVLKEEANASP-MMEGADLD 698
D V P MP ++ F L G + L + L A P + +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 YAETTPELIVTVDKERAAELGISVDEVSQTLEVMLGGRKETTYVDRGEEYDVYLRGDENS 758
E T + + VD+E+A LG+S+ +++QT+ LGG ++DRG +Y++ D
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 FNNVGDLSQIYMRSAKGELVTLDTVTHIEEVASAQKLSHTNKQKSITLKANISKGYTLGE 818
D+ ++Y+RSA GE+V T V + +L N S+ ++ + G + G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 819 ALKFLDNKAIELLPKDISIGYTGESKDFKENQSSILIVFGLALLVAYLVLAAQFESFINP 878
A+ ++N A + LP I +TG S + + + + ++ +V +L LAA +ES+ P
Sbjct: 839 AMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 879 LVVMFTVPMGVFGGFLGLLVTSQGINIYSQIGMIMLIGMVTKNGILIVEFANQLRDR-GL 937
+ VM VP+G+ G L + +Q ++Y +G++ IG+ KN ILIVEFA L ++ G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 938 ALDKAIIDASTRRLRPILMTAFTTLVGAVPLIFSSGAGSESRIAVGTVVFFGMAFATFVT 997
+ +A + A RLRPILMT+ ++G +PL S+GAGS ++ AVG V GM AT +
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 998 LFVIPAMYRLIS 1009
+F +P + +I
Sbjct: 1018 IFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1150RTXTOXIND513e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.4 bits (123), Expect = 3e-09
Identities = 21/108 (19%), Positives = 46/108 (42%), Gaps = 3/108 (2%)

Query: 53 PLTQSISLIGKLA-ADRAVVIAPQVTGKIKQIAVTSNQAVKKGQLLIELDDMKAQAAVAE 111
+ + GKL + R+ I P +K+I V ++V+KG +L++L + A+A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 112 ANAFLNDETRKLREFEKLISRNAITQTEIDAQKASVDIARARLASAQA 159
+ L +L + I +I ++ K + ++ +
Sbjct: 139 TQSSL--LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184


69Shewmr7_1222Shewmr7_1231N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1222-2120.760858hypothetical protein
Shewmr7_1223-2140.401453hypothetical protein
Shewmr7_1224-112-0.236994bacterioferritin
Shewmr7_1225-39-0.233750bacterioferritin
Shewmr7_1226-312-0.390243methyl-accepting chemotaxis sensory transducer
Shewmr7_1227-312-0.438623hypothetical protein
Shewmr7_1228-212-0.476977DNA polymerase IV
Shewmr7_1229-213-0.548379M20C family Xaa-His dipeptidase
Shewmr7_1230014-0.265559peptidase M17, leucyl aminopeptidase
Shewmr7_12310170.437319TonB-dependent siderophore receptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1222HTHFIS749e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 9e-18
Identities = 20/116 (17%), Positives = 52/116 (44%), Gaps = 2/116 (1%)

Query: 18 IRVGLVEDQQLVRQGIASLIAISQHIEVSWQAENGQEALKRLQTDAVDVLLSDIRMPVLN 77
+ + +D +R + ++ + + N + + D++++D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 78 GISLLKQLRAAQNPIPVIMLTTFDDSELFLNSLQAGANGFLLKDVSLDKLLEAIET 133
LL +++ A+ +PV++++ + + + + GA +L K L +L+ I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1223PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 2e-06
Identities = 20/97 (20%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 293 LVLQEGISNAVRHG-----HANQLTLSMQEEQAELIICLKDNGQGISQSA--SQGVGLSS 345
+++Q + N ++HG ++ L ++ + + +++ G ++ S G GL +
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQN 317

Query: 346 MQERLSPFHGSARLQANHAGVDSSCTQGC-SLMIRLP 381
++ERL +G A + S QG + M+ +P
Sbjct: 318 VRERLQMLYG------TEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1224ISCHRISMTASE372e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 37.3 bits (86), Expect = 2e-05
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 18/125 (14%)

Query: 8 KTALLIIDMQQ---GLFYADAPPFNREQVLNNINLLIAKAREAGAPIWAVRHTG---PE- 60
+ LLI DMQ F A A P ++ NI L + + G P+ G P+
Sbjct: 30 RAVLLIHDMQNYFVDAFTAGASPV--TELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87

Query: 61 --------GSPIAAGTANWQLIESLAINPQLDNIFDKTKPSCFYQTGLAEALAHEGVSEL 112
G + +G ++I LA D + K + S F +T L E + EG +L
Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDD-DLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 113 VIVGM 117
+I G+
Sbjct: 147 IITGI 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1228HTHTETR535e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.1 bits (127), Expect = 5e-11
Identities = 18/68 (26%), Positives = 33/68 (48%)

Query: 2 RNAEFDRAQVLRGAMAAFMHKGYTKTSMQDLTQATGLHPGSIYCAFSNKRGLLIAAIEQY 61
+ A+ R +L A+ F +G + TS+ ++ +A G+ G+IY F +K L E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 QQDRNEQF 69
+ + E
Sbjct: 67 ESNIGELE 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1230HTHFIS320.012 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.012
Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 6/39 (15%)

Query: 339 LFGYVENATFRGTVFTDFSLIRPGSLHKANGGVLLMDAI 377
LFG+ + A FT G +A GG L +D I
Sbjct: 208 LFGHEKGA------FTGAQTRSTGRFEQAEGGTLFLDEI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1231VACCYTOTOXIN320.012 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 31.9 bits (72), Expect = 0.012
Identities = 17/70 (24%), Positives = 29/70 (41%)

Query: 576 SMGVSIYPNDGSNADTLLRNADAAMYRAKSEGRNNFAFYTESLTKQSIEHLKLQSALYGA 635
S G S + N ++ ++ N + +Y ++ F F + L +SAL
Sbjct: 1067 SYGYSSFSNQANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALLRD 1126

Query: 636 LEQNALYLMY 645
L Q+ YL Y
Sbjct: 1127 LNQSYNYLAY 1136


70Shewmr7_1309Shewmr7_1317N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_13091210.341071rare lipoprotein A
Shewmr7_1310080.178636hypothetical protein
Shewmr7_1311-1101.095386lytic murein transglycosylase B
Shewmr7_1312-2151.536241hypothetical protein
Shewmr7_1313-3151.489536rod shape-determining protein RodA
Shewmr7_1314-2171.666507peptidoglycan glycosyltransferase
Shewmr7_1315-2161.689386rRNA large subunit methyltransferase
Shewmr7_1316-2160.199447iojap-like protein
Shewmr7_1317-216-0.576635nicotinate-nucleotide adenylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1309OMS28PORIN310.019 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 31.3 bits (70), Expect = 0.019
Identities = 32/138 (23%), Positives = 61/138 (44%), Gaps = 11/138 (7%)

Query: 138 VLDDFAKADVLFKRTEPAPFKSVNVLAEGRRAL------EVANVEMGLALAEDEIDYLVE 191
++ D AK V+ + K ++AEG + V + +++A E +L+E
Sbjct: 102 LMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVSVA-GEATFLIE 160

Query: 192 NFVRLNRNPNDIELMMFAQ--ANSEHCRHKIFNADWTIDGEAQ-PKSLFKMIKNTFETTP 248
+ LN++PN+ EL + + A E + + ++ +D Q + + M+ +
Sbjct: 161 KQIMLNKSPNNKELELTKEEFAKVEQVKETLMASERALDETVQEAQKVLNMVNGLNPSNK 220

Query: 249 DHVLSAYKDNAAVMEGSV 266
D VL A KD A + V
Sbjct: 221 DQVL-AKKDVAKAISNVV 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1313IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 0.002
Identities = 35/264 (13%), Positives = 85/264 (32%), Gaps = 35/264 (13%)

Query: 393 QASVQSIEQQASKAQRIAKQNGEEAQALMQQTDQIATAIEEMSTSIRDVANHAQDGANQS 452
+V+ EQ A++ ++ +EA++ ++ Q AQ G+
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV--------------AQSGSETK 1093

Query: 453 QQVDLAAKEGQQQQTQVVQDLLKLSQQLSSSHQAVEKVSQE-SEAISKVTEVINSIAEQT 511
+ KE + + + Q + QE SE + E +
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE-----PARE 1148

Query: 512 NLLALNAAIEAARAGEQGRGFAVVADEVRTLAQRTQSSI---LEISQTIDKLQSQVKTTT 568
N +N ++ + A+ T S++ + S T++ S V+
Sbjct: 1149 NDPTVNIKEPQSQTNTTA--------DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 569 SQMAQSHQLGIASANQGEETGKQLEEITRRIGELAISSRNIASATEQQSSVAQEITHNLH 628
+ + Q + S + + + + + +++ +S+VA + +
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSV----PHNVEPATTSSNDRSTVALCDLTSTN 1256

Query: 629 QISELANEGEHRAAETVNSANDLS 652
+ L++ +N +S
Sbjct: 1257 TNAVLSDARAKAQFVALNVGKAVS 1280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1315ACRIFLAVINRP378e-116 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 378 bits (971), Expect = e-116
Identities = 204/1046 (19%), Positives = 423/1046 (40%), Gaps = 52/1046 (4%)

Query: 1 MIKAFVENGRLVSLVIALLLVAGFGAISSLPRTEDPHITNRFASVITPYPGASAERVEAL 60
M F+ ++ +L++AG AI LP + P I SV YPGA A+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTEVLENQLRRLEEIKLIQSTS-RPGISVIQLELKDTVKDTDPVWSR--ARDLLADTRNT 117
VT+V+E + ++ + + STS G I L + TDP ++ ++ L
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS---GTDPDIAQVQVQNKLQLATPL 117

Query: 118 LPDGIQTPTL-DDQVGYAYTAILSLVWNNSSQPRVDMLNRYAKELQSRLRLLSGTDFVKL 176
LP +Q + ++ +Y + V +N + D+ + A ++ L L+G V+L
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 177 YGAPEEEILVQLDGYKMSQLQLTPGTIAKILSSADSKIAAGEINN------NNFRAFVEV 230
+GA + + + LD +++ +LTP + L + +IAAG++ A +
Sbjct: 178 FGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 231 SGELDSQSRIRQVPLKVDTQGQIIRLGDIAHISRQPKTPADSIALVDGEQGVFVAARMLN 290
+ +V L+V++ G ++RL D+A + + + IA ++G+ + ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIKLAT 295

Query: 291 NTRVDIWQGQVKQLVDEFNQELPANIKVQWLFEQNSYTSDRLGGLIINLLQGFVIILAVL 350
+K + E P +KV + ++ + + ++ L + +++ V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 351 LLTLG-LRNAIIVALSLPLTALFTLACMKYIGLPIHQMSVTGLVVALGIMVDNAIVIVDA 409
L L +R +I +++P+ L T A + G I+ +++ G+V+A+G++VD+AIV+V+
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 410 IAQRRQ-QGMNRLRAVSETLHHLWLPLAGSTITTILAFAPIVLMPGAAGEFVGGIAMSVM 468
+ + + A +++ + L G + F P+ G+ G +++++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 469 FALLGSYVISHTLIAGLAGRF-----SIEGKNPAWYQHGINV---PLVSGYFQASLRFAL 520
A+ S +++ L L + +N + N V+ Y S+ L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHY-TNSVGKIL 534

Query: 521 NRPLLSATLIGVIPLLGFYASGKMTEQFFPPSDRDMFQIELYLAPHVSLENTLNQVQLMD 580
+ +I ++ F P D+ +F + L + E T + +
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 581 KQL--HQIEGITQVDWVVGGNTPSFYYNLTQRQQGATNYAQAMVK-----ASDFERANTL 633
++ + V V G + Q A +K D A +
Sbjct: 595 DYYLKNEKANVESVFTVNGFSFSG--------QAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 634 IPELQQTLDK---AFPEAQVLVRKLEQGPPFNAPVELM-IFGPNLETLRSLGDEVRNILA 689
I + L K F + +E G EL+ G + L +++ + A
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA 706

Query: 690 ATP-DVLHTRATLSAGAPKVWLQVNEDASLISGLTLTDIARQVQMATTGVIGGSVLEQSE 748
P ++ R + L+V+++ + G++L+DI + + A G +++
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 749 SLPIRVRLGDTSREQASRLSEIQLVTPSGTAVPLSALSHNEVQVSRGAIPRRNGQRVNTI 808
+ V+ R + ++ + + +G VP SA + + + R NG I
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 809 EAYIVSGVLPAQVLNDVKDKVAGISLPAGYRIEIGGESAKRNEAVGNLLSNLILVVTLLL 868
+ G + +++ + LPAG + G S + + + + + ++
Sbjct: 827 QGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884

Query: 869 ATVVLSFNSFRLTAIILLSALQSAGLGLLAVYVFGYPFGFPVIIALLGLMGLAINAAIVI 928
+ + S+ + ++L LLA +F ++ LL +GL+ AI+I
Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944

Query: 929 LAELEDTDNARA-GDKEVIITTVSGCGRHITSTTITTVGGFIPLII---AGGGFWPPFAI 984
+ +D G E + V R I T++ + G +PL I AG G I
Sbjct: 945 VEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGI 1004

Query: 985 AIAGGTLLTTLLSLVWVPTMYLLLMK 1010
+ GG + TLL++ +VP ++++ +
Sbjct: 1005 GVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1316RTXTOXIND422e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 2e-06
Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 5/106 (4%)

Query: 75 SGKLSELTVDSGAKVTQGQVLAKLDTRLLDAEHQEIQASLAQTQADVDLATSTLNRNLEL 134
+ + E+ V G V +G VL KL +A+ + Q+SL Q + + L+R++EL
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ-TRYQILSRSIEL 162

Query: 135 KKSGYVSEQLLDENRTQLASLEAGKKRLLASLQANQLKRDKSQLLA 180
K +L + ++ + L SL Q ++Q
Sbjct: 163 NK----LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204



Score = 40.6 bits (95), Expect = 8e-06
Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 22/166 (13%)

Query: 101 RLLDAEHQ--EIQASLAQTQADVDLATSTLNR---NLELKKSGYVSEQL--LDENRTQLA 153
+L+ E++ E L ++ ++ S + +L + +E L L + +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 154 SLEAGKKRLLASLQANQLKRDKSQLLAPFNGIISQRQ-HNLGEVVEAGSPVFILVGSVNT 212
L L N+ ++ S + AP + + Q + H G VV + ++V +T
Sbjct: 313 LLTL-------ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 213 -EAYIGVPVAVAQQFVNGQNVTV--SVHNQQ----FTAKIAGISAE 251
E V GQN + K+ I+ +
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1317HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.8 bits (173), Expect = 2e-17
Identities = 40/197 (20%), Positives = 68/197 (34%), Gaps = 5/197 (2%)

Query: 11 RSEQKKQQVLVAAIDLFCRQGFPHTSMDEVAKQAGVSKQTVYSHYGSKDDLFVAAIE--S 68
+++ +Q +L A+ LF +QG TS+ E+AK AGV++ +Y H+ K DLF E
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 69 KCVGHNLNADLLSDPSQPEATLTEFALQFGEMIVSPEAITVFKACVAQSESHP---EVSR 125
+G P P + L E + E V+ E + + V +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 126 LFFEAGPQHMLAMLTKYLIAVEALGLYRFPQPHHCAVRLCLMLFGELKLRLELGLETEPL 185
+ + L + A + L ++ L
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL 187

Query: 186 LGEREQYIRGCAEMFLK 202
E Y+ EM+L
Sbjct: 188 KKEARDYVAILLEMYLL 204


71Shewmr7_1327Shewmr7_1370N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1327020-2.895633putative metalloprotease
Shewmr7_1328118-1.519483PhoH family protein
Shewmr7_1329318-1.025467(dimethylallyl)adenosine tRNA
Shewmr7_1330217-0.603886hypothetical protein
Shewmr7_1331117-0.112308hypothetical protein
Shewmr7_13320170.3258582-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
Shewmr7_1333-2121.064311peptidyl-tRNA hydrolase
Shewmr7_1334-213-0.068269GTP-dependent nucleic acid-binding protein EngD
Shewmr7_1335-115-0.845990*******hypothetical protein
Shewmr7_1336120-1.476175hypothetical protein
Shewmr7_1337221-2.082658hypothetical protein
Shewmr7_1338325-3.160392hypothetical protein
Shewmr7_1339433-4.988723hypothetical protein
Shewmr7_1340431-4.068136hypothetical protein
Shewmr7_1341226-2.978958transcription elongation factor GreA
Shewmr7_1342020-1.835950hypothetical protein
Shewmr7_1343018-1.554776preprotein translocase subunit SecD
Shewmr7_1344-1120.109777preprotein translocase subunit SecF
Shewmr7_1345-1100.334974preprotein translocase subunit SecF
Shewmr7_1346-1120.583387hypothetical protein
Shewmr7_1347-1131.17584223S rRNA methyltransferase J
Shewmr7_1348-1130.935952membrane protease FtsH catalytic subunit
Shewmr7_1349-1120.772816hypothetical protein
Shewmr7_13500130.636307dihydropteroate synthase
Shewmr7_1351-1140.577453phosphoglucosamine mutase
Shewmr7_1352-1120.746143triosephosphate isomerase
Shewmr7_1353013-0.394712preprotein translocase subunit SecG
Shewmr7_1354-114-0.655571**hypothetical protein
Shewmr7_1355-216-0.639954transcription elongation factor NusA
Shewmr7_1356-116-0.706610translation initiation factor IF-2
Shewmr7_1357-116-0.029607ribosome-binding factor A
Shewmr7_1358-213-0.226987tRNA pseudouridine synthase B
Shewmr7_1359-213-0.15012630S ribosomal protein S15
Shewmr7_1360-2130.004325diguanylate cyclase/phosphodiesterase
Shewmr7_1361-113-0.428815hypothetical protein
Shewmr7_1362-115-0.856093polynucleotide phosphorylase/polyadenylase
Shewmr7_1363017-1.598892lipoprotein NlpI
Shewmr7_1364116-1.136507lipoprotein NlpI
Shewmr7_1365018-1.079859peptide chain release factor 3
Shewmr7_1366120-0.785722TatD-related deoxyribonuclease
Shewmr7_1367017-0.229533nucleoside transporter
Shewmr7_1368019-0.337577hypothetical protein
Shewmr7_1369-118-0.062147nucleoside-specific channel-forming protein,
Shewmr7_1370-323-0.620601hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1327HTHFIS634e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 4e-13
Identities = 24/128 (18%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRALESLNLQIDTAKDGREALDKLKTIAAEMNNVAEEIPLIISDI 239
I+V DD A R + +AL + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1330FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSIDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1332FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.3 bits (94), Expect = 1e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 8e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1334FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 19/119 (15%), Positives = 41/119 (34%), Gaps = 4/119 (3%)

Query: 145 EDATSITVSAEGEVSVKTPGAAENQVVGQLSMSDFINPSGLDPMGQNLYTETG---ASGT 201
D I +++E + + + Q + + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLAYVNQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1335FLGLRINGFLGH1472e-46 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 147 bits (373), Expect = 2e-46
Identities = 78/228 (34%), Positives = 113/228 (49%), Gaps = 20/228 (8%)

Query: 4 YFILAVALL-LTACSSTSKKPIADDPFYAPVYPEAPPTKIAATGSIYQDSQAA-----SL 57
Y I ++ +L LT C+ P+ A P P A GSI+Q +Q L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPL 65

Query: 58 YSDIRAHKVGDIITIVLKEATQAKKSAGNQIKKGSDMSLDPIYAGGSNVS------IGGV 111
+ D R +GD +TIVL+E A KS+ + + G V G
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNF-----GFDTVPRYLQGLFGNA 120

Query: 112 PLDLRYKDSMNTKRESDADQSNSLDGSISANVMQVLNNGNLVVRGEKWISINNGDEFIRV 171
D+ + A+ SN+ G+++ V QVL NGNL V GEK I+IN G EFIR
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 172 TGIVRSQDIKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
+G+V + I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1336FLGPRINGFLGI379e-133 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 379 bits (974), Expect = e-133
Identities = 161/367 (43%), Positives = 224/367 (61%), Gaps = 14/367 (3%)

Query: 5 LILAVAMLAFSLPSQAE--RIKDIANVQGVRNNQLIGYGLVVGLPGTGEKTR---YTEQT 59
L+ + + P+QA+ RIKDIA++Q R+NQLIGYGLVVGL GTG+ R +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FTTMLKNFGINLPDNFRPKIKNVAVVAVHADMPAFIKPGQELDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINDL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENIEVEPADESAKVIVNSRTGTIVVGQNVKLLPAAVTHGGLTVTIAEATQVSQPNAL 295
A +EN+ VE D AKV++N RTGTIV+G +V++ AV++G LTV + E+ QV QP
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTINASESNRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1337FLGFLGJ1522e-45 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 152 bits (386), Expect = 2e-45
Identities = 70/168 (41%), Positives = 101/168 (60%), Gaps = 2/168 (1%)

Query: 206 QILPTAAFRETQKTLKFGSREEFLATLYPHAEKAAKALGTQPEVLLAQSALETGWGQKIV 265
Q++ A R +L S+ FLA L A+ A++ G ++LAQ+ALE+GWGQ+ +
Sbjct: 131 QLVQKAVPRNYDDSLPGDSKA-FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQI 189

Query: 266 RGNNGAPSHNLFNIKADRRWQGDKANVSTLEFEQGIAVRQKADFRVYADFEHSFNDFVSF 325
R NG PS+NLF +KA W+G ++T E+E G A + KA FRVY+ + + +D+V
Sbjct: 190 RRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGL 249

Query: 326 IAEGERYQAAKKVAASPTQFIRALQDAGYATDPKYAEKVIKVMQSISE 373
+ RY AA AAS Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 250 LTRNPRY-AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKS 296



Score = 86.3 bits (213), Expect = 4e-21
Identities = 39/93 (41%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGVFVQMLMKSMRDANAVFESDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPESS 104
M DQQ++ ++ LGLA+MMV+Q++PE
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQP 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1338FLGHOOKAP12153e-64 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 215 bits (549), Expect = 3e-64
Identities = 125/455 (27%), Positives = 195/455 (42%), Gaps = 19/455 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQTTLDSQRLGNSFYGTGTYVSD 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ +S + G G YVS
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQVFSQIGKIVPQSLNDLFSGLNSLAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F+ L +L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLNDAKQLANSLNQMQSTLNGQLTQTNDQITGMTKRINEISTELANLNLE 183
D R + + ++ L N L Q Q N I +IN + ++A+LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDAM-----LLDKQDALVQELSQYAQVNVIPLENGAKSIMLGGAIMLVSGEV-- 236
+ + A LLD++D LV EL+Q V V + G +I + LV G
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 237 PMSVSTATGDPFPNELQLMSSIGSQSVRVDPNKLGGQLGALFEYREQTLVPAGLELDQLA 296
++ ++ DP + + + G LG + +R Q L L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKLQAQGFDLNGQVGTDIFKDINDPLMSIGRVAGFSGNTGNATLGVNIDDTSA 356
L A+ FN GFD NG G D F + V + N G+ +G + D SA
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LSGGSYELSF--TAPATYELRDTQTGTITPLTLNGTKLEGGAGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP +G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGAANGIEVVMTDPKGIAAAAPKITPDAANS 449
++P + A ++V++TD IA A+ + D+ N
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446



Score = 82.7 bits (204), Expect = 1e-18
Identities = 55/217 (25%), Positives = 80/217 (36%), Gaps = 20/217 (9%)

Query: 430 TDPKGIAAAAPKITPDAANSGNTQVKVTQITNRSAANFPTTGSELTIQLDTTAVPPTFEA 489
T KG A +T +A +F ++T T
Sbjct: 338 TKNKGDVAIGATVTDASAVLATDY----------KISFDNNQWQVTRLASNT-TFTVTPD 386

Query: 490 FDVNGASLGAPVAYTPPSISAFGFTFEVDSSAAAAGDKFTFDLS---------FAEGDNT 540
+ A G + +T FT + S A D D + + DN
Sbjct: 387 ANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446

Query: 541 NALAMAKLSETKVMNGGKSTLADVFEQTKQDIGSQTKAAEVRVGAADAIYQQAYARVESE 600
N A+ L GG + D + DIG++T + + Q + +S
Sbjct: 447 NGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSI 506

Query: 601 SGVNLDEEAANLMRFQQAYQASARIMSTAQQIFDTLL 637
SGVNLDEE NL RFQQ Y A+A+++ TA IFD L+
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1339FLAGELLIN515e-09 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 50.8 bits (121), Expect = 5e-09
Identities = 33/233 (14%), Positives = 83/233 (35%), Gaps = 4/233 (1%)

Query: 20 QTATSKILDQLSSGKKVNTSGDDPVAALGIDNLNQRNALVDQFMKNIDYATNHLQQTESQ 79
Q++ S +++LSSG ++N++ DD + + Q +N + + Q TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGQADALISSMKDLMLQGSNGSQTSEERQTIADDLRKSLDQLLTIANTKDESGNYLFAGN 139
L + + + +++L +Q +NG+ + + ++I D++++ L+++ ++N +G + + +
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQD 140

Query: 140 KTETLPFQFDANGKIVYQGDSGVHSAIIASGIQLNTNVAGDTAFIKSPNAMGDYSVNYSS 199
+ + I ++ G NV G +V
Sbjct: 141 NQMKIQVGANDGETITIDLQKIDVKSLGLDG----FNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 200 SQQGEFSVTSAKLDGVTPSLSDYQINFLDDGAGGINVEVTDTATPANVISAAA 252
+ + ++ D T N +
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1340FLAGELLIN2072e-63 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 207 bits (528), Expect = 2e-63
Identities = 162/509 (31%), Positives = 228/509 (44%), Gaps = 46/509 (9%)

Query: 2 AITVNTNVTSLKSQKNLNGANSALQTSMERLSSGLRINSAKDDAAGLQISNRLTSQINGL 61
A +NTN SL +Q NLN + S+L +++ERLSSGLRINSAKDDAAG I+NR TS I GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAQRNANDGISIAQTAEGAMQTSTDILQRMRDLSLQSANGSNSTEDRAAMQKELAALQT 121
A RNANDGISIAQT EGA+ + LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGTYGTQKFQVGANANETISVTLMDVSSNKLGNNTISGAGSVL 181
E+ R+++ T F G K+L K QVGAN ETI++ L + LG + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GVAATDTLSNSVAFTAGDIKVNGKTVAVAAADTATSLADKINATGSGVKAEAKLSTTIEG 241
S V V A V A
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 -------LSSTDTGTLTVYDAEGNADSYDLSTYN---------------------GDAKT 273
+ T T AE A + + G+ K
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 274 LASDLGKAGYDVSYDADTGKIGFSATGVQGIEISGGVV---------------GGTVSLG 318
+ G+ D G A +Q + V L
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 319 GNVADDTNTNVSVASSLTLSSPDKFTVTDDGTADLGEILSGGTSELNKVSDIDINTAEGA 378
N A + ++V + ++ VT G + + G S L ++ +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLI--NEDAAAAKKST 417

Query: 379 QDAISVIDAAIAGIDSSRSDLGAVQNRMSFTINNLNNISTNVSDARSRIQDVDFAKETAT 438
+ ++ ID+A++ +D+ RS LGA+QNR I NL N TN++ ARSRI+D D+A E +
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 439 MTKQQILSQTSSAMLAQANQIPQVALSLL 467
M+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1341FLAGELLIN2111e-64 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 211 bits (537), Expect = 1e-64
Identities = 163/509 (32%), Positives = 230/509 (45%), Gaps = 46/509 (9%)

Query: 2 AITVNTNVTSLGSQKNLNKANSALQTSMERLSSGLRINSAKDDAAGLQISNRLTSQINGL 61
A +NTN SL +Q NLNK+ S+L +++ERLSSGLRINSAKDDAAG I+NR TS I GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAQRNANDGISIAQTAEGAMQTSTDILQRMRDLSLQSANGSNSADDRAAMQKEISSLQT 121
A RNANDGISIAQT EGA+ + LQR+R+LS+Q+ NG+NS D ++Q EI
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGTYGTQKFQVGSNANETISISLGDVSSNKLGNNTISGAGSVL 181
E+ R+++ T F G K+L K QVG+N ETI+I L + LG + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GVAATDTLSNSVAFTAGDIKVNGKTVAVAAADTATSLADKINATGSGVKAEAKLSTTIEG 241
S V V A V A
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 -------LSSTDTGTLTVYDAEGNADSYDLSTYN---------------------GDAKT 273
+ T T AE A + + G+ K
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 274 LASDLGKAGYDVSYDADTGKIGFSATGVQGIEISGGVV---------------GGTVSLG 318
+ G+ D G A +Q + V L
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 319 GNVADDTNTNVSVASSLTLSSPDKFTVTDDGTADLGEILSGGTSELNKVSDIDINTAKGA 378
N A + ++V + ++ VT G + + G S L ++ K
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLI--NEDAAAAKKST 417

Query: 379 QDAISVIDAAIAGIDSQRADLGAVQNRMNFTINNLSNISTNVSDARSRVQDVDFAKETAQ 438
+ ++ ID+A++ +D+ R+ LGA+QNR + I NL N TN++ ARSR++D D+A E +
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 439 MTKQQILSQTSSAMLAQANQLPQVALSLL 467
M+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1345HTHFIS448e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 448 bits (1153), Expect = e-157
Identities = 172/480 (35%), Positives = 263/480 (54%), Gaps = 19/480 (3%)

Query: 7 RILLVGTPSERLSRLCCIFEFLGEQIEII-STEKLSSCLQDTRYRALVLTTDNM----SV 61
IL+ + + L G + I + L + LV+T M +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-LVVTDVVMPDENAF 63

Query: 62 EALKSLANQYPWQPILL---FGNVGDLQVSNVLG---QIEEPLNYPQLTELLHFCQVYGQ 115
+ L + P P+L+ ++ G + +P + +L ++ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 116 VKRPQVPTSANQTKLFRSLVGRSEGIANVRHLISQVATSDATVLVLGQSGTGKEVVARNI 175
+ ++ + LVGRS + + +++++ +D T+++ G+SGTGKE+VAR +
Sbjct: 124 RRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 176 HYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAISSRKGRFELAEGGTLFLDEI 235
H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA + GRFE AEGGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 236 GDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLESMITGNEFREDLYYRL 295
GDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I FREDLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 296 NVFPIEMPALSDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELSN 355
NV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL N
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 356 LVERLTILYPGGLVDVNDLPIKYRHIDVPEYSIELSEEQQERDALASIFTSEEPVEIPET 415
LV RLT LYP ++ + + R ++P+ IE + + +++ EE +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYFA 417

Query: 416 RFPSELPPEGVNLKDLLAELEIDMIRQALEQQDNVVARAAEMLGIRRTTLVEKMRKYGMT 475
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G++
Sbjct: 418 SFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1347HTHFIS463e-163 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 463 bits (1193), Expect = e-163
Identities = 170/483 (35%), Positives = 250/483 (51%), Gaps = 43/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYDCIDVASGEEAIIALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A YD ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNYLQQHHPKLPVLLMTAYATIGSAVSAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQSSDQPVVAD-----------EKSLALLSLAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFELAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FE A+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKTIKLDVRVLATSNRDLKAVVAAGQFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLTWPALNQRPADILPLARHLLAKHAKALNVLDLPEFDEAACRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + K LD+ FD+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEK--EGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVVQRALILRAGAVITANDIIIDAQDVPLSSDD-------------------------- 383
+N+V+R L VIT I + + S
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 -AEYVNEPEGLGEELKAQEHVIILETLAQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 442
+ + L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 443 QLP 445
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1348FLGHOOKFLIE596e-15 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 59.3 bits (143), Expect = 6e-15
Identities = 32/101 (31%), Positives = 54/101 (53%)

Query: 12 MQSLQGEIKPSFGISPNNIVQQVNNTSGADFGQLLSQAIGNVSGLQSTSSNLATRLEMGD 71
+Q ++G I + + Q+ F L A+ +S Q+ + A + +G+
Sbjct: 3 IQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGE 62

Query: 72 TTVSLSDTVIAREKASVAFEATVQVRNKLVEAYKEIMSMPV 112
V+L+D + +KASV+ + +QVRNKLV AY+E+MSM V
Sbjct: 63 PGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1349FLGMRINGFLIF2974e-96 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 297 bits (762), Expect = 4e-96
Identities = 160/567 (28%), Positives = 263/567 (46%), Gaps = 57/567 (10%)

Query: 26 LGGVDMMRQITMILALAICLALAVFVMIWAQEPEYRPL-GKMETQEMVQVLDVLDKNKIK 84
L + +I +I+A + +A+ V +++WA+ P+YR L + Q+ ++ L + I
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 85 YQIDVD--VVKVPEDKYQEVKMMLSRAGIDSAAASSKDFLTQDSGFGVSQRMEQARLKHS 142
Y+ ++VP DK E+++ L++ G+ A + L Q FG+SQ EQ + +
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQ-EKFGISQFSEQVNYQRA 134

Query: 143 QEENLARAIEQLQSVSRAKVILALPKENVFARNTAQPSATVVINTRRG-GLGQGEVDAIV 201
E LAR IE L V A+V LA+PK ++F R PSA+V + G L +G++ A+V
Sbjct: 135 LEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 202 DIVASAVQGLEPSRVTVTDSNGRLLNSGSQDGVSARARRELELVQQKEAEYRTKIDSILS 261
+V+SAV GL P VT+ D +G LL + G +L+ E+ + +I++ILS
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDL-NDAQLKFANDVESRIQRRIEAILS 253

Query: 262 PILGPDNFTSQVDVSMDFTAVEQTAKRFNPDLPSLRSEMTVENNST-----GGSTGGIPG 316
PI+G N +QV +DF EQT + ++P+ + ++ + + G GG+PG
Sbjct: 254 PIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPG 313

Query: 317 ALSNQPP---------------MESNIPQEA-DKATESVTAGNSHREATRNFELDTTISH 360
ALSNQP N PQ + + S ++ R T N+E+D TI H
Sbjct: 314 ALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRH 373

Query: 361 TRQQIGVVRRVSVSVAVDFKPGAAGENGQVARVARTEQELTNIRRLLEGAVGFSAQRGDV 420
T+ +G + R+SV+V V++K A G+ + T ++ I L A+GFS +RGD
Sbjct: 374 TKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIEDLTREAMGFSDKRGDT 428

Query: 421 LEVVTVPFMDQLVEDVPAPELWEQPWFWRAVKLGVGALVILV----LILAVVRPMLKRLI 476
L VV PF + W+Q F + L++LV L VRP L R +
Sbjct: 429 LNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRV 487

Query: 477 YPDNVNMPEDSRLGNELAEIEDQYAADTLGMLNTKEAEYSYADDGSIL---IPNLHKDDD 533
E E+ E + D + +
Sbjct: 488 E----EAKAAQEQAQVRQETEE-------------AVEVRLSKDEQLQQRRANQRLGAEV 530

Query: 534 MIKAIRALVANEPELSTQVVKNWLQDN 560
M + IR + N+P + V++ W+ ++
Sbjct: 531 MSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1350FLGMOTORFLIG2909e-99 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 290 bits (743), Expect = 9e-99
Identities = 109/348 (31%), Positives = 194/348 (55%), Gaps = 5/348 (1%)

Query: 1 MAENKSKDAAEAPSFNIKDLSGIEKTAILLLSLSEADAASILKHLEPKQVQKVGMAMAAM 60
M E K K+ + + L+G +K AILL+S+ ++ + K+L ++++ + +A +
Sbjct: 1 MEEKKEKEILD-----VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKL 55

Query: 61 EDFGQEKVIGVHKLFLDDIQKYSSIGFNSEEFVRKALTAALGEDKAGNLIEQIIMGSGAK 120
E E V F + + I ++ R+ L +LG KA ++I + ++
Sbjct: 56 ETITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSR 115

Query: 121 GLDSLKWMDARQVATIIQNEHPQIQTIVLSYLEPDQAAEIFGQFPENTRLDLMMRIANLE 180
+ ++ D + IQ EHPQ ++LSYL+P +A+ I P + ++ RIA ++
Sbjct: 116 PFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMD 175

Query: 181 EVQPAALQELNDIMEKQFAGQGGAQAAKMGGLKAAANIMNYLDTGVESQLMETMRETDEE 240
P ++E+ ++EK+ A GG+ I+N D E ++E++ E D E
Sbjct: 176 RTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPE 235

Query: 241 MAQQIQDLMFVFENLIDVDDRGIQTLLREVQQDVLMKALKGTDDQLKDKILGNMSKRAAE 300
+A++I+ MFVFE+++ +DDR IQ +LRE+ L KALK D +++KI NMSKRAA
Sbjct: 236 LAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAAS 295

Query: 301 LLRDDLEAMGPIRISEVEIAQKEILSIARRLSDSGEIMLGGGGGDEFL 348
+L++D+E +GP R +VE +Q++I+S+ R+L + GEI++ GG ++ L
Sbjct: 296 MLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1351FLGFLIH882e-22 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 87.5 bits (216), Expect = 2e-22
Identities = 59/201 (29%), Positives = 104/201 (51%), Gaps = 4/201 (1%)

Query: 54 APKAVAAETIAPPTMAEIEDIRAQAEEEGFN---EGKTQGYAEGLEQGRLEGLEQGHTEG 110
AP I P IE+ E++ + QGY G+ +GR +G +QG+ EG
Sbjct: 16 APPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 111 LAQGHEQGLEAGLAEAKTLIQRFEGLLSQFEKPLQLLDGDIEHSLMTLTMALAKSVIGHE 170
LAQG EQGL ++ + R + L+S+F+ L LD I LM + + A+ VIG
Sbjct: 76 LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQT 135

Query: 171 LKTHPEQILSALRLGVESLPIKEQSVSIRMHPDDVALVEQLYTSTQLNRNQWQLEAEPSL 230
++ ++ ++ P+ +R+HPDD+ V+ + +T L+ + W+L +P+L
Sbjct: 136 PTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT-LSLHGWRLRGDPTL 194

Query: 231 NPGDCIISSQRSLVDLTLSSR 251
+PG C +S+ +D ++++R
Sbjct: 195 HPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1353FLGFLIJ435e-08 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 42.9 bits (100), Expect = 5e-08
Identities = 36/145 (24%), Positives = 71/145 (48%)

Query: 1 MANADPLLLVLKLALDAEEQAALLLKSAQLECQKRQNQLDALNNYRLDYMKQMQSQQGQA 60
MA L + LA E AA LL + CQ+ + QL L +Y+ +Y + S
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ISASHYHQFHRFIRQIDEAIAQQNRVVADGEKQKNYRQQHWLDKQKKRKAVELLLDNKEK 120
I+++ + + +FI+ +++AI Q + + ++ + W +K+++ +A + L + +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 KRQALELKKEQKMTDEFASQQFFRR 145
E + +QK DEFA + R+
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1354FLGHOOKFLIK501e-08 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 50.2 bits (119), Expect = 1e-08
Identities = 37/132 (28%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 459 MKQQLVTMVSQGIQQAEIRLDPPELGHMLVKVQVHGDQTQVQFHVTQAQTRDVVEQAIPR 518
+ Q + QG Q AE+RL P +LG + + ++V +Q Q+Q R +E A+P
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 519 LRELLQEQGMQLADSHVSQGDQGQRREGGFGEAGGSSGGNVDDFSAEELD-----LGLNQ 573
LR L E G+QL S++S +++ + N + + E+ D + L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQG 363

Query: 574 ATSLNSGIDYYA 585
+ NSG+D +A
Sbjct: 364 RVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1356FLGMOTORFLIM2497e-83 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 249 bits (637), Expect = 7e-83
Identities = 88/327 (26%), Positives = 165/327 (50%), Gaps = 12/327 (3%)

Query: 1 MSDLLSQDEIDALLHGVDDVEEDNELDAAGLEARS----YDFSSQDRIVRGRMPTLEIVN 56
M+++LSQDEID LL + + E DA + YDF D+ + +M TL +++
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIE-DARPISDTRKITLYDFRRPDKFSKEQMRTLSLMH 59

Query: 57 ERFARHLRISMFNMMRRAAEVSINGVQMLKFGEYVHTLFVPTSLNMVRFHPLKGTALITM 116
E FAR S+ +R V + V L + E++ ++ P++L ++ PLKG A++ +
Sbjct: 60 ETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEV 119

Query: 117 EARLVFILVDNFFGGDGRFHAKIEGREFTPTERRIVQLLLKIIFEDYKDAWAPVMDVEFD 176
+ + F ++D FGG G+ R+ T E +++ ++ I + +++W V+D+
Sbjct: 120 DPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPR 177

Query: 177 YLDSEVNPAMANIVSPTEVVVINSFHIEVDGGGGDFHITMPYSMIEPIRELLDAG--VQS 234
E NP A IV P+E+VV+ + +V G + +PY IEPI L + S
Sbjct: 178 LGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSS 237

Query: 235 DKQDTDMRWSQALHDEIMDVKVGFDATVVEHELTLKDVMNFKAGDIIPIE---LPEYIMM 291
++ + ++ L D++ V + A V L+++D++ + GDII + + + ++
Sbjct: 238 VRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVL 297

Query: 292 KIEDLPTYRCKMGRSRDNLALKIYEKI 318
I + + C+ G +A +I E+I
Sbjct: 298 SIGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1357FLGMOTORFLIN1119e-35 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 111 bits (278), Expect = 9e-35
Identities = 54/122 (44%), Positives = 81/122 (66%)

Query: 2 STDDDWAAAMAEQALEEANAIDLDELVDDSQPISKAEAAKLDTILDIPVTISMEVGRSYI 61
+ DD WA A+ EQ + +D I+DIPV +++E+GR+ +
Sbjct: 14 ALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRM 73

Query: 62 SIRNLLQLNQGSVVELDRVAGEPLDVMVNGTLIAHGEVVVVNDKFGIRLTDVISQTERIK 121
+I+ LL+L QGSVV LD +AGEPLD+++NG LIA GEVVVV DK+G+R+TD+I+ +ER++
Sbjct: 74 TIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMR 133

Query: 122 KL 123
+L
Sbjct: 134 RL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1359FLGBIOSNFLIP2741e-95 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 274 bits (702), Expect = 1e-95
Identities = 121/240 (50%), Positives = 175/240 (72%)

Query: 8 LIGFSTLLFAASVGAADGVLPAVTVKTAADGSTEYSVTMQILLLMTSLSFIPAMVIMLTS 67
L+ + +L A LP +T + G +S+ +Q L+ +TSL+FIPA+++M+TS
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 68 FTRIIVVLSILRQAIGLQQTPSNQVLIGMSLFMTFFIMAPVFDKIYDQGVKPYIDEQLTL 127
FTRII+V +LR A+G P NQVL+G++LF+TFFIM+PV DKIY +P+ +E++++
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 128 QQAFDKGKEPLRAFMLGQVRTTDLKTFIDISGYQNINSPEEAPMSVLVPAFITSELKTAF 187
Q+A +KG +PLR FML Q R DL F ++ + PE PM +L+PA++TSELKTAF
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 188 QIGFMLFVPFLVLDLVVASILMAMGMMMLSPMIVSLPFKIMLFVLVDGWGLVMGTLANSF 247
QIGF +F+PFL++DLV+AS+LMA+GMMM+ P ++LPFK+MLFVLVDGW L++G+LA SF
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1360TYPE3IMQPROT471e-10 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 47.1 bits (112), Expect = 1e-10
Identities = 20/73 (27%), Positives = 39/73 (53%)

Query: 4 EALIDIFREALAVIVMMVSAIVLPGLGIGLIVAVFQAATSINEQTLSFLPRLIVTLLALM 63
+ L+ +AL +++++ + IGL+V +FQ T + EQTL F +L+ L L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VMGHWLVQTLMDF 76
++ W + L+ +
Sbjct: 62 LLSGWYGEVLLSY 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1361TYPE3IMRPROT1233e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 123 bits (311), Expect = 3e-36
Identities = 94/243 (38%), Positives = 142/243 (58%), Gaps = 1/243 (0%)

Query: 15 YMWPLFRVTSMLMVMVVFGATTTPTRVRLLLAVAITLAIAPVLPPVKDAELFSLSAVFIT 74
Y WPL RV +++ + + P RV+L LA+ IT AIAP LP D +FS A+++
Sbjct: 16 YFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP-ANDVPVFSFFALWLA 74

Query: 75 AQQIIIGVAMGFVTQMVMQTFVLTGQIIGMQTSLGFASMVDPGSGQQTPVIGNFFLLLAT 134
QQI+IG+A+GF Q G+IIG+Q L FA+ VDP S PV+ +LA
Sbjct: 75 VQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLAL 134

Query: 135 LIFLAVDGHLLMIRMLVASFETLPISNQGLTLTSYRSLAEWGSYMFGAALTMSLSAIIAL 194
L+FL +GHL +I +LV +F TLPI + L ++ +L + GS +F L ++L I L
Sbjct: 135 LLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194

Query: 195 LLVNLSFGVMTRAAPQLNIFSIGFPITMIGGLLILWLTLTPVMAHFEEVWASAQLLLCDI 254
L +NL+ G++ R APQL+IF IGFP+T+ G+ ++ + + E +++ LL DI
Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254

Query: 255 LGL 257
+
Sbjct: 255 ISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1362TYPE3IMSPROT338e-117 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 338 bits (868), Expect = e-117
Identities = 97/347 (27%), Positives = 178/347 (51%), Gaps = 2/347 (0%)

Query: 6 SGERSEEPTGRRLEQAREKGQVARSKELGTATVLLSAATGLYMLGPGIAKALSNVFERVF 65
SGE++E+PT +++ AR+KGQVA+SKE+ + ++++ + L L + S + +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LI 59

Query: 66 TMDRAAIFDTNQMFNVWGVVGSEIGWPLLKIMLLIVVVAFIGNVSLGGMNFSTQAMMPKA 125
+++ + + + V V E + ++ + ++A +V G S +A+ P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SKMSPIAGFKRMFGVQALVELTKGIAKFSVVAIAAYLLLSHYFNDILLLSADHLPGNVHH 185
K++PI G KR+F +++LVE K I K +++I ++++ +L L +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 ALDLLVWMFILLCSSVLVIVVIDVPFQIWNHNKQLKMTKQEVKDEYKDTEGKPEVKGRVR 245
+L + ++ +VI + D F+ + + K+LKM+K E+K EYK+ EG PE+K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QMQRELAQRRMMAEVPNADVIVVNPEHYAVAIKYDVKRSAAPFVIAKGVDEVAFKIREVA 305
Q +E+ R M V + V+V NP H A+ I Y + P V K D +R++A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 RAHNIAIVSAPPLARAIYHTTKLEQQIPEGLFTAVAQVLAYVFQLRQ 352
+ I+ PLARA+Y ++ IP A A+VL ++ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1363HTHFIS310.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.027
Identities = 25/158 (15%), Positives = 51/158 (32%), Gaps = 19/158 (12%)

Query: 485 VVDAATVVATHISQILTNNAAKLLGYEEVQQLMDMLAKHSPKLVDGFIPDV-MPLGNVVK 543
V D + T ++Q L+ + L +A LV + DV MP N
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLV---VTDVVMPDENAFD 64

Query: 544 VMQNLLNEGVSVR--------DLRTIVQTL----LEYGTKSNDTEVLTAAVRIAL---KR 588
++ + + T ++ +Y K D L + AL KR
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 589 MIVQEISGPELEIPVITLAPELEQMLHQSMQATGGDGP 626
+ + +P++ + ++++ + D
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1364PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.013
Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 238 VKQGGVVALVGPTGVGKTTSLAKLA 262
K V L G G+GK+T + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1367HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 6 KILIVDDFSTMRRIIKNLLRDLGFNNTQEADDGSTALPMLQKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + +T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKAIRADDSLKHLPVLMVTAEAKREQIIAAAQAGVNGYVVKPF 110
DLL I+ LPVL+++A+ I A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1369PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 2e-06
Identities = 14/77 (18%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 469 DLDKNLVEALADPLV--HLVRNSVDHGIEMPNDREASGKPRTGTITLSASQEGDHILLKI 526
++ +++ P++ LV N + HGI P+ G I L +++ + L++
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 527 EDDGAGMDPEKLKQIAI 543
E+ G+ +
Sbjct: 297 ENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1370HTHFIS642e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 2e-13
Identities = 28/135 (20%), Positives = 56/135 (41%), Gaps = 7/135 (5%)

Query: 2 AIKVLVVDDSSFFRRRVSEIVNQDPELEVIATASNGAEAVKMAAELNPQVITMDIEMPVM 61
+LV DD + R +++ +++ + SN A + A + ++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAKCP-TPILMFSSLTHDGAKATLDALDAGALDFLPKRF--EDIATNKDDA 118
+ + I P P+L+ S+ + + A + GA D+LPK F ++ A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 119 ILLLQQRVKALGRRR 133
+ ++R L
Sbjct: 119 LAEPKRRPSKLEDDS 133


72Shewmr7_1376Shewmr7_1379N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1376-118-3.621179hypothetical protein
Shewmr7_1377-121-3.633073phosphoserine phosphatase
Shewmr7_1378023-4.271211alpha/beta hydrolase fold domain-containing
Shewmr7_1379-124-4.855638type IV pilus assembly PilZ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1376TYPE3IMSPROT547e-12 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 53.6 bits (129), Expect = 7e-12
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 3/87 (3%)

Query: 8 TQQAVALSYD-GKH-APKVVASGEGLVADEIIALAKASGVYIHQDPHLSNFL-RLLELGE 64
T A+ + Y G+ P V + +A+ GV I Q L+ L +
Sbjct: 265 THIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDH 324

Query: 65 EIPKELYLLIAELIAFVYMLDGKFPEQ 91
IP E AE++ ++ + +
Sbjct: 325 YIPAEQIEATAEVLRWLERQNIEKQHS 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1377IGASERPTASE382e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.7 bits (87), Expect = 2e-04
Identities = 32/181 (17%), Positives = 64/181 (35%), Gaps = 12/181 (6%)

Query: 230 LTPQAELLNTSKPSIQAQVAPDNKAAVEVTTSSATDNPTSKNNASALSIQTSLQANTEPK 289
+ P A + A+ + VE AT+ T++N A +++++ANT+
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATE-TTAQNREVAKEAKSNVKANTQTN 1083

Query: 290 LTTVNQKLIPEIQPQEMAKTQQ-----KAPSLEINLTEAKNVASNTLPTRD--ENIRNTA 342
+ E Q E +T KA E V S P ++ E ++ A
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 343 SAMLP----TNAMKSEAASSSLAKSALSSNELPLNLKPLAAEAQLTEKTNKASESTISVN 398
N + ++ +++ A + + E N++ E+ N E+ +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 399 E 399

Sbjct: 1204 P 1204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1378VACJLIPOPROT2305e-78 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 230 bits (588), Expect = 5e-78
Identities = 91/257 (35%), Positives = 134/257 (52%), Gaps = 16/257 (6%)

Query: 9 LLGFALLPKVYGAEATVPDTTPKETASAVKITYDDPRDPLEGFNRAMWDFNYLYLDRYIY 68
L AL + A+ + DPLEGFNR M++FN+ LD YI
Sbjct: 5 LSALALGTTLLVGCASSGTDQQGRS------------DPLEGFNRTMYNFNFNVLDPYIV 52

Query: 69 RPIAHGYNDYLPLPAKTGINNFVQNLEEPSSLVNNALQGKWGWAANAGGRFTVNTTIGLL 128
RP+A + DY+P PA+ G++NF NLEEP+ +VN LQG RF +NT +G+
Sbjct: 53 RPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMG 112

Query: 129 GVFDVADMMGMPRKQDE---FNEVLGYYGVPNGPYFMAPFAGPYVVRELASDWVDGLYFP 185
G DVA M ++ E F LG+YGV GPY PF G + +R+ D D LY
Sbjct: 113 GFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPV 172

Query: 186 LSELTVWQSIVKWGLKSLHARASAIDQERLVDNALDPYTFVKDAYLQHMDYKVYDGNV-P 244
LS LT S+ KW L+ + RA +D + L+ + DPY V++AY Q D+ G + P
Sbjct: 173 LSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKP 232

Query: 245 QKQEDDELLDQYMQELE 261
Q+ + + + +++++
Sbjct: 233 QENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1379HTHFIS952e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 2e-23
Identities = 29/101 (28%), Positives = 47/101 (46%)

Query: 7 SILLVEDDPVFRQIVATFLSGRGAEVVQACDGEQGLSIFKQQRFDIILADLSMPKLGGLD 66
+IL+ +DD R ++ LS G +V + D+++ D+ MP D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 MLKEMSKLEPLVPSIVISGNNVMADVVEALRVGACDYLVKP 107
+L + K P +P +V+S N ++A GA DYL KP
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


73Shewmr7_1572Shewmr7_1578N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_15721121.178372hypothetical protein
Shewmr7_15731121.452253pyrroline-5-carboxylate reductase
Shewmr7_1574-1140.984944hypothetical protein
Shewmr7_15750170.739274pilus retraction ATPase PilT
Shewmr7_1576-214-0.420074twitching motility protein
Shewmr7_1577-213-0.264337glutathione peroxidase
Shewmr7_1578-214-0.253641ferrochelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1572PF06580280.045 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.9 bits (62), Expect = 0.045
Identities = 15/103 (14%), Positives = 35/103 (33%), Gaps = 7/103 (6%)

Query: 21 GFGLIKRKGLRTFVFIPLMINLVLFAAVIYVAIGQLDVLFTWMNAQLPEYLSWLNF---- 76
G+G+ L F F L + L + + +AI + ++ T + WL
Sbjct: 19 GWGVY---TLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQ 75

Query: 77 LLWPLAVTTMLVMLAFVFSSVMNWLAAPFNGLLAEKVEQLLTG 119
++ + +++ + + ++ W F L
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLAL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1573GPOSANCHOR443e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 43.9 bits (103), Expect = 3e-06
Identities = 36/226 (15%), Positives = 86/226 (38%), Gaps = 12/226 (5%)

Query: 188 RENLERLGDIRSELAKQLEKLSQQAKAAKQYRELKQAERKTHAELLVMRYQELQSQMASL 247
+ +LE+ + + + +A K E +QAE + E + +++ +L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 248 SEQISSLELQQAAAQSLAQTGELESTELQLTLSQLAEQEQQAVEAYYLTGTEIAKLEQQL 307
+ ++L ++A + + ST + L ++ A+LE+ L
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA-------RQAELEKAL 269

Query: 308 QSQKQRDAQLHTQLEQLSEQITQNQAKLAAYQASFQALEAELSQLAPQHELQQEMLDELQ 367
+ +++ L + +A+ A + Q L A L + +E +L+
Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329

Query: 368 A-----QWEMSVSRTEAQSESARVLAATVAQHKLQLELHRSKLAHQ 408
A + + +S QS + A+ A+ +L+ E + + ++
Sbjct: 330 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375



Score = 42.4 bits (99), Expect = 1e-05
Identities = 49/283 (17%), Positives = 95/283 (33%), Gaps = 12/283 (4%)

Query: 227 KTHAELLVMRYQELQSQMASLSEQISSLELQQAAAQSLAQTGELESTELQLTLSQLAEQE 286
K L + L+ L+E++S+ + + + EL+ + L +
Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129

Query: 287 QQAVEAYYLTGTEIAKLEQQLQSQKQRDAQLHTQLEQLSEQITQNQAKLAAYQASFQALE 346
+ A+ T + + L+++K A LE+ E A A + LE
Sbjct: 130 EGAMNFS----TADSAKIKTLEAEKAALAARKADLEKALEGAM---NFSTADSAKIKTLE 182

Query: 347 AELSQLAPQHELQQEMLDELQAQWEMSVSRTEAQSESARVLAATVAQHKLQLELHRSKLA 406
AE + L + ++ L+ ++ + LAA A + LE +
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 407 HQQQLNAHKTQLHQEQQQELASLNAHALEDSSASLNDEIAQLEQALAEQVEINQEFESTL 466
+ A L ALE + + A+++ AE+ + E
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEK-ALEGAMNFSTADSAKIKTLEAEKAALEAE----K 297

Query: 467 AADTHALDLARGEFEQLSQRLTSMRARFELVEQWLAKQEELSD 509
A H + + L + L + R + +E K EE +
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340



Score = 41.2 bits (96), Expect = 2e-05
Identities = 41/293 (13%), Positives = 103/293 (35%), Gaps = 9/293 (3%)

Query: 616 AKQDNSQSLVQLSKEQIQLSEAIADCEQAKAIQQARLDELAQQLTQVRDSLSQGTKRLHQ 675
A + + +L ++ + + + + L ++ + LS ++L +
Sbjct: 44 ATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRK 103

Query: 676 LQLDKATKSTQLNNAEIQAKQREAKRGQLAETVARTHAELAELAEQLILLAEQEDELAEA 735
+ K++++ E + E A++ L + LA ++ +L +A
Sbjct: 104 NDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKA 163

Query: 736 LEISLEQQQQQSQDAQGDMARHQALKAQIGDAERRLTSLNASLQSIATRMAVSTEQIELQ 795
LE ++ S + A AL+A+ + E+ L + + ++ +
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 223

Query: 796 RVRVSELVHSKEILSA--QLANVAAQEGDQQTAQLSEQLAQLLNRQQSQQQALQSLRSQQ 853
R ++L + E + + + + A L + A+L + + ++
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 854 SSLTETLNSIGLKQKQELGKLEGLTQSLSTLKLRREGLKGQADSQLVALSEQQ 906
+L + E LE +Q L+ R+ L+ D+ A + +
Sbjct: 284 KTLEAEKA----ALEAEKADLEHQSQVLNA---NRQSLRRDLDASREAKKQLE 329



Score = 40.4 bits (94), Expect = 4e-05
Identities = 37/253 (14%), Positives = 89/253 (35%), Gaps = 4/253 (1%)

Query: 146 QGTISRLIESKPQDLRTFIEEAAGISRYKERRRETENRIRHTRENLERLGDIRSELAKQL 205
+S E ++ ++ E+A+ I + R+ + E + L +
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 206 EKLSQQAKAAKQYRELKQAERKTHAELLVMRYQELQSQMASLSEQISSLELQQAAAQSLA 265
L+ + ++ E + + + L+++ A+L + + LE A + +
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSA----KIKTLEAEKAALEARQAELEKALEGAMNFS 206

Query: 266 QTGELESTELQLTLSQLAEQEQQAVEAYYLTGTEIAKLEQQLQSQKQRDAQLHTQLEQLS 325
+ L+ + LA ++ +A ++++ + A L + +L
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 326 EQITQNQAKLAAYQASFQALEAELSQLAPQHELQQEMLDELQAQWEMSVSRTEAQSESAR 385
+ + A A + LEAE + L + + L A + +A E+ +
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 386 VLAATVAQHKLQL 398
L A + + Q
Sbjct: 327 QLEAEHQKLEEQN 339



Score = 32.7 bits (74), Expect = 0.008
Identities = 28/199 (14%), Positives = 55/199 (27%)

Query: 616 AKQDNSQSLVQLSKEQIQLSEAIADCEQAKAIQQARLDELAQQLTQVRDSLSQGTKRLHQ 675
K +L K +A LA + + +L
Sbjct: 184 EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTA 243

Query: 676 LQLDKATKSTQLNNAEIQAKQREAKRGQLAETVARTHAELAELAEQLILLAEQEDELAEA 735
T + E + + E A++ L + L ++ +L
Sbjct: 244 DSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ 303

Query: 736 LEISLEQQQQQSQDAQGDMARHQALKAQIGDAERRLTSLNASLQSIATRMAVSTEQIELQ 795
++ +Q +D + L+A+ E + AS QS+ + S E +
Sbjct: 304 SQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363

Query: 796 RVRVSELVHSKEILSAQLA 814
+L +I A
Sbjct: 364 EAEHQKLEEQNKISEASRQ 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1575HTHTETR363e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 35.8 bits (82), Expect = 3e-04
Identities = 16/106 (15%), Positives = 38/106 (35%), Gaps = 7/106 (6%)

Query: 504 MLDRMGMKSATNLAQAIEAAKTTTLPRFLYALGIREVGEATAANLAT---HFGSLEALRV 560
M + ++ ++ A + + + + E+ +A HF L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 561 ATIEQLIQVEDIGEVVAQHVAHFFAQPHNL--EVIDALITAGVNWP 604
E +IGE+ ++ A F P ++ E++ ++ + V
Sbjct: 61 EIWEL--SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1578DHBDHDRGNASE290.023 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.9 bits (64), Expect = 0.023
Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query: 7 EQKSVAVVGCG-WFGFALAKHLVQAGYRVTGAKRHTEELAPLTEA 50
E K + G G A+A+ L G + + E+L + +
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSS 51


74Shewmr7_1620Shewmr7_1627N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1620448-11.6500454-hydroxy-3-methylbut-2-en-1-yl diphosphate
Shewmr7_1622343-10.611360histidyl-tRNA synthetase
Shewmr7_1623239-9.636594hypothetical protein
Shewmr7_1624229-7.344452outer membrane protein assembly complex subunit
Shewmr7_1625117-4.389660outer membrane protein assembly complex subunit
Shewmr7_1626115-3.567802GTP-binding protein EngA
Shewmr7_1627-113-1.087995exodeoxyribonuclease VII large subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1620DNABINDINGHU894e-27 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 89.0 bits (221), Expect = 4e-27
Identities = 36/88 (40%), Positives = 52/88 (59%)

Query: 2 NKAQLIQRIATSLEQSQASTKPVVEQILQQIHIALSEGEKVFLPQFGTFELRFHLPKSGR 61
NK LI ++A + E ++ + V+ + + L++GEKV L FG FE+R + GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGETIEIAGFNQPSFKAATALKKAI 89
NPQTGE I+I P+FKA ALK A+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1623SUBTILISIN1147e-30 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 114 bits (286), Expect = 7e-30
Identities = 69/289 (23%), Positives = 120/289 (41%), Gaps = 61/289 (21%)

Query: 258 SSYVVDDNGNPVPNEYKVCVHGHGTAVGGTIASIRGNGVGVSSVLGSNNAELVYVKVLDS 317
++ DD G+P + +GHGT V GTIA+ N GV V + A+L+ +KVL+
Sbjct: 67 RNFTDDDEGDPEIFKDY---NGHGTHVAGTIAA-TENENGVVGV--APEADLLIIKVLNK 120

Query: 318 CNDGAFLSDIIKGIHWSVGDHFDGVTDISSPVDVINLSLGGMGNGGLCDVGFNAMADAVA 377
G II+GI++++ VD+I++SLGG + +AV
Sbjct: 121 QGSGQ-YDWIIQGIYYAIEQK----------VDIISMSLGG-------PEDVPELHEAVK 162

Query: 378 YANSKGAVVVASTGNSALEA----TAATPVSCHGIITAAANTSNGELAPFSNYYNSRKNI 433
A + +V+ + GN P + +I+ A + + FSN N ++
Sbjct: 163 KAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNE-VDL 221

Query: 434 SAIGQDLLTPFVNTSVYVSRNGVGGVEGDCNKITNCYAYYTGTSLSAPVISSAVALIKME 493
A G+D+L+ YA ++GTS++ P ++ A+ALIK
Sbjct: 222 VAPGEDILSTV---------------------PGGKYATFSGTSMATPHVAGALALIKQL 260

Query: 494 NPS-----LKAEQIFDILYNTA------SEYNTNEVGNKTALYKLSKNT 531
+ L +++ L + N + TA+ +LS+
Sbjct: 261 ANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRIF 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1626YERSSTKINASE320.012 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.012
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 182 QVLDGIIHSHANQVLHRDIKPDNILVDD-EGRVHVIDFGISKLMGEQ 227
++LD H V+H DIKP N++ D G VID G+ GEQ
Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1627PF05272300.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.018
Identities = 11/30 (36%), Positives = 15/30 (50%), Gaps = 4/30 (13%)

Query: 26 LESGA----PIALVGPNGAGKTTLFSLLCG 51
+E G + L G G GK+TL + L G
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVG 618


75Shewmr7_1697Shewmr7_1704N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_1697-2150.168823flagellar biosynthesis protein FliP
Shewmr7_1698-2120.409175flagellar biosynthetic protein FliQ
Shewmr7_1699-2100.014808flagellar biosynthesis protein FliR
Shewmr7_1700-210-0.251492flagellar biosynthesis protein FlhB
Shewmr7_1701-210-0.230868flagellar biosynthesis protein FlhA
Shewmr7_1702-1130.255754flagellar biosynthesis regulator FlhF
Shewmr7_17031190.116556cobyrinic acid a,c-diamide synthase
Shewmr7_1704530-1.012305flagellar biosynthesis sigma factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1697TCRTETB638e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 63.0 bits (153), Expect = 8e-13
Identities = 68/394 (17%), Positives = 147/394 (37%), Gaps = 35/394 (8%)

Query: 31 SKQRDTRLMWALCVASVVVYINLYLMQGMLPLIAEHFAVSGSKATLILSVTSFSLAFSLL 90
S R +++ LC+ S +N ++ LP IA F + + + + +
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 91 IYAVVSDRIGRHTPIVVSLWLLALSNLL-LIWAGDFNALVYVRFLQGVLLAAVPAIAMAY 149
+Y +SD++G ++ + + +++ + F+ L+ RF+QG AA PA+ M
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 150 FKEQLSPSTMLKAAGIYIMANSIGGIVGRLLGGVMSQFLSWQESMWLLFLVTLAGVALTS 209
+ KA G+ ++G VG +GG+++ ++ W + L+ ++T+ V
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS-YLLLIPMITIITVPFLM 186

Query: 210 YLLPSGADAQ--------------------AVSSGQATSPTLSKRARLL--QDIYGFSHH 247
LL + +S + +S + L+ + I +
Sbjct: 187 KLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 248 LTDPQM--RLAYAIG---GVTFMMMVNQFSFIQLHLMAAPYEWSRFQA--TLIFLCYSSG 300
DP + + + IG G V F + ++M ++ S + +IF S
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 301 TVASYFTAKWLAKFGQHKLYQWSWCLMLLGSL---LTLFDTTFTICLGFLMTACGFFLTH 357
+ Y + + G + + + L L T++ + + + G T
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 358 SCCNSFVAMRAS-RDRAKATSLYLCCYYLGAALG 390
+ ++ V+ ++ SL +L G
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1702ACRIFLAVINRP505e-164 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 505 bits (1301), Expect = e-164
Identities = 216/1047 (20%), Positives = 450/1047 (42%), Gaps = 72/1047 (6%)

Query: 3 LTRLAIKRPVTTSMFFFAILLFGLASSRLLPLEMFPGIDIPQIVVQVPYKGSTPAEVERD 62
+ I+RP+ + +++ G + LP+ +P I P + V Y G+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITKVLEESLATMGGIDELESESSQEG-AEIEINMKWGENVATKSLEAREKIDAVRHLLPK 121
+T+V+E+++ + + + S S G I + + G + ++ + K+ LLP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 DVERVFIRQFSTADMPVLTIRISSDRELSGAFDLLD---KQLKRPLERVEGVSKVNLYGV 178
+V++ I ++ ++ SD + D+ D +K L R+ GV V L+G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 179 EQKQIEVRINANRLAASGYSATELQTRLGRENFVLSAGTL------RESNLVYQVSPKGE 232
Q + + ++A+ L + ++ +L +N ++AG L L + +
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 FRNLEDIKALVLLPGLT-----LDDVADVQFALPERVEGRHLDKHYAVGLDVFKESGANL 287
F+N E+ + L L DVA V+ ++ A GL + +GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 288 VEVSDRVLEVIELAKQDQQF-QGIRLFIMEDQASGVKSSLSDLLLSGLIGALLSFIVLYL 346
++ + + +LA+ F QG+++ D V+ S+ +++ + +L F+V+YL
Sbjct: 300 LDTAKAIKA--KLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYL 357

Query: 347 FLRNFKMTLIVVSSVPISIGMTLAAMYLLGYSLNILSMMGLLLAVGMLIDNAVVVTESVL 406
FL+N + TLI +VP+ + T A + GYS+N L+M G++LA+G+L+D+A+VV E+V
Sbjct: 358 FLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 407 QEKQGKATNSAEDNENAVMTGVDKVSLAVLAGTMTTAIVFLPNIFGVKVELTIFLEHVAI 466
+ E A + ++ A++ M + VF+P F I+ + +I
Sbjct: 418 RVMMEDKLPPKE----ATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ-FSI 472

Query: 467 AICISLAASLLVAKTLIPLMLTKFHFDIAPEKAPGK-------------LQNFYNRSLNW 513
I ++A S+LVA L P + ++ E K N Y S+
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 514 VLLRPWRSGLISVAILVSTALPLSMVKQDQEDSQSKERIYINYQVEGRHNLNVTEAMVSQ 573
+L R LI I+ + + + + Q+ T+ ++ Q
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 574 MEEYLYNNKEEFHIDSVYSYY-------APDDASSVILLK--KDLPMPLDELKKKIRSGF 624
+ +Y N++ +++SV++ A + + + LK ++ + + I
Sbjct: 593 VTDYYLKNEKA-NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 625 PKFSIAKPQFGWGNDNSGVRVTLTGRST--------------SELIHLSEQVLPLLS-NI 669
+ K + G+ + + G +T L Q+L + + +
Sbjct: 652 MEL--GKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHP 709

Query: 670 KGLVDVRSEVNGAQQEVVIRINRQMAARLDLKLNEVASSISMALRGSPLRSFRHDPNGEL 729
LV VR + + ++++ A L + L+++ +IS AL G+ + F D
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI-DRGRVK 768

Query: 730 RIEMAYEKEWQKSLEKLKQLPIVRIDQRLYTLDNLASIEILPRFDTIKHYNRQTSLSIGA 789
++ + + +++ E + +L + + + + + ++ YN S+ I
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQG 828

Query: 790 NLDK-LTTEEAQTKIKQVMENVRFPNGYNYSLRGGFERQDEDESVMAINMLLAIAMIYIV 848
++ +A ++ + + P G Y G ++ + + ++ ++++
Sbjct: 829 EAAPGTSSGDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 849 MAALFESLLLPTAIITSILFSITGVFWALLLTGTPMSVMAMIGILILMGIVVNNGIVLVD 908
+AAL+ES +P +++ + I GV A L V M+G+L +G+ N I++V+
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 909 QINQ-MTPELDKLSDTIREVCITRLRPVLMTVGTTVLGLVPLAMGETQIGGGGPPYSPMA 967
M E + + RLRP+LMT +LG++PLA+ G G + +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS---NGAGSGAQNAVG 1003

Query: 968 IAIIGGLSFSTVTSLYLVPLCYQLLYR 994
I ++GG+ +T+ +++ VP+ + ++ R
Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1703ACRIFLAVINRP6650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 665 bits (1716), Expect = 0.0
Identities = 262/1094 (23%), Positives = 473/1094 (43%), Gaps = 100/1094 (9%)

Query: 7 SVKRPVTVWMFMLAIMLFGMVGFSRLAVKLLPDLSYPTLTIRTLYDGAAPVEVEQLVSKP 66
++RP+ W+ + +M+ G + +L V P ++ P +++ Y GA V+ V++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 IEEAVGVVKGLRKISSISRS-GMSDVVLEFEWGTTMDMASLDVREKLDTI--ALPLDVKK 123
IE+ + + L +SS S S G + L F+ GT D+A + V+ KL LP +V++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 124 PLLLRFNPNLDPIMRLALSVPNASEAELKQMRTYAEEELKRRLEALSGVAAVRLSGGLEQ 183
+ + +M N + Y +K L L+GV V+L G +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGT-TQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QY 182

Query: 184 EVHIQLNQEKLSQLNLNADDIKRRIYEENINLSAGKVIQGD------REYLVRTLNQFNS 237
+ I L+ + L++ L D+ ++ +N ++AG++ + +F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 LEELGQVIVYRDAQ-TLVRLFEVATITDAFKERSDITRIGSQESIELAIYKEGDANTVAV 296
EE G+V + ++ ++VRL +VA + + + I RI + + L I AN +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 297 AKKLRDELVKINQD-PKQNKLEVIYDQSEFIESAVSEVTSSALMGSILSMLVIYLFLRNI 355
AK ++ +L ++ P+ K+ YD + F++ ++ EV + +L LV+YLFL+N+
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 356 IPTLIISISIPFSVIATFNMMYFADISLNIMSLGGIALAIGLLVDNAIVVLENIDRC-RS 414
TLI +I++P ++ TF ++ S+N +++ G+ LAIGLLVD+AIVV+EN++R
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 415 EGMSKLDAAVTGTKEVAGAIFASTMTTLAVFVPLVFVDGIAGALFSDQALTVTFALLASL 474
+ + +A ++ GA+ M AVF+P+ F G GA++ ++T+ A+ S+
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 475 LVALTSIPMLASREGFTALPELIKKTPKEKPTTKLGKLKHYSATVFSFPIVLLFSYIPSA 534
LVAL P L + L+K E K
Sbjct: 483 LVALILTPALCAT--------LLKPVSAEHHENK-------------------------- 508

Query: 535 LLTLALVIGRFFSWLLGLVMRPLSSGFNFVYHVIESVYHKLLAMALRKQVATLLLTIGIT 594
G FF W FN + + Y + L LL+ I
Sbjct: 509 --------GGFFGW------------FNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIV 548

Query: 595 GACISLLPRLGMELIPPMNQGEFYVEILLPPGTAVGETDKVLQQLAMSI--KDRPEVKHA 652
+ L RL +P +QG F I LP G T KVL Q+ ++ V+
Sbjct: 549 AGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESV 608

Query: 653 YSQAGSGGLMTSDTARGGENWGRLQVVL---SDHTAYHQVTQVLRDTARRIPELEAKIEQ 709
++ G + +N G V L + + + A+ KI
Sbjct: 609 FTVNGFS------FSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL---GKIRD 659

Query: 710 PELFSFKTPLEIEL---SGYDLHLLKHSADNLVKALSASDRFA-----------DVNTSL 755
+ F P +EL +G+D L+ + A ++ V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 756 RDGQPELSIRFDHARLAALGMDAPTVANRIAQRVGGTVASQYTVRDRKIDILVRSELDER 815
+ + + D + ALG+ + I+ +GGT + + R R + V+++ R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 816 DQISDIDALIINPNSQQPIALSAVAEVSLQLGPSAINRISQQRVALVSANLAYG-DLSDA 874
D+D L + + + + SA G + R + + A G DA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 875 VAEAQQILSAQVLPASVQARFGGQNEEMEHSFQSLKIALILAVFLVYLVMASQFESLLHP 934
+A + + S LPA + + G + + S + ++ +V+L +A+ +ES P
Sbjct: 840 MALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 935 LLILFAVPMALAGSVLGLYITQTHLSVVVFIGLIMLAGIVVNNAIVLVDRINQL-RTEGV 993
+ ++ VP+ + G +L + V +GL+ G+ NAI++V+ L EG
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 994 DKLEAIKVAAKSRLRPIMMTTLTTTLGLLPMALGLGDGSEVRAPMAITVIFGLSLSTLLT 1053
+EA +A + RLRPI+MT+L LG+LP+A+ G GS + + I V+ G+ +TLL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1054 LIVIPVLYALFDRK 1067
+ +PV + + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1704RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 72 GLIEAINVEEGDRVQKGQILAVIDAKRQQYDLDRSEAEVKIIEQELNRLKKMS 124
+++ I V+EG+ V+KG +L + A + D ++++ + E R + +S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157



Score = 39.8 bits (93), Expect = 1e-05
Identities = 35/202 (17%), Positives = 80/202 (39%), Gaps = 24/202 (11%)

Query: 91 LAVIDAKRQ----QYDLDRSEAEVKIIEQELNRLK---KMSNKEFIS--ADSMAKLEYNL 141
AV++ + + +L +++++ IE E+ K ++ + F + D + + N+
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 142 QAAIAKRDLAELQVKESHVVSPINGIIAKRYVKAGNMAKEFGD-LFYIV-NQDELHGIVH 199
+ E + + S + +P++ + + V + L IV D L
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 200 LPEQQLTSLRLGQEAQV-FS--NQQSKNAIHAKVLRISP--VVDPQSGT-FKVTLAVP-- 251
+ + + + +GQ A + + KV I+ + D + G F V +++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 252 -----NQDAHLKAGMFTRVELK 268
N++ L +GM E+K
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIK 453


76Shewmr7_1872Shewmr7_1887N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_18721160.774414beta-lactamase domain-containing protein
Shewmr7_18731170.780308methyl-accepting chemotaxis sensory transducer
Shewmr7_1874117-1.133263hypothetical protein
Shewmr7_1875217-1.339079hypothetical protein
Shewmr7_1876216-1.4867002Fe-2S iron-sulfur cluster binding
Shewmr7_1877018-3.738706twin-arginine translocation pathway signal
Shewmr7_1878114-2.152195hypothetical protein
Shewmr7_1879115-1.642314hypothetical protein
Shewmr7_1880116-0.025975hypothetical protein
Shewmr7_18811170.244313methyl-accepting chemotaxis sensory transducer
Shewmr7_18821170.068094hypothetical protein
Shewmr7_1883116-0.008907hypothetical protein
Shewmr7_1884018-1.028037GCN5-related N-acetyltransferase
Shewmr7_1885017-1.179297hypothetical protein
Shewmr7_1886-112-0.555881hypothetical protein
Shewmr7_1887-113-0.458677type 11 methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1872RTXTOXINA310.010 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.010
Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 95 LINSLISKITQLNQGTEAFSSTLADFGLQLQTKHDVGTL 133
LIN L+ + LN +FS L G L + +
Sbjct: 187 LINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGV 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1874RTXTOXIND877e-21 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 86.8 bits (215), Expect = 7e-21
Identities = 44/286 (15%), Positives = 96/286 (33%), Gaps = 26/286 (9%)

Query: 71 VENQRVEKGQVLFRLDDAMFKVMVDKASAKLAQVKTDLAVLKASYHEKQAEITLAETKLA 130
V + V + L + + ++ + L + + + + A + + + +++L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 131 FAEKEQKRQENLIGKHFV--SESQLEDARQNTDIARQNIQTLQKDLHRIAESLGGSP-DF 187
+ + I KH V E++ +A + + ++ ++ ++ E F
Sbjct: 239 --DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 188 PIEQHPSYLEALAQLNE-------AKLDLSRVEIKAPVSGVVSQLP--KLGQYVNVGAIA 238
E + + + I+APVS V QL G V
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 239 LALV-ADHALWIEANFTETDLTHVKPGQKVNIHIDTFPDNRW---QGTVESLSPATGAEF 294
+ +V D L + A D+ + GQ I ++ FP R+ G V++++
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA---- 412

Query: 295 SLIPAQNATGNWVKIAQRVPVRIAIDTVLPEAPLRAGLSAVVDIDT 340
G + + PL +G++ +I T
Sbjct: 413 ---IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKT 454



Score = 57.1 bits (138), Expect = 4e-11
Identities = 22/138 (15%), Positives = 43/138 (31%), Gaps = 12/138 (8%)

Query: 50 VKADKVPVSAQVAGNVDNLYVVENQRVEKGQVLFRLDDAMFKVMVDKASAKLAQVKTDLA 109
+ V + V E + V KG VL +L + K + L Q + +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 110 VLKASY----HEKQAEITLAETK--LAFAEKEQKRQENLIGKHFVSESQLEDARQNTDIA 163
+ K E+ L + +E+E R +LI + Q +
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI------KEQFSTWQNQKYQK 205

Query: 164 RQNIQTLQKDLHRIAESL 181
N+ + + + +
Sbjct: 206 ELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1875TCRTETB1351e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 135 bits (340), Expect = 1e-36
Identities = 85/390 (21%), Positives = 174/390 (44%), Gaps = 15/390 (3%)

Query: 51 LDTTIANVALPHMQGSMGATQDQISWVLTSYIVAAAIFMPLTGFLTARLGRKRVFMWAVV 110
L+ + NV+LP + +WV T++++ +I + G L+ +LG KR+ ++ ++
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 111 GFTIASMLCGAAQNLEQIVLF-RLLQGVFGASLVPLSQSVLLDSYPPERHGSAMALWGVG 169
S++ + +++ R +QG A+ L V+ P E G A L G
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 170 VMVGPILGPSLGGWLTEYYNWRWVFYINLPFGLLAWFGLAAYVKETPLDHSRKFDLLGFA 229
V +G +GP++GG + Y +W + + +P + + + + FD+ G
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGII 205

Query: 230 MLSLAIGALQMLLDRGESLDWFSSREIVIEAIIAGMAFYLFVAHIFTHKHPFIEPGLFKD 289
++S+ I + F++ + I++ ++F +FV HI PF++PGL K+
Sbjct: 206 LMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKN 255

Query: 290 RNFSVGLIFIFIIGIILLATMALLPPFMQNLLGYPVIDVGY-LLAPRGVGTMIAMMTVGK 348
F +G++ II + ++++P M+++ ++G ++ P + +I G
Sbjct: 256 IPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315

Query: 349 LAGKVDVRYQIFLGLMLTILSLWEMTGFNTNITSWDIVRTGIIQGLGLGFIFVPLSTITF 408
L + Y + +G+ +S F TSW + + GL F +STI
Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTA-SFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVS 374

Query: 409 ATLAAKYRNEGTALFSLMRNIGSSIGISVV 438
++L + G +L + + GI++V
Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1879HTHFIS813e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 3e-18
Identities = 32/104 (30%), Positives = 51/104 (49%)

Query: 8 ILVIDDDLVTNQILTAFIHSKGWGVITCCNLEEAYEEINQQNIELILLDYYLPDGTALTL 67
ILV DDD +L + G+ V N + I + +L++ D +PD A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LERLRYREPTVPVIVISADNEYQKILSCFRLGALDFIIKPINLE 111
L R++ P +PV+V+SA N + + GA D++ KP +L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1880HTHFIS867e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 7e-23
Identities = 29/122 (23%), Positives = 53/122 (43%), Gaps = 3/122 (2%)

Query: 1 MSK-KILIVDDSAAIRQMVEATLKSANYQVVLAKDGREALDLCGGQRFDFILTDQNMPRM 59
M+ IL+ DD AAIR ++ L A Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRAMTAFMRTPIVMLTTEAGEDMKAQGRAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1881PF06580396e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 6e-05
Identities = 15/70 (21%), Positives = 32/70 (45%), Gaps = 10/70 (14%)

Query: 452 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPEKRLAAGKSEAGVLSLKASQRGGSIVIAV 509
+I+ +++ V P+ LV N + HGI + + G + LK ++ G++ + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 510 HDDGGGLNRE 519
+ G +
Sbjct: 297 ENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1886HTHFIS672e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 2e-14
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 7/107 (6%)

Query: 3 IKVLVVDDSALIRNLLGKMIE-ADSELSLVGMAADAYMAKDMVNQHRPDVITLDIEMPKV 61
+LV DD A IR +L + + A ++ + AA + D++ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGLTFLDRLMKARPTAVVMISSLTEEG-ADATFNALALGAVDFIPKP 107
+ L R+ KARP V++ ++ + A GA D++PKP
Sbjct: 61 NAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_1887HTHFIS692e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 2e-14
Identities = 36/199 (18%), Positives = 70/199 (35%), Gaps = 19/199 (9%)

Query: 255 KILLVDDQQSMVDYFSSLLRSHGLMVKGMTKPEQVLPTLEQFEPDLFIFDLYMPDVNGLE 314
IL+ DD ++ + L G V+ + + + + DL + D+ MPD N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYSSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAP--SLFVTQVISRAQ 372
L I++ P+LV+S+ +T + + G+ D + K + +
Sbjct: 65 LLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 373 RGHDIRSSASRDSLTGLLNHTQILVAARRCFNLAKRINSSVCIAMLDLDHFKQVNDTYGH 432
+ + L+ + + R + + ++ I G
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT--------------GE 168

Query: 433 SGGDKVLLAFAHLLQQSLR 451
SG K L+A A L R
Sbjct: 169 SGTGKELVARA-LHDYGKR 186



Score = 56.0 bits (135), Expect = 2e-10
Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 1/123 (0%)

Query: 131 RIAIIEDDNNVGAMITKQLHEFGFNVQHFLNFTDFLEIQNTSPFDLILLDLILPDYTEEA 190
I + +DD + ++ + L G++V+ N DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFTAATEFEKHNTRVFVLSSRGDFEMRLLAIRANVSEYFVKPAETTLLVRKIHQWLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I + L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQP 253
++P
Sbjct: 124 RRP 126


77Shewmr7_2224Shewmr7_2231N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2224-116-3.516215hypothetical protein
Shewmr7_2225-121-4.276516L-serine ammonia-lyase
Shewmr7_2226021-3.457755beta-hexosaminidase
Shewmr7_2227023-4.221417hypothetical protein
Shewmr7_2228-122-3.379221hypothetical protein
Shewmr7_2229-224-3.152421acylphosphatase
Shewmr7_2230-122-2.603412hypothetical protein
Shewmr7_2231-120-2.032182hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2224GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.9 bits (77), Expect = 0.002
Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 7/68 (10%)

Query: 594 ALEEAKIQ-QAIAEQEAIAAQAKAA--EEAALAKAKAEVEAEAERQRL----EQEEQMKA 646
ALEEA + A+ + ++K +E A +AK E EA+A +++L E+ +++A
Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRA 460

Query: 647 SEQSQPET 654
+ S +T
Sbjct: 461 GKASDSQT 468



Score = 32.0 bits (72), Expect = 0.010
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 611 AAQAKAAEEAALAKAKAEVE-AEAERQRLEQEEQMKASEQSQPETGSQEAIATSD-ESLA 668
+ +AK EA K + + + +EA RQ L + + AS +++ + A S +L
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRR--DLDASREAKKQVEKALEEANSKLAALE 413

Query: 669 K 669
K
Sbjct: 414 K 414



Score = 31.6 bits (71), Expect = 0.010
Identities = 21/93 (22%), Positives = 34/93 (36%), Gaps = 23/93 (24%)

Query: 597 EAKIQQAIAEQEAIAAQAKAAEEAALAKAKAEVEA-EAERQRLEQ--------------- 640
A+ + + A+ KAA EA A + + + A RQ L +
Sbjct: 273 MNFSTADSAKIKTLEAE-KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 641 ----EEQMKASEQSQPETGSQEAIATSDESLAK 669
EEQ K SE S+ + + S E+ +
Sbjct: 332 HQKLEEQNKISEASR--QSLRRDLDASREAKKQ 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2225HTHTETR397e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.8 bits (90), Expect = 7e-06
Identities = 13/75 (17%), Positives = 29/75 (38%)

Query: 21 WEQRRDYLTQVALRSLRGHKTFDLCRSHLVQVSQISKGTIYNHFTTEADLIVAVASAQYD 80
++ R ++ VALR + + + +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 81 EWLCAAKQDALRYPD 95
+ ++P
Sbjct: 69 NIGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2230HTHFIS290.048 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.048
Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 39 KWDKEVEVLIVGSGFAGLAAGIEAIRKGAKD 69
K ++ VL++ S I+A KGA D
Sbjct: 71 KARPDLPVLVM-SAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2231NUCEPIMERASE300.013 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.013
Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 151 ILVTGASGGVGS-VAVTLLANAGYRVIA 177
LVTGA+G +G V+ LL G++V+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA-GHQVVG 29


78Shewmr7_2278Shewmr7_2285N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2278-1183.292873transglutaminase domain-containing protein
Shewmr7_22791171.990883hypothetical protein
Shewmr7_22800171.735958ATPase
Shewmr7_22811141.037148SEFIR domain-containing protein
Shewmr7_22820150.823325IS4 family transposase
Shewmr7_22830151.098281hypothetical protein
Shewmr7_2284-116-0.046835hypothetical protein
Shewmr7_2285-1181.098669diguanylate phosphodiesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2278OMPADOMAIN704e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 70.3 bits (172), Expect = 4e-16
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 2/92 (2%)

Query: 142 ELALGMNVQFRTGSSELESHFLPQLDNVVKVMKRSSESN--LELKGYADRRGDLAYNQAL 199
L +V F + L+ LD + + + + + GY DR G AYNQ L
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGL 273

Query: 200 SEQRLLEVRGYLIKQGVAPERITTQAFGARMP 231
SE+R V YLI +G+ ++I+ + G P
Sbjct: 274 SERRAQSVVDYLISKGIPADKISARGMGESNP 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2279HTHFIS637e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 7e-14
Identities = 29/134 (21%), Positives = 54/134 (40%), Gaps = 5/134 (3%)

Query: 3 RIAIVEDEAAIRENYKDVLQQHGYSVQTYADRPSAMLAFNTRLPDLAIIDIGLGNEIDGG 62
I + +D+AAIR L + GY V+ ++ + DL + D+ + +E
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE--NA 62

Query: 63 FMLCQSLRAMSNTLPIIFLTARDSDFDTVCGLRLGADDYLSKEVSFPH---LTARLAALF 119
F L ++ LP++ ++A+++ + GA DYL K + R A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 120 RRSELATSQTPQEN 133
+R Q+
Sbjct: 123 KRRPSKLEDDSQDG 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2283SHAPEPROTEIN432e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 42.8 bits (101), Expect = 2e-06
Identities = 31/131 (23%), Positives = 56/131 (42%), Gaps = 22/131 (16%)

Query: 144 AEAVLA---EKGVAAEGLTSITHAVIGRPVNFQGIGGEESNRQAEAILTLAAKRAGFVDV 200
E +L ++ + + ++ PV + + AI +A+ AG +V
Sbjct: 87 TEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV-------ERRAIRE-SAQGAGAREV 138

Query: 201 AFLFEPLAAGMDYEASLSADQTVLVVDVGGGTTDCSVVKMGPKHQASFDRSADCLGHSGQ 260
+ EP+AA + +S +VVD+GGGTT+ +V+ + + S
Sbjct: 139 FLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSV 187

Query: 261 RIGGNDLDIAL 271
RIGG+ D A+
Sbjct: 188 RIGGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2285SHAPEPROTEIN290.039 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.3 bits (66), Expect = 0.039
Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 137 NIVIDIGGGSTEVVLGQKNTPTHLSSLRCGCVSFNE 172
++V+DIGGG+TEV + N + SS+R G F+E
Sbjct: 161 SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196


79Shewmr7_2555Shewmr7_2566N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2555216-0.429784hypothetical protein
Shewmr7_2556218-0.548242hypothetical protein
Shewmr7_2557321-0.694985gonadoliberin III-like protein
Shewmr7_2558423-1.110192hypothetical protein
Shewmr7_2559529-1.283619alpha-L-glutamate ligase-like protein
Shewmr7_2560429-0.986632hypothetical protein
Shewmr7_2561325-0.726948hypothetical protein
Shewmr7_2562232-1.440151hypothetical protein
Shewmr7_2563129-1.486845hypothetical protein
Shewmr7_2564124-1.339980LysR family transcriptional regulator
Shewmr7_2565-116-0.903699SrpA-like protein
Shewmr7_2566018-1.577905methyl-accepting chemotaxis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2555HTHFIS310.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.010
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2556HTHFIS290.021 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.021
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2560DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2561PF05272330.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.006
Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 286 AEATVVRSYVDWMTSVPWSQRSKIKRDLAKAQEVLDTDHYGLEKVKDRILEYLAVQSRVR 345
A+ V + DW+ + W + ++++ L D+ +++ + V
Sbjct: 527 ADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVA 586

Query: 346 QLKGP------ILCLVGPPGVGKTSL 365
++ P + L G G+GK++L
Sbjct: 587 RVMEPGCKFDYSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2562HTHFIS300.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.017
Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLRNSSPKDGVELGKSNILLIGPTG 123
+ P+ E + ++G+ A +Y RL + +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAA----MQEIYRVLARLMQT---------DLTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2566ENTEROTOXINA300.026 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 29.6 bits (66), Expect = 0.026
Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 15/71 (21%)

Query: 275 NFFTIRDVLGHYDPET----------VRYFLLSGHYRSQINYSEENLKQARAALERLYTA 324
N F + DVLG Y P + Y + G YR +E L + R +R Y
Sbjct: 111 NMFNVNDVLGVYSPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRNREYRDRYYR- 169

Query: 325 IKDVDLTVAPA 335
+L +APA
Sbjct: 170 ----NLNIAPA 176


80Shewmr7_2637Shewmr7_2647N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2637-2164.840415periplasmic sensor signal transduction histidine
Shewmr7_2638-1265.709940hypothetical protein
Shewmr7_26391265.745376two component transcriptional regulator
Shewmr7_26400255.648412ApbE family lipoprotein
Shewmr7_26411236.120723hypothetical protein
Shewmr7_26422183.149001hypothetical protein
Shewmr7_26431171.823682putative lipoprotein
Shewmr7_26441171.370677hypothetical protein
Shewmr7_26451170.467249redoxin domain-containing protein
Shewmr7_26461170.495855hypothetical protein
Shewmr7_2647117-0.630203hydrogenase (NiFe) small subunit HydA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2637SUBTILISIN1438e-41 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 143 bits (362), Expect = 8e-41
Identities = 71/210 (33%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 126 AGMKVCIIDSGLDSSNPDFNWNNITG----DNDPGTGNWFQNGGPHGTHVAGTIGAADNN 181
G+KV ++D+G D+ +PD I G D+D G F++ HGTHVAGTI A +N
Sbjct: 41 RGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE 100

Query: 182 IGVVGMAPGVPMHIVKVFNASGWGYSSDLAYAANKCSNAGAKIISMSLGGGAANNTEKNA 241
GVVG+AP + I+KV N G G + IISMSLGG A
Sbjct: 101 NGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEA 160

Query: 242 FDAFTAAGGLVVAAAGNDGNSVRS-----YPAGYPSVMMIGANDANNNIADFSQYPSCVS 296
A+ LV+ AAGN+G+ YP Y V+ +GA + + + ++FS
Sbjct: 161 VKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNS----- 215

Query: 297 GRGKKAVNDDGICVEVTAGGVDTLSTYPAG 326
V++ A G D LST P G
Sbjct: 216 ----------NNEVDLVAPGEDILSTVPGG 235



Score = 53.3 bits (128), Expect = 8e-10
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 7/70 (10%)

Query: 447 YGFMSGTSMATPAVSGMAALVWSN-----HSQCTGTQIRKALKATAMDAGTVGKDNYFGY 501
Y SGTSMATP V+G AL+ T ++ L + G G
Sbjct: 237 YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGN 294

Query: 502 GIVNAKAADA 511
G++ A +
Sbjct: 295 GLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2640ABC2TRNSPORT401e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.5 bits (92), Expect = 1e-05
Identities = 44/166 (26%), Positives = 78/166 (46%), Gaps = 23/166 (13%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI----TPYVLVGFV 237
G++ T M T AA R Q E ++ T +R +++LG++ T L G
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 238 QLAIILTAGH-----LLFAVPIRGGLDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+ G+ LL+A+P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPVAAQWIAEALPATHFMRMSRAIVL 338
++ P + LSG +FP + +P+ Q A LP +H + + R I+L
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2642RTXTOXIND596e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.5 bits (144), Expect = 6e-12
Identities = 36/180 (20%), Positives = 61/180 (33%), Gaps = 11/180 (6%)

Query: 17 PSPRGWGKLLASLLGAALLLQLTACGDESPRVLGTV--ERDRLTLTAPVGELIKRVNVVE 74
PR + L A +L + + G + + ++K + V E
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 75 GQQVQAGEVLLELDSTAAQARLGQRQAELKQA-------QAKLDEAVTGARSEDIDKARA 127
G+ V+ G+VLL+L + A+A + Q+ L QA Q E
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 128 ALDGANASVKEARQNFERTQ--QLFKTKVLSQADLDAARAARDTSLAKQAEAEQSLRLLQ 185
+ + + Q K + +LD RA R T LA+ E R+ +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234



Score = 49.8 bits (119), Expect = 6e-09
Identities = 30/242 (12%), Positives = 77/242 (31%), Gaps = 35/242 (14%)

Query: 73 VEGQQVQAGEVLLELDSTAAQARLGQRQAELKQAQAKLDEAVTGARSEDIDKARAALDGA 132
V ++V L++ + Q + Q++ L + +A + A ++
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA------------ERLTVLARINRY 226

Query: 133 NASVKEARQNFERTQQLFKTKVLS--------------QADLDAARAARDTSLAKQAEAE 178
+ + + L + ++ +L ++ + ++ A+
Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286

Query: 179 QSLRLLQNGTRSEQLEQARAAVEAAMAGVAQEQKALKDLSLVAAK-PA---VVDTLPWRV 234
+ +L+ ++E L++ R + + K + + P V
Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346

Query: 235 GDRVAAGSQLIGLLAIEHPY-VRVYLPATWLDRVKAGSQVKILVDG----RTQPIAGTVR 289
G V L+ ++ + V + + + G I V+ R + G V+
Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406

Query: 290 NI 291
NI
Sbjct: 407 NI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2643HTHTETR739e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.7 bits (178), Expect = 9e-18
Identities = 24/151 (15%), Positives = 55/151 (36%), Gaps = 6/151 (3%)

Query: 31 SDARQRLITAAVSLFSERSYPTVSTREIARVAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ A+ LFS++ + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VITRLREVSTAQAPNN---VGDLMQTYYRVMAPNPGLPRLIVRVLQESDGTEAYRIMLSV 147
+ E + + +++ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQMLSLSRQWLEASF---VSAGILKEGLDP 175
+ S +E + + A +L L
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2647RTXTOXIND330.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.007
Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 12/179 (6%)

Query: 730 WLGQLKYDVSDWAASRPFFEQYLAYSQTMYALAPEDKDALMELSYAHNTLGS----VSMK 785
L Q +Y + + + + + E ++ L S + K
Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE-EEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 786 QQDFTKAQQDFEESLRLKLLALAKAPEDSQLIAD-VADTRSWLASAALSQGDLLSAINIH 844
+ + K + + L + + S++ + D S L A+++ +L N +
Sbjct: 206 ELNLDKKRAERLTVLA----RINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY 261

Query: 845 IQLQQELGKNIKQPYIL--DRLSASHQILSELYDYQNQIDLSLRQAQLGLEAITRALEQ 901
++ EL Q + + LSA + ++N+I LRQ + +T L +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320


81Shewmr7_2776Shewmr7_2779N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_2776-19-1.729824peptidase M22, glycoprotease
Shewmr7_2777010-2.259536delta-aminolevulinic acid dehydratase
Shewmr7_2778011-2.350018bifunctional methionine sulfoxide reductase B/A
Shewmr7_2779013-1.8319482OG-Fe(II) oxygenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2776PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.003
Identities = 20/105 (19%), Positives = 34/105 (32%), Gaps = 26/105 (24%)

Query: 327 LISNAIRY----TEPGGKITVQWRSVATGGLFSVTDTGEGIAPQHISRLTERFYRVDSAR 382
L+ N I++ GGKI ++ V +TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 383 SRQTGGSGLGLAIVKHALSHHHSE---LNISSELGKGSTFSFVIP 424
+G GL V+ L + + +S + GK + +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2777HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 2e-23
Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 4/130 (3%)

Query: 3 ARILIVEDELAIREMLTFVMEQHGFTTSAAEDFDSAIALLKEPYPDLILLDWMFPGGSGI 62
A IL+ +D+ AIR +L + + G+ + + + DL++ D + P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QLAKRLKQDEFTRQIPIIMLTARGEEEDKVKGLEVGADDYITKPFSPKELVARIKAVL-- 120
L R+K+ +P+++++A+ +K E GA DY+ KPF EL+ I L
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 121 RRSAPTRLEE 130
+ P++LE+
Sbjct: 122 PKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2778ECOLNEIPORIN812e-19 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 80.6 bits (199), Expect = 2e-19
Identities = 77/335 (22%), Positives = 127/335 (37%), Gaps = 33/335 (9%)

Query: 7 KTLLASALASATLASAYAAEPLTVYGKLNV---TAQSNDEKGDST------TTIQSNASR 57
K+L+A LA+ +A A +T+YG + T++S G T I S+
Sbjct: 3 KSLIALTLAALPVA---AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSK 59

Query: 58 FGVKGDFELSSSLEAFYTVEYEVDTGAASSDNFKARNQFVGLKGAFGSFSVGRNDTLLKI 117
G KG +L + L+A + VE + A + + R F+GLKG FG VGR +++LK
Sbjct: 60 IGFKGQEDLGNGLKAIWQVEQKASI-AGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLK- 117

Query: 118 SQGNVDQFNDLSGDL--KSLFKGENRLGQTATYLSPSIGGFVFGATYAAEGDADQQAQDG 175
G+++ ++ S L + + E RL + Y SP G YA +A + +
Sbjct: 118 DTGDINPWDSKSDYLGVNKIAEPEARL-ISVRYDSPEFAGLSGSVQYALNDNAGRHNSES 176

Query: 176 FSLAAMYGDAKLKKSPFYAAIAYDSDVKGYEILRASVQGKIADLTLGGMYQQQEQTYKNA 235
+ Y + A + + I + + ++ +Y ++A
Sbjct: 177 YHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDA 236

Query: 236 LPV----NTDSVNGYLFSAAYDINAVTLKAQY----------QDMEDLGDSWSVGADYSL 281
V + +S + AY VT + Y + + D VGA+Y
Sbjct: 237 KLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDF 296

Query: 282 GKPTKVFAFYT--NRSMEASNDDDKYIAVGLEHKF 314
K T S VGL HKF
Sbjct: 297 SKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2779SECA300.010 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.010
Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 81 ESLEEKVALIEDEENRKLAKKEKDALKD-EIITSLLPRAFS 120
++E ++ + DEE + + + L+ E++ +L+P AF+
Sbjct: 29 NAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFA 69


82Shewmr7_2884Shewmr7_2892N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_28840140.897442response regulator receiver modulated CheW
Shewmr7_2885-1121.449966type 11 methyltransferase
Shewmr7_2886-1111.720346peptidase S16, lon domain-containing protein
Shewmr7_2887-1111.634814ECF subfamily RNA polymerase sigma-24 factor
Shewmr7_2888-2101.357076anti-ECFsigma factor, ChrR
Shewmr7_2889-1141.006639hypothetical protein
Shewmr7_28900140.903570hypothetical protein
Shewmr7_2891015-0.262718hypothetical protein
Shewmr7_2892012-1.267826hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2884HTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 3e-12
Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 4/134 (2%)

Query: 13 DKRQQLISTAFKLFYFQSVHGVGINQILQESAIAKKTLYHHFASKDELVEAVVQYRDQVF 72
+ RQ ++ A +LF Q V + +I + + + + +Y HF K +L + + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 73 YQWLSER-VQTAETGKAGIRALFMALDDWFNQRVPQLCEFRGCFFINVSAEFTDASHPVH 131
+ E + + +R + + + + + F EF V
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTE-ERRRLLMEIIF--HKCEFVGEMAVVQ 127

Query: 132 RLCAEHKQRVADLM 145
+ D +
Sbjct: 128 QAQRNLCLESYDRI 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2887INFPOTNTIATR1371e-43 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 137 bits (347), Expect = 1e-43
Identities = 66/132 (50%), Positives = 88/132 (66%), Gaps = 2/132 (1%)

Query: 25 KAAQENIRLGNEFLAQNKNQEGVKTTASGLQYQVLQQGTGTVHPKASDTVTVHYHGTLID 84
K A+EN G+ FL+ NK++ G+ SGLQY+++ GTG P SDTVTV Y GTLID
Sbjct: 99 KKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGA-KPGKSDTVTVEYTGTLID 157

Query: 85 GTVFDSSVERGEPIAFPLNRVIKGWTEGVQLMVEGDKYRFFIPSELAYGNRST-GKIGGG 143
GTVFDS+ + G+P F +++VI GWTE +QLM G + F+P++LAYG RS G IG
Sbjct: 158 GTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPN 217

Query: 144 SVLIFDVELLKV 155
LIF + L+ V
Sbjct: 218 ETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2888MICOLLPTASE473e-07 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 47.4 bits (112), Expect = 3e-07
Identities = 43/210 (20%), Positives = 76/210 (36%), Gaps = 19/210 (9%)

Query: 542 WEIDADNGDILNAMHEGLGHGEGTTPPANKAPIANAGTDVTVTGTLDVTLNGSASRDPEN 601
++D + + + + G+ T NK P A +D +V ++ +G+ S+D +
Sbjct: 744 HKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDG 803

Query: 602 AALSYQWSQVSGPSLSITNADMANAVVQLSATASEVVYVFSLRVTDPEGLSSTDTVTITH 661
+Y+W G ++ A A + + T Y L VTD G +T++ I
Sbjct: 804 EIKAYEWDFGDG-----EKSNEAKATHKYNKTGE---YEVKLTVTDNNGGINTESKKI-- 853

Query: 662 KAETANQAPVV--SAPASVTVEAGQSVSINATAT---DADGDSLTYAWTVPS----GVAA 712
K V+ S P + +A Q N + S Y + V +
Sbjct: 854 KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITL 913

Query: 713 SGQNSATLVVTAPAVTQSTQYSLSVLVSDG 742
+ NS + T Y L +DG
Sbjct: 914 NNLNSVGITWTLYKEGDLNNYVLYATGNDG 943


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2891FLAGELLIN320.007 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.9 bits (72), Expect = 0.007
Identities = 14/87 (16%), Positives = 33/87 (37%), Gaps = 4/87 (4%)

Query: 282 QLASAMEEMSSTIAEVAQNTQLTSTSINTAYDLCLKSSANMKANTQKVEQLAKSVADAAN 341
+A+ + + ++N + T + + N Q+V +L+ + N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTE----GALNEINNNLQRVRELSVQATNGTN 103

Query: 342 NAHQLNKEAERVASAMGEIDSIAEQTN 368
+ L + + + EID ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_2892SECA320.007 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.8 bits (72), Expect = 0.007
Identities = 37/173 (21%), Positives = 67/173 (38%), Gaps = 42/173 (24%)

Query: 220 SQVVYPVEQRRKRELLSELIGK-KNWQQVLVFTATRDAADTLVKELNLDGIPSEVVHGEK 278
+VY E + + ++ ++ + Q VLV T + + ++ + EL GI V++
Sbjct: 424 PDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN--- 480

Query: 279 AQGSRRRALREFMSGKV-RVLVATEVAARGLDI---------------PSLEYVVNFDLP 322
A+ A +G V +AT +A RG DI P+ E +
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 323 FLAED---------YV-----H---RI-----GRTGRAGKSGVAISFVSREEE 353
+ ++ H RI GR+GR G +G + ++S E+
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDA 593


83Shewmr7_3012Shewmr7_3024N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3012-192.135612OmpA/MotB domain-containing protein
Shewmr7_3013-1122.169208hypothetical protein
Shewmr7_3014-2112.292804ribonuclease T
Shewmr7_3015-2112.158655alkyl hydroperoxide reductase/ Thiol specific
Shewmr7_3016-2111.338324Na+/H+ antiporter NhaC
Shewmr7_3017-2110.378862hypothetical protein
Shewmr7_3018-212-1.386522RDD domain-containing protein
Shewmr7_3019120-3.874815uracil phosphoribosyltransferase
Shewmr7_3020125-4.903277phosphoribosylaminoimidazole synthetase
Shewmr7_3021127-5.455009phosphoribosylglycinamide formyltransferase
Shewmr7_3022129-5.884073UMP phosphatase
Shewmr7_3023231-5.689557peptidase M28
Shewmr7_3024333-5.837319hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3012PF00577338e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 33.3 bits (76), Expect = 8e-04
Identities = 14/101 (13%), Positives = 37/101 (36%), Gaps = 3/101 (2%)

Query: 104 VSYDVTLN--RYNYSGESDLGYFEVTAGVEFSGFRV-AYWYTNDYGGTDLDYHYGEINYS 160
++Y+ + N + G S Y + +G+ +R+ + + +
Sbjct: 187 LNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHI 246

Query: 161 YEFVENWNLDLHYGYNVGDALDDGEGFDSYSDYSVGVSTEF 201
++E + L +GD G+ FD + ++++
Sbjct: 247 NTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDD 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3014VACCYTOTOXIN290.031 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 29.2 bits (65), Expect = 0.031
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 6/65 (9%)

Query: 17 QNAAAKKRQSKP-RLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHH--- 72
Q A K KP ++ + A +GF+ + L+ +LL GW WG+ H+
Sbjct: 67 QAEEANKTPDKPDKVWRIQAGKGFNE-FPNKEYDLYKSLLSSKIDGGWDWGNAARHYWVK 125

Query: 73 -SEWN 76
+WN
Sbjct: 126 DGQWN 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3015UREASE310.008 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 31.2 bits (71), Expect = 0.008
Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 334 PAEFLGIAESVGRLAVGQRADLVLLD 359
PA G++ +G L VG+RADLVL +
Sbjct: 413 PAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3018TRNSINTIMINR310.021 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 31.2 bits (70), Expect = 0.021
Identities = 18/45 (40%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 5 IAATAILLALGLTACSDAP-KTNAVPSSSTAEQAKPNQLTQAQLQ 48
+AAT I AL LT D P T+ +++ AE A +QLTQ +
Sbjct: 248 LAATGIAQALALTPEPDDPTTTDPDQAANAAESATKDQLTQEAFK 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3020BINARYTOXINA300.009 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 30.4 bits (68), Expect = 0.009
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 247 YFFVSPKRPELAAAILAGLENMISDGSFDEMFNRELKIDKLYRDAQFE 294
Y+F SP++ I +N IS F+E+ +E DKL++ F+
Sbjct: 133 YYFESPEKFAFNKEIRTENQNEISLEKFNEL--KETIQDKLFKQDGFK 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3022THERMOLYSIN310.005 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 30.8 bits (69), Expect = 0.005
Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 10/97 (10%)

Query: 35 LQVQEFMSAATSFPIVFTKNNQTGEFVSIAITA----------LKPNTNKLLKNGQWQSR 84
L F ++A +V+ + +T FVS ++ L N GQ + R
Sbjct: 16 LMAWPFGASAKGKSMVWNEQWKTPSFVSGSLLGRCSQELVYRYLDQEKNTFQLGGQARER 75

Query: 85 YLPIQVQLYPLGMTHVDEEKIILGIDINNSSVAENET 121
I +L LG T + E+ I + + +
Sbjct: 76 LSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVN 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3024ACRIFLAVINRP300.046 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.046
Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 29/134 (21%)

Query: 66 SVVDAVTAEDIGKFPDGDVGESLGRIPGVAVNRQFGQGQQVSIRGASSQLTSTLLNGHSV 125
S T +DI + +V ++L R+ GV + FG + I
Sbjct: 144 SDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRI----------------- 186

Query: 126 ASTGWFDQQAIDRSFNYSLLPPEMVGGIQVYKSSQADIAEGGIGGT-VIVKTRKPLDLEA 184
W D + Y L P +++ + K IA G +GGT + + + A
Sbjct: 187 ----WLDADLL---NKYKLTPVDVINQL---KVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 185 NSVFLSAKGDYGTV 198
+ F + + ++G V
Sbjct: 237 QTRFKNPE-EFGKV 249


84Shewmr7_3167Shewmr7_3174N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3167528-1.731946hypothetical protein
Shewmr7_3168225-1.220392decaheme cytochrome c MtrF
Shewmr7_3169-124-0.825245hypothetical protein
Shewmr7_31700210.412429decaheme cytochrome c
Shewmr7_31712211.713793hypothetical protein
Shewmr7_31721191.989311decaheme cytochrome c
Shewmr7_31731192.797477hypothetical protein
Shewmr7_31740192.932495cytochrome C family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3167BCTERIALGSPG334e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.6 bits (74), Expect = 4e-04
Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 12 QTGFTLIELMISLT-LGLVVMLGASQIFVSVNKAYVETQRFSQLQG 56
Q GFTL+E+M+ + +G++ L + + KA + S +
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAV-SDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3168BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.002
Identities = 13/23 (56%), Positives = 17/23 (73%), Gaps = 2/23 (8%)

Query: 4 RKQKGFSLIEIMVTSFIVAFGIL 26
KQ+GF+L+EIMV IV G+L
Sbjct: 5 DKQRGFTLLEIMVV--IVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3169BCTERIALGSPG382e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.3 bits (89), Expect = 2e-06
Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 3 TKKILGFTLTELMVVVAIVAIIAGIAAPSFASMIRENTARTQVNELLALTN 53
T K GFTL E+MVV+ I+ ++A + P+ + + V++++AL N
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3170BCTERIALGSPG429e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 9e-08
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 3 LSKIKVNTGFTLIELMIAIAIVGILASIALPSYQEHVRNTRRTDARD---ALSNA 54
+ GFTL+E+M+ I I+G+LAS+ +P+ + + A AL NA
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3174FERRIBNDNGPP382e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.4 bits (89), Expect = 2e-05
Identities = 46/196 (23%), Positives = 74/196 (37%), Gaps = 19/196 (9%)

Query: 4 RRFI-ALGLSLALLPI---AAMAEPAKRIIALSPHAVEMLYAIGAGESIVAATDYADY-- 57
RR + A+ LS L + A A RI+AL VE+L A+G D +Y
Sbjct: 10 RRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGI--VPYGVADTINYRL 67

Query: 58 ----PEAAKKIPSIGGYYGIQIERVLELNPDLIVVWDTGNKA--EDINQL-KSLGFKLYS 110
P + +G +E + E+ P + VW G E + ++ GF +S
Sbjct: 68 WVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FS 125

Query: 111 SSPKMLEDVAKEIEELGALTGRTEQASQVAADYRNQLLQLRSENAAKSE-PKVFYQLWST 169
+ L K + E+ L A A Y + + ++ + P + L
Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185

Query: 170 PLMTV-AKNSWIQQII 184
M V NS Q+I+
Sbjct: 186 RHMLVFGPNSLFQEIL 201


85Shewmr7_3199Shewmr7_3205N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3199014-1.087245AzlC family protein
Shewmr7_3200014-1.285384branched-chain amino acid transport
Shewmr7_3201014-0.973473major facilitator transporter
Shewmr7_3202015-0.797377N-acetylglucosamine 6-phosphate deacetylase
Shewmr7_3203-115-0.085340N-acetylglucosamine kinase
Shewmr7_3204-1150.356839sugar isomerase (SIS)
Shewmr7_3205-2140.727361tagatose-bisphosphate aldolase noncatalytic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3199BACINVASINC290.048 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 29.1 bits (64), Expect = 0.048
Identities = 37/184 (20%), Positives = 69/184 (37%), Gaps = 4/184 (2%)

Query: 229 IKNMADSLNDIAKGEGDLTKRLSVKGEDEIAQLGQAFNLFVDKLQTIIGDVANATAKVKS 288
IKN+ + N + G + S+ + NL L++ G A + +K+
Sbjct: 223 IKNVLNGQNSVKLGAEGVDSLKSLNMKK--TGTDATKNLNDATLKSNAGTSATESLGIKN 280

Query: 289 AANAIHDQTKVMSSQLLSHNNETDQVVTAITEMSSTASEVAQNTTQVAEATHAATGDVAN 348
+ I + + + S+ L ++ +M+ + Q T + G +A
Sbjct: 281 SNKQISPEHQAILSKRLESVESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAG 340

Query: 349 AQRCVDASLEEIAGLMAQINNAAGSIKS--LSEQSQKINSVLSVIGGIAEQTNLLALNAA 406
A R A+ E ++Q+NN S S E S+K S++ + E N +A
Sbjct: 341 ASRQYAATQERSEQQISQVNNRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASAL 400

Query: 407 IEAA 410
A
Sbjct: 401 AAIA 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3201SUBTILISIN1099e-28 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 109 bits (274), Expect = 9e-28
Identities = 60/265 (22%), Positives = 94/265 (35%), Gaps = 80/265 (30%)

Query: 201 GPELIQANKIWSGEATDSVPYKGEGIIVGIVDTGINSDHRSFADVGDDGYDHTNPWNAGN 260
G E+IQA +W+ +G G+ V ++DTG ++DH
Sbjct: 25 GVEMIQAPAVWNQ-------TRGRGVKVAVLDTGCDADHPDLKA---------------- 61

Query: 261 YVGDCTKSGFETMCNDKLIGVRSYPVITDNFTSGVYGATRPAVGEDYQGHGTHVASTAAG 320
++IG R++ + DY GHGTHVA T A
Sbjct: 62 ----------------RIIGGRNFTDDDEGDPEIFK---------DYNGHGTHVAGTIAA 96

Query: 321 NVLLDVDYVNPDSGTEASDGSVIKPKLFPRMSGVAPHANIVAYQVCHPSNAINAGCPGEA 380
++G + GVAP A+++ +V + +
Sbjct: 97 T--------ENENG----------------VVGVAPEADLLIIKVLNK----QGSGQYDW 128

Query: 381 LIAGIEDAINDGVDVINFSIGGQDSNPWADDVELAFLSAREAGISVAVAAGNSGQPVGYK 440
+I GI AI VD+I+ S+GG + P + A A + I V AAGN G
Sbjct: 129 IIQGIYYAIEQKVDIISMSLGGPEDVPELHE---AVKKAVASQILVMCAAGNEGDGDDRT 185

Query: 441 EYFGRIDHASPWLMNVAASTHAREV 465
+ G +++V A R
Sbjct: 186 DELGYPG-CYNEVISVGAINFDRHA 209



Score = 58.7 bits (142), Expect = 7e-11
Identities = 32/173 (18%), Positives = 59/173 (34%), Gaps = 41/173 (23%)

Query: 594 DGRYDNGLAGYGLSDWLAKGSNHMLTISGTTIERTMDPERADWLAAFSSRGPSPSTPEAL 653
+G D+ G N ++++ +R + FS+
Sbjct: 178 EGDGDDRTDELGYPGCY----NEVISVGAINFDRHA--------SEFSNSNNEVD----- 220

Query: 654 IPAVAAPGVDIYAAFADEHPFSSSAASGDFAFLSGTSMASPHVAGSMALLRQAQPSWSAT 713
+ APG DI + G +A SGTSMA+PHVAG++AL++Q +
Sbjct: 221 ---LVAPGEDILSTVPG----------GKYATFSGTSMATPHVAGALALIKQLANASFER 267

Query: 714 EIQSALAMTAENKVQYYRLDDKNGDVALASTYRA-GTGRINVANAVNAGFVMD 765
++ + L ++ + G G + + + D
Sbjct: 268 DLTEPELYAQL----------IKRTIPLGNSPKMEGNGLLYLTAVEELSRIFD 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3204HTHFIS854e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 4e-21
Identities = 32/134 (23%), Positives = 61/134 (45%), Gaps = 4/134 (2%)

Query: 13 ILLIEDDLPLAELVCTYLTQEGYKLIHLDNAEDALIRQDTDDFDLIICDVMLPGQDGFSI 72
IL+ +DD + ++ L++ GY + NA D DL++ DV++P ++ F +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 73 YPQLAASYP-CPIIFLTALDNHADQIRGLNLGACDYLLKPVVPP---LLLARIKANLRKQ 128
P++ + P P++ ++A + I+ GA DYL KP ++ R A +++
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 129 QSSSSRSKLKLHDL 142
S L
Sbjct: 126 PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3205PF06580300.019 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.019
Identities = 11/76 (14%), Positives = 25/76 (32%), Gaps = 8/76 (10%)

Query: 275 YQPKLNIAFHYEPQQAHFDPIAMTIAIQNLIQNAMRFA------QHEIHVHFYRQGNINR 328
++ +L P M + Q L++N ++ +I + +
Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLV--QTLVENGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 329 ISVEDDGPGFEGKGKK 344
+ VE+ G K+
Sbjct: 294 LEVENTGSLALKNTKE 309


86Shewmr7_3231Shewmr7_3235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3231-1130.712737hypothetical protein
Shewmr7_3232-1120.844028transcriptional regulator PhoU
Shewmr7_3233-1161.096300phosphate transporter ATP-binding protein
Shewmr7_3234-1160.820221phosphate ABC transporter inner membrane subunit
Shewmr7_3235-2181.067790binding-protein-dependent transport systems
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3231HTHFIS791e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 1e-18
Identities = 31/169 (18%), Positives = 60/169 (35%), Gaps = 23/169 (13%)

Query: 10 TLLLVDDEPVNLRVLKQVLHQ-DYHLIFAKSGEEALRLAQTELPSLILLDIMMPNMTGLE 68
T+L+ DD+ VL Q L + Y + + R L++ D++MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 69 VCQLLKNIPETQSIPVIFVTALNDEHDEAAGFAVGGVDYIVKPISATIVKARVKTHLSLV 128
+ +K +PV+ ++A N G DY+ KP T + + L+
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 129 QADELRRTR---------------LQVIQRLGRAAEYKDN-----ETGT 157
+ + ++ + L R + E+GT
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3232HTHFIS651e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 1e-12
Identities = 27/122 (22%), Positives = 46/122 (37%), Gaps = 5/122 (4%)

Query: 785 TLLVVDDIQQNIDLLSVWLTRQGHKVITARDGEQALLRMQKADIDITLMDLQMPVMDGLT 844
T+LV DD +L+ L+R G+ V + + D D+ + D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 845 AAKMRREQEAESQLPHMPIIALTASVLEQDKSAAEQAGMDGFANKPIDFALLTREIARVL 904
+ P +P++ ++A A + G + KP D L I R L
Sbjct: 65 L-----LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 905 QL 906

Sbjct: 120 AE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3234RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 24/121 (19%), Positives = 47/121 (38%), Gaps = 7/121 (5%)

Query: 106 DYEADLMQAEATLAQATAALNEEIARGEVAKIEFKGYDKGLPPELGLRIPQLKKEQANVK 165
+ E ++A L + L + + AK E++ + E+ + +L++ N+
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI---LDKLRQTTDNIG 312

Query: 166 YAQAALARAQRNLERTVIRAPFDGIIKARNV-DLGQYVTLGTNLGELY---DTRIAEIRL 221
LA+ + + +VIRAP ++ V G VT L + DT +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 222 P 222

Sbjct: 373 Q 373



Score = 38.3 bits (89), Expect = 5e-05
Identities = 25/125 (20%), Positives = 53/125 (42%), Gaps = 11/125 (8%)

Query: 62 GVVTPKYKTQLVTEVQGRMLSISPQFVA-GGIVKKGDQLAQIEPSDYEADLMQAEATLAQ 120
G +T +++ + ++ + + V G V+KGD L ++ EAD ++ +++L Q
Sbjct: 88 GKLTHSGRSKEIKPIENSI--VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 121 ATA------ALNEEIARGEVAKIEFKGYD--KGLPPELGLRIPQLKKEQANVKYAQAALA 172
A L+ I ++ +++ + + E LR+ L KEQ + Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 173 RAQRN 177
+
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3235ACRIFLAVINRP500e-162 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 500 bits (1290), Expect = e-162
Identities = 210/1043 (20%), Positives = 445/1043 (42%), Gaps = 49/1043 (4%)

Query: 11 FARNSVAANLLMWALLIGGLFSTVLINKEVFPSFNLNLLSITVAYPGAAPQEIEEGINIK 70
F R + A +L L++ G + + + +P+ +S++ YPGA Q +++ +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 71 IEEAIQDINGIKKVTSVA-SEGVGSITVEVEDDYDVQTVLDEAKLRLDAI-STFPVNIEK 128
IE+ + I+ + ++S + S G +IT+ + D + + +L P +++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 129 PQIYKIEPENNVIWV----SVYGDMSLHDMKELAKS-VRDDLTQLPAVTRAKVTGVRDYE 183
I + ++ + V S + D+ + S V+D L++L V ++ G Y
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYA 183

Query: 184 IGIEVSEDKLREYGLTFSQVALAVQNSSIDLPGGSIRAEDG------DILLRTKGQAYTG 237
+ I + D L +Y LT V ++ + + G + + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 238 DDFANIVVTTRADGSRVMLPQVATIKDDFEERLEYTRFNGKPAAIIEVTSVNDQNALDIA 297
++F + + +DGS V L VA ++ E R NGKPAA + + NALD A
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303

Query: 298 QQVKDYVEKRRATLPANAKLDTWGDLTHYLKGRLNMMMSNMFYGALLVFVILALFL-DLK 356
+ +K + + + P K+ D T +++ ++ ++ +F +LVF+++ LFL +++
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 357 LAFWVMMGLPVCFLGTMLIMPLEPFSMTINMLTLFAFILVLGIVVDDAIVIGESAYSE-V 415
+ +PV LGT I+ F +IN LT+F +L +G++VDDAIV+ E+ +
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAA--FGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 416 ERHGHSIDNVIRGAQKVAMPATFGVLTTIAAFIPMLMVSGPMGIIWKSIGMVVIMCLAFS 475
E + + ++ + A FIPM G G I++ + ++ +A S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LVESKFILPAHLAHM-KFRKPGE---PTGFFGRFKDRFNNRVQHFIHHSYRNFLERCIQH 531
++ + + PA A + K GFFG F F++ V H Y N + + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNH-----YTNSVGKILGS 536

Query: 532 RYNVVAAFIGVLILSVALVVSGKVRWVFFPDIPSDFIQVQLEMDEGSSEQNTLKVVQDIE 591
+ + ++ V L + ++ F P+ +++ G++++ T KV+ +
Sbjct: 537 TGRYLLIYALIVAGMVVLFL--RLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 592 EALYKMNAKMEKDNGSEVVKHSFINMSSRTSAFIFAELTKGEDREVDGET---IAAAWRE 648
+ K + + V SF ++ + F L E+R D + + +
Sbjct: 595 DYYLKNEKANVESVFT-VNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 649 QLPELLSVKKLDFNAS-----GNGGGGGDISFRLTSSDLEELSAAARELKQKLATY-EGV 702
+L ++ + FN G G + L+ A +L A + +
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 703 YDIADNFSSGSHEIRLKI-RPEAEALGLTLSDLARQVRYGFYGYEAQRILRNKEEIKVMV 761
+ N + + +L++ + +A+ALG++LSD+ + + G + K+ V
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 762 RYPLEQRRTVGYLENMLIRTPQGKSVPFSTVAEVEKGESYASITRVDGKRAITIIANANK 821
+ + R ++ + +R+ G+ VPFS + R +G ++ I
Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE--- 829

Query: 822 HKVEPSKVVNEIQKDFLPQLQAKYPK-IQTTLDGGSLDEQNAMVGLMQGFFFALFTIYAL 880
P + + L +K P I G S E+ + + ++
Sbjct: 830 --AAPGTSSGDAM-ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 881 MAVPLKSYSQPLIIMSVIPFGIIGALFGHLIQGLAMSVLSLCGIVALAGVVVNDSLILVD 940
+A +S+S P+ +M V+P GI+G L + V + G++ G+ +++++V+
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 941 FVNRARE-QGLSIKQAAVDSGCYRFRAIILTSLTTFVGLVPIILERSLQAQIVIPMATSL 999
F E +G + +A + + R R I++TSL +G++P+ + + + +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGV 1006

Query: 1000 AFGILFSTVVTLILVPLLYIILD 1022
G++ +T++ + VP+ ++++
Sbjct: 1007 MGGMVSATLLAIFFVPVFFVVIR 1029


87Shewmr7_3305Shewmr7_3313N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3305-2142.129723hypothetical protein
Shewmr7_3306-2141.965418hypothetical protein
Shewmr7_3307-2141.959934fructokinase
Shewmr7_33081193.1074153'(2'),5'-bisphosphate nucleotidase
Shewmr7_33091212.843114dTDP-4-dehydrorhamnose reductase
Shewmr7_33102222.857322putative hydrolase
Shewmr7_33110222.504810SNF2-like protein
Shewmr7_33120252.750464hypothetical protein
Shewmr7_33130192.503866hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3305OUTRMMBRANEA280.041 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 28.4 bits (63), Expect = 0.041
Identities = 19/107 (17%), Positives = 38/107 (35%), Gaps = 18/107 (16%)

Query: 213 ISFNKGCYMGQETVARMKYRGGNKRALYILHGTT-SLNINLETGIEIELEDGYRKGGQII 271
++ G MG + + RM Y+G + Y G + + ++ D Y + G
Sbjct: 66 VNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDL---DIYTRLG--- 119

Query: 272 EFVQRGNQVLLTAVLANDTQNDAKLRFADDEQSSLRIQALPYSLEDE 318
V DT+++ + D S + + Y++ E
Sbjct: 120 -----------GMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPE 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3306HTHFIS904e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 4e-22
Identities = 39/159 (24%), Positives = 65/159 (40%), Gaps = 6/159 (3%)

Query: 1 MDKATILVVDDTPENIDILVGILG-EDYKVKVAIDGPRALALVAKTLPDLILLDVMMPGM 59
M ATILV DD +L L Y V++ + +A DL++ DV+MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NGYEVCKLLKQEPLTCHIPVIFVTALSEVADETQGFELGAVDYITKPVSAPVVKARVRTH 119
N +++ +K+ +PV+ ++A + + E GA DY+ KP + +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LALYDQKRLLEQQVKERTQEL--EETRF-EIIRRLGRAA 155
LA ++ + + L EI R L R
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3307HTHFIS789e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 9e-17
Identities = 34/139 (24%), Positives = 61/139 (43%), Gaps = 5/139 (3%)

Query: 1283 SILVADDNATARDIMRTTLESMGFRVDTVRSGEEAVTRCSQQEYAVALIDWKMPNLDGIE 1342
+ILVADD+A R ++ L G+ V + + + + + D MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1343 TAKQIKQLAKNAPRILMVSAHATQEFLSQIEAL--GLAGYISKPISASRLLDGIMNSLGR 1400
+IK+ + P ++M SA T + I+A G Y+ KP + L+ I +L
Sbjct: 65 LLPRIKKARPDLPVLVM-SAQNTFM--TAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1401 AGVLPVRRNSESIDPKLLL 1419
P + +S D L+
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140



Score = 69.5 bits (170), Expect = 6e-14
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 1425 RILLVEDNEMNLEVATEFLEQVGIILSIATNGQIALDKLAQQSFDLVLMDCQMPVMDGYQ 1484
IL+ +D+ V + L + G + I +N +A DLV+ D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1485 ATQAIRKRPELAELPVIAMTANAMAGDKEMCLKAGMNDHIAKP 1527
I+K +LPV+ M+A + G D++ KP
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3312BCTERIALGSPG533e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 53.4 bits (128), Expect = 3e-12
Identities = 19/64 (29%), Positives = 38/64 (59%)

Query: 5 RKGFTLIELMIAVAIIGILAAIAIPSFNEYLKQGRRFDAQQYLVSSAQALERHYSRNGLY 64
++GFTL+E+M+ + IIG+LA++ +P+ ++ + A +V+ AL+ + N Y
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 65 PASQ 68
P +
Sbjct: 67 PTTN 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3313BCTERIALGSPG404e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 40.3 bits (94), Expect = 4e-07
Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 2 ANRTNAGFTLVELMVAIAIIGILASIALPSY 32
A GFTL+E+MV I IIG+LAS+ +P+
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNL 33


88Shewmr7_3364Shewmr7_3371N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_33640121.216424transcriptional regulator
Shewmr7_3365-1121.997170beta-ketoacyl synthase
Shewmr7_3366-2121.850126omega-3 polyunsaturated fatty acid synthase
Shewmr7_3367-3131.941302Beta-hydroxyacyl-(acyl-carrier-protein)
Shewmr7_3368-3131.8018802-nitropropane dioxygenase, NPD
Shewmr7_3369-114-0.012374hypothetical protein
Shewmr7_3370-315-3.245359XRE family transcriptional regulator
Shewmr7_3371-315-3.644380hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3364PF06580424e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 4e-06
Identities = 27/148 (18%), Positives = 56/148 (37%), Gaps = 19/148 (12%)

Query: 405 NQLTEINEGVSTAYVQLRELL----STFRLTIKEPNLKN-AMEAMLEQLRANTDI----- 454
N L I + + RE+L R +++ N + ++ L + + +
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF 236

Query: 455 --KIHLDYKLSPQWLEAKQHIHILQITREATLNAIKHANASR----VIIRCYKDDNGMVN 508
++ + +++P ++ + ++Q E N IKH A I+ DNG V
Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 509 ISVSDNGIGIGYLKERDQHFGIGIMHER 536
+ V + G + G+ + ER
Sbjct: 294 LEVENTGSLALKNTKESTGTGLQNVRER 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3365HTHFIS643e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 3e-14
Identities = 27/159 (16%), Positives = 62/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEVGGGLDALSAVATDEPDIVLLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+V+ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLDKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKNAADEQEWISSLTPRELQILQQLA 164
E + +L++ + + + + +I + LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3369CARBMTKINASE310.009 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.9 bits (70), Expect = 0.009
Identities = 18/81 (22%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 202 DYSAALLAEALKASAVEIWTDVAGIYTTDPRLAPNAHPIAEISFNEAAEMATFGAKVLHP 261
D + LAE + A I TDV G + E+ E + G
Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAALY--YGTEKEQWLREVKVEELRKYYEEGH--FKA 271

Query: 262 ATILPAVRQQIQVFVGSSKEP 282
++ P V I+ F+ E
Sbjct: 272 GSMGPKVLAAIR-FIEWGGER 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3371HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGNEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


89Shewmr7_3481Shewmr7_3487N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3481115-2.529760hypothetical protein
Shewmr7_3482115-2.778476hypothetical protein
Shewmr7_3483115-2.608877extracellular solute-binding protein
Shewmr7_3484117-2.781192hypothetical protein
Shewmr7_3485115-2.556476cobalamin synthesis protein, P47K
Shewmr7_3486116-2.174068DEAD/DEAH box helicase domain-containing
Shewmr7_3487117-1.313413diguanylate cyclase/phosphodiesterase with
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3481HTHFIS616e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 6e-13
Identities = 24/107 (22%), Positives = 43/107 (40%), Gaps = 3/107 (2%)

Query: 146 RVLVVDDSRMARNVIKRTIGNLGIKQITEAEDGAQAIELMRNNMFDLIITDYNMPSVDGL 205
+LV DD R V+ + + G + A + DL++TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 206 ALTQFIRNESQQSHVPILMVSSEANDAHLSNVSQAGVNALCDKPFEP 252
L I+ + +P+L++S++ S+ G KPF+
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108



Score = 47.9 bits (114), Expect = 2e-08
Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 6/155 (3%)

Query: 10 SILIVEPSETQRRIIIKRLQQEGIISIQNAASLTQARELIARHKPDLIASAMYFEDGTAT 69
+IL+ + R ++ + L + G ++ ++ IA DL+ + + D A
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 DFLSYLRTNSEYKDIQFMLVSSECRREQLEIFRQSGVVAILPKPFNAEHLGKALNATIDL 129
D L ++ D+ +++S++ + G LPKPF+ L + +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 130 LSHDELDLSHFDVHDVRVLVVDDSRM--ARNVIKR 162
L D D LV + M V+ R
Sbjct: 122 PKRRPSKLED-DSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3482DNABINDINGHU1093e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (274), Expect = 3e-35
Identities = 44/88 (50%), Positives = 66/88 (75%)

Query: 2 NKTELIAKIAENADITKAEAARALKSFEAAITESMKNGDKISIVGFGSFETSIRAARTGR 61
NK +LIAK+AE ++TK ++A A+ + +A++ + G+K+ ++GFG+FE RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIQIAEATVPKFKAGKTLRDSV 89
NPQTG+EI+I + VP FKAGK L+D+V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3483OMPADOMAIN337e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 33.4 bits (76), Expect = 7e-04
Identities = 35/200 (17%), Positives = 66/200 (33%), Gaps = 54/200 (27%)

Query: 47 VAIQGGIDYSHDSGFYAGTWASNVDFGDDTSYELDLYAGYGGNITEDLSYDIGYLYYAYP 106
+ G HD+GF +N + + GY + + +++GY +
Sbjct: 30 TGAKLGWSQYHDTGFI-----NNNGPTHENQLGAGAFGGY--QVNPYVGFEMGYDWLGRM 82

Query: 107 DAEGSID-------------------------FGELHGAITWKWFEISYSHVINAGDDVA 141
+GS++ + L G + W + S+V D
Sbjct: 83 PYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMV---WRADTKSNVYGKNHDTG 139

Query: 142 AEPLDNKDMSYLAATASFPLTDKLSLSLHYGYSSGDVVESWFDEDNYADYNITLSADTSM 201
P+ A + +T +++ L Y +++ N D + T+
Sbjct: 140 VSPV-------FAGGVEYAITPEIATRLEYQWTN-----------NIGDAH-TIGTRPDN 180

Query: 202 GTVSFMVSDTDLQGDDAKVV 221
G +S VS QG+ A VV
Sbjct: 181 GMLSLGVSYRFGQGEAAPVV 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3485HTHFIS623e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.8 bits (150), Expect = 3e-12
Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 3 LLLIDDDEVDRTAIIRALRQSKLAFNVIEANCAFDGLNLALERHFDGILLDYLLPDANGL 62
+L+ DDD RT + +AL S+ ++V + A D ++ D ++PD N
Sbjct: 6 ILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVLIKLNSMTQEQTVVLMLSRYEDEKLAQRCIELGAQDFLLK 104
++L ++ + VL++S A + E GA D+L K
Sbjct: 64 DLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3486HTHFIS445e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.1 bits (104), Expect = 5e-08
Identities = 15/73 (20%), Positives = 28/73 (38%), Gaps = 6/73 (8%)

Query: 11 TILLVDDDDVDYMAVQRAMKQLRLLNPLVRARDGLEALAILTHSDAIKGSYLILLDLNMP 70
TIL+ DDD + +A+ + + + + L++ D+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAA----GDGDLVVTDVVMP 58

Query: 71 RMNGFEFLEHIRS 83
N F+ L I+
Sbjct: 59 DENAFDLLPRIKK 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3487PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 2e-05
Identities = 21/107 (19%), Positives = 37/107 (34%), Gaps = 22/107 (20%)

Query: 608 LVIRNLISNAIKH---HDKGEGVIKVICETSNHHYLFSVIDDGPGISSRFHGKVFQMFQT 664
++++ L+ N IKH G I + N V + G
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS---------------- 301

Query: 665 LRPRDEVEGSGLGLSLVKKTVESLGGK---IQLESEGRGCCFRFSWP 708
L ++ E +G GL V++ ++ L G I+L + P
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


90Shewmr7_3505Shewmr7_3515N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3505-3180.987319TonB-dependent receptor
Shewmr7_3506-1181.739705hypothetical protein
Shewmr7_3507-1171.833355potassium efflux system protein
Shewmr7_3508-2204.003871diguanylate cyclase
Shewmr7_3509-1194.593918hypothetical protein
Shewmr7_3510-1204.677939uroporphyrin-III C/tetrapyrrole
Shewmr7_35110205.035518SmpA/OmlA domain-containing protein
Shewmr7_35120215.323433hypothetical protein
Shewmr7_3513-1195.311948hypothetical protein
Shewmr7_35140163.887124cyclase/dehydrase
Shewmr7_35150172.723142SsrA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3505HTHFIS355e-121 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 355 bits (913), Expect = e-121
Identities = 136/479 (28%), Positives = 229/479 (47%), Gaps = 48/479 (10%)

Query: 18 LLVLDPEQSLPE-CSEELKQAAWNCLKAVSAAEALVLLQKYDLRVAIAFIN--DTNQVLL 74
+LV D + ++ ++ L +A ++ +AA + D + + + D N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 75 ANEIAIIQTEYPSLHWIAVTD-STLEQHCSWLSAANFIDYYHRPFDWGRFADTLGHAWGM 133
+ I+ P L + ++ +T + DY +PFD +G A
Sbjct: 66 ---LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAE 121

Query: 134 AQLTVAKKGKSAPTEVLTTIKGDNPLLQQLRQRLHKFSLSDDTVLLSGETGSGKGLCAKT 193
+ +K + + G + +Q++ + L + +D T++++GE+G+GK L A+
Sbjct: 122 PKRRPSKLEDDSQD--GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 194 LHSLSKRRDGPFITVNCGALPIGLIHSALFGHEKGAFTDADKRYIGHLEQANGGTLFLDE 253
LH KRR+GPF+ +N A+P LI S LFGHEKGAFT A R G EQA GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 254 IADLPLDLQVNLLHVLDDKQIMRIGGNVPIKVDCRLLFASHQDLEVAIDEGRFREDLYHR 313
I D+P+D Q LL VL + +GG PI+ D R++ A+++DL+ +I++G FREDLY+R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 314 INVLRLHVPSLRQYSDEVMLLAEDFLRE-NTDSNVQFHFSDDARCAMKHYNWPGNVRELR 372
+NV+ L +P LR ++++ L F+++ + F +A MK + WPGNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 373 NRIRRAMVLSDDSKITAQLLGLDQLPSRAGQDLARCRV---------------------- 410
N +RR L IT +++ + + +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 411 ---------------EHEAEVLLKAISDHKHNISAAARSLNISRATFYRLLKKCQIKMP 454
E E ++L A++ + N AA L ++R T + +++ + +
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3507SACTRNSFRASE409e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 9e-07
Identities = 20/72 (27%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 81 ASIGRVVVSPAGRGKGLAMPLMQRAIDAALSTWPAAGIQIGAQDYLKS---FYQKLGFNA 137
A I + V+ R KG+ L+ +AI+ A G+ + QD S FY K F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 138 CS-EMYLEDGIP 148
+ + L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3508TCRTETB1201e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (303), Expect = 1e-31
Identities = 89/421 (21%), Positives = 177/421 (42%), Gaps = 19/421 (4%)

Query: 18 SEYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 77
+ Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 78 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASILCSISWN-LEAMIAFRALQGFFGGALIP 136
I + G LS L ++R LL+ F S++ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 137 LAFRLILEFLPENKRAVGMALFGVTATFAPSIGPTLGGWLTEHFSWHYLFYINVQPGLLV 196
L ++ ++P+ R L G +GP +GG + + W YL I + ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TII 180

Query: 197 MAMLAYGLEKRPVVWDKLKNADLAGIVTMALGMGCLEVVLEEGNRKDWFGSDLIRNLAII 256
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 257 AAVNLVLFVWIQLKRKDPLVNLRLLGKRDFVLSTIAYFLLGMALFGAIYLIPLYLSQVHD 316
+ ++ ++FV K DP V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 317 YTPLEIGEVIMWMGFPQLLVL-PLVPRLMQRFDGRYLAAFGFFMFALSYYMNSQMTADYA 375
+ EIG VI++ G +++ + L+ R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 376 GPQMIASQVVRALG-QPFILVPIGMLATAHLKPHENPSASTVLNVMRNLGGAFGIALVAT 434
+ +V LG F I + ++ LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 435 L 435
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3509RTXTOXIND951e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 94.9 bits (236), Expect = 1e-23
Identities = 38/290 (13%), Positives = 92/290 (31%), Gaps = 28/290 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLASAKADLQTLAAKVELQHALISQASAGVVAAQADKLRAEQ 130
+ + + S + ++ + ++ + A A + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLTRAKKLKVSNYSSQDDVDQLQAGFDSAAAGLDEAKA--------LLVAKERELAVFN- 181
+L L ++ V + + + A L K+ +L AKE V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLNQAGSVVEQSNAALELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTF---TGVIDSLSPASGAK 290
L +VP+ +TA + I + GQ+ + ++AFP + G + +++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412

Query: 291 FSLLPAENATGNFTKIVQRIPVRIRLDLSEEEARVVPGLSAVVKVDTASH 340
+ G ++ I + + G++ ++ T
Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMR 457



Score = 58.3 bits (141), Expect = 1e-11
Identities = 24/128 (18%), Positives = 48/128 (37%), Gaps = 2/128 (1%)

Query: 59 VTDNQHVRKGELLAQLEDNQFSAKVSQAEASLASAKADLQTLAAKVELQHALISQASAGV 118
V + + VRKG++L +L A + ++SL A+ + L +
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR-SIELNKLPELKL 170

Query: 119 VAAQADKLRAEQQLTRAKKLKVSNYSS-QDDVDQLQAGFDSAAAGLDEAKALLVAKEREL 177
+ +E+++ R L +S+ Q+ Q + D A A + E
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 178 AVFNAQLN 185
V ++L+
Sbjct: 231 RVEKSRLD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3511MECHCHANNEL1741e-59 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 174 bits (443), Expect = 1e-59
Identities = 89/136 (65%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPAVVIAYGKFIQTVIDFTIIAFAIFMGLKAINSLKRKQEEAPKAPPAPTKDQ 120
L+ AQGD PAVV+ YG FIQ V DF I+AFAIFM +K IN L RK+EE P A PAPTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEE-PAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3513ACRIFLAVINRP6530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 653 bits (1687), Expect = 0.0
Identities = 224/1077 (20%), Positives = 440/1077 (40%), Gaps = 72/1077 (6%)

Query: 9 AIKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + ++ A + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVTDVLSFGGEVR 187
I S + +D + G ++ VK + + GV DV FG +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVSEALESNNRNAGGWFMDQGQEQLVVRGYGMLPAGEAGLA 247
++ +D + L Y L+ V L+ N + G L + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLG-GTPALPGQQLNASIIAQTRFK 241

Query: 248 AIAQIPLTEVR----GTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQALGEVVAGVV 303
+ +R G+ VR+ D+A+V+ G E + G+ +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-----GK-----PAAGLGI 291

Query: 304 LKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVRDALLMAFVFI 363
GAN T I A++ ++ P+G+ YD V ++ V L A + +
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 364 VVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDGSVV 423
+++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD ++V
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 424 MVENIFKHLTQPDRRHLAQARSRVDGEVDPYHGDEDGSVRAVEADNSMAVRIMLAAKEVC 483
+VEN+ + + + + EA + ++
Sbjct: 412 VVENVERVM-------------------------MEDKLPPKEA-------TEKSMSQIQ 439

Query: 484 SPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK- 542
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499

Query: 543 ---------RGVVLKESVVLAPLDSAYRKLLSATLARPKLVMISALLMFAMSMVLLPRLG 593
G + + Y + L ++ L+ A +VL RL
Sbjct: 500 VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLP 559

Query: 594 TEFVPELEEGTINLRVTLAPTASLGTSLDVAPKLEAMLLEFPEVEYALSRIGAPELGGDP 653
+ F+PE ++G + L A+ + V ++ L+ + S
Sbjct: 560 SSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSG 618

Query: 654 EPVSNIEVYIGLKPIEEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLS 711
+ + ++ LKP EE + E + R E + G ++ F+ P + EL +
Sbjct: 619 QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGT 675

Query: 712 GVKAQLA-IKLFGPDLDVLSEKGQVLTDLVAKIPGAV-DVSLEQVSGEAQLVVRPDRSQL 769
I G D L++ L + A+ P ++ V + AQ + D+ +
Sbjct: 676 ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKA 735

Query: 770 ARYGISVDQVMSLVSQGIGGASAGQVIDGNARYDINLRLAAEYRSSPDVIKDLLLSGSNG 829
G+S+ + +S +GG ID + ++ A++R P+ + L + +NG
Sbjct: 736 QALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG 795

Query: 830 ATVRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAG 888
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 796 EMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAG 853

Query: 889 YTVIVGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIV 948
G ++ + + +V IS ++ L L + + + +M VPL ++G ++
Sbjct: 854 IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLL 913

Query: 949 ALFVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGESLYDSVYEGTVGRLRP 1007
A + V +G +T G++ N +++V+ + G+ + ++ RLRP
Sbjct: 914 AATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRP 973

Query: 1008 VLMTALTSALGLIPILVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1064
+LMT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 974 ILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 108 bits (270), Expect = 8e-26
Identities = 85/550 (15%), Positives = 186/550 (33%), Gaps = 68/550 (12%)

Query: 10 IKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L +VA V + +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 DVLSFGGEVR-QYQVQVDPNKLRAYGLSMAQVSEALES--NNRNAGGWFMDQGQEQLVVR 234
V G E Q++++VD K +A G+S++ +++ + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEA-GLAAIAQIPLTEVRGTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQ 293
A + ++ + G V + G+ + R + +
Sbjct: 774 ----ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSME 825

Query: 294 ALGEVVAGVVLKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVR 353
GE G D A + + LP G+ ++ + + +
Sbjct: 826 IQGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAP 873

Query: 354 DALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVA 413
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 414 IGMLVDGSVVMVENIFKHLTQPDRRHLAQARSRVDGEVDPYHGDEDGSVRAVEADNSMAV 473
IG+ ++++VE A+ +G+ VEA
Sbjct: 934 IGLSAKNAILIVE-------------FAKDLMEKEGK------------GVVEA------ 962

Query: 474 RIMLAAKEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAV 533
++A + PI + I+ PL G + + ++ M+SA L+A+ V
Sbjct: 963 -TLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 534 PALAVYLFKR 543
P V + +
Sbjct: 1022 PVFFVVIRRC 1031



Score = 98.4 bits (245), Expect = 8e-23
Identities = 67/347 (19%), Positives = 141/347 (40%), Gaps = 16/347 (4%)

Query: 735 VLTDLVAKIPGAVDVSLEQVSGEAQLVVRPDRSQLARYGISVDQVMSLVSQGIGGASAGQ 794
+ D ++++ G DV L + + + D L +Y ++ V++ + +AGQ
Sbjct: 161 NVKDTLSRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218

Query: 795 VIDGNARYDINLRLA----AEYRSSPDVIKDLLLSGSNGATVRLGEVASVEVEMAPPNIR 850
+ A L + +++ + K L S+G+ VRL +VA VE+ N+
Sbjct: 219 LGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI 278

Query: 851 -RDDVQRRVVVQANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIVGGQYENQQRAQQK 906
R + + + +A G + K I A + Q P G V+ Y+ Q
Sbjct: 279 ARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLS 336

Query: 907 LMLVVP---ISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIVALFVSGTYLSVPSSI 963
+ VV +I L+ L++Y ++ L+ VP+ L+G L G ++ +
Sbjct: 337 IHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMF 396

Query: 964 GFITLFGVAVLNGVVLVDSINQRRQS-GESLYDSVYEGTVGRLRPVLMTALTSALGLIPI 1022
G + G+ V + +V+V+++ + ++ + ++ A+ + IP+
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 1023 LVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRHDKSP 1069
G I + ++ I+ + S + L++ P L L + +
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3514RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.4 bits (136), Expect = 9e-11
Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 9/138 (6%)

Query: 157 EVAKAQAEYINAAAEWNRVRR---MSESAVSVSRRMQAQVDAELKRAILEAIKMTDEQIR 213
V + + +Y+ A E + ES + ++ V K IL+ ++ T + I
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 214 TLE----STPEAIGSYQLLAPIDGRVQQ-DIAMLGQVFTAGTPLMQLT-DESHLWVEAQL 267
L E + + AP+ +VQQ + G V T LM + ++ L V A +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 268 TPAQAANVNVGGPALVQV 285
+NVG A+++V
Sbjct: 373 QNKDIGFINVGQNAIIKV 390



Score = 42.1 bits (99), Expect = 3e-06
Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%)

Query: 101 SLSNLNLDTMATATLVVDRDRTATLAPQLDVRVQARHVVPGQEVKKGEPLLTLGG----A 156
L + + A L R+ + P + V+ V G+ V+KG+ LL L A
Sbjct: 76 VLGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 157 EVAKAQAEYINAAAEWNRVRRMSESAVSVSRRMQAQVDAELKRAILEAIKMTDEQIRTLE 216
+ K Q+ + A E R + +S S D + + E + + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 217 STPEAIGSYQLLAPIDGRVQQDIAMLGQV 245
+ YQ +D + + + +L ++
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3515RTXTOXIND310.010 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.010
Identities = 21/167 (12%), Positives = 52/167 (31%), Gaps = 17/167 (10%)

Query: 80 EVQAQIARQQQAELAIAAADRAVYNPEL-GLNYQNADTDTYSLGLSQTLDWGDKRGVATR 138
+ + QA L + EL L + Y +S+ + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 139 LAQLEAQILLADISLERSQMLAERLLALAEQAQSNKALTFAEQQLRFTQAQLNIAEQRFA 198
+ + Q +++L++ + AE+ + E R +++L+
Sbjct: 195 FSTWQNQKYQKELNLDKKR---------AERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 199 AGDLS-----DVELQLLKL--ELASNTADYAMAEQAALVADGKVIEL 238
++ + E + ++ EL + E L A + +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292


91Shewmr7_3521Shewmr7_3529N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_35211163.937831hypothetical protein
Shewmr7_3522-114-1.186229Abi family protein
Shewmr7_3523-113-1.351018restriction modification system DNA specificity
Shewmr7_3524013-0.866225N-6 DNA methylase
Shewmr7_3525-110-0.267888hypothetical protein
Shewmr7_3526-2120.446915hypothetical protein
Shewmr7_3527-2100.378470hypothetical protein
Shewmr7_3528-1142.592367EcoEI R domain-containing protein
Shewmr7_3529-2132.478519hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3521RTXTOXIND290.006 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.006
Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 126 GIVGAIWVKDGDDVAFDQPLFTL 148
IV I VK+G+ V L L
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3522DHBDHDRGNASE522e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 52.4 bits (125), Expect = 2e-10
Identities = 41/183 (22%), Positives = 80/183 (43%), Gaps = 5/183 (2%)

Query: 2 ILITGASSGLGAALAALYAKDNQALTLTGRDANRLHTVANALSPFSTQSISAISADLADE 61
ITGA+ G+G A+A A + + +L V ++L + + A AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 ASVEALFDGL---TDTPNTVIHCAGSGYFGALETQGTSEIQALLNNNVTSTILLVRELVK 118
A+++ + + + +++ AG G + + E +A + N T R + K
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 RYK-QQAVKVVVVMSTAALAAKAGESTYCAAKWAVRGFIESVRLELKQSQMKLIAVYPGG 177
+++ +V V S A + + Y ++K A F + + LEL + ++ V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 MDT 180
+T
Sbjct: 190 TET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3523TCRTETA514e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.6 bits (121), Expect = 4e-09
Identities = 60/322 (18%), Positives = 107/322 (33%), Gaps = 22/322 (6%)

Query: 50 VAHVSYAISAYALGVVVGSPIIMVLGVRIKRRTLLIALAAMMAVANGLSALAPSLNWLVF 109
AH ++ YAL +P++ L R RR +L+ A AV + A AP L L
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 110 FRFLSGLPHGAYFGVSMLLAASLVPPDMKARAVSRVIIGLTLATIVGVPFATWMGQTVGW 169
R ++G+ GA V+ A + D +AR + + G MG
Sbjct: 102 GRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSP 159

Query: 170 RSGIGIVAIIAAVTAVMLYFLAPNVAVPQNASPKKELQTLKNREVWLTLGIAAIGFGGIF 229
+ A + + + FL P + + N +
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPE---SHKGERRPLRREALNPLASFRWARGMTVVAALM 216

Query: 230 CVYTYLAETLIQVTQV------------EPFKIPVMIAVFGI-GATLGTLVCGWAADK-S 275
V+ ++ + + QV + I + +A FGI + ++ G A +
Sbjct: 217 AVF-FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLG 275

Query: 276 ALAAAFWSLVLSTLVLALYPSLTGSYWALMPI-VFFVGSGIGLATIVQARLMDVAPDGQA 334
A ++ L + W PI V GIG+ + V + Q
Sbjct: 276 ERRALMLGMIADGTGYILL-AFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334

Query: 335 MTGALVQCAFNLANAIGPWVGS 356
+ +L + +GP + +
Sbjct: 335 QLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3524IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.001
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 145 PMAYDDTPVAVAPPVRVTTSMQYSPSEGRMVSNMPTNSATVISQTGASTARASTASTEQI 204
P + +T P + T+S P N NS + T ++E
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVN-TGNSVVENPENTTPATTQPTVNSES- 1215

Query: 205 ANVPRARAARSVSSLPSNAR 224
+N P+ R RSV S+P N
Sbjct: 1216 SNKPKNRHRRSVRSVPHNVE 1235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3529SUBTILISIN1096e-28 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 109 bits (275), Expect = 6e-28
Identities = 64/307 (20%), Positives = 95/307 (30%), Gaps = 99/307 (32%)

Query: 210 TDRGPEFIGADQMWQGTATQGGLPVKGEGMVVGIIDTGINTDHVAFADDEEYARLNPYKG 269
RG E I A +W T +G G+ V ++DTG + DH
Sbjct: 22 IPRGVEMIQAPAVWNQT--------RGRGVKVAVLDTGCDADHPDLKA------------ 61

Query: 270 QAIGDCGAFPELCNNKLVGLHSYPEITDVYAAPEFQTSSGAKKRIRPANAEDYAGHGSHT 329
+++G ++ + + P +DY GHG+H
Sbjct: 62 ---------------RIIGGRNFTDDDE----------------GDPEIFKDYNGHGTHV 90

Query: 330 ASTVAGNTLKDTPLQGFTGDKVSDGVDVPFTFPQTSGVAPRAHIIAYQVCWPGSSGDPYA 389
A T+A ++ GVAP A ++ +V SG
Sbjct: 91 AGTIAAT-----------ENENG-----------VVGVAPEADLLIIKVLNKQGSGQ--- 125

Query: 390 GCPESAILSAFEDAIADGVDAINFSIGGAENMPWGDPMELAFLSAREAGISVAAAAGNSG 449
I+ AI VD I+ S+GG E+ + A A + I V AAGN G
Sbjct: 126 ---YDWIIQGIYYAIEQKVDIISMSLGGPED---VPELHEAVKKAVASQILVMCAAGNEG 179

Query: 450 AFWTADH------SSPWVTTVGASTHDRKLK------AGIKSITAFEG-----TGKPTTA 492
V +VGA DR + + E G
Sbjct: 180 DGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYAT 239

Query: 493 IQGTSFS 499
GTS +
Sbjct: 240 FSGTSMA 246



Score = 73.0 bits (179), Expect = 1e-15
Identities = 32/131 (24%), Positives = 48/131 (36%), Gaps = 32/131 (24%)

Query: 627 NNLATFSSLGPSKTNNTLVPDLTAPGVDIYAANADDQPFTNNPSASDWTFMSGTSMAAPH 686
+ + FS+ DL APG DI + + SGTSMA PH
Sbjct: 207 RHASEFSNSNNE-------VDLVAPGEDILSTVPG----------GKYATFSGTSMATPH 249

Query: 687 VTGAMTLLTQL-----HPDWTPAEIQSALMLTAGPVVLNTGYELIEPYYNFMAGAGAINV 741
V GA+ L+ QL D T E+ + L+ P+ + E G G + +
Sbjct: 250 VAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKME----------GNGLLYL 299

Query: 742 ARAADTGLVMD 752
+ + D
Sbjct: 300 TAVEELSRIFD 310


92Shewmr7_3760Shewmr7_3773N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr7_3760-2150.830486hypothetical protein
Shewmr7_3761-1150.900530transaldolase B
Shewmr7_3762-1160.789315glucose-6-phosphate isomerase
Shewmr7_3763-1151.146134hypothetical protein
Shewmr7_3764-1171.150425hemerythrin HHE cation binding domain-containing
Shewmr7_3765-2141.403336hypothetical protein
Shewmr7_37660162.758065ECF subfamily RNA polymerase sigma-24 factor
Shewmr7_3767-1163.077962von Willebrand factor, type A
Shewmr7_37690152.690738hypothetical protein
Shewmr7_37700162.926040putative sulfate transporter YchM
Shewmr7_3771-1152.856241phosphoribosylaminoimidazole carboxylase,
Shewmr7_37720132.202972phosphoribosylaminoimidazole carboxylase ATPase
Shewmr7_3773-1141.470382diguanylate cyclase with GAF sensor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3760HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 5e-13
Identities = 25/68 (36%), Positives = 32/68 (47%)

Query: 1 MKTETQSTRQHILDVGYSLIVKQGFSCLGLAQLLKAAQVPKGSFYHYFKSKEQFGEALLT 60
K E Q TRQHILDV L +QG S L ++ KAA V +G+ Y +FK K +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 GYFEQYQA 68

Sbjct: 65 LSESNIGE 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3763PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.6 bits (95), Expect = 5e-06
Identities = 35/187 (18%), Positives = 72/187 (38%), Gaps = 33/187 (17%)

Query: 167 LIIEQADRLRSLVDRL-------LGPQRPTQHSLHNIHQVVQKVYKLVEMALPTNIQLKR 219
LI+E + R ++ L L Q SL + VV +L + +Q +
Sbjct: 185 LILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 220 DYDPSIPDIEMDPDQMQQAVLNILQNAVQALEHTGGEILIRTRTQHQVTIGSQRHKLVLT 279
+P+I D+++ P +Q V N +++ + L GG+IL++ + +T
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNGT----------VT 293

Query: 280 LSIIDNGPGIPPELMDTLFYPMVTSREQGSGLGLSIAHNIARLHSG---RIDCVSSPGHT 336
L + + G + + ++ +G GL ++ G +I G
Sbjct: 294 LEVENTGSL------------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341

Query: 337 EFIISLP 343
++ +P
Sbjct: 342 NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3764HTHFIS5570.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 557 bits (1438), Expect = 0.0
Identities = 198/474 (41%), Positives = 295/474 (62%), Gaps = 12/474 (2%)

Query: 7 VWILDDDSSIRWVLEKALQGAKLSTASFAAAESLWQALEISQPRVIVSDIRMPGTDGLTL 66
+ + DDD++IR VL +AL A + A +LW+ + ++V+D+ MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LERLQIHYPHIPVIIMTAHSDLDSAVSAYQAGAFEYLPKPFDIDEAISLVERALTHATEQ 126
L R++ P +PV++M+A + +A+ A + GA++YLPKPFD+ E I ++ RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--PK 123

Query: 127 SPTPAVQETQVKTPEIIGEAPAMQEVFRAIGRLSRSSISVLINGQSGTGKELVAGALHKH 186
+++ ++G + AMQE++R + RL ++ ++++I G+SGTGKELVA ALH +
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 187 SPRKDKPFIALNMAAIPKDLIESELFGHEKGAFTGAANVRQGRFEQANGGTLFLDEIGDM 246
R++ PF+A+NMAAIP+DLIESELFGHEKGAFTGA GRFEQA GGTLFLDEIGDM
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 247 PLDVQTRLLRVLADGQFYRVGGHSAVQVDVRIIAATHQDLEQLVLKGGFREDLFHRLNVI 306
P+D QTRLLRVL G++ VGG + ++ DVRI+AAT++DL+Q + +G FREDL++RLNV+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 307 RVHLPPLSQRREDIPQLATHFLASAAKEIGVEAKILTKETAAKLSQLPWPGNVRQLENTC 366
+ LPPL R EDIP L HF+ A KE G++ K +E + PWPGNVR+LEN
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLV 362

Query: 367 RWLTVMASGQEILPQDLPPELLKEPASINPMAKGSQDWQSALTEWIDQKLSE-------- 418
R LT + I + + EL E ++ ++++ +++ + +
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 419 -GNSDLLTEVQPAFERILLETALRHTQGHKQEAAKRLGWGRNTLTRKLKELSMD 471
S L V E L+ AL T+G++ +AA LG RNTL +K++EL +
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3765OMPADOMAIN692e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 2e-16
Identities = 50/202 (24%), Positives = 69/202 (34%), Gaps = 28/202 (13%)

Query: 1 MKKLSLVAVTLLSALVAGQASAATDNTGFYVGGAL-------NRVTVDAFDDSETGTGFG 53
MKK +A+ + A A A AA + +Y G L + E G G
Sbjct: 1 MKKT-AIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 54 VYGGYNFNEWFGLEANFF----ATADLGDSDVDISAGALTFTPKFTLQINDMFSAYAKVG 109
+GGY N + G E + + A + T K I D Y ++G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 110 VA----SMAVNVDGLGFDEDFTGFGWTYGVGVNAAVTEHLNVRLSYDITT--GDLDADHS 163
NV G D TG + GV A+T + RL Y T GD
Sbjct: 120 GMVWRADTKSNVYGKNHD---TGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTI-- 174

Query: 164 YLNMKDIDTDMKQLAIGVHYQF 185
D L++GV Y+F
Sbjct: 175 -----GTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3766PF01206270.006 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 27.4 bits (61), Expect = 0.006
Identities = 6/35 (17%), Positives = 14/35 (40%)

Query: 92 PLLMWRSRVTCAQSGKVVIVECLDERKRRSLIRWC 126
P+L + + +G+V+ V D + +
Sbjct: 18 PILKAKKTLATMNAGEVLYVMATDPGSVKDFESFS 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3769HTHFIS972e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.8 bits (241), Expect = 2e-25
Identities = 46/163 (28%), Positives = 77/163 (47%), Gaps = 3/163 (1%)

Query: 2 SRILLIDDDLGLSELLGQLLELEGFKLTLAYDGKQGLELALAGDYDLILLDVMLPKLNGF 61
+ IL+ DDD + +L Q L G+ + + + AGD DL++ DV++P N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLRALRQH-KQTPVLMLTARGDEIDRVVGLEIGADDYLPKPFNDRELIARIRAIIRRSH 120
++L +++ PVL+++A+ + + E GA DYLPKPF+ ELI I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 LTAQEIHATPAQEFGDLRLDPSRQEAYCNEQLIILTGTEFTLL 163
++ + + QE Y L L T+ TL+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3770PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 2e-05
Identities = 25/121 (20%), Positives = 52/121 (42%), Gaps = 12/121 (9%)

Query: 274 IAYEAEQLEQLIAELLELSRVKLSTNETKIRLGLAESLSQVLDDAEFEADQQGKKIT--I 331
I + + +++ L EL R L + + + LA+ L+ V + + Q ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQ-VSLADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 332 DIDEAIELSHYPKSLSRAIENLLRNAIRYA------QSDIHLRASQANGQVQITIKDDGP 385
I+ AI P L ++ L+ N I++ I L+ ++ NG V + +++ G
Sbjct: 245 QINPAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 386 G 386

Sbjct: 302 L 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3771HTHFIS344e-114 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 344 bits (884), Expect = e-114
Identities = 120/375 (32%), Positives = 200/375 (53%), Gaps = 17/375 (4%)

Query: 259 FHRDSALHVQTQALALTQTKSSRTLQDKPLNQLGVRFRDPLLERAWQQANKVITKQIPLL 318
F + + +ALA + + S+ D + R ++ ++ +++ + L+
Sbjct: 106 FDLTELIGIIGRALAEPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLM 164

Query: 319 VLGETGVGKEQFVKKLHAQSARRSQPLVAVNCAALPAELVESELFGYQAGAFTGANRTGF 378
+ GE+G GKE + LH RR+ P VA+N AA+P +L+ESELFG++ GAFTGA
Sbjct: 165 ITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRS- 223

Query: 379 IGKIRQAHGGFLFLDEIGEMPLAAQSRLLRVLQEREVVPVGSNQSFKVDIQIIAATHMDL 438
G+ QA GG LFLDEIG+MP+ AQ+RLLRVLQ+ E VG + D++I+AAT+ DL
Sbjct: 224 TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDL 283

Query: 439 ESLVAQGLFRQDLFYRLNGLQVRLPALRERQ-DIERIIH---KLHRRHRSSAQTLCTELL 494
+ + QGLFR+DL+YRLN + +RLP LR+R DI ++ + + + E L
Sbjct: 284 KQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEAL 343

Query: 495 AQLMRYDWPGNLRELDNLMQVACLMAEGEAVLEITHLPDYLAQKLMNLAFEPQTLTEVVD 554
+ + WPGN+REL+NL++ + + V+ + + L ++ + E
Sbjct: 344 ELMKAHPWPGNVRELENLVRRLTALYPQD-VITREIIENELRSEIPDSPIEKAAARSGSL 402

Query: 555 AETTKHPHELSESSSATIDSLHGTINLN----------VLQAYRACEGNVSQCAKRLGIS 604
+ + + + ++ D+L + + +L A A GN + A LG++
Sbjct: 403 SISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLN 462

Query: 605 RNALYRKLKQLGIKD 619
RN L +K+++LG+
Sbjct: 463 RNTLRKKIRELGVSV 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr7_3773DHBDHDRGNASE966e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.9 bits (238), Expect = 6e-26
Identities = 73/257 (28%), Positives = 117/257 (45%), Gaps = 16/257 (6%)

Query: 6 IALITGASRGLGKNAALTLAAQGVDIILTYQSNAAAAAEVVAEIEWHGRKAVALPLDVGD 65
IA ITGA++G+G+ A TLA+QG I N +VV+ ++ R A A P DV D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 SQSFSDFSQRVKTALEQTWQRDSFNYLVNNAGIGIHVPMAETSMEQFDTLMNIHVKGPFF 125
S + + + R++ + + LVN AG+ + S E+++ +++ G F
Sbjct: 69 SAAIDEITARIEREMGP------IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 126 LTQALLPLLAD--NGSIINVSTGLTRFAVPGFGAYATMKGAVETMTKYWAKELGPRGIRV 183
++++ + D +GSI+ V + AYA+ K A TK EL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NVLAPGAIETDF-------GGGAVRDNRQMNEFLAQQTALGRVGLPEDIGGAISVLLSPA 236
N+++PG+ ETD GA + + E L ++ P DI A+ L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 AAWINAQRIEASGGMFL 253
A I + GG L
Sbjct: 243 AGHITMHNLCVDGGATL 259



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.